https://pinot.apache.org/ logo
Join Slack
Powered by
# general
  • r

    Rashid

    07/26/2021, 12:40 PM
    Is it possible to batch ingest via Apache Flink to the HDFS as deep store in Apache Pinot?
    m
    p
    +4
    • 7
    • 15
  • l

    Lakshmanan Velusamy

    07/27/2021, 4:06 AM
    Hi Group, We want to compute rolling window percentile on top of group by results. Is there an option to expose serialized MergingDigest which is used by PercentileTDigest, similar to DistinctCountRawHLL ?
    m
    j
    • 3
    • 13
  • m

    Map

    07/27/2021, 8:41 PM
    If I am not mistaken, Pinot's integration with kafka schema registry is only for avro decoder? Can pinot read table schemas directly from Kafka schema registry?
    m
    k
    • 3
    • 11
  • r

    robert zych

    07/28/2021, 7:32 PM
    Hi @User! I'm a Data Engineer and interested in real-time data processing, analytics, and anomaly detection. Some the nice folks from the Kafka community recommended StarTree/Pinot. It's great to such great performance, easy Kafka integration, and usability. Can't wait to get started and learn more about Pinot!
    ❤️ 2
    🍾 2
    👋🏻 1
    👋 3
    k
    • 2
    • 4
  • p

    Pei Wang

    07/28/2021, 7:36 PM
    Hi everyone, I am an engineer working on Presto development. I was interested when reading the tutorial using Pinot and Presto posted in medium. I am very exited about data analytics in general so definitely want to learn more about Pinot!
    👋🏻 1
    👋 3
    ❤️ 2
    k
    x
    • 3
    • 4
  • a

    Aash Viswanathan

    07/28/2021, 7:51 PM
    Howdy! I’m a senior data scientist at Atlassian. Formerly worked at LinkedIn where I used Pinot to deliver metrics to downstream apps. I’m here for the community and to learn from you all how you use pinot in your applications.
    👋 4
    👋🏻 1
    ❤️ 1
    x
    s
    +2
    • 5
    • 6
  • r

    Ryan Clark

    07/28/2021, 9:24 PM
    Debugging batch ingestion from S3 🧵 The job got submitted correctly to my cluster, but I suspect my input file format is to blame for the failure
    java.lang.IllegalStateException: Cannot flatten value node: null
    x
    • 2
    • 5
  • r

    robert zych

    07/29/2021, 2:10 AM
    @User gave a very interesting Kafka Summit APAC talk recently entitled "Real-time Analytics with Upsert Using Apache Kafka and Apache Pinot"
    👍 5
    k
    y
    m
    • 4
    • 10
  • p

    prateek nigam

    07/29/2021, 4:58 AM
    Is apache pinot support connection pool mechanism, if we use JDBC client.
    k
    • 2
    • 1
  • l

    Lars-Kristian Svenøy

    07/29/2021, 9:42 AM
    Hi everyone! I am an engineer working on implementing Pinot + Presto to modernise our current stack for realtime analytics. Our current stack is not well suited for realtime analytics, and this is when we came across Pinot. I’ve been looking into the stack for a while now, ran various performance tests and it looks really promising so far!
    👍 6
    🚀 4
    k
    • 2
    • 2
  • m

    Map

    07/29/2021, 4:51 PM
    if i am reading correctly, when
    segmentPushType
    set to
    REFRESH
    , no segment build will be triggered? Once a segment is flushed, it will be dropped? However, in my test, it seems segments still get built
    m
    • 2
    • 4
  • p

    Pala

    07/29/2021, 10:28 PM
    Hello folks. We are evaluating deploying Pinot. Do most folks here deploy Pinot on Kubernetes or directly on EC2 (or equivalent) nodes?
    m
    k
    • 3
    • 3
  • c

    Charles

    07/30/2021, 1:09 AM
    Hi All, Do we have some tools to migrate data from ClickHouse to Pinot? thx
    k
    x
    • 3
    • 6
  • r

    robert zych

    07/30/2021, 4:46 PM
    I've heard the Pinot is for "User-Facing, Real-Time Analytics" and was wondering who the "User" is in this context? Is it referring to external end-users? If so, is Pinot overkill for internal analytics?
    m
    k
    • 3
    • 2
  • c

    Carl

    07/30/2021, 5:34 PM
    Hi, we are ingesting a timestamp data into a offline Pinot offline table as a long data type and 1millisecondsepoch date time specs. But for all null records Pinot ingested as min long value. How should we define the schema so that null timestamp can be also ingested as a null value in Pinot?
    m
    k
    • 3
    • 6
  • w

    Will Gan

    07/30/2021, 7:01 PM
    Hi, does anyone know why I might be getting a
    java.net.BindException: Address already in use
    when I use
    bin/pinot-admin.sh
    ? For context I'm using Kubernetes and running the command from the Controller Pod
    x
    • 2
    • 9
  • x

    xtrntr

    07/31/2021, 12:23 AM
    hello, i’m trying to figure out how pinot can support the queries i’ll like to make. my schema has the following fields:
    Copy code
    userId | time (15 minutes) | cell
    a        0                      1
    a        1                      2
    a        2                      -1
    a        3                      -1
    a        4                      -1
    a        5                      3
    a        6                      3
    ..
    z        0                      1
    i’ll like to interpolate the missing values (-1), then query the result of that transformation. my understanding is that pinot doesn’t support subqueries but does so in conjunction with presto, is this the right channel to ask?
    Copy code
    # example of values after interpolation
    userId | time (15 minutes) | cell
    a        0                      1
    a        1                      2
    a        2                      2
    a        3                      2 or 3
    a        4                      3
    a        5                      3
    a        6                      3
    
    # example of values after forwardFill
    userId | time (15 minutes) | cell
    a        0                      1
    a        1                      2
    a        2                      2
    a        3                      2
    a        4                      2
    a        5                      3
    a        6                      3
    m
    k
    k
    • 4
    • 28
  • a

    Abhay Rawat

    08/02/2021, 12:58 PM
    Hi @User and everyone 👋, I am a data engineer at expediagroup and currently POC ing Pinot for one of our use case. Performance is very promising but we struggled a bit with setup, one of the reason was low kubernetes support in our environment. Another hurdle we encountered was lack of subquery/join support, guess we might have to use it in conjunction with presto for that. Still trialling.
    m
    b
    +2
    • 5
    • 5
  • p

    Pedro Silva

    08/02/2021, 2:47 PM
    Hello, does Pinot support initDB scripts when deploying in kubernetes? Akin to cassandra: https://artifacthub.io/packages/helm/bitnami/cassandra#cassandra-parameters
    m
    x
    • 3
    • 5
  • k

    Karin Wolok

    08/02/2021, 6:59 PM
    Hey all!!!! 🎉 TODAY IS A REALLY BIG DAY!!! 🎉 Pinot has officially graduated as a top-level Apache Software Foundation project! 🎓 🍷 🥳 One of the co-authors of the Pinot project (and now, founder of StarTree.ai) @User had published a blog about our journey here (URL below) 📢 Please help us spread the word on this monumental day by sharing this news!!!! 🔊 Big shoutout to all the contributors who helped bring this project to this place! Congrats to you all!!! ❤️ @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User https://www.startree.ai/blogs/apache-pinots-graduation-a-celebration-of-community-and-a-look-ahead/
    🎯 1
    🙌 12
    clap1 2
    ❤️ 25
    🎉 28
    🍷 16
    l
    • 2
    • 1
  • e

    Elon

    08/02/2021, 6:59 PM
    Congrats!!!
    🎉 1
    k
    • 2
    • 1
  • c

    Carl

    08/03/2021, 2:18 AM
    We have a offline table field truncating texts as it exceed the default 512 length. Is it possible to change the default maxlength without having to reingest all the records to Pinot?
    s
    m
    • 3
    • 7
  • s

    Saurabh Dwivedy

    08/03/2021, 8:03 AM
    Great news on Pinot being selected as Apache TL project
    k
    • 2
    • 1
  • b

    Bruce Ritchie

    08/03/2021, 7:35 PM
    I unfortunately had to miss the meetup today. Is it going to be made available on youtube/etc at some point?
    m
    • 2
    • 1
  • e

    Evan Galpin

    08/04/2021, 2:35 PM
    With respect to complex data types[1] (not yet available, not until 0.8 AFAIU), what would happen in the given example if there were an additional complex data type in the input object. Say that the example input object also had a
    friends
    array (arbitrary example) at the same level as
    group
    . I noted that the docs say this about the unnesting example (emphasis mine):
    For instance, the example in the beginning will transform into two rows with this configuration example.
    Are these then distinct rows in the table, or some kind of special nested rows associated with a top-level row? And if there were another nested/complex field, would that result in a combinatorial explosion in terms of number of rows? A related question: how does this approach work with conjunctions? In the example given where the topics are
    paddling
    and
    hiking
    , can a query be written to answer the question “which meetup events had talks about both paddling and hiking?” [1] https://docs.pinot.apache.org/basics/data-import/complex-type#handle-the-complex-type-with-ingestion-configurations
    k
    j
    • 3
    • 25
  • v

    Vibhor Jain

    08/05/2021, 2:49 PM
    Hi All, We are evaluating Apache Pinot around the security posture and wanted some info around the same. For encrypting Personally Identifiable Information (PII) data at rest, we need to handle this at two places. 1. Completed segments sitting on Pinot servers. 2. Segments sitting in Deep store say HDFS (looks like Pinot supports this) Is there any support from Pinot for segments sitting on the server's disk? On the Github issue, we see https://github.com/apache/pinot/issues/5762 that talks about this and is still open.
    k
    d
    • 3
    • 3
  • d

    Davis Treybig

    08/05/2021, 7:13 PM
    Hi all - I am really interested in Pinot and tooling like it which is enabling simpler development for building customer-facing analytical applications, especially cases where you are moving beyond a dashboard and want to really embed data into the application workflow. I have spent a few months surveying a lot of the different tools and approaches that are coming together to make this easier, including Pinot. I wrote up some of what I have seen here, in case it is of interest to anyone: https://towardsdatascience.com/the-analytical-application-stack-eead8ce6b70
    👍 10
    m
    k
    a
    • 4
    • 7
  • n

    Noureddine

    08/06/2021, 2:10 PM
    Hi guys, I am looking for a way to delete a record in a pinot table. I am aware that Pinot doesn't support this type of operation, but in our use case we have deletes once in a while so I am wondering if there is a Hack on how to delete a record without rebuilding the whole Table ?
    m
    k
    • 3
    • 6
  • c

    Charles

    08/09/2021, 3:11 AM
    Hi all can we add more than 500 columns in one table for some special scenarios ?
    k
    • 2
    • 2
  • c

    Charles

    08/10/2021, 1:26 AM
    Hi All do we have some example to extract attributes from json objects in transformFunction
    n
    • 2
    • 5
1...252627...160Latest