https://pinot.apache.org/ logo
Join Slack
Powered by
# general
  • m

    mod

    05/17/2021, 6:02 AM
    HI, Is the cloud pubsub driver available with apache pinot?
    k
    • 2
    • 3
  • r

    RK

    05/17/2021, 12:59 PM
    Can anyone explain me what is the issue here.i have started Pinot controller broker and server.since last 2 weeks it is working fine but today suddenly that UI part is not loading.
    a
    m
    d
    • 4
    • 7
  • r

    RK

    05/17/2021, 1:01 PM
    I have attached the video also.its not stable loading for milliseconds and then again showing blank
    m
    • 2
    • 2
  • m

    Mayank

    05/17/2021, 4:57 PM
    We have seen both models in deployment. It depends on your use case. If you are going to just server dashboards and are ok with sub-second latencies, Pinot+Presto might work (note that complex joins/nested queries may make the latency higher than expected). If you are looking for high-throughput at low latency, perhaps using Pinot would be better option.
    b
    • 2
    • 1
  • r

    RK

    05/18/2021, 6:42 AM
    In config file I have added like so i.am.only able to see the rows whose value is not null.
    x
    n
    • 3
    • 4
  • i

    irakov

    05/18/2021, 8:04 AM
    Hi, I have a question about StarTree indices: are there any recommendations about ordering in
    Copy code
    tableIndexConfig.starTreeIndexConfigs.dimensionsSplitOrder
    Thanks in advance :)
    n
    • 2
    • 3
  • k

    kauts shukla

    05/18/2021, 10:09 AM
    Hi All, My server is crashing every time due of OOM, My servers are r5.4xlarge and heap space i have provided is 110GB. I have 2 servers.
    d
    m
    • 3
    • 47
  • a

    Ambika

    05/18/2021, 10:12 AM
    Hello -- I created a default real time table . After consuming some 300k events i wanted to add a sorted inverted so i edited the table to add the sorted col. How can I check if my query is using the index and whether the index is successfully created or not.?
    m
    n
    • 3
    • 27
  • a

    Ambika

    05/18/2021, 10:12 AM
    is there any way to get the explain plan ?
    k
    • 2
    • 1
  • y

    Yupeng Fu

    05/18/2021, 4:45 PM
    Any idea why we convert division of long/long to double by default? for example, in presto
    select (1618900560000 - 1618437600000)/604800000*604800000
    returns 0, while
    SELECT (1618900560000 - 1618437600000)/604800000
    returns 0.76547 ?
    a
    s
    +2
    • 5
    • 39
  • k

    Karin Wolok

    05/18/2021, 5:49 PM
    šŸ· Welcome new Pinot community members! šŸ‘‹ We're so happy that you've joined us! If you haven't already, please take a moment and introduce yourself, let us know how you found Pinot and what you're working on! 😃 @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User
    m
    l
    g
    • 4
    • 3
  • n

    Neil Teng

    05/20/2021, 1:33 PM
    Hey, do we have any example querying/working on the multi-value column? I cannot find one in the document. e.g. get the first element in the multi-val column.
    m
    j
    +2
    • 5
    • 15
  • k

    kelv

    05/20/2021, 11:32 PM
    Hi, i'm streaming from Kafka and would like to have a way to uniquely identify messages. In Kafka consumer, I can do that with offset. Is it possible to expose Kafka metadata such as offset/timestamp to Pinot clients?
    m
    x
    +3
    • 6
    • 17
  • r

    RK

    05/21/2021, 10:51 AM
    How Pinot loads the segment.i.e. If I am consuming data from Kafka and if any new events come in Kafka so pinot will load the entire segment once again or only the new events.?
    m
    • 2
    • 5
  • h

    Hamza

    05/21/2021, 3:06 PM
    Hello, I'm new to Pinot and I'm trying to run some tests on the tool using a GCP Deployment. I have deployed GCP o GKE using the the official documentation and it seems to work well. The next step is to import batch data from Google Cloud Storage in order to run some queries. The data on Cloud Storage is 300 Gb of csv files. First thing : I couldn't figure out how to link Cloud Storage to Apache Pinot via the pinot plugin. Second thing : How can I transfrom the csv files into Pinot Tables ? Can I have some guidance on these two subjects ? Thanks in advance šŸ˜„
    m
    x
    • 3
    • 54
  • k

    Ken Krugler

    05/21/2021, 7:01 PM
    What’s the use case for running with multiple controllers? These are stateless, and don’t have a lot of load (if using something like HDFS for deep storage), right? So is it just zero downtime (assuming you have a LB in front of them) in case one goes down?
    m
    k
    • 3
    • 8
  • k

    Ken Krugler

    05/21/2021, 7:09 PM
    If I have one controller and two brokers, does the controller distribute the query load across the two brokers? I thought it would, but the Pinot in production page recommends ā€œHTTP load balancers for spraying queries across brokers (or other mechanism to balance queries)ā€œ. Or is that to automatically route traffic away from a broker which has gone down?
    m
    • 2
    • 6
  • k

    kauts shukla

    05/22/2021, 2:56 PM
    Hi All, everyone my pinot server keep getting lost on restart, have kept JVM 16GB and table [ā€œloadModeā€: ā€œMMAPā€] . Screenshot
    j
    • 2
    • 2
  • r

    Ricardo Bernardino

    05/25/2021, 9:30 AM
    Hi all! For anyone unfamiliar with Zookeeper operations (which we were), you will see that Zookeeper will keep increasing its disk usage. We found this odd since it only allows 1MB worth of data in its leaf nodes. Looking at our data directory we saw its disk usage mainly in the logs folder and the snapshots. After searching a bit, we found that there are two configurations that will automatically purge these files (
    autopurge.snapRetainCount
    and
    autopurge.purgeInterval
    ). The logs are transaction logs, not application level logs, and they relate to the snapshots such that Zookeeper can recover from a failure by using the latest snapshot and the transaction logs. The
    purgeInterval
    is 0 by default so it does not purge anything. and
    snapRetainCount
    is 3 - but again this is disabled. Depending on the docker image and helm chart you are using you already have an env var to change the `purgeInterval`: •
    ZOO_AUTOPURGE_PURGEINTERVAL
    for the official docker image •
    ZK_PURGE_INTERVAL
    for the zookeeper incubator helm chart Hope this helps!
    šŸ‘ 3
    x
    k
    l
    • 4
    • 4
  • k

    kauts shukla

    05/25/2021, 10:57 AM
    Hi All, Expecting that all the finished segments will be rebalanced among all 3 pinot servers, instead of just the consuming server post addition of 1 new server from 2 to 3 nodes now.
    m
    k
    • 3
    • 5
  • t

    troywinter

    05/26/2021, 3:59 AM
    Can I use an aggregation result as argument to a UDF during query?
    x
    j
    • 3
    • 12
  • r

    RK

    05/26/2021, 5:03 AM
    Someone kindly guide where can I find these 2 details.I am trying to use HDFS as deepatorage and need to set these 2 properties as per the pinot docs, but unable to find which path I should use. I am loading streaming data from Kafka topic and pinot is running in cluster-mode. Controller , server and brokers are running on diff diff nodes.
    x
    • 2
    • 7
  • r

    RK

    05/26/2021, 11:01 AM
    Is there any limit in Pinot to execute queries simentenoulsly. I.e. I have created a pipeline through pinot where we are ingesting data from Kafka topic and creating Pinot table.After that connecting from prestro using pinot-connector and in the last step connecting from superset and from there querying pinot table. Here I have generated 4 graphs using diff SQL queries and added all 4 in one dashboard.After sometime when I am trying refresh the dashboard page so it's firing 4 queries to Pinot and all are getting failed it's showing internal error couldn't reach host, and if I am refreshing one slice from the dashboard then it's working fine.i.e. when firing one query at a time. P.S. I am facing this issue when I have started pinot in cluster mode with 3 servers 2 controller and 2 broker. When I am doing the same using quick-start it's working fine and entire dashboard is loading/refreshing at same time.
    m
    • 2
    • 6
  • j

    Josh Highley

    05/26/2021, 3:16 PM
    are there any systemd scripts for starting / stopping the pinot components?
    m
    d
    • 3
    • 10
  • c

    Chethan UK

    05/26/2021, 4:59 PM
    Wow, great content. Excellent half hour spent. Great job @User @User
    šŸ· 3
    āž• 2
    a
    • 2
    • 2
  • k

    kelv

    05/26/2021, 5:35 PM
    Hi! What is the reason for having Pinot queries implicitly default to
    limit 10
    ? Is there a way to remove the limit in the query without specifying an arbitrary limit value?
    m
    • 2
    • 15
  • c

    Carl

    05/26/2021, 7:28 PM
    Hi, in Pinot docs, ā€œe.g. 365 DAYS in the example means that segments containing data older than 365 days will be deleted periodically.ā€ Does Pinot delete the segment containing ALL data or ANY data older than the retentionperiod?
    x
    • 2
    • 1
  • k

    Karin Wolok

    05/26/2021, 8:50 PM
    Welcome new Apache Pinot community members! šŸ‘‹ We're so happy you're here and joining the movement of speeeeeed for user-facing analytics. šŸ· Please take a moment, if you haven't already - to introduce yourself and tell us what brought you here! 😃 @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User @User
    chewbacca dance 2
    a
    f
    • 3
    • 2
  • k

    Kewei Shang

    05/28/2021, 4:34 PM
    Hi team, we’re considering Pinot for our customer-facing analytics (mainly dashboards). We also want to use the same data store (Pinot table) to export denormalized report (CSV file of 10k - 100k rows filtered from the table, with ~20 selected columns) to our clients. Is Pinot also a good fit in the report generating use case. For example, our backend service could query the Pinot table and generate the CSV report from the query result. Thanks!
    m
    k
    • 3
    • 6
  • j

    Juraj Komerički

    05/30/2021, 2:28 PM
    Hello everyone šŸ˜„ , is there a tutorial or some doc in which a multi-node setup is explained. i have 5 VMs and want to set it (Pinot) up on them. What are the best practices regarding infrastructure here?
    k
    • 2
    • 2
1...202122...160Latest