https://pinot.apache.org/ logo
Join Slack
Powered by
# general
  • j

    Joey Pereira

    08/19/2020, 6:25 PM
    Is there any way to get a query plan from Pinot, such as indexes used to pull data, or is this something that you can only really interpret from the query response stats on the Controller's query console?
    k
    k
    • 3
    • 10
  • r

    Ravikiran Katneni

    08/20/2020, 4:46 AM
    Yes, we are trying to benchmark, Presto-Pinot combination for 10GB to 200GB TPCH data. We want good query performance with fast loading capability. When compared with other OLAP dbs, Pinot seems to be taking long time for loading data. One observation is, the standalone job is using a single CPU(not sure how many threads) for a single upload job, even when there are having multiple files in import folder. Other OLAP dbs seem to load data using more than one CPU. Is there any setting to make Pinot import job use more than one CPU? Using HDFS ,S3 or GC is not in the scope of bench marking, We want to minimize the dependency on Hadoop or other Big systems because the data sizes we are targeting are not truly BigData terriroty.
    k
    • 2
    • 6
  • y

    Yugandhar

    08/24/2020, 4:40 PM
    Does it support any exactly once guarantees like Druid ?
    m
    • 2
    • 6
  • y

    Yugandhar

    08/25/2020, 3:56 PM
    Is there support for Piont with existing parquet files and if it supports, can it build it's own indexing on top of that ?
    🙌 1
    m
    • 2
    • 2
  • a

    Adrian Cole

    08/26/2020, 4:44 AM
    Hi, question about ServiceManager.. is it a feature or bug to have multiple instances of the same role in bootstrap services? eg. 2 minions
    k
    x
    • 3
    • 32
  • s

    samarth

    08/26/2020, 6:15 AM
    Is there a tool / utility to read segment data . modify it and upload . I was looking to generate bulk data for query performance testing pinot . Thanks ..
    k
    j
    • 3
    • 5
  • d

    Daniel Kocot

    08/26/2020, 9:30 PM
    Hi there 👋
    👋 3
    m
    • 2
    • 1
  • a

    Adrian Cole

    08/27/2020, 3:23 AM
    hi. wondering about composite health endpoint for servicemanager
    x
    d
    • 3
    • 15
  • a

    Adrian Cole

    08/28/2020, 3:10 AM
    mind pushing a snapshot here? I can use docker to get latest, but I want to play with using dependencies as flat classpath is so much faster to boot https://repository.apache.org/content/repositories/snapshots/org/apache/pinot/ I can use jitpack but it is awkward and doesn't work to get dep tree due to this: "<version>${revision}${sha1}</version>"
    x
    j
    • 3
    • 23
  • s

    Syed Zeeshan Ahmed

    08/28/2020, 11:22 PM
    I am new to this project, so I just wanted to know on how to build this project in debug mode?
    m
    • 2
    • 1
  • b

    Buchi Reddy

    08/31/2020, 8:37 PM
    Curious: Are there any plans of introducing per doc TTL?
    m
    k
    n
    • 4
    • 21
  • a

    Adrian Cole

    09/02/2020, 2:05 AM
    hi, what's the general advice to bootstrap pinot as it comes up (thinking servicemanager where you have everything you need in one container)
    k
    • 2
    • 8
  • a

    Adrian Cole

    09/02/2020, 6:10 AM
    Copy code
    public class IndexingConfig extends BaseJsonConfig {
      private List<String> _invertedIndexColumns;
      private List<String> _rangeIndexColumns;
      private boolean _autoGeneratedInvertedIndex;
    n
    j
    • 3
    • 8
  • m

    Mayank

    09/02/2020, 2:59 PM
    Hi Community, friendly reminder of the Pinot virtual meetup today (9/2 6pm). Hope to see you all there: https://linkedinpinotmeetup.splashthat.com/
    🍷 5
    👍 8
    d
    • 2
    • 1
  • a

    Adrian Cole

    09/03/2020, 6:58 AM
    I see that 0.5.0 rc1 is pushed to docker. this is exciting. Is it also pushed to a stable maven repo?
    👍 1
    k
    x
    • 3
    • 18
  • a

    Adrian Cole

    09/03/2020, 8:56 AM
    any way to get rid of this error nicely besides hushing the logger in a new cluster (with ServiceManager)
    pinot              | 08:53:07,679 WARN [HelixController-pipeline-default-hypertrace-views-(f3017057_DEFAULT)] controller.LogUtil (LogUtil.java:11) - Event f3017057_DEFAULT : Cannot confirm top state missing start time. Use the current system time as the start time.
    k
    j
    • 3
    • 4
  • a

    Adrian Cole

    09/03/2020, 9:01 AM
    It seems that controller.data.dir pinot.server.instance.dataDir and pinot.server.instance.segmentTarDir need to be absolute paths. this is a little annoying. can this be relative?
    k
    • 2
    • 6
  • k

    Kavin Kuppusamy

    09/03/2020, 4:19 PM
    @User Hey Pinot Team, that was great virtual meetup last night. Got to see some amazing developments going on with pinot. Could you please share the link to session recording if it has been uploaded somewhere?
    👍 6
    m
    • 2
    • 1
  • a

    Adrian Cole

    09/04/2020, 1:30 AM
    question.. is there a side-effect on ZooKeeper from running
    bin/pinot-admin.sh AddTable
    or is it contained to pinot's data directory
    x
    • 2
    • 29
  • a

    Adrian Cole

    09/04/2020, 5:41 AM
    It seems PinotAdministrator doesn't return exit code 1 on failure.. this makes use in shell scripts difficult
    x
    • 2
    • 8
  • s

    Sarabjeet

    09/05/2020, 8:07 PM
    Hello guys! Super-new here. Qq - do we still use Zookeeper inspector for local installation? I couldn’t find its reference in here but saw it mentioned as a pre-requisite in the

    YouTube▾

    video. I’m asking this because I have minor thing to add in the documentation.
    n
    • 2
    • 1
  • m

    Mayank

    09/08/2020, 2:32 PM
    And the slides can be found here: https://www.slideshare.net/MayankShrivastava25/apache-pinot-meetup-sept02-2020
    👍 2
    🍷 4
    🎉 3
    b
    h
    • 3
    • 3
  • s

    samarth

    09/08/2020, 6:16 PM
    Is it possible to get JVM metrics like gc time, number of threads , heap / non heap memory etc from pinot-server ? I looked at 1. https://github.com/apache/incubator-pinot/blob/master/pinot-common/src/main/java/org/apache/pinot/common/metrics/ServerMeter.java 2. https://github.com/apache/incubator-pinot/blob/master/pinot-common/src/main/java/org/apache/pinot/common/metrics/ServerGauge.java but did not find much other than
    REALTIME_OFFHEAP_MEMORY_USED
    k
    • 2
    • 1
  • t

    Tim Chan

    09/09/2020, 12:42 AM
    I don’t think this is possible. I am getting
    java.lang.IllegalStateException: Cannot read single-value from Collection
    in the server logs.
    n
    • 2
    • 6
  • j

    Joey Pereira

    09/09/2020, 4:39 PM
    How does Pinot's real-time ingestion handle out-of-orderness for the event timestamps? eg: if we have event timestamps that may have 30s out-of-order, how does Pinot address that? What about longer time-ranges (1min, 10min?) For context: we have publisher time timestamping that naively allows publishers to specify a "created" timestamp, but we force it to be the current timestamp if it's >30s from the current timestamp. This leads to some out-of-orderness.
    m
    k
    • 3
    • 12
  • j

    Joey Pereira

    09/09/2020, 6:15 PM
    Follow up question from (^): since segments are not time-partitioned, do hybrid table which need to combine star-indexes use the time-granularity to divide the star-index (ie: allowing for star-index on online to still be efficient?)
    k
    • 2
    • 5
  • j

    Joey Pereira

    09/09/2020, 6:21 PM
    Also, does the offline data generation intentionally try to shuggle and generate segment data so segments will contain only one time-interval (ie: generate well-partitioned segments)?
    k
    • 2
    • 3
  • k

    Kishore G

    09/09/2020, 9:02 PM
    numEntriesScannedInFilter, numEntriesScannedPostFilter
    s
    • 2
    • 4
  • t

    Tim Chan

    09/10/2020, 9:18 PM
    i was hoping that setting up deep store using GCS will save me from perpetually having to expand disk
    x
    • 2
    • 2
  • t

    Tim Chan

    09/10/2020, 9:19 PM
    or another question, is there a TTL for data in a table?
    n
    x
    • 3
    • 9
1...789...160Latest