https://pinot.apache.org/ logo
Join Slack
Powered by
# general
  • j

    Jagannath Timma

    04/01/2022, 5:33 PM
    General question: Why is pinot broker/controller a statefulset in the helm chart? Those can be deployments isnt it? They dont have any persistence afaik.
    m
    x
    • 3
    • 6
  • w

    Weixiang Sun

    04/01/2022, 5:44 PM
    If I want to move pinot table from one tenant to another, can I just change the tenant name inside table config? If yes, is there any downtime?
    m
    j
    • 3
    • 6
  • p

    Prateek Singhal

    04/01/2022, 10:57 PM
    Hi team, I believe it is possible to apply inverted index on multiple columns. Any idea on how it is stored? All the documentation I have seen so far give examples of inverted index on only 1 column. I would like to apply inverted index on multiple columns in an optimized way.
    s
    • 2
    • 1
  • a

    Alice

    04/02/2022, 5:30 PM
    Hi team, if two tables ingest the same Kafka topic, will they have the same copy of data?
    m
    • 2
    • 2
  • a

    Alice

    04/02/2022, 5:43 PM
    What’s the best practice to set stream.kafka.consumer.prop.auto.offset.reset? I haven’t found detailed info about it. I set it smallest in case stream data lost. But I met a situation, when I restarted the controller, pinot just stopped consuming data.
    m
    • 2
    • 2
  • s

    Satyam Raj

    04/04/2022, 9:58 AM
    Hi everyone! I’ve been doing POC on Pinot, and currently facing issue while ingestion orc file data to pinot. Filed an GH issue as well: https://github.com/apache/pinot/issues/8460 Can anyone help?
    k
    • 2
    • 13
  • m

    Mohemmad Zaid Khan

    04/04/2022, 10:59 AM
    Does Pinot support using
    pinot-jdbc-client
    in JMeter and Perf testing queries performance of Pinot using jmeter’s JDBC Request Sampler?
    m
    • 2
    • 1
  • d

    Diana Arnos

    04/05/2022, 8:44 AM
    Hey there, which metric can I use to check the consumption lag from the servers in comparison with the topic they are consuming from?
    👍 1
    n
    • 2
    • 2
  • b

    Bordin Suwannatri

    04/05/2022, 9:19 AM
    hello everyone i try to config tls-ssl on my apache pinot cluster. Still not working. Anyone can share how to config enable tls-ssl on pinot cluster controller, broker, server, minion... how to connect with tls-ssl to apache pinot.
    d
    • 2
    • 3
  • f

    francoisa

    04/05/2022, 2:25 PM
    Hi. Starting to get dirty hands on code to implement my GDPR purge process. I’m a bit struggling with the build time and the no way to test. Is there any way to get a faster build to test it ? I’ve used the folowing maven command
    mvn install package -DskipTests -Pbin-dist
    and it take more than 20 minutes to build 😕 Anyway to get faster ?
    k
    n
    • 3
    • 4
  • d

    Daniel

    04/05/2022, 5:29 PM
    Hi there. Is there a way to transform a string field to a double/float upon ingestion? I'm looking at the supported transform functions and do not see a function that supports this https://docs.pinot.apache.org/users/user-guide-query/supported-transformations
    n
    • 2
    • 2
  • a

    Ashwin

    04/06/2022, 5:02 AM
    Anybody know the differences in use cases between Pinot, druid, Clickhouse, and Rockset?
    👀 2
    m
    • 2
    • 2
  • a

    Alice

    04/06/2022, 9:25 AM
    image.png
    f
    j
    • 3
    • 4
  • p

    Prashant Pandey

    04/06/2022, 1:57 PM
    Hi Pinot folks, we observed a peculiar incident today wherein consumption stopped from just 1 partition of a topic (this topic has 96 partitions, 95 are working fine). This segment was moved from CONSUMING to OFFLINE state due to some exception during consumption.
    Copy code
    0 2022/04/06 12:06:04.179 ERROR [LLRealtimeSegmentDataManager_span_event_view_1__50__287__20220406T1205Z] [span_event_view_1__50__287__20220406T1205Z] Exception while in work
      1 2022/04/06 12:06:04.365 INFO [FileUploadDownloadClient] [span_event_view_1__50__287__20220406T1205Z] Sending request: <http://controller-0.controller-headless.pinot.svc.cluster.local:9000/segmentStoppe>    dConsuming?reason=java.lang.NullPointerException&streamPartitionMsgOffset=1059610656&instance=Server_server-span-event-view-realtime-7.span-event-view-realtime-headless.pinot.svc.cluster.local_8098&of    fset=-1&name=span_event_view_1__50__287__20220406T1205Z to controller: controller-0.controller-headless.pinot.svc.cluster.local, version: Unknown
      2 2022/04/06 12:06:04.366 INFO [ServerSegmentCompletionProtocolHandler] [span_event_view_1__50__287__20220406T1205Z] Controller response {"isSplitCommitType":false,"streamPartitionMsgOffset":null,"build    TimeSec":-1,"status":"PROCESSED","offset":-1} for <http://controller-0.controller-headless.pinot.svc.cluster.local:9000/segmentStoppedConsuming?reason=java.lang.NullPointerException&streamPartitionMsgO>    ffset=1059610656&instance=Server_server-span-event-view-realtime-7.span-event-view-realtime-headless.pinot.svc.cluster.local_8098&offset=-1&name=span_event_view_1__50__287__20220406T1205Z
      3 2022/04/06 12:06:04.366 INFO [LLRealtimeSegmentDataManager_span_event_view_1__50__287__20220406T1205Z] [span_event_view_1__50__287__20220406T1205Z] Got response {"isSplitCommitType":false,"streamParti    tionMsgOffset":null,"buildTimeSec":-1,"status":"PROCESSED","offset":-1}
    I have attached the server logs when this happened.
    server-span-event-view-realtime-7
    r
    n
    +2
    • 5
    • 20
  • n

    Nicolas Kovacs

    04/06/2022, 2:47 PM
    Hello everyone, Im struggling to create a Kafka stream ingestion with HLC consumer and a custom consumer group id. Anyone has worked on a similar case ?
    k
    n
    • 3
    • 15
  • d

    dmitry H

    04/06/2022, 5:13 PM
    Hello everybody. Does Pinot support data export out of Pinot via bulk extract or CDC?
    m
    k
    • 3
    • 2
  • f

    Facundo Bianco

    04/06/2022, 6:14 PM
    Hi All, do you know how Pinot stores data between brackets? (not JSON). Let me explain: I have this data
    Copy code
    id,timestamp,application
    1,1649268351,"{'app_name': 'foo', 'version': '1.0.0', 'app_id': None, 'business': 'ponzico'}"
    And when I load that info I got
    Copy code
    | id | timestamp  |    application    |
    |----|------------|-------------------|
    | 1  | 1649268351 | foo,1.0.0,ponzico |
    (In table-schema.json row "_application_" is configured as "STRING".) There is a way to query "_application_" row based on one of the values inside? (ie
    SELECT * FROM testing WHERE application.app_name = "foo"
    ). Thanks in advance!
    k
    n
    s
    • 4
    • 5
  • d

    Diana Arnos

    04/07/2022, 2:39 PM
    Hey there! I have a different type of question this time: If I had to give a presentation to my company advocating for us to start using Pinot as a go-to tool for user-facing real-time analytics, which arguments or points of view you would recommend me to speak about?
    👍 1
    d
    k
    m
    • 4
    • 3
  • a

    Arkadiusz Chmura

    04/07/2022, 4:42 PM
    Hi everyone! I was wondering whether the dataset used

    in this talk▾

    is available somewhere (Airline data from 1987-2008). I am currently working on my Master's thesis and I would like to run some experiments on that dataset. Thanks
    m
    m
    • 3
    • 4
  • a

    abhinav wagle

    04/07/2022, 5:18 PM
    Hi there, Checking community reviews. We are in process of setting up Kubernetes-based deployment of Pinot Cluster. Has anyone seen significant performance gains from using SSDs with instance store instead of EBS for server PODs?
    m
    p
    k
    • 4
    • 8
  • a

    Alice

    04/08/2022, 8:56 AM
    Hi, is it supported to use JSONEXTRACTSCALAR function in the transformConfigs?
    k
    • 2
    • 4
  • s

    Satyam Raj

    04/08/2022, 9:12 AM
    Hey guys, Is there a way to connect Pinot to a secure Kafka cluster using Jaas configuration?
    m
    a
    +2
    • 5
    • 27
  • f

    francoisa

    04/08/2022, 9:17 AM
    Hi getting deeper in my purge process dev many thanks to the pinot team for the awesome support and code quality 😉 . Is there any way to upload a realtime segment ? Like purging it and refreshing it ? I do not really inderstand why this is not possible if the segment is not consuming 😕 And I’m a bit surprised of the API getting an OFFLINE table for the uploadProcess instead my original type REALTIME
    m
    s
    • 3
    • 11
  • p

    Peter Pringle

    04/08/2022, 11:28 AM
    Any plans for pinot to support 'RealtimeToOfflineSegementsTask' for upsert enabled tables. Use case is this: 1. Normal queries; key contains a business date; only want to see the last value for a day (as per current upsert config works well) 2. Detail / as of queries; want to see record up to a given timestamp for a given business date (skipUpsert and LASTWITHTIME( col, asOfTime ). e.g. compare 11am yesterday v today 3. Historical want to move data beyond a week to OFFLINE 4. Historical want to drop data points e.g. milli seconds -> hours per day. Like reducing data resolution over time through some data cleaning job.
    l
    • 2
    • 6
  • w

    Weixiang Sun

    04/08/2022, 5:15 PM
    It seems that we have health API for controller. Do we have health API for table and broker?
    m
    • 2
    • 2
  • e

    Evan Galpin

    04/08/2022, 7:41 PM
    hey folks, does anyone have experience with “zombie” tables? I attempted to add a tableSpec but was missing a field in the config. This resulted in a null pointer exception when Pinot processed the config. As a result, the table doesn’t exist in the UI or by querying the list API, but I can’t create a table with the same name due to receiving an error that “table already exists”. Presumably the spec reached ZK or something, but no where else? What can be done to remove the zombie table?
    m
    p
    +3
    • 6
    • 14
  • a

    Alice

    04/09/2022, 9:08 AM
    Hi, does transformConfigs works when used along with complexTypeConfig in ingestionConfig section?
    f
    m
    • 3
    • 2
  • s

    sunny

    04/11/2022, 1:14 AM
    Hi, I checked Authorization in broker level.
    Copy code
    pinot.broker.access.control.principals.<user>.tables=test_table
    It seems that restarting broker is required whenever adding a privileged table. Do you restart every time you grant table privileges in the production environment ? Or is there any other way?
    m
    • 2
    • 3
  • s

    suraj kamath

    04/11/2022, 7:42 AM
    Hi Team, As part of a POC, We are trying to load pinot table data into a spark dataFrame using the spark JDBC option. However when we try we are seeing the following error:
    Copy code
    Exception in thread "main" java.sql.SQLFeatureNotSupportedException
    	at org.apache.pinot.client.base.AbstractBaseStatement.setQueryTimeout(AbstractBaseStatement.java:167)
    	at org.apache.spark.sql.execution.datasources.jdbc.JDBCRDD$.resolveTable(JDBCRDD.scala:60)
    	at org.apache.spark.sql.execution.datasources.jdbc.JDBCRelation$.getSchema(JDBCRelation.scala:226)
    	at org.apache.spark.sql.execution.datasources.jdbc.JdbcRelationProvider.createRelation(JdbcRelationProvider.scala:35)
    	at org.apache.spark.sql.execution.datasources.DataSource.resolveRelation(DataSource.scala:355)
    	at org.apache.spark.sql.DataFrameReader.loadV1Source(DataFrameReader.scala:325)
    	at org.apache.spark.sql.DataFrameReader.$anonfun$load$3(DataFrameReader.scala:307)
    	at scala.Option.getOrElse(Option.scala:189)
    	at org.apache.spark.sql.DataFrameReader.load(DataFrameReader.scala:307)
    	at org.apache.spark.sql.DataFrameReader.load(DataFrameReader.scala:225)
    k
    • 2
    • 6
  • a

    Alice

    04/11/2022, 1:00 PM
    When I add a table, the following error occurred. What’s the possible reasons for it? {“code”500,“error”“org.apache.kafka.common.KafkaException: Failed to construct kafka consumer”}
    m
    s
    • 3
    • 4
1...383940...160Latest