coco
07/06/2022, 6:20 AMDV Kumar
07/06/2022, 7:37 AMA_Phil
07/06/2022, 8:46 AMtimestamp
, id
and `value`; the id
describes the value
being ingested; here value
can hold both INT
and STRING
for my use case.
I wanted to know which of these options are feasible:
1. Create columns value_int
and value_string
and use a filtering function in Pinot that can save records in value_int
if value
is INT
, and vice-versa for STRING
values of value
. I tried this, but the filter function as shown in the docs, does not allow this
2. Store all values as STRING
and use a pinot-specific CAST
or CONVERT
function to do aggregations. But I could not find a cast/convert function in Pinot. Thus I am not able to do sum
operations on the data
I would welcome any ideas/workaround for the same.David Gregory
07/06/2022, 11:34 AMJohn Peter S
07/06/2022, 12:13 PMAmanda Robson
07/06/2022, 2:57 PMKevin Xu
07/07/2022, 1:17 AMDiogo Baeder
07/07/2022, 2:04 AMcoco
07/07/2022, 5:01 AMabhinav wagle
07/07/2022, 5:51 PMPrashant Pandey
07/10/2022, 5:29 PMRohan Pednekar
07/11/2022, 5:49 PMYarden Rokach
Dan DC
07/12/2022, 1:08 PMScott deRegt
07/12/2022, 6:55 PMCASE ...
statements. I found this thread that case statements will not work with star tree - since the thread is a couple of years old just wanted to double-check if this is still the case?Tony Zhang
07/12/2022, 11:03 PMHuaqiang He
07/13/2022, 9:08 AMSergii Balganbaiev
07/13/2022, 2:22 PMScott deRegt
07/14/2022, 12:08 AM<s3://bucket/metrics/country=US/files.parquet>
2. From this thread, it seems there is no way currently to make pinot aware of columns associated with partition filepath metadata. i.e. in above example, pinot table cannot contain a country
column.
Am I understanding that correctly? If so, how does partition-based segment pruning help in this case if the partition column cannot be part of the query issued to pinot?Deepika Eswar
07/14/2022, 11:07 AMDeepika Eswar
07/14/2022, 11:08 AMDeepika Eswar
07/14/2022, 11:08 AMSlackbot
07/14/2022, 12:57 PMYarden Rokach
Deepika Eswar
07/15/2022, 6:42 AMAbhishek Gupta
07/15/2022, 10:22 AMchandarasekaran m
07/17/2022, 2:55 AMAbdullah Jaffer
07/17/2022, 5:14 AMJohn Peter S
07/18/2022, 7:17 AMAbhishek Gupta
07/18/2022, 8:59 AMUpsert table maintains an in-memory map from the primary key to the record location
- The "record location" could be either in-memory or in segment store, so does this map maintain both kinds of locations? By storing all primary keys, will this map keep growing indefinitely in memory and will require vertical scaling of servers at some point?
3. If a record in a segment is updated, all servers need to reload it, I guess. Does it make updates expensive?
4. Overall, is our use-case well suited for Pinot (where data updates/deletes of a record are pretty common)?