Map
07/19/2021, 9:18 PMKishore G
Map
07/19/2021, 10:12 PMAn important requirement for the Pinot upsert table is to partition the input stream by the primary key. For Kafka messages, this means the producer shall set the key in theIs it to say that if the primary key column has thousands or even millions of distinct values, I need to create these many partitions in my kafka topic?API. If the original stream is not partitioned, then a streaming processing job (e.g. Flink) is needed to shuffle and repartition the input stream into a partitioned one for Pinot's ingestion.send
Kishore G
Kishore G
Map
07/20/2021, 2:54 PMKishore G
Map
07/21/2021, 5:16 PMprimaryKeyColumns
and segmentAssignmentStrategy
and the value strictReplicaGroup
for instanceSelectorType
are not documented in https://docs.pinot.apache.org/configuration-reference/table or https://docs.pinot.apache.org/configuration-reference/schemaKishore G
Jackie
07/21/2021, 5:57 PM