Hi team is it a requirement to enable partitioning in Pinot Apache Pinot #general

Join Slack

Hi team, is it a requirement to enable partitionin...

# general

Alice

04/29/2022, 3:14 AM

Hi team, is it a requirement to enable partitioning in Pinot to use upsert feature?

Alice

04/29/2022, 3:26 AM

I mean, if I had set “routing”: { “instanceSelectorType”: “strictReplicaGroup” }, “upsertConfig”: { “mode”: “FULL” } in the table config and set primary key in the schema, will upsert takes effect without setting segmentPartitionConfig?

Mayank

04/29/2022, 3:29 AM

Yes partitioning is a requirement

Alice

04/29/2022, 3:33 AM

At the moment, does Pinot set a default segmentPartitionConfig if it’s missing in the table config for upsert feature?

Mayank

04/29/2022, 3:33 AM

Mayank

04/29/2022, 3:33 AM

https://docs.pinot.apache.org/basics/data-import/upsert

Alice

04/29/2022, 3:35 AM

I think there is no segmentPartitionConfig in this example?

Mayank

04/29/2022, 3:36 AM

SegmentPartitionConfig is separate from upsert

Mayank

04/29/2022, 3:37 AM

For upsert, the requirement is the the upstream is partitioned by the upsert primary key

Mayank

04/29/2022, 3:38 AM

SegmentPartitionConfig is for specifiying partitioning that was done upstream (what function was chosen, etc). This is used during query execution to only query partitions for the key in the query. It is separate from upsert, and not needed for upsert.

Alice

04/29/2022, 3:38 AM

I see. Thanks.

Alice

04/29/2022, 3:40 AM

The following config is just for querying?

Mohemmad Zaid Khan

04/29/2022, 5:29 AM

You need to push data to kafka with a key that is also the primary key in pinot schema. for example, if there are two columns

and

in primaryKeys of pinot schema. Then in your kafka producer (may be flink, spark or anyother job), you need to use both

and

attribute of kafka message as partitioning key. So that message lands to same kafka partition for specific values of

and

Alice

04/29/2022, 5:33 AM

Yes, we’re working on it. Thanks. @User

Open in Slack

Previous Next