Hi team, when using upsert with realtime table, ca...
# general
t
Hi team, when using upsert with realtime table, can we do segments compaction or merge for committed segments? Like merge multiple small segments into one large segment. If not, how should we deal with too many small segments when using upsert? @User @User
y
upsert in pinot does not use compact, but use metadata to track the records of the same key. you can find the design details in this doc https://docs.google.com/document/d/1qljEMndPMxbbKtjlVn9mn2toz7Qrk0TGQsHLfI--7h8/edit
segment size can be controlled via threshold, which is separate
t
I see, so in the current design, there is no way to merge multiple small segments into larger segments for upsert table? We can only control segments size during ingestion?
y
Copy code
"realtime.segment.flush.threshold.size": "0",
      "realtime.segment.flush.threshold.time": "24h",
     "realtime.segment.flush.desired.size": "50M",
j
These are two separate topics. Segment merge is not supported yet, and the feature is on our roadmap
Once segment merge is supported, we should be able to merge segments for upsert table