Hi team, Can I use upsert in realtime table and use realtimetoofflinesegmentstask at the same time? will this task make sure rows in offline table upserted?
x
Xiaobing
06/24/2022, 3:04 AM
To my best knowledge, I don’t think the task does upsert when moving segments to offline table.
Under the hood, it downloads the raw segments of realtime table from the deep store, applies transformations and indices defined in the offline table and then uploads the segments.
Xiaobing
06/24/2022, 3:07 AM
The task can do rollup/dedup while moving the segments, but I don’t think they deal with upsert
Xiaobing
06/24/2022, 3:10 AM
cc @Neha Pawar in case I missed anything
a
Alice
06/24/2022, 3:17 AM
I noticed dedup is based on all column values are same. Does it support primary key?😅
x
Xiaobing
06/24/2022, 3:47 AM
Yeah it’s full row right now. Pk based dedup is not supported here yet.