Looks like I missed your question here. it depends on how you push the data into pinot. If you are ingesting everything via a realtime table, then you can enable upserts and de-dup to handle duplicates. not sure how this works in offline tables though.