@User Thanks so much! 😍
https://github.com/apache/pinot/blob/c6ad763a5013825810a0af5448bb4a1d8be0e230/pinot-core/src/test/java/org/apache/pinot/core/minion/SegmentPurgerTest.java#L127
I checked Pinot's Purge Task, but I didn't mention it because I wanted to see if there was any other way.
However, I think it's only a way to use PurgeTask, so I have a few questions about PurgeTask.
Q1) Can PurgeTask also delete individual rows in the committed segment (not concealing) of the realtime table?
Q2) PurgeTask does not seem to delete individual rows by random accessing segment files.
If PurgeTask downloads, regenerates, and uploads segment files, what is the difference from injection job?
I am trying to understand this difference.
We will prefer an injection job to using Minion and implementing a task code. because it is more familiar to develop an injestion job.
Q3) We can service large amounts of data and large numbers of segments. If PurgeTask works in a download and regenerative manner, regeneration and reload of segments will likely affect clusters or services, regardless of whether PurgeTask or InjectionJob is used. How will the cluster or service be affected?