Punish Garg
06/11/2021, 5:07 PMMayank
Mayank
Punish Garg
06/11/2021, 5:26 PMMayank
Mayank
Mayank
Mayank
Punish Garg
06/11/2021, 5:36 PMMayank
Mayank
Punish Garg
06/13/2021, 11:52 AMPunish Garg
06/13/2021, 11:54 AMMayank
1. Yes, see <https://docs.pinot.apache.org/configuration-reference/job-specification#segment-name-generator-spec>.
2. Pinot uses crc check to figure out if data has changed. If you regenrate a segment and push again without changing the input, Pinot will see same crc and won't overwrite (as expected).
3. Not sure if I follow, what file do you need to modify again and again? May be check <https://docs.pinot.apache.org/basics/components/minion#starting-a-minion> that can be used to write scheduled tasks.
You can overwrite all segments every day. But note that if your retention is 1 year, you want to regenerate and push 1 years worth of segments each day?
Punish Garg
06/13/2021, 3:28 PM/container_data/examples/rawdata/2021-05-01/raw_data1.csv -- total records : 2
/container_data/examples/rawdata/2021-05-01/raw_data.csv -- total records : 1
First time when i ran pinot ingestion job, i can see two segments files were created
/container_data/examples/segments/2021-05-01/dim_meta_eg_OFFLINE_2021-05-01_2021-05-01_1.tar.gz ---- total records : 2
/container_data/examples/segments/2021-05-01/dim_meta_eg_OFFLINE_2021-05-01_2021-05-01_0.tar.gz -- total records : 1
Now my input data can be changed, lets assume in next run, i have only single file for 2021-05-01
/container_data/examples/rawdata/2021-05-01/raw_data.csv -- total records : 3
when i tried to load this file again, i am expecting that all segments should be overwrite but its not happening
Actual: i still see one of segments were overwrite but other one still remain the same:
/container_data/examples/segments/2021-05-01/dim_meta_eg_OFFLINE_2021-05-01_2021-05-01_1.tar.gz ---- total records : 2 (from previous run)
/container_data/examples/segments/2021-05-01/dim_meta_eg_OFFLINE_2021-05-01_2021-05-01_0.tar.gz -- total records : 3( newer created)
But i was expecting like this
/container_data/examples/segments/2021-05-01/dim_meta_eg_OFFLINE_2021-05-01_2021-05-01_0.tar.gz -- total records : 3
Punish Garg
06/13/2021, 3:29 PMPunish Garg
06/13/2021, 3:29 PMMayank
Punish Garg
06/13/2021, 3:55 PMMayank
Punish Garg
06/13/2021, 4:13 PMPunish Garg
06/13/2021, 4:14 PMMayank
Punish Garg
06/13/2021, 4:20 PMMayank