If I understand right, when I batch ingest a set o...
# general
a
If I understand right, when I batch ingest a set of parquet files, the job will create a segment for each parquet file and then will upload it all to Pinot? Is that right? If so, are there any guidelines about picking segment sizes for optimal query performance?
m
Yes, all data is internally stored in Pinot’s columnar indexed format.
You want to avoid large number of tiny segments. If your data allows, few hundred MB per segment is a good size