Evan Galpin
07/15/2021, 3:35 PMSegment Metadata Push
bulk ingestion strategy[1], which seems to imply writing segments to one of a few distributed file systems first, and then informing the controller about the segments and their associated metadata. I suppose I’m looking for the generic internals to create a segment from input data. Is `SegmentGenerationUtils.java`[2] the right starting place?
Thanks!
[1] https://docs.pinot.apache.org/basics/data-import/batch-ingestion#3-segment-metadata-push
[2] https://github.com/apache/incubator-pinot/blob/master/pinot-common/src/main/java/o[…]che/pinot/common/segment/generation/SegmentGenerationUtils.javaKen Krugler
07/15/2021, 3:41 PMSegmentCreationMapper
class in the pinot-hadoop sub-project, it gives a fairly self-contained overview of what code is called to convert an input file to a segment. The key bit (in the map()
method) is the call to SegmentIndexCreationDriver.build()
.Evan Galpin
07/15/2021, 3:42 PMMayank