Ken Krugler
11/17/2021, 7:42 PMSubbu Subramaniam
11/17/2021, 9:41 PMRichard Startin
11/17/2021, 10:57 PMKen Krugler
11/17/2021, 11:06 PMFile inputFile = new File("/Users/kenkrugler/Downloads/adbeat/pinot-segments-ads/ads_us_2020-11_00.tar.gz");
File outputFile = new File("./build/metadata.properties");
TarArchiveInputStream tis = new TarArchiveInputStream(new GZIPInputStream(new FileInputStream(inputFile)));
TarArchiveEntry tarEntry;
while ((tarEntry = tis.getNextTarEntry()) != null) {
if (tarEntry.isFile() && tarEntry.getName().endsWith("/metadata.properties")) {
FileOutputStream fos = new FileOutputStream(outputFile);
IOUtils.copy(tis, fos);
fos.close();
break;
}
}
tis.close();
Given the ordering of files in segments I’ve seen, the metadata.properties occurs before the big pieces (columns.psf & star_tree_index), e.g. 15K of reading/decompression vs. 200-300MB for my segments.Ken Krugler
11/17/2021, 11:10 PMSegmentPushUtils.generateSegmentMetadataFile()
for where it downloads and unpacks/untars the entire segment to get a file used by sendSegmentUriAndMetadata()
.Subbu Subramaniam
11/17/2021, 11:12 PMKen Krugler
11/17/2021, 11:13 PMKen Krugler
11/17/2021, 11:13 PMSegmentPushUtils.generateSegmentMetadataFile()
Subbu Subramaniam
11/17/2021, 11:14 PMKen Krugler
11/17/2021, 11:15 PMSubbu Subramaniam
11/17/2021, 11:22 PMKen Krugler
11/17/2021, 11:23 PMSubbu Subramaniam
11/17/2021, 11:24 PMKen Krugler
11/17/2021, 11:25 PMSubbu Subramaniam
11/17/2021, 11:34 PMRichard Startin
11/18/2021, 12:28 AMKen Krugler
11/18/2021, 12:29 AMRichard Startin
11/18/2021, 12:33 AMKen Krugler
11/18/2021, 12:35 AMRichard Startin
11/18/2021, 12:35 AMSubbu Subramaniam
11/18/2021, 1:15 AM