@here was wondering whether you might be able to point out to some rule-of-thumb compression benchmarks using Pinot? Cheers
Daniel Molnar
02/03/2021, 5:41 PM
I'm like catching Snappy, runlength and so, but do you care about this, or are they like under-the-hood defaults?
k
Kishore G
02/03/2021, 6:08 PM
it depends on the input data format
• it its row format like csv, json, avro, proto you can see anywhere between 3x to 10x compression
• If its columnar like orc/parquet then its you dont see a lot of compression - 0.9x to 1.1x
d
Daniel Molnar
02/04/2021, 8:12 AM
@Kishore G thank you so much for your insight! I really appreciate it!