Compression depends on a variety of factors (eg cardinality of columns, data type, type of indexing used, etc). But even so, when compared to text based input (eg CSV/JSON), the compressions should be quite a lot. If you want an accurate number, just take one sample JSON, and create a pinot segment out of it (using pinot-admin)