Hi :wave: How much storage is on average required ...
# general
s
Hi 👋 How much storage is on average required for Pinot relative to source data size? For instance if my source database is 1TB, how much EBS do I need on AWS for the Pinot cluster? I'm imagining it'd be more than the source size due to replication and indexing?
k
whats the source format?
s
Mysql database
Mysql binlogs get consumed by Debezium and get published as Avro files to Kafka topics which are then ingested by Pinot
k
so avro files you will see 3x compression
so plan for originalsize/3 *replication factor/num-servers for each ebs volume
s
ok makes sense, thank yoiu