Pavel Stejskal
11/29/2021, 7:29 PMMayank
a) As of now, Pinot serving nodes store a local copy on the attached disk (both realtime as well as offline). The persistent storage can be in HDFS/S3 or similar such deepstore. For realtime, each Pinot server is assigned a sub-set of partitions from the topic to consume and store.
b) RT nodes periodically flush the in-memory index to persistent store (HDFS in your case). But note that it will need to maintain a copy in the local disk as well, for serving.
c) No, all data currently is local to the serving nodes.
d) 200TB size is in what format? As I mentioned, serving nodes need local storage to serve the data from.
Mayank
Pavel Stejskal
11/29/2021, 8:05 PMMayank
Jyoti Gambhir
02/26/2024, 11:05 AMXiang Fu
Jyoti Gambhir
02/27/2024, 4:06 AM