Hi Team -- A basic question , If we use S3 for sto...
# general
a
Hi Team -- A basic question , If we use S3 for storing the segments, how does pinot take care of query latency since there will be netw call involved?
p
Someone with more experience can chime in but Pinot controller & server instances will have a local copy of the segments to handle queries.
k
Only Pinot server will have the local copy, not controller
a
ok.. so does it mean the cluster must have enough memory combined to host all the data that is getting ingested?
and that would keep growing in production, so how do we generally scale that ? Should we be looking at sacling up number of servers as traffic and data grows?
k
No, all the data is stored on disk and mmaped. So you need enough disk space
a
ok.. what is we use S3 ?
k
s3 the backup store
a
oh ok.. so we will have data in disk of the servers +the backup store.
m
Correct
p
@User So when configuring S3 or some other object store as deep store for segments means that disk space in servers is for caching purposes only?
Are there any metrics/ways to see if see if a given server is under-provisioned for caching segments?
m
Yes, servers use local disk for better performance, as of now
There’s an endpoint in server to get table size.
✔️ 1