Nisheet
05/13/2022, 8:20 AMNisheet
05/13/2022, 8:20 AMDiogo Baeder
05/13/2022, 12:52 PMloadMode
? Is it MMAP
?Mayank
Mayank
Kartik Khare
05/13/2022, 2:01 PMJackie
05/13/2022, 5:10 PMJackie
05/13/2022, 5:21 PMNisheet
05/14/2022, 5:44 AMCould the segments be too large, perhaps?I have set maximum segment size to be of max 500MB or 4m records.
Also, what's the table indexYes it is MMAP.? Is itloadMode
?MMAP
How many partitions do you have? Have many pk you expect per partition?We have currently set 10 partitions. We have almost 4B records so 4B primary keys. I have set this value as 10 because this 4B is complete data since inception. The new record rate would be around 10 million per day so 1million per partition(approx). If we fan out partitions more then the new segment size would be smaller
If total ram is 64GB, heap should be a small percentage.The instance type in r5a.4x large with 128 gb of ram. Out of this I had allocated 64GB for heap usage
can I know what is the primary key you are using?In this case primary key is a string type with 14 characters.
What is your primary key column? What is its approximate cardinality? Do you use upsert feature to dedup the records or update the records?In this case primary key is a string type with 14 characters. Current pk count is around 4b and it would increase with a rate of approx 10-15 million per day We are using upsert Mode as
FULL
as we are rewriting the complete data for the given record. This should replace the pointer in in-memory in the memory to new segment locationNisheet
05/17/2022, 3:44 PMKartik Khare
05/17/2022, 3:46 PMJackie
05/17/2022, 5:39 PMJackie
05/17/2022, 5:41 PMNisheet
05/18/2022, 6:03 AMNisheet
05/20/2022, 9:54 AMNisheet
05/20/2022, 11:57 AMNisheet
05/23/2022, 7:57 AMJackie
05/23/2022, 7:14 PMJackie
05/23/2022, 7:14 PMNisheet
05/23/2022, 7:17 PMJackie
05/23/2022, 7:28 PM