< Kishore G> Could I know the zookeeper cluster deployment s Apache Pinot #troubleshooting

<@UDRJ7G85T> Could I know the zookeeper cluster de...

Tony Zhang

08/11/2022, 5:28 AM

@Kishore G Could I know the zookeeper cluster deployment suggestion? we will have 50W segments for 4 tables. thanks.

Kishore G

08/11/2022, 5:37 AM

what is 50W?

Tony Zhang

08/11/2022, 5:37 AM

500K segments

Tony Zhang

08/11/2022, 5:38 AM

we are estimating how many zookeeper nodes should be there to support such number of segements. thanks.

Tony Zhang

08/11/2022, 5:41 AM

I notice the comment below, but Could you please share more info about the how many zookeeper nodes and the configurations to support such scale? thanks. @Kishore G

Copy code

At LinkedIn, we do run a cluster with several million segments (and thousands of tables), and 100s of servers. Over time, we have made improvements to pinot that helps us handle this type of load. The load is on zookeeper. Increasing bandwidth on zookeeper, separating the Helix and Pinot controller instances are things you can do.

Mayank

08/11/2022, 5:55 AM

50k segment per table or across all tables?

Mayank

08/11/2022, 5:57 AM

For production folks usually run 5 ZK nodes (to ensure consensus). if 50k segments across all servers, it is not that big.

Tony Zhang

08/11/2022, 6:01 AM

Thanks @Mayank, It is not 50K, it is 500K segments and the biggest table is estimated to have 200K segments. so I am not sure how we can support such case.

Tony Zhang

08/11/2022, 6:04 AM

And we have tried rollup segments which seems a nightmare for team. it would take long long time to rollup and proved a resource consuming task.

Mayank

08/11/2022, 1:15 PM

500k overall is still small. What’s the segment size for the table with 200k segments

Tony Zhang

08/11/2022, 4:00 PM

Here is a segment example, for now, we set the segment threshold is : 15minutes or 500M

Copy code

{
  "segment.realtime.endOffset": "140557099793",
  "segment.start.time": "1660055340000",
  "segment.time.unit": "MILLISECONDS",
  "segment.flush.threshold.size": "1966448",
  "segment.realtime.startOffset": "140555344567",
  "segment.end.time": "1660056581000",
  "segment.total.docs": "1966933",
  "segment.realtime.numReplicas": "1",
  "segment.creation.time": "1660055859719",
  "segment.index.version": "v3",
  "segment.crc": "1224605237",
  "segment.realtime.status": "DONE",
  "segment.download.url": "<s3://xxxxxx>"
}

Mayank

08/11/2022, 4:01 PM

What’s the on disk size for this segment (it seems to have 1.9M rows).

Tony Zhang

08/11/2022, 4:06 PM

It is about 56MB per segment

Mayank

08/11/2022, 4:06 PM

Ok, you can definitely increase the 15 minutes to 1 - 2 hours. That will reduce number of segments by a factor of 4-8

Tony Zhang

08/11/2022, 4:07 PM

the segment size 56MB is the size showed in s3.

Tony Zhang

08/11/2022, 4:09 PM

How many ZK nodes I should use? currently we use 3. it seems a little slow even to get the table meta infos.

Mayank

08/11/2022, 4:11 PM

For these, ZK is not going to be the bottleneck. What’s the cpu/mem on each node

Mayank

08/11/2022, 4:11 PM

Also for controllers

Tony Zhang

08/11/2022, 4:18 PM

Here are some configurations:

Copy code

zookeeper:
  ## Replicas
  replicaCount: 3

  autopurge:
    ## The time interval in hours for which the purge task has to be triggered. Set to a positive integer (1 and above) to enable the auto purging.
    ##
    purgeInterval: 1
  resources:
    requests:
      memory: 8Gi
      cpu: 2
    limits:
      cpu: 2
      memory: 8Gi

Tony Zhang

08/11/2022, 4:19 PM

Copy code

controller:
  replicaCount: 3
  persistence:
    size: "600Gi"
    mountPath: /var/pinot/server/data

  resources:
    requests:
      memory: "24Gi"
      cpu: "6"
    limits:
      cpu: "6"
      memory: "24Gi"

Tony Zhang

08/11/2022, 4:20 PM

We setup in AWS EKS.

Mayank

08/11/2022, 4:25 PM

It might help to vertically size controllers first

Tony Zhang

08/11/2022, 4:27 PM

Any advice for the controller size? 6

Mayank

08/11/2022, 4:29 PM

Increase to 16 core 64 GB or around that range first

Tony Zhang

08/11/2022, 4:31 PM

For controller?

Tony Zhang

08/11/2022, 4:32 PM

And any suggestion for Broker and Server?

Mayank

08/11/2022, 4:49 PM

That depends on your workload

Tony Zhang

08/11/2022, 10:20 PM

I know brokers and servers are scalable. but is there any resource configuration in vertically suggestion?

Mayank

08/11/2022, 10:41 PM

You can start off with this as reference: https://www.startree.ai/blog/capacity-planning-in-apache-pinot-part-1

Open in Slack

Previous Next