This message was deleted Apache Druid #troubleshooting

Join Slack

This message was deleted.

# troubleshooting

Slackbot

05/01/2023, 8:48 PM

This message was deleted.

Sergio Ferragut

05/01/2023, 8:55 PM

You can control how many threads the historicals use to download segments with

Copy code

druid.segmentCache.numLoadingThreads

Younes Naguib

05/01/2023, 8:59 PM

🤦 makes sense 🙂

Sergio Ferragut

05/01/2023, 8:59 PM

Also, maxSegmentsInNodeLoadingQueue
controls how many pending operations are given to each historical. Default of 100 should be plenty.

Sergio Ferragut

05/01/2023, 9:00 PM

Let us know how it goes.

Younes Naguib

05/01/2023, 9:19 PM

Same

Younes Naguib

05/01/2023, 9:19 PM

Copy code

numLoadingThreads

to 5, and maxSegmentsInNodeLoadingQueue was already 200

Sergio Ferragut

05/01/2023, 9:42 PM

Do you see pending loads on the historicals in the Druid Console?

Younes Naguib

05/01/2023, 9:43 PM

Yes

Sergio Ferragut

05/01/2023, 9:46 PM

hmmm what else could throttle the download?

Sergio Ferragut

05/01/2023, 9:47 PM

just to confirm, did you restart the historicals after the change?

Younes Naguib

05/01/2023, 10:11 PM

Yes I did

Adam Peck

05/01/2023, 11:25 PM

Is the location hosting the segments able to do ~25Gb/s? Outside of Druid are you able to get that kind of bandwidth a CLI tool?

Younes Naguib

05/02/2023, 12:02 AM

Yes sir… even getting data via s3cli runs ~10Gb+

Adam Peck

05/02/2023, 12:25 AM

That includes writing and flushing to disk? Do you have a NVMe raid setup for Druid? I am assuming disk IO is the bottle neck here but maybe I am missing something.

Gian Merlino

05/02/2023, 6:19 AM

IIRC, there's some additional config needed to actually parallelize loading beyond

numLoadingThreads

— I believe there's also gates for feeding the threads

Gian Merlino

05/02/2023, 6:20 AM

for http load queues (

druid.coordinator.loadqueuepeon.type=http

) IIRC you also want to set

druid.coordinator.loadqueuepeon.http.batchSize=N

(where

N <= numLoadingThreads

)

Gian Merlino

05/02/2023, 6:20 AM

I don't remember how it works for ZK

Gian Merlino

05/02/2023, 6:21 AM

druid.coordinator.loadqueuepeon.curator.numCallbackThreads

seems relevant from reading the docs, although i don't have personal experience with it

Gian Merlino

05/02/2023, 6:21 AM

Mostly use HTTP these days 🙂

Younes Naguib

05/11/2023, 2:40 PM

This was it my friend 🙂

Copy code

druid.coordinator.loadqueuepeon.type=http
druid.coordinator.loadqueuepeon.http.batchSize=50

🙌 1

Gian Merlino

05/11/2023, 5:02 PM

nice! good to hear

Open in Slack

Previous Next