Apache Druid #general

William Montgomery

05/29/2025, 6:50 PM

I have a question re: metrics. Does

query/failed/count

include timeout queries that are also included in

query/timeout/count

Eyal Yurman

05/30/2025, 4:42 PM

Hi, I was looking at the state of ingestion in Apache Druid. https://druid.apache.org/docs/latest/ingestion/ I noticed there seems to be quite a lot of options, which is normal for open-source, but was wondering if we are thinking of some convergence. Here's some duplicated/parallel options in different tracks: • Ingestion methods: 4-5 ◦ Streaming (Kafka and Kinesis) ◦ Native batch ◦ SQL (MSQ) ◦ Hadoop-based (deprecated) • task orchestrators/managers: - 2-3 (at least) ◦ middleManager ◦ Kubernetes-based (MM-less). ◦ To some extent, YARN (In case of Hadoop). Although this have been deprecated. • Schedulers: 2 (ok, actually 1 + Null) ◦ Supervisors for streaming. ◦ No scheduler for batch (user relies on external scheduler). I was wondering if there is a roadmap or any thoughts where it goes long-term and perhaps converges: • Should native batch be replaced with MSQ? • Should streaming use SQL (INSERT) as well, instead of native spec? • Should a batch scheduler be introduces and should it share API similarity with streaming schedule (Supervisor).

Nick Marsh

06/02/2025, 4:20 AM

Hello all. I’m trying to figure out how to migrate a supervisor from one Kafka topic to another. I tried updating the supervisor to read from both topics, using a flag on the Kafka payload to signal to the indexer about which topic to use, but when the indexer tries to do the handoff I got the “Inconsistent metadata state” state. The only way I’ve seen to fix this is to reset the supervisor, but that will lead to me processing the same Kafka payloads multiple times. Is there a way to change a supervisor’s topic without resetting it? Alternatively is there any other way to migrate to another topic without the risk of double-processing the Kafka payloads?

Asit

06/03/2025, 4:05 AM

Hi All ,

Asit

06/03/2025, 4:06 AM

We are looking for a Druid Consultant who can help us in scaling and managing workloads . If anyone is interested please DM me

Sachit Swaroop NB

06/03/2025, 5:40 AM

Hi, while setting up DRUID on EKS using helm, we want to use authentication. using druid-basic-security extension for this case, As per the documentation, the following is the given one but it is not accepted by the pods. druid.auth.authenticatorChain: '["MyBasicMetadataAuthenticator"]' druid.auth.authenticator.MyBasicMetadataAuthenticator.type: "basic" druid.auth.authorizers: '["MyBasicMetadataAuthorizer"]' druid.auth.authorizer.MyBasicMetadataAuthorizer.type: "basic" druid.escalator.type: "basic" druid.escalator.internalClientUsername: "druid_system" druid.escalator.internalClientPassword: "your_internal_password" druid.escalator.authorizerName: "MyBasicMetadataAuthorizer" any is there any specific format we need to maintain ? Ref - https://github.com/asdf2014/druid-helm

06/03/2025, 5:54 AM

Hi Team, What is the impact of having Swap memory enabled on the historical servers? In our org, completely disabling Swap is not allowed so we have set the "swappiness=1" which is the smallest possible value. But we see that the Swap memory is 100% utilized even though there is free memory on the host and this is raising alerts at h/w level (as 100% Swap utilization usually indicates that the host may be running out of memory). Does Druid use the Swap memory to read the memory mapped files in addition to the free main memory? Or could this be happening because the OS is swapping in all the files that Druid is reading during query execution? Is the swap actually getting thrashed? Trying to figure if we need to take any action or if this can be safely ignored. Thanks, AR.

Daniel Augusto

06/03/2025, 10:04 AM

We are having trouble with using pod identity in EKS by setting

AWS_CONTAINER_CREDENTIALS_FULL_URI

so Druid can use S3. Minimum AWS SDK is 1.12.746 and Druid uses 1.12.638 in latest.

Copy code

AWS_CONTAINER_CREDENTIALS_FULL_URI has an invalid host. Host should resolve to a loopback address or have the full URI be HTTPS

I've search slack for similar errors, but I could find none. Has anyone seen this too? Do we plan to bump AWS SDK for next release?

✅ 1

sandy k

06/04/2025, 1:02 AM

we are running cluster data, master, broker nodes, running into frequent crashes for broker, overlord. off late zookeeper seems to be unresponsive due connection. ui doesnt show up the segments and just get hung. overlord crashes 3-4 times a day. how to improve on this

Kevin C.S

06/04/2025, 3:11 PM

Hi team, we recently migrated JSON data to avro in apache druid (v31) and we have integrated schema registry to the supervisors. Since we wanted schema evolution we added useSchemaDiscovery to true. After which we noticed Druid started to edit dimensionExclusions and started to add values there on its own. Is this behaviour documented somewhere? As this caused a lot of missing data to get ingested

kn3jox

06/06/2025, 10:34 AM

hello. i have time series data in Druid and am using Superset to chart that data. some charts look at the entire time series, but for some charts i only want to look at the last day's data. i can't configure the time range to be the "Last Day" or any of the other time range filters because it's possible i don't have data from "yesterday" in relation to "today". can someone help me with a custom SQL that would get the last day's (in the data, not in time) data for my chart?

Cristina Munteanu

06/10/2025, 1:18 PM

Hey everyone! 👋 Join us for a Real-Time Analytics & AI at Scale meetup in New York City on June 18! It’s a casual, in-person gathering for devs working on big data, distributed systems, AI infra, or just curious about how modern stacks scale real-time analytics. 📍New York City 🗓️ Wednesday June 18th 🍕 Talks + food + networking https://www.pingcap.com/event/real-time-analytics-and-ai-at-scale-meetup/ No fluff — just solid tech talks, cool people, and hands-on lessons from the field. Hope to see you there! 🙂

Juan Pablo Egido

06/10/2025, 8:54 PM

Hi everyone, does it make sense to create an MCP Server for Druid? Has anyone done it?

Aryan Mullick

06/17/2025, 6:54 AM

hey everyone, ive been using druid to write queries on a large datasource. but recently when i’ve tried to run it on a datasource with a size of 5.2GB its started showing bad gateway. i should mention im using a single server small set up. if anyone can help me out i would really appreciate it

Asit

06/17/2025, 7:10 PM

Hi everyone, we have a usecase where we suspend and resume supervisors continuously to manage resources used for kafka ingestion . Lately we have seen that the overload uses a lot of memory (90GB plus ) so quick question does suspended supervisors add to overload memory ?

anish

06/18/2025, 6:09 AM

Hi everyone, getting this issue with msq

Copy code

RowTooLarge: Encountered row that cannot fit in a single frame (max frame size = 1,000,000)

How does one increase the maxFrameSize, tried setting this in contextParam but of no use

Mirza Munawar

06/19/2025, 5:17 AM

Exception while seeking to the [earliest] offset of partitions in topic [schema_history_remote_postgres]: Timeout expired while fetching topic metadata while i'm trying to fetch data from kafka

Mirza Munawar

06/19/2025, 5:26 AM

idk much about druid i just started learning it

Mirza Munawar

06/19/2025, 5:26 AM

can anyone say me what i'm doing wrong here ?

Mirza Munawar

06/19/2025, 6:52 AM

ohh it worked with Docker container name instead of localhost

👍 1

Eyal Yurman

06/20/2025, 11:01 PM

Im looking to assign different run priority to different ingestion tasks. (Meaning, which task gets to run, not related to task locking priority). We're ingesting data in a Lambda fashion: new data is ingested in streaming and then re-ingested after a week with batch. Since the batch re-ingestion has a much more relaxed SLA, we were thinking to let both workload share the same hardware resources, but protect the streaming by giving it higher run priority. But I couldn't find anything about such configuration - does this mechanism exist? I wonder if MM-less can support it, as Kubernetes has these mechanisms built-in.

Abdul Ahad Munaf

06/24/2025, 5:22 PM

Hi guys, Hope you're doing well, Is there any way to ingest data from 3 kafka servers into 1 datasource, docs says that we can only have 1 kafka per datasource and submitting another spec file would just replace the previous one, but is there any other way? Any help would be really appreciated (beginner on druid), Thank you

Abdul Ahad Munaf

06/24/2025, 5:24 PM

Any other alternatives would help aswell, where we can ingest data from mutiple kafka servers to 1 data source 🙂

Aryan Mullick

06/25/2025, 12:01 PM

is there a way i can check how much memory is being used during a query

Robert Galloway

06/27/2025, 9:23 AM

Hi all, hope this is the place to ask, if not then could you point me in the correct direction, it would be appreciated. We’ve have two time fields, __time (which is the time the event is processed in our system) and the transaction time, when the event actually occurred. The transaction time is stored as is laid out in the docs (having it as millisecond timestamps stored as long types): https://druid.apache.org/docs/latest/ingestion/schema-design/#secondary-timestamps. The issue is that we are now wanting to filter on the transaction time which can differ to __time by years in the past and future, meaning if we want to filter on this time field we end up performing a full table scan, which ends up taking a long time to run. Reading previous questions around this and the documentation it looks like we should have a second partition on the transaction time field: https://druid.apache.org/docs/latest/ingestion/partitioning/#secondary-partitioning. I just wanted to check that is correct and I’m not overlooking something. Many thanks, Robert

Cristina Munteanu

06/30/2025, 3:36 PM

Hi everyone, 🌉 Are you in San Francisco or the surrounding area? Come check out the Real-Time Data Lakes featuring ClickHouse® and StarRocks meetup on July 8! Featuring talks from Altinity, PostHog, and CelerData. • Tuesday July 8th | San Francisco • Great talks, networking, and food 🍻 Register: https://lu.ma/dccyqxyu

Doaa Deeb

07/02/2025, 4:00 PM

Are there any plans to add support to PromQL in Druid? Thanks

Tim Frey

07/04/2025, 7:55 PM

hi everyone. We are currently building an MCP Server for Druid and wonder if we can get some feedback here.

Sivakumar Karthikesan

07/05/2025, 2:21 PM

Team, does anyone implement druid emitter prometheus for monitoring ? can you help to share the some of the metrics?

Asit

07/06/2025, 5:17 PM

Hi All, I wanted to check if there is a way to clean up only the older segment versions from metadata and deep storage, while retaining the latest versions safely in deep storage. We have a use case where we want to maintain retention in Druid as per policy. However, since we are running frequent compactions, we do not want to retain older segment versions unnecessarily. I came across the following properties: •

druid.coordinator.kill.on

•

druid.coordinator.kill.period

•

druid.coordinator.kill.durationToRetain

From what I understand, these properties delete all unused segments after a certain period, but not specifically older versions if newer versions exist. Is there a way to configure Druid to only delete older segment versions while keeping the latest one for each interval?