Apache Druid #troubleshooting

Join Slack

Vijay Narayanan

01/16/2023, 8:02 AM

Hdfs can be a source for druid and also as deep storage but you cannot store metadata in hdfs

Slackbot

01/16/2023, 8:04 AM

This message was deleted.

Vijay Narayanan

01/16/2023, 8:05 AM

Equivalent meaning same capacity as local disk? Yes

Vijay Narayanan

01/16/2023, 8:05 AM

https://druid.apache.org/docs/latest/development/extensions-core/hdfs.html

swetha

01/16/2023, 8:41 AM

Copy code

2023-01-16T08:38:21,728 INFO [task-runner-0-priority-0] org.apache.kafka.clients.consumer.internals.Fetcher - [Consumer clientId=consumer-kafka-supervisor-hjelfmnc-1, groupId=kafka-supervisor-hjelfmnc] Fetch position FetchPosition{offset=144333390, offsetEpoch=Optional.empty, currentLeader=LeaderAndEpoch{leader=Optional[kafka-1.kafka-headless.druid.svc.cluster.local:9092 (id: 1 rack: null)], epoch=0}} is out of range for partition druid-0, raising error to the application since no reset policy is configured
2023-01-16T08:38:21,728 WARN [task-runner-0-priority-0] org.apache.druid.indexing.kafka.IncrementalPublishingKafkaIndexTaskRunner - OffsetOutOfRangeException with message [Fetch position FetchPosition{offset=144333390, offsetEpoch=Optional.empty, currentLeader=LeaderAndEpoch{leader=Optional[kafka-1.kafka-headless.druid.svc.cluster.local:9092 (id: 1 rack: null)], epoch=0}} is out of range for partition druid-0]
2023-01-16T08:38:21,728 WARN [task-runner-0-priority-0] org.apache.druid.indexing.kafka.IncrementalPublishingKafkaIndexTaskRunner - Retrying in 30000ms
2023-01-16T08:38:51,733 INFO [task-runner-0-priority-0] org.apache.kafka.clients.consumer.internals.Fetcher - [Consumer clientId=consumer-kafka-supervisor-hjelfmnc-1, groupId=kafka-supervisor-hjelfmnc] Fetch position FetchPosition{offset=144333390, offsetEpoch=Optional.empty, currentLeader=LeaderAndEpoch{leader=Optional[kafka-1.kafka-headless.druid.svc.cluster.local:9092 (id: 1 rack: null)], epoch=0}} is out of range for partition druid-0, raising error to the application since no reset policy is configured
2023-01-16T08:38:51,734 WARN [task-runner-0-priority-0] org.apache.druid.indexing.kafka.IncrementalPublishingKafkaIndexTaskRunner - OffsetOutOfRangeException with message [Fetch position FetchPosition{offset=144333390, offsetEpoch=Optional.empty, currentLeader=LeaderAndEpoch{leader=Optional[kafka-1.kafka-headless.druid.svc.cluster.local:9092 (id: 1 rack: null)], epoch=0}} is out of range for partition druid-0]
2023-01-16T08:38:51,734 WARN [task-runner-0-priority-0] org.apache.druid.indexing.kafka.IncrementalPublishingKafkaIndexTaskRunner - Retrying in 30000ms

Slackbot

01/16/2023, 8:47 AM

This message was deleted.

Slackbot

01/16/2023, 3:53 PM

This message was deleted.

Slackbot

01/17/2023, 4:57 AM

This message was deleted.

Akhila Mukka

01/17/2023, 5:28 AM

Initially, without creating static looup In coordinator log, we could see below messages after every 2 mins 2023-01-17T043923,100 INFO [LookupCoordinatorManager--0] org.apache.druid.server.lookup.cache.LookupCoordinatorManager - Not updating lookups because no data exists 2023-01-17T044123,100 INFO [LookupCoordinatorManager--0] org.apache.druid.server.lookup.cache.LookupCoordinatorManager - Not updating lookups because no data exists But once the lookup is created, I could see the lookup not found exception error if we open the lookup immediately { "type": "map", "map": { "a": "alpha", "b": "beta" } } After some time, we could see the values. I don’t see the ‘ Not updating lookups because no data exists‘ log being logged anymore but nothing related lookup is getting logged after that. One more thing is if I update anything in the lookup , there is nothing being logged in the coordinator or broker or router related to it though the lookup get updated

D K

01/17/2023, 7:13 AM

Hi, When i ingest a table into druid, i see duplicate records. Exact same row is ingested twice

D K

01/17/2023, 7:13 AM

Here is the screenshot

Slackbot

01/17/2023, 7:13 AM

This message was deleted.

D K

01/17/2023, 7:14 AM

please let me know what is going wrong. Source data is always correct

Slackbot

01/17/2023, 8:43 AM

This message was deleted.

Slackbot

01/17/2023, 2:12 PM

This message was deleted.

Slackbot

01/17/2023, 3:52 PM

This message was deleted.

Bharat Thakur

01/17/2023, 3:52 PM

java.lang.IllegalArgumentException: Cannot construct instance of `org.apache.druid.segment.loading.L...

David Monical

01/17/2023, 9:10 PM

Hi all! Curious about an issue I'm running into where

null

values are stripped from multivalue fields when ingesting json data. Couldn't find anything in the documentation about this behavior so I filed a ticket here. Figured I'd open up the discussion here as well, thanks so much!

MarekM

01/18/2023, 10:33 AM

Hi i have problem with authorizer

Slackbot

01/18/2023, 10:33 AM

This message was deleted.

MarekM

01/18/2023, 10:34 AM

i put the configuration form https://druid.apache.org/docs/latest/operations/security-overview.html

Slackbot

01/18/2023, 4:08 PM

This message was deleted.

👀 1

Slackbot

01/19/2023, 12:51 PM

This message was deleted.

Shilpa Sivanesan

01/19/2023, 1:28 PM

Hi All Seeing a weird issue with druid historical the segment is expected to be replicated to 2 nodes as per retention rules but there are few segments which were replicated to most of the nodes and occupying huge storage Any ideas on what could have been wrong ?

Shilpa Sivanesan

01/19/2023, 1:30 PM

For eg : we have tiers (hot and cold) 2022-01-12 is expected to be in hot But it is available in cold and replicated almost to 36 cold historical nodes

Vijay Narayanan

01/19/2023, 1:38 PM

what is your retention rule?

Shilpa Sivanesan

01/19/2023, 4:23 PM

Last 3 months on hot and beyond that to cold

Slackbot

01/19/2023, 4:35 PM

This message was deleted.

Shilpa Sivanesan

01/19/2023, 4:46 PM

we are seeing this issue after switching from tranquility to kinesis ingestion