Ayhan Kaya
12/14/2024, 11:08 AMAyhan Kaya
12/14/2024, 11:11 AMLucas Capistrant
12/16/2024, 3:44 PMSoman Ullah
12/18/2024, 5:39 PMAshwin Tumma
12/31/2024, 11:43 PMAshi Bhardwaj
01/05/2025, 9:38 AMMaytas Monsereenusorn
01/07/2025, 8:38 PMMaytas Monsereenusorn
01/18/2025, 1:18 AMHazmi
01/20/2025, 10:03 AMSuraj Goel
01/27/2025, 5:10 PMSuraj Goel
01/28/2025, 7:02 AMinfo Advisionary
01/30/2025, 11:08 AMMaytas Monsereenusorn
01/31/2025, 6:05 AMSuraj Goel
02/13/2025, 10:13 AMAshwin Tumma
02/21/2025, 1:10 AMJamie Chapman-Brown
02/27/2025, 6:23 PMdruid_emitted_metrics{metric_name="ingest-rabbit-lag"}
. We've recently switched to using the Prometheus plugin, but I can't find any rabbit ingest stats.
Am I missing anything? Can anyone point me to what I would need to change to get these stats back?Mikhail Sviatahorau
03/05/2025, 3:51 PMcoordinator-issued
prefix, which helps identify auto-compaction and could be used to check and reject publishing segments that don’t match the current cluster state. The problem is that this prefix isn’t accessible on the level where segments are being chosen. The options are to add some preprocessing of the input source at generateAndPublishSegments in IndexTask or to send some flag to the InputSourceProcessor.prosess() method, but both feel like a last resort.
Curious to hear your thoughts — if not there, where do you think this could be best handled?Maytas Monsereenusorn
03/24/2025, 7:07 PMMaytas Monsereenusorn
04/16/2025, 9:38 PMMaytas Monsereenusorn
04/23/2025, 7:36 PMAbhishek Balaji Radhakrishnan
05/07/2025, 12:31 AMjson_merge()
function when someone gets a chance: https://github.com/apache/druid/pull/17983. Thanks!Abhishek Balaji Radhakrishnan
05/12/2025, 9:09 PMAbhishek Balaji Radhakrishnan
05/23/2025, 1:42 AMMaytas Monsereenusorn
05/28/2025, 2:03 AMint toAdjust = 0
if (violatesPeriodThreshold) {
toAdjust += adjustment;
}
if (violatesDurationThreshold) {
toAdjust += adjustment;
}
if (violatesSegmentThreshold) {
toAdjust += adjustment;
}
if (violatesSegmentRangeThreshold) {
toAdjust += adjustment;
}
if (toAdjust != 0) {
final int adjustedPriority = theQuery.context().getPriority() - toAdjust;
return Optional.of(adjustedPriority);
}
• What if we can set the adjustment value for each Threshold seperately?
i.e.
int toAdjust = 0
if (violatesPeriodThreshold) {
toAdjust += periodThresholdAdjustment;
}
if (violatesDurationThreshold) {
toAdjust += durationThresholdAdjustment;
}
if (violatesSegmentThreshold) {
toAdjust += segmentThresholdAdjustment;
}
if (violatesSegmentRangeThreshold) {
toAdjust += segmentRangeThresholdAdjustment;
}
if (toAdjust != 0) {
final int adjustedPriority = theQuery.context().getPriority() - toAdjust;
return Optional.of(adjustedPriority);
}
The motivation for the first change is that if a query that violate N thresholds, it should be penalize more (not equal) to another query that violate N-1 thresholds.
The motivation for the second change is that some violate are worst than other. i.e. periodThreshold is not that bad compare to segmentRangeThreshold.
The prioritization value would then carry over to the Historical and can help with resources prioritization on Historical processing threadpool (related to this discussion https://apachedruidworkspace.slack.com/archives/C030CMF6B70/p1745436989786489). CC:@Gian Merlino @Clint WylieAbhishek Balaji Radhakrishnan
05/28/2025, 3:02 AMSoman Ullah
05/28/2025, 7:15 PMJesse Tuglu
06/03/2025, 11:11 PMDataSourceMetadata
are solely supervisor-based (either materialized view or seekable stream). ObjectMetadata
seems to only be used in tests. The way I see it there are ≥ 2 options:
• Commit a datasource metadata row per supervisor (likely the easiest, but will take some re-workings on the SegmentTransactionalInsertAction
API and others, who assume these rows are keyed by datasource
) – I'm currently doing this and it seems to work fine.
• Commit a single row per datasource, storing partitions per supervisor ID and doing merges in the plus
minus
methods
◦ Something like the payload being:
▪︎ map[supervisor_id] = SeekableStreamSequenceNumbers
◦ This might suffer from write contention since N supervisors * M tasks per supervisor will be attempting to write new updates in the commit payload to this row in the DB.Allen Madsen
06/10/2025, 9:39 PMJesse Tuglu
06/13/2025, 1:46 AMJesse Tuglu
06/17/2025, 7:31 PM[INFO] ------------------< org.apache.druid:druid-quidem-ut >------------------
[INFO] Building druid-quidem-ut 34.0.0-SNAPSHOT [80/80]
[INFO] from quidem-ut/pom.xml
[INFO] --------------------------------[ jar ]---------------------------------
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/extensions/druid-multi-stage-query/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/extensions/druid-datasketches/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/extensions/druid-orc-extensions/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/extensions/druid-parquet-extensions/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/extensions/druid-avro-extensions/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/extensions/druid-protobuf-extensions/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/extensions/druid-s3-extensions/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/extensions/druid-kinesis-indexing-service/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/extensions/druid-azure-extensions/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/extensions/druid-google-extensions/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/extensions/druid-hdfs-storage/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/extensions/druid-histogram/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/druid-aws-common/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/druid-processing/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/druid-sql/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/druid-indexing-service/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/druid-indexing-hadoop/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/extensions/mysql-metadata-storage/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/extensions/druid-kafka-indexing-service/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/extensions/druid-basic-security/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/extensions/druid-lookups-cached-global/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/extensions/druid-testing-tools/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/extensions/simple-client-sslcontext/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/druid-services/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/druid-server/34.0.0-SNAPSHOT/maven-metadata.xml>
Downloading from datasets: <https://raw.githubusercontent.com/kgyrtkirk/datasets/repo/org/apache/druid/druid-gcp-common/34.0.0-SNAPSHOT/maven-metadata.xml>
Wondering if you knew about this, and whether this was intentional
cc @Gian Merlino