Prathamesh
08/09/2025, 9:52 AMSan Kumar
08/12/2025, 3:25 AMZaeem Arshad
08/12/2025, 3:47 AMArnav
08/12/2025, 4:23 AMarnavshi
08/12/2025, 7:05 AMForbidden: updates to statefulset spec for fields other than \'replicas\', \'ordinals\', \'template\', \'updateStrategy\', \'persistentVolumeClaimRetentionPolicy\' and \'minReadySeconds\' are forbidden\n'
While I understand that this is a Kubernetes issue/limitation, I wanted your guidance on what can be done to resolve this.San Kumar
08/12/2025, 11:09 AMam_developer
08/12/2025, 11:31 AMAbdulaziz Alqahtani
08/14/2025, 11:02 AMavailabilityLagMsMap
from /consumingSegmentsInfo
→ reports ~200–400 ms for me.
• endToEndRealtimeIngestionDelayMs
from Prometheus → shows a “saw-tooth” pattern, peaking around 5 seconds.
Can someone explain the difference between these two metrics, why they report different values, and whether the saw-tooth pattern is expected?Idlan Amran
08/18/2025, 2:38 AMprofile
, each JSON will have around 5M rows so it will have consistent JSON and segment size:
SELECT shop, svid, spid, type, profile, "key", message, product,
CAST(MAX(created_at) AS TIMESTAMP) AS created_at,
ARRAY_AGG(product_log, 'STRING', TRUE) AS product_log
FROM product_tracking
WHERE profile = {profile}
AND created_at >= CAST(DATE_TRUNC('DAY', timestampAdd(DAY,{-lookback_days},NOW()), 'MILLISECONDS','GMT-04:00') AS TIMESTAMP)
AND created_at < CAST(DATE_TRUNC('DAY', timestampAdd(DAY,0,NOW()), 'MILLISECONDS','GMT-04:00') AS TIMESTAMP)
GROUP BY shop, svid, spid, type, profile, "key", message, product
LIMIT 999999999
need help for any insights/feedback from other Pinot OSS users, thanks.Rishabh Sharma
08/18/2025, 12:37 PMSan Kumar
08/19/2025, 5:28 AMSan Kumar
08/19/2025, 5:54 AMkranthi kumar
08/19/2025, 1:29 PMMilind Chaudhary
08/20/2025, 5:49 AMIndira Vashisth
08/21/2025, 12:52 PMShubham Kumar
08/21/2025, 1:00 PMtar.gz
, such as zstd or Snappy?
2. I created an index on a column (col1
) and ingested data. Suppose a segment contains 50 records, and I run a query with the condition col1 = 'xyz'
. In this case, does Pinot load the entire segment into memory and then filter the records, or does it directly fetch only the matching data from the segment?Sandeep R
08/25/2025, 11:36 PMJan Siekierski
08/27/2025, 11:33 AMJohn Solomon J
08/28/2025, 7:17 PMVatsal Agrawal
08/29/2025, 5:43 AMArnav
08/29/2025, 5:52 PM{
"start_time_new": {
"long": 1756489188000
},
"event_time_new": {
"long": 1756489188000
}
}
i tried below configuration but it's not parsing
"ingestionConfig": {
"transformConfigs": [
{
"columnName": "start_time_new",
"transformFunction": "jsonPathLong(__raw__start_time_new, '$.long', 0)"
},
{
"columnName": "event_time_new",
"transformFunction": "jsonPathLong(__raw__event_time_new, '$.long', 0)"
}
],
"continueOnError": false,
"rowTimeValueCheck": false,
"segmentTimeValueCheck": true
}
Rajkumar
08/30/2025, 6:23 PMArnav
09/01/2025, 7:07 AM"stream.kafka.decoder.prop.format": "AVRO",
"stream.kafka.decoder.prop.schema.registry.schema.name": "schema-name",
"stream.kafka.decoder.class.name": "org.apache.pinot.plugin.inputformat.avro.confluent.KafkaConfluentSchemaRegistryAvroMessageDecoder",
"stream.kafka.decoder.prop.schema.registry.rest.url": "schema-url",
"stream.kafka.decoder.prop.key.format": "AVRO",
"stream.kafka.decoder.prop.key.schema.registry.schema.name": "schema-name-key",
"stream.kafka.decoder.prop.key.schema.registry.rest.url": "schema-url",
data is also properly deserialised. Only __key is blank. My guess is that below configs i added is not able to deserialise it. Is there any other way to deserialise the key?
"stream.kafka.decoder.prop.key.format": "AVRO",
"stream.kafka.decoder.prop.key.schema.registry.schema.name": "schema-name-key",
"stream.kafka.decoder.prop.key.schema.registry.rest.url": "schema-url",
Abdulaziz Alqahtani
09/01/2025, 7:17 PMtenant_id
(ULID). The column is low cardinality, and most queries include a tenant_id
predicate. What’s the best way to index this column?cesho
09/04/2025, 2:16 PMAbdulaziz Alqahtani
09/07/2025, 8:34 PMmg
09/08/2025, 8:09 PM<https://example.com/pinot/>
. The main UI works fine and most links are correctly routed. Those that works open on <https://example.com/pinot/#/>...
However, the Swagger REST API UI link is not. Swagger API button, it tries to access <https://example.com/help>
instead of <https://example.com/pinot/help>
, resulting in a 404 Not Found error.
I don't see an obvious way to enforce the swagger link subpath to something other than (/) ? I am using helm, and I have been looking for different options in https://github.com/apache/pinot/blob/master/helm/pinot/README.md but nothing worked..
thanks in advance..Soon
09/11/2025, 5:19 PMFILTER_SORTED_INDEX
would it be the same as using FILTER_INVERTED_INDEX
like sorted inverted index?Indira Vashisth
09/15/2025, 9:57 AMIndira Vashisth
09/15/2025, 10:02 AM