Elon
03/16/2022, 5:18 PMFacundo Bianco
03/16/2022, 8:18 PM"dateTimeFieldSpecs": [{
"name": "timestampCustom",
"dataType": "STRING",
"format" : "1:MILLISECONDS:SIMPLE_DATE_FORMAT:yyyy-MM-dd'T'HH:mm:ss.SSZZ",
"granularity": "1:MILLISECONDS"
}]
Table is generated successfully but POST command returns
I discovered is related to date format, could you kindly indicate how should it be? I used this site to generate the custom format. Thanks in advance!Copy code{ "code": 500, "error": "Caught exception when ingesting file into table: foo_OFFLINE. null" }
Grace Lu
03/16/2022, 11:26 PMException in thread "main" java.lang.NoSuchMethodError: org.apache.hadoop.yarn.api.records.impl.pb.ProtoUtils.convertToProtoFormat(Lorg/apache/hadoop/yarn/api/records/ExecutionType;)Lorg/apache/hadoop/yarn/proto/YarnProtos$ExecutionTypeProto;
at org.apache.hadoop.yarn.api.records.impl.pb.ExecutionTypeRequestPBImpl.setExecutionType(ExecutionTypeRequestPBImpl.java:73)
We wonder if pinot has also introduced this class in its dependencies and if it is conflicted with the library in our hadoop cluster itself? We are at spark 2.4.6, hadoop 2.9.1, pinot 0.9.2, and seems like pinot 0.9.2 is built with hadoop2.7.0 and spark 2.4.0, have we tested the compatible spark/hadoop version for running ingestion jobs?Jonathan Meyer
03/17/2022, 4:18 PMSUM
to not return 0 when there's actually no values to aggregate ? i.e. return null
in such caseTony Requist
03/17/2022, 7:22 PM"realtime.segment.flush.threshold.rows": "10000000",
"realtime.segment.flush.threshold.time": "6h",
"realtime.segment.flush.threshold.segment.size": "400M",
I changed these values two days ago, previously the "rows" limit was 0.
Pinot is generating segments with 3,333,333 rows, every ~90 minutes, 95-100MB -- significantly below any of the limits.
Server logs show Starting consumption on realtime consuming segment ... maxRowCount 33333
and Stopping consumption due to row limit nRows=3333333
- I am trying to figure out where that limit is coming from.Luis Fernandez
03/17/2022, 9:13 PMSandeep R
03/18/2022, 1:19 AM"name": "LOG_TS",
"dataType": "LONG",
"format": "1:MILLISECONDS:SIMPLE_DATE_FORMAT:yyyy-mm-ddThh:mm:ss.sssZ",
"granularity": "1:MILLISECONDS"
Luis Fernandez
03/18/2022, 4:12 PMTable Consuming Latency
I have been turning off and on various part of pinot to see how it behaves, this time i decided to turn off for sometime the kafka app that produces the records to pinot, i saw a latency increase when i turned off the app and at least for p99, it was 160ms and now is over a minute, when things like this happen when do you expect pinot to get back to its regular level does it ever get back? I was thinking as the day goes by maybe and this topic start to get less traffic then maybe things come down but I was wondering if that somehow can come back any other way. Ofc this is still pretty fast but I’m wondering what happens if I were to take down the app for a longer time how could that impact the p99 timesLuis Fernandez
03/18/2022, 7:00 PMEncountered exception while processing requestId 9610 from broker Broker_pinot-broker-1.pinot-broker-headless.pinot.svc.cluster.local_8099
java.lang.NullPointerException: null
at org.apache.pinot.core.util.trace.TraceContext.getTraceInfo(TraceContext.java:191) ~[pinot-all-0.10.0-SNAPSHOT-jar-with-dependenci
es.jar:0.10.0-SNAPSHOT-b7c181a77289fccb10cea139a097efb5d82f634a]
at org.apache.pinot.core.query.executor.ServerQueryExecutorV1Impl.processQuery(ServerQueryExecutorV1Impl.java:223) ~[pinot-all-0.10.
0-SNAPSHOT-jar-with-dependencies.jar:0.10.0-SNAPSHOT-b7c181a77289fccb10cea139a097efb5d82f634a]
at org.apache.pinot.core.query.executor.QueryExecutor.processQuery(QueryExecutor.java:60) ~[pinot-all-0.10.0-SNAPSHOT-jar-with-depen
dencies.jar:0.10.0-SNAPSHOT-b7c181a77289fccb10cea139a097efb5d82f634a]
at org.apache.pinot.core.query.scheduler.QueryScheduler.processQueryAndSerialize(QueryScheduler.java:151) ~[pinot-all-0.10.0-SNAPSHO
T-jar-with-dependencies.jar:0.10.0-SNAPSHOT-b7c181a77289fccb10cea139a097efb5d82f634a]
at org.apache.pinot.core.query.scheduler.QueryScheduler.lambda$createQueryFutureTask$0(QueryScheduler.java:137) ~[pinot-all-0.10.0-S
NAPSHOT-jar-with-dependencies.jar:0.10.0-SNAPSHOT-b7c181a77289fccb10cea139a097efb5d82f634a]
at java.util.concurrent.FutureTask.run(FutureTask.java:264) [?:?]
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) [?:?]
at shaded.com.google.common.util.concurrent.TrustedListenableFutureTask$TrustedFutureInterruptibleTask.runInterruptibly(TrustedListe
nableFutureTask.java:111) [pinot-all-0.10.0-SNAPSHOT-jar-with-dependencies.jar:0.10.0-SNAPSHOT-b7c181a77289fccb10cea139a097efb5d82f634a]
at shaded.com.google.common.util.concurrent.InterruptibleTask.run(InterruptibleTask.java:58) [pinot-all-0.10.0-SNAPSHOT-jar-with-dep
endencies.jar:0.10.0-SNAPSHOT-b7c181a77289fccb10cea139a097efb5d82f634a]
at shaded.com.google.common.util.concurrent.TrustedListenableFutureTask.run(TrustedListenableFutureTask.java:75) [pinot-all-0.10.0-S
NAPSHOT-jar-with-dependencies.jar:0.10.0-SNAPSHOT-b7c181a77289fccb10cea139a097efb5d82f634a]
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) [?:?]
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) [?:?]
at java.lang.Thread.run(Thread.java:829) [?:?]
does anyone know what this NullPointer may refer to?Weixiang Sun
03/18/2022, 8:25 PMBordin Suwannatri
03/21/2022, 5:15 AMAli Atıl
03/21/2022, 7:22 AMDiana Arnos
03/22/2022, 2:01 PMConsumed 261 events from (rate:3.1030054/s), currentOffset=763096, numRowsConsumedSoFar=288096, numRowsIndexedSoFar=288096
....
[Consumer clientId=consumer-455, groupId=] Discovered group coordinator <redacted> (id: 2147483646 rack: null)
But the controller still show them with dead
status and when I try to query the data, I see in the Broker log:
No server found for request 1: select responseId from responseCount limit 1
And this is the response from the query API:
{
"exceptions": [],
"numServersQueried": 0,
"numServersResponded": 0,
"numSegmentsQueried": 0,
"numSegmentsProcessed": 0,
"numSegmentsMatched": 0,
"numConsumingSegmentsQueried": 0,
"numDocsScanned": 0,
"numEntriesScannedInFilter": 0,
"numEntriesScannedPostFilter": 0,
"numGroupsLimitReached": false,
"totalDocs": 0,
"timeUsedMs": 0,
"offlineThreadCpuTimeNs": 0,
"realtimeThreadCpuTimeNs": 0,
"segmentStatistics": [],
"traceInfo": {},
"minConsumingFreshnessTimeMs": 0,
"numRowsResultSet": 0
}
How can I make the Controller see they are alive? 👀Weixiang Sun
03/23/2022, 4:52 AMBordin Suwannatri
03/23/2022, 8:30 AMeywek
03/23/2022, 4:59 PMpurchase
, `page_view`…) and I would like to create a segment by event name (with a size limit ofc).
Since those events are user defined I can’t really know how many partitions I’ll have.
I’ve seen Murmur, Hashcode… partition config but it doesn’t insure me that each event type will have a dedicated segment (e.g. I don’t want page_view
and purchase
events to be in the same segments, to avoid loading any page_view
data when doing a query on page_view
ones)
Thank youWei Li
03/24/2022, 6:51 AMahsen m
03/24/2022, 5:45 PMLuis Fernandez
03/25/2022, 6:44 PMIdealState
in zookeeper?Diogo Baeder
03/25/2022, 11:35 PMDiogo Baeder
03/27/2022, 11:03 PMDiana Arnos
03/28/2022, 12:08 PMcontroller.extra.configs
, but every time I do it the Controller starts responding with 502 Bad Gateway
and I can't see anything wrong in the logs.
Results from helm template
on the thread.Bordin Suwannatri
03/28/2022, 3:56 PMLuis Fernandez
03/28/2022, 5:28 PMkubectl
however I noticed that when I run this command I always get a brand new server and have to issue rebalances again, is restarting servers something that requires rebalancing? I'm pretty sure it must be something funky going on with our configLakshmanan Velusamy
03/28/2022, 6:45 PMahsen m
03/29/2022, 1:34 AMpersistence:
enabled: true
accessMode: ReadWriteOnce
size: 2G
mountPath: /var/pinot/controller/data
storageClass: ""
extraVolumes:
- name: gcp-credentials-volume
secret:
secretName: gcp-credentials
items:
- key: gcp_creds_json
path: gcp_credentials.json
extraVolumeMounts:
- name: gcp-credentials-volume
mountPath: /opt/pinot/gcp
readOnly: true
but when i run helm template testing --debug . the template it generates does not have any volume mount named `gcp-credentials-volume`, any idea's?
sunny
03/29/2022, 1:43 AMMohammed Galalen
03/29/2022, 6:01 AMprotoc-gen-grpc-java-1.4.0-osx-x86_64
and the other com.github.eirslett:frontend-maven-plugin:1.1
I had to upgrade ``com.github.eirslett:frontend-maven-plugin`` to 1.11.0
and downloaded the protoc-gen-grpc-java-1.4.0-osx-x86_64
manually.
But I couldn't run the example, and I'm getting this error
Failed to start a Pinot [SERVER] at 15.16 since launch
java.lang.RuntimeException: java.util.concurrent.RejectedExecutionException: event executor terminated
at org.apache.pinot.core.transport.QueryServer.start(QueryServer.java:136) ~[pinot-all-0.10.0-SNAPSHOT-jar-with-dependencies.jar:0.10.0-SNAPSHOT-649f5988d5746869ef6a690f4747ff4d6fb9c607]
at org.apache.pinot.server.starter.ServerInstance.start(ServerInstance.java:165)
Kamal Chavda
03/29/2022, 5:50 PMDiogo Baeder
03/30/2022, 12:31 AMsegmentPartitionConfig
map doesn't accept the mapping of column to partition config directly, as the table configuration documentation says, but rather can only contain a columnPartitionMap
field it seems, and then this field in its turn can contain the mapping between column and partition config
• The segmentsConfig
seems to have had its old replicasPerPartition
renamed to replication
, if I understand correctly - or maybe I just don't understand where each should be used, if both are valid (although the config docs don't mention replicasPerPartition
anymore)
Should I open a ticket on GitHub about these? Or am I getting something wrong perhaps?