Venkat Boina(VB)
05/13/2023, 7:12 PMChris Han
05/15/2023, 7:47 PMIDEAL STATE
for a table in Zookeeper. The IDEAL STATE
json I need to update is over 769,000 characters long (there are over 8000 segments), and when I try to update it I'm receiving a Bad Request
response, presumably because the request data to Zookeeper is too long. I need to manually update the DEAD
server IPs with ALIVE
server IPs. I have over 8000 of these entries:
...
"table_OFFLINE_8697": {
"Server_10.193.7.135_8098": "ONLINE"
},
"table_OFFLINE_8698": {
"Server_10.193.7.135_8098": "ONLINE"
},
...
Is there a way I can iteratively update the IDEAL STATE
that doesn't require me to upload the entire document?
Is there another way I can "migrate" the segments from one server to another within the Zookeeper configs?Ethan Huang
05/16/2023, 3:27 AMDefaultIndexCreatorProvider#newRangeIndexCreator
, RangeIndexHandler#handleNonDictionaryBasedColumn
). However, the BitSlicedRangeIndexCreator
relies on the min and max value of the indexing column, but the minValue
and maxValue
are both null
in ColumnMetadata
when the column has no dictionary.
is it a bug? or additional configurations needed to avoid such exception? BTW, the version is 0.12.1 release. Thanks.Venkat Boina(VB)
05/16/2023, 7:44 AMLee Wei Hern Jason
05/16/2023, 8:44 AMenvFrom:
- secretRef:
name: pinot-secrets
extra:
configs: |-
pinot.set.instance.id.to.hostname=true
pinot.minion.storage.factory.class.s3=org.apache.pinot.plugin.filesystem.S3PinotFS
pinot.minion.storage.factory.s3.region=ap-southeast-1
pinot.minion.segment.fetcher.protocols=file,http,s3
pinot.minion.segment.fetcher.s3.class=org.apache.pinot.common.utils.fetcher.PinotFSSegmentFetcher
segment.fetcher.auth.token=${PINOT_SEGMENT_FETCHER_AUTH_TOKEN}
task.auth.token=${PINOT_SEGMENT_FETCHER_AUTH_TOKEN}
Lvszn Peng
05/16/2023, 12:19 PMException in thread "main" java.lang.NoSuchFieldError: JAVA_11
. Is the Java version to low?Ehsan Irshad
05/16/2023, 1:21 PMnumSegmentsProcessed
by Segment Pruning on broker
2. Reduce the numEntriesScannedPostFilter
by adding more filters in query
3. Because of 2, numEntriesScannedInFilter
will increase. So make it 0 by adding the indexesDeepak Arumugham
05/16/2023, 1:43 PMChris Han
05/17/2023, 3:35 PMException in thread "idle-connection-reaper" java.lang.OutOfMemoryError: Java heap space
Deepak Arumugham
05/17/2023, 10:54 PMcontroller.data.dir=<gs://pinot-data-dir>
<http://pinot.controller.storage.factory.class.gs|pinot.controller.storage.factory.class.gs>=org.apache.pinot.plugin.filesystem.GcsPinotFS
pinot.controller.segment.fetcher.protocols=file,http,gs
<http://pinot.controller.segment.fetcher.gs|pinot.controller.segment.fetcher.gs>.class=org.apache.pinot.common.utils.fetcher.PinotFSSegmentFetcher
Even though we have provided the correct GCS data directory for the controller, the segments are getting created locally in pinot cluster's disk and soon we get into java.lang.OutOfMemoryError: Java heap space. And our parquet files are sized in 50-500 MB range.
We are under the impression that on Ingestion, data would be processed and would be created in GCS buckets. Am I missing something here?
How can we solve this? Any pointers would be helpfulMichael Roman Wengle
05/18/2023, 5:34 AMno native library is found for os.name=Linux and os.arch=aarch64
null
java.lang.NullPointerException
at xerial.larray.impl.LArrayLoader$NativeLib.extractLibraryFile(LArrayLoader.java:182)
[...]
java.lang.UnsatisfiedLinkError: 'long xerial.larray.impl.LArrayNative.mmap(long, int, long, long)'
at xerial.larray.impl.LArrayNative.mmap(Native Method) ~[pinot-all-0.13.0-SNAPSHOT-jar-with-dependencies.jar:0.13.0-SNAPSHOT-12d86902a84d4bc78b6f2f7bc8bd002659ee61cb]
The minions are deployed on Graviton nodes in k8s (official Pinot Helm chart).
Did anyone experience the same problem? Is there a way to solve the issue or do we need to switch to x86 k8s nodes?Eaugene Thomas
05/18/2023, 7:48 AMDeena Dhayalan
05/18/2023, 8:31 AMTommaso Peresson
05/18/2023, 11:48 AMConcurrentTasksPerWorker
in the minion config runtime for a SegmentGenerationAndPushTask
task? ThanksTanmay Varun
05/18/2023, 5:16 PMhelm install -n pinot-quickstart kafka kafka/kafka --set replicas=1,zookeeper.image.tag=latest
replicas --> replicaCountDeepak Arumugham
05/19/2023, 5:51 AMCaught Exception in state transtition from OFFLINE -> ONLINE for resourceCan you please provide any insights on this. Once the ingestion is complete, the segment goes to BAD state
And on trying to query, we are getting
{
"errorCode": 305,
"message": "null:\n1 segments unavailable: [xyz_OFFLINE_2021-11-16-17_2022-09-21-00_0]"
}
Sanjay
05/19/2023, 1:07 PMstandalone
ingestion and it tries to copy the input files in /tmp
directory and eventually that is causing the space issue, is there any parameter to change to some other mount
path?Tommaso Peresson
05/19/2023, 4:04 PMSegmentGenerationAndPushTask
tasks takes minutes and I don’t know how to debug it and optimise it. I thought it was wildcards in the input format triggering a long scan on GCS(as it is a flat FS) but removing them doesn’t help. Can someone pls help me with a checklist of things to look for optimise this process?J Vossler
05/19/2023, 5:16 PMRaveendra Yerraguntla
05/20/2023, 6:59 PMTanmay Varun
05/20/2023, 10:07 PM"segmentPartitionConfig": {
"columnPartitionMap": {
"merchantId": {
"functionName": "Murmur",
"numPartitions": 36
}
}
},
Ayush Chauhan (Tech)
05/21/2023, 6:09 AMAbhijeet Kushe
05/21/2023, 1:35 PMAbhijeet Kushe
05/21/2023, 1:35 PM{
"tableName": "workflowEvents",
"tableType": "REALTIME",
"segmentsConfig": {
"timeColumnName": "eventTimestamp",
"timeType": "MILLISECONDS",
"schemaName": "workflowEvents",
"replicasPerPartition": "4",
"retentionTimeUnit": "DAYS",
"retentionTimeValue": "1826",
"segmentPushType": "APPEND"
},
"tenants": {
"broker": "DefaultTenant",
"server": "DefaultTenant"
},
"tableIndexConfig": {
"loadMode": "MMAP",
"streamConfigs": {
"streamType": "kinesis",
"stream.kinesis.topic.name": "prod-rel-cdp-dl-workflow-metrics-stream",
"region": "us-east-1",
"shardIteratorType": "LATEST",
"stream.kinesis.consumer.type": "lowlevel",
"stream.kinesis.fetch.timeout.millis": "30000",
"stream.kinesis.decoder.class.name": "org.apache.pinot.plugin.stream.kafka.KafkaJSONMessageDecoder",
"stream.kinesis.consumer.factory.class.name": "org.apache.pinot.plugin.stream.kinesis.KinesisConsumerFactory",
"realtime.segment.flush.threshold.size": "1000000",
"realtime.segment.flush.threshold.time": "1h"
}
},
"upsertConfig": {
"mode": "FULL"
},
"routing": {
"instanceSelectorType": "strictReplicaGroup"
},
"metadata": {
"customConfigs": {}
}
}
Abhijeet Kushe
05/21/2023, 1:36 PMpinot.server.instance.max.parallel.refresh.threads=3
Sid
05/22/2023, 10:20 AMTanmay Varun
05/22/2023, 9:58 PMSonit Rathi
05/23/2023, 4:02 AMEhsan Irshad
05/23/2023, 7:07 AMJatin
05/23/2023, 9:56 AM