raghav
10/16/2024, 4:35 PM[85563.074s][info][gc ] GC(5548) Pause Full (G1 Evacuation Pause) 32705M->32631M(32768M) 3647.713ms
[85563.074s][info][gc,cpu ] GC(5548) User=45.55s Sys=0.00s Real=3.65s
[85563.111s][info][gc,start ] GC(5549) Pause Young (Concurrent Start) (G1 Evacuation Pause)
[85563.111s][info][gc,task ] GC(5549) Using 13 workers of 13 for evacuation
[85563.132s][info][gc,mmu ] GC(5549) MMU target violated: 20.6ms (20.0ms/21.0ms)
[85563.132s][info][gc ] GC(5549) To-space exhausted
[85563.132s][info][gc,phases ] GC(5549) Pre Evacuate Collection Set: 0.4ms
[85563.132s][info][gc,phases ] GC(5549) Evacuate Collection Set: 15.2ms
[85563.132s][info][gc,phases ] GC(5549) Post Evacuate Collection Set: 4.7ms
[85563.132s][info][gc,phases ] GC(5549) Other: 0.5ms
[85563.132s][info][gc,heap ] GC(5549) Eden regions: 12->0(204)
[85563.132s][info][gc,heap ] GC(5549) Survivor regions: 0->0(26)
[85563.132s][info][gc,heap ] GC(5549) Old regions: 3311->3323
[85563.132s][info][gc,heap ] GC(5549) Humongous regions: 773->773
[85563.132s][info][gc,metaspace ] GC(5549) Metaspace: 144441K(152832K)->144441K(152832K)
GC setting
-Xms24G -Xmx32G -XX:+UseG1GC -XX:MaxGCPauseMillis=20 -XX:G1ReservePercent=15 -XX:+ParallelRefProcEnabled -XX:+DisableExplicitGC -XX:InitiatingHeapOccupancyPercent=35
Is there any recommended setting for GC?X G
10/17/2024, 6:22 AM05:42:50.343 WARN [IdealStateGroupCommitTest] [pool-30-thread-100] IdealState updater 91 failed to commit.
java.lang.RuntimeException: Caught exception while updating ideal state for resource: potato_5_OFFLINE
at org.apache.pinot.common.utils.helix.IdealStateGroupCommit.updateIdealState(IdealStateGroupCommit.java:312) ~[pinot-common-1.3.0-SNAPSHOT.jar:1.3.0-SNAPSHOT-e782d0f645ff1c8a32bb0b9c5a0e07f991d50ec5]
at org.apache.pinot.common.utils.helix.IdealStateGroupCommit.commit(IdealStateGroupCommit.java:124) ~[pinot-common-1.3.0-SNAPSHOT.jar:1.3.0-SNAPSHOT-e782d0f645ff1c8a32bb0b9c5a0e07f991d50ec5]
at org.apache.pinot.controller.helix.IdealStateUpdater.run(IdealStateGroupCommitTest.java:149) ~[test-classes/:?]
at java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:572) ~[?:?]
at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:317) ~[?:?]
at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1144) ~[?:?]
at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:642) ~[?:?]
at java.base/java.lang.Thread.run(Thread.java:1583) [?:?]
Caused by: org.apache.pinot.spi.utils.retry.AttemptsExceededException: Operation failed afte
this error is printing everywhere in UT set 2 in many unrelated PRs. e.g. 14229 14244
A few days back, we just saw and fixed one case due to a PR whose UT is failing got merged. Could anyone help take a look? ThanksLeo
10/17/2024, 3:35 PMraghav
10/17/2024, 5:34 PMsamirun shaik
10/17/2024, 6:09 PMSubbu Subramaniam
10/17/2024, 6:18 PMsamirun shaik
10/17/2024, 6:25 PMSajjad Moradi
10/17/2024, 8:09 PM/segments
endpoint:
$ curl -X POST -F segName=@segName.tar.gz controllerHost:controllerPort/segments
Mayank
Melvin
10/18/2024, 2:14 AMLeo
10/18/2024, 2:02 PMNani
10/21/2024, 3:08 PMScott deRegt
10/21/2024, 6:38 PM80
(finishConnect(..) failed: Connection refused: <http://pinot-server-dev-1.example.com/{ip|pinot-server-dev-1.example.com/{ip> obfuscated}:80
)? Nothing is listening on that server
port. This configuration is using an unspecified `pinot.query.server.port`/`pinot.query.runner.port`.Vũ Lê
10/22/2024, 10:08 AMvalues.yaml
file in my Helm deployment. However, after ingesting a real-time table, I don't see any segments being saved to MinIO. Below is my configuration:
Pinot Controller configuration:
# -------------------------------------------------------------------------
# Pinot Controller:
# -------------------------------------------------------------------------
...
data:
dir: s3://pinot-segments
# Extra configs will be appended to pinot-controller.conf file
extra:
configs: |-
controller.disable.ingestion.groovy=false
pinot.set.instance.id.to.hostname=true
controller.task.scheduler.enabled=true
controller.enable.split.commit=true
pinot.controller.storage.factory.s3.accessKey=minioadmin
pinot.controller.storage.factory.s3.secretKey=minioadmin
pinot.controller.segment.fetcher.protocols=file,http,s3
pinot.controller.segment.fetcher.s3.class=org.apache.pinot.common.utils.fetcher.PinotFSSegmentFetcher
pinot.controller.storage.factory.class.s3=org.apache.pinot.plugin.filesystem.S3PinotFS
pinot.controller.storage.factory.s3.endpoint=http://10.168.6.104:9000
pinot.controller.storage.factory.s3.region=us-west-1
pinot.controller.storage.factory.s3.disableAcl=false
controller.local.temp.dir=/var/pinot/controller/data
...
Pinot Server configuration:
# -------------------------------------------------------------------------
# Pinot Server:
# -------------------------------------------------------------------------
...
dataDir: s3://pinot-segments
segmentTarDir: /var/pinot/server/data/segment
# Extra configs will be appended to pinot-server.conf file
extra:
configs: |-
pinot.set.instance.id.to.hostname=true
pinot.server.instance.realtime.alloc.offheap=true
pinot.query.server.port=7321
pinot.query.runner.port=7732
pinot.broker.disable.query.groovy=false
pinot.server.storage.factory.class.s3=org.apache.pinot.plugin.filesystem.S3PinotFS
pinot.server.storage.factory.s3.region=us-east-1
pinot.server.storage.factory.s3.accessKey=minioadmin
pinot.server.storage.factory.s3.secretKey=minioadmin
pinot.server.storage.factory.s3.disableAcl=false
pinot.server.storage.factory.s3.endpoint=http://10.168.6.104:9000
pinot.server.segment.fetcher.protocols=file,http,s3
pinot.server.segment.fetcher.s3.class=org.apache.pinot.common.utils.fetcher.PinotFSSegmentFetcher
...Aryaman Vinchhi
10/22/2024, 6:40 PMStuart Millholland
10/22/2024, 7:30 PMVipin Rohilla
10/23/2024, 5:23 PMAnish Nair
10/24/2024, 11:43 AMNayot Poolsappasit
10/25/2024, 7:36 PMNick Johnson
10/26/2024, 9:09 PMNani
10/28/2024, 1:18 AMNani
10/28/2024, 1:18 AMraghav
10/28/2024, 6:17 AMPraneeth G
10/28/2024, 11:18 AM2024/10/25 12:17:22.939 INFO [PinotFSFactory] [main] Got scheme hdfs, initializing class org.apache.pinot.plugin.filesystem.HadoopPinotFS
2024/10/25 12:17:22.939 INFO [PinotFSFactory] [main] Initializing PinotFS for scheme hdfs, classname org.apache.pinot.plugin.filesystem.HadoopPinotFS
2024/10/25 12:17:23.125 WARN [NativeCodeLoader] [main] Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
2024/10/25 12:17:23.198 ERROR [StartServiceManagerCommand] [main] Failed to start a Pinot [CONTROLLER] at 8.227 since launch
java.lang.NoClassDefFoundError: org/apache/htrace/core/Tracer$Builder
at org.apache.hadoop.fs.FsTracer.get(FsTracer.java:42) ~[hadoop-common-3.2.3.3.2.2.0-1.jar:?]
at org.apache.hadoop.fs.FileSystem.java:3370) ~[hadoop-common-3.2.3.3.2.2.0-1.jar:?]
at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:125) ~[hadoop-common-3.2.3.3.2.2.0-1.jar:?]
at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:3424) ~[hadoop-common-3.2.3.3.2.2.0-1.jar:?]
at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:3392) ~[hadoop-common-3.2.3.3.2.2.0-1.jar:?]
at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:485) ~[hadoop-common-3.2.3.3.2.2.0-1.jar:?]
at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:233) ~[hadoop-common-3.2.3.3.2.2.0-1.jar:?]
at org.apache.pinot.plugin.filesystem.HadoopPinotFS.init(HadoopPinotFS.java:67) ~[pinot-hdfs-1.2.0-shaded.jar:1.2.0-cc33ac502a02e2fe830fe21e556234ee99351a7a]
at org.apache.pinot.spi.filesystem.PinotFSFactory.register(PinotFSFactory.java:52) ~[pinot-all-1.2.0-jar-with-dependencies.jar:1.2.
In the executable file classpath is set as
export HADOOP_HOME=/usr/odp/3.2.2.0-1
export CLASSPATH_PREFIX="${HADOOP_HOME}/share/hadoop/hdfs/hadoop-hdfs-${HADOOP_VERSION}.jar:${HADOOP_HOME}/share/hadoop/common/lib/hadoop-annotations-${HADOOP_VERSION}.jar:${HADOOP_HOME}/share/hadoop/common/lib/hadoop-auth-${HADOOP_VERSION}.jar:${HADOOP_HOME}/share/hadoop/common/hadoop-common-${HADOOP_VERSION}.jar:${HADOOP_HOME}/share/hadoop/common/lib/guava-${HADOOP_GUAVA_VERSION}.jar:${HADOOP_HOME}/share/hadoop/common/lib/gson-${HADOOP_GSON_VERSION}.jar"
Further debugging understood that it due to mismatch box has hadoop-common-3.2.3.3.2.2.0-1
but pinot-1.2 uses hadoop-common-3.3.6
Tracer class has been moved from to org.apache.htrace.core.Tracer
-> org.apache.hadoop.tracing.Tracer
Since pinot fat/shaded
jars apache-pinot-1.2.0-bin/lib/pinot-all-1.2.0-jar-with-dependencies.jar , apache-pinot-1.2.0-bin/plugins/pinot-file-system/pin
ot-hdfs/pinot-hdfs-1.2.0-shaded.jar
are being added to the classpath anyway later on .. I removed my boxes hadoop lib from classpath and controller startup succeeded .
Query
1. If the fat jar's have all needed libraries .. do we still need to add hadoop libraries in the classpath in executible file as per documentation ?
2. We will not be able to update hadoop lib version .. so will there be any issues by removing conflicting versions and relying on pinot fat jars.
Please help with above query ?suraj sheshadri
10/28/2024, 7:27 PMSELECT count(*)
FROM usersample_main
WHERE arraytomv(dmp_segments) IN (1132) AND arraytomv(dmp_segments)IN (1135)
cc: @Mayank @Jackie @Xiang Fuulagaraja j
10/29/2024, 5:00 AMAnish Nair
10/29/2024, 10:21 AMJatin Kumar
10/30/2024, 3:43 AMAlexander Maniates
10/30/2024, 12:57 PMproperties
and the json looks like
{
"num_clicks": "5514400327644543899"
}
and we use json_extract_scalar(properties, '$.num_clicks', 'LONG', 0)
, the actual resulting value is 5514400327644544000
Has anybody else run into this?
I have also reproduced a failing test here that shows the problem: https://github.com/apache/pinot/pull/14332/filesNick Johnson
10/30/2024, 6:19 PM