Prakash Tirumalareddy
10/05/2020, 2:23 PMCaused by: java.lang.IllegalArgumentException: Parameter 'Bucket' must not be null
I am using 0.5.0
GenerationJobRunner,
segmentTarPushJobRunnerClassName: org.apache.pinot.plugin.ingestion.batch.standalone.SegmentTarPushJobRunner,
segmentUriPushJobRunnerClassName: org.apache.pinot.plugin.ingestion.batch.standalone.SegmentUriPushJobRunner}
includeFileNamePattern: glob:**/*.parquet
inputDirURI: s3://edp-pinot-data/nem13/
jobType: SegmentCreationAndUriPush
outputDirURI: s3://edp-pinot-segments/nem13/segments
overwriteOutput: true
pinotClusterSpecs:
- {controllerURI: 'http://localhost:9000'}
pinotFSSpecs:
- {className: org.apache.pinot.spi.filesystem.LocalPinotFS, configs: null, scheme: file}
- className: org.apache.pinot.plugin.filesystem.S3PinotFS
configs: {region: ap-southeast-2}
scheme: s3
pushJobSpec: {pushAttempts: 1, pushParallelism: 1, pushRetryIntervalMillis: 1000,
segmentUriPrefix: 's3://edp-pinot-segments', segmentUriSuffix: null}
recordReaderSpec: {className: org.apache.pinot.plugin.inputformat.parquet.ParquetRecordReader,
configClassName: null, configs: null, dataFormat: parquet}
segmentNameGeneratorSpec: null
tableSpec: {schemaURI: 'http://localhost:9000/tables/nem13/schema', tableConfigURI: 'http://localhost:9000/tables/nem13',
tableName: nem13}
Am I missing anything? Please help!!!Kishore G
Kartik Khare
10/05/2020, 3:38 PMPrakash Tirumalareddy
10/05/2020, 11:18 PMDaniel Lavoie
10/05/2020, 11:19 PMPrakash Tirumalareddy
10/05/2020, 11:19 PMPrakash Tirumalareddy
10/05/2020, 11:32 PMPrakash Tirumalareddy
10/05/2020, 11:46 PMPrakash Tirumalareddy
10/06/2020, 1:36 AMPrakash Tirumalareddy
10/06/2020, 2:44 AMNeha Pawar
Neha Pawar
Neha Pawar
Neha Pawar
Prakash Tirumalareddy
10/06/2020, 6:16 AMPrakash Tirumalareddy
10/06/2020, 6:17 AMKartik Khare
10/06/2020, 6:59 AMPrakash Tirumalareddy
10/06/2020, 6:59 AMKartik Khare
10/06/2020, 6:59 AMKartik Khare
10/06/2020, 6:59 AMPrakash Tirumalareddy
10/06/2020, 7:00 AMPrakash Tirumalareddy
10/06/2020, 7:00 AMpushJobSpec:
pushAttempts: 1
pushRetryIntervalMillis: 1000
segmentUriPrefix: "s3://"
segmentUriSuffix: ""
Kartik Khare
10/06/2020, 7:12 AMmvn clean package -DskipTests -Pbin-dist
Prakash Tirumalareddy
10/06/2020, 7:12 AMPrakash Tirumalareddy
10/06/2020, 1:47 PM2020/10/07 00:44:54.420 INFO [PinotFSFactory] [main] Initializing PinotFS for scheme s3, classname org.apache.pinot.plugin.filesystem.S3PinotFS
2020/10/07 00:44:54.891 INFO [S3PinotFS] [main] mkdir <s3://edp-pinot-segments/nem13/segments>
2020/10/07 00:44:55.598 INFO [S3PinotFS] [main] Listed 1 files from URI: <s3://edp-pinot-data/nem13/>, is recursive: true
2020/10/07 00:44:56.043 INFO [S3PinotFS] [main] Copy <s3://edp-pinot-data/nem13/currentregisterreaddate=2002-06-14/active_ind=Y/part-00000-2c2c776c-12f8-45a0-96fa-e402b13fdb57.c000.snappy.parquet> to local /var/folders/xs/bknv88ln05g5z3dgzss7whw80000gn/T/pinot-956cf81e-458b-45f3-9669-c24019eeacd3/input/part-00000-2c2c776c-12f8-45a0-96fa-e402b13fdb57.c000.snappy.parquet
2020/10/07 00:44:56.176 WARN [SegmentIndexCreationDriverImpl] [main] Using class: org.apache.pinot.plugin.inputformat.parquet.ParquetRecordReader to read segment, ignoring configured file format: AVRO
Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/hadoop/fs/Path
at org.apache.pinot.plugin.inputformat.parquet.ParquetRecordReader.init(ParquetRecordReader.java:46)
at org.apache.pinot.spi.data.readers.RecordReaderFactory.getRecordReaderByClass(RecordReaderFactory.java:133)
at org.apache.pinot.core.segment.creator.impl.SegmentIndexCreationDriverImpl.getRecordReader(SegmentIndexCreationDriverImpl.java:120)
at org.apache.pinot.core.segment.creator.impl.SegmentIndexCreationDriverImpl.init(SegmentIndexCreationDriverImpl.java:96)
at org.apache.pinot.plugin.ingestion.batch.common.SegmentGenerationTaskRunner.run(SegmentGenerationTaskRunner.java:104)
at org.apache.pinot.plugin.ingestion.batch.standalone.SegmentGenerationJobRunner.run(SegmentGenerationJobRunner.java:190)
at org.apache.pinot.spi.ingestion.batch.IngestionJobLauncher.kickoffIngestionJob(IngestionJobLauncher.java:142)
at org.apache.pinot.spi.ingestion.batch.IngestionJobLauncher.runIngestionJob(IngestionJobLauncher.java:117)
at org.apache.pinot.tools.admin.command.LaunchDataIngestionJobCommand.execute(LaunchDataIngestionJobCommand.java:123)
at org.apache.pinot.tools.admin.command.LaunchDataIngestionJobCommand.main(LaunchDataIngestionJobCommand.java:65)
Caused by: java.lang.ClassNotFoundException: org.apache.hadoop.fs.Path
at java.base/jdk.internal.loader.BuiltinClassLoader.loadClass(BuiltinClassLoader.java:602)
at java.base/jdk.internal.loader.ClassLoaders$AppClassLoader.loadClass(ClassLoaders.java:178)
at java.base/java.lang.ClassLoader.loadClass(ClassLoader.java:521)
... 10 more
Prakash Tirumalareddy
10/06/2020, 1:47 PMKartik Khare
10/06/2020, 1:48 PMPrakash Tirumalareddy
10/06/2020, 1:50 PMKartik Khare
10/06/2020, 1:51 PMsegmentUriPrefix: ""
and try againDaniel Lavoie
10/06/2020, 1:51 PMDaniel Lavoie
10/06/2020, 1:52 PMKartik Khare
10/06/2020, 1:52 PMDaniel Lavoie
10/06/2020, 1:54 PMjava.lang.NoClassDefFoundError: org/apache/hadoop/fs/Path
Prakash Tirumalareddy
10/06/2020, 1:54 PMPrakash Tirumalareddy
10/06/2020, 1:58 PMKartik Khare
10/06/2020, 2:00 PMPrakash Tirumalareddy
10/06/2020, 2:02 PMPrakash Tirumalareddy
10/06/2020, 2:08 PMDaniel Lavoie
10/06/2020, 2:09 PMCaused by: java.lang.IllegalArgumentException: INT96 not yet implemented.
Kartik Khare
10/06/2020, 2:11 PMDaniel Lavoie
10/06/2020, 2:12 PMorg.apache.parquet.avro.AvroSchemaConverter
from PinotPrakash Tirumalareddy
10/06/2020, 2:13 PMKartik Khare
10/06/2020, 2:15 PMPrakash Tirumalareddy
10/06/2020, 2:17 PMPrakash Tirumalareddy
10/06/2020, 2:21 PMKartik Khare
10/06/2020, 2:21 PMPrakash Tirumalareddy
10/06/2020, 2:23 PMKartik Khare
10/06/2020, 2:28 PMtimes
value to int64
and tryPrakash Tirumalareddy
10/06/2020, 2:32 PMKartik Khare
10/06/2020, 2:32 PMPrakash Tirumalareddy
10/06/2020, 2:39 PMNeha Pawar
Prakash Tirumalareddy
10/07/2020, 7:52 AMNeha Pawar
Neha Pawar
Prakash Tirumalareddy
10/07/2020, 10:29 PM