David Cyze
09/02/2021, 4:55 PMrealtime.segment.flush.threshold.rows=30
, and I ran my application to the point where I had pushed around 100k rows to a kafka topic before realizing that this was much too small a segment size.
I stopped my app, deleted the table, changed realtime.segment.flush.threshold.rows=100000
, and recreated it. Then, I ran my app to push 3mil rows to the kafka topic.
At some point in Pinot's ingestion process, the status of my table changed to BAD
. I looked in the controller logs and noticed this error:
2021/09/02 16:02:48.585 ERROR [SegmentCompletionFSM_simplejson__0__632__20210902T1602Z] [grizzly-http-server-21] Caught exception while committing segment metadata for segment: simplejson__0__632__20210902T1602Z
java.lang.IllegalStateException: Failed to find IdealState for table: simplejson_REALTIME
In the web UI for the Pinot Controller, under the Cluster Manager for the affected table, I sorted the SEGMENTS
list by Status and noticed that I had two Bad
segments. Inspecting the bad segments, I noticed that each had a total of 30 documents. I checked a handful of Good
segments, and each had 100k documents.
I'm not sure how to bring these segments into a Good
state, or why they entered into a Bad
state in the first place. I was unable to find anything in Pinot's documentation on what causes this error or how to resolve it.Jackie
09/02/2021, 5:51 PMJackie
09/02/2021, 5:52 PMJackie
09/02/2021, 5:53 PMDavid Cyze
09/02/2021, 6:07 PMDavid Cyze
09/02/2021, 6:09 PMJackie
09/02/2021, 6:18 PMJackie
09/02/2021, 6:21 PMDavid Cyze
09/02/2021, 6:28 PMPinotCluster/PropertyStore/Segments
is reported empty from the zookeeper browserJackie
09/02/2021, 6:30 PMJackie
09/02/2021, 6:32 PMsimplejson__0__0__20210902T...
)David Cyze
09/02/2021, 6:34 PMJackie
09/02/2021, 6:34 PMsimplejson__REALTIME
is removedDavid Cyze
09/02/2021, 6:35 PMDavid Cyze
09/02/2021, 6:35 PMJackie
09/02/2021, 6:35 PMDavid Cyze
09/02/2021, 6:38 PMDavid Cyze
09/02/2021, 6:42 PMCONSUMING
, but I would have expected at least one segment to have completed by nowJackie
09/02/2021, 7:08 PM"stream.kafka.consumer.prop.auto.offset.reset"
to be "smallest"
?David Cyze
09/02/2021, 7:09 PMlargest
David Cyze
09/02/2021, 7:11 PM