Lee Wei Hern Jason
11/07/2022, 7:08 AMCaught exception while fetching segment from: <http://ip-10-110-217-232.ap-southeast-1.compute.internal:8097/segments/transportSurgeMirrorMetric_REALTIME/transportSurgeMirrorMetric__0__151__20221027T0840Z> to: /mnt/data/pinot/index/transportSurgeMirrorMetric_REALTIME/transportSurgeMirrorMetric__0__151__20221027T0840Z.tar.gz
This issue arise when i tried to restart all my Pinot’s servers. The segments are present in both servers which im not too sure why is it trying to download from its peer. Thanks in advance 🙏Mayank
Lee Wei Hern Jason
11/07/2022, 3:12 PMMayank
Lee Wei Hern Jason
11/07/2022, 3:45 PMERROR [MessageGenerationPhase] [HelixController-pipeline-default-stg-mimic-pinot-(e22acb6a_DEFAULT)] Event e22acb6a_DEFAULT : Unable to find a next state for resource: transportSurgeMirrorMetric_REALTIME partition: transportSurgeMirrorMetric__0__151__20221027T0840Z from stateModelDefinitionclass org.apache.helix.model.StateModelDefinition from:ERROR to:ONLINE
Broker logs:
Failed to find servers hosting segment: transportSurgeMirrorMetric__0__151__20221027T0840Z for table: transportSurgeMirrorMetric_REALTIME (all ONLINE/CONSUMING instances: [Server_ip-10-110-217-232.ap-southeast-1.compute.internal_8098] and OFFLINE instances: [] are disabled, counting segment as unavailable)
Server logs below:
2022/11/07 04:12:33.383 INFO [HelixStateTransitionHandler] [HelixTaskExecutor-message_handle_thread_10] Instance Server_ip-10-110-222-230.ap-southeast-1.compute.internal_8098, partition transportSurgeMirrorMetric__0__151__20221027T0840Z received state transition from OFFLINE to ONLINE on session 2007a773b0d0022, message id: defaccf1-dd6a-4d89-9a1f-dcdb5340d2e5
2022/11/07 04:12:33.521 ERROR [transportSurgeMirrorMetric_REALTIME-RealtimeTableDataManager] [HelixTaskExecutor-message_handle_thread_10] Caught exception while loading segment: transportSurgeMirrorMetric__0__151__20221027T0840Z, downloading a new copy
java.lang.RuntimeException: java.io.FileNotFoundException: /mnt/data/pinot/index/transportSurgeMirrorMetric_REALTIME/transportSurgeMirrorMetric__0__151__20221027T0840Z/v3/metadata.properties (Too many open files)
2022/11/07 04:12:33.537 INFO [S3PinotFS] [HelixTaskExecutor-message_handle_thread_10] Copy <s3://stg-pinot-archive/stg-mimic-pinot/controller-data/transportSurgeMirrorMetric/transportSurgeMirrorMetric__0__151__20221027T0840Z> to local /mnt/data/pinot/index/transportSurgeMirrorMetric_REALTIME/transportSurgeMirrorMetric__0__151__20221027T0840Z.tar.gz
2022/11/07 04:12:34.077 WARN [PinotFSSegmentFetcher] [HelixTaskExecutor-message_handle_thread_10] Caught exception while fetching segment from: <s3://stg-pinot-archive/stg-mimic-pinot/controller-data/transportSurgeMirrorMetric/transportSurgeMirrorMetric__0__151__20221027T0840Z> to: /mnt/data/pinot/index/transportSurgeMirrorMetric_REALTIME/transportSurgeMirrorMetric__0__151__20221027T0840Z.tar.gzLee Wei Hern Jason
11/08/2022, 7:57 AMMayank
Lee Wei Hern Jason
11/09/2022, 12:13 AMLee Wei Hern Jason
11/09/2022, 9:43 AM2022/11/07 04:12:33.521 ERROR [transportSurgeMirrorMetric_REALTIME-RealtimeTableDataManager] [HelixTaskExecutor-message_handle_thread_10] Caught exception while loading segment: transportSurgeMirrorMetric__0__151__20221027T0840Z, downloading a new copy
java.lang.RuntimeException: java.io.FileNotFoundException: /mnt/data/pinot/index/transportSurgeMirrorMetric_REALTIME/transportSurgeMirrorMetric__0__151__20221027T0840Z/v3/metadata.properties (Too many open files)
And it is unable to dl from deep store or from peer cause the segment is available on local and replacing the file is not allowed ?
Resetting the segment solves the issue, as it brings it back to OFFLINE state and restart the process of finding the segment on local which now is successful in doing so.Xiang Fu
Lee Wei Hern Jason
11/10/2022, 9:07 AM65536 .
I have about 3453 segments, and assuming that all of the segment files are open, [columns.psf creation.meta index_map metadata.properties], it is around 14k open files which is far from the limit. Not too sure why it is hitting the limitLee Wei Hern Jason
11/10/2022, 9:41 AMXiang Fu