Join Slack
Powered by
Hi, another question about realtime2offlinesegment...
# troubleshooting
a
Alice
07/06/2022, 1:15 PM
Hi, another question about realtime2offlinesegmentstask. We found a small bucket time(1h) data would task more than 1 hour to process, especially in mapper phase. Is it normal? Is there some task configures I’m missing here?
Alice
07/06/2022, 1:18 PM
Here is my r2o task config: “taskTypeConfigsMap”: { “RealtimeToOfflineSegmentsTask”: { “bucketTimePeriod”: “1h”, “bufferTimePeriod”: “6h”, “schedule”: “0 0 0/1 * * ?“, “roundBucketTimePeriod”: “1m”, “mergeType”: “rollup”, “value.aggregationType”: “max”, “maxNumRecordsPerSegment”: “2000000" } }
l
Laxman Ch
07/06/2022, 4:24 PM
Yes. depends on your data though. r2o task is
single threaded
. for larger data sets this takes longer time.
m
Mayank
07/06/2022, 6:39 PM
What is the segment size being generated for 1h?
a
Alice
07/07/2022, 12:44 AM
about 10G
Alice
07/07/2022, 12:55 AM
Could it be configured to multiple threads?
Alice
07/07/2022, 2:42 AM
Is there some suggestion on resource assignment for minion instances?
m
Mayank
07/07/2022, 2:43 AM
@Sandeep Dabade
^^
Open in Slack
Previous
Next