Is there a quick way we can convert realtime segme...
# troubleshooting
e
Is there a quick way we can convert realtime segments to offline segments? Is there any benefit to doing that since the realtime segment is created with star tree, inverted, sorted and text indexes?
n
What’s the motivation for wanting that? We have this project ongoing to move realtime segments to offline table: https://docs.google.com/document/d/1-e_9aHQB4HXS38ONtofdxNvMsGmAoYfSnc2LP88MbIc/edit#
would be helpful if you could take a look at the doc and make sure this would cover what you are looking for
as of now, there’s no way. you’d have to download segments, convert to raw data, upload to offline
e
Sounds good. Thanks for the link!
What do you recommend is the optimal segment size or rowcount?
n
in realtime we recommend 150M as optimal segment size
if you’re uploading back to same table’s OFFLINE component, you need to be careful about the time boundary for the hybrid table
e
Do you mean be careful If there is overlap in the time intervals, it could delete records in one of the realtime or offline segments?
n
yes if there is overlap. be cognizant that the query will be split as such: https://docs.pinot.apache.org/basics/components/broker
what are you trying to achieve?
nothing will be deleted.
e
Just want to get a better understanding, we want to replace realtime segments with sanitized offline segments, or just upload segments. I think I have enough to go on now, thanks for the info!