Hi! Is there a way to have the data streamed from Kafka and then put into S3 in Parquet format?
m
Mayank
05/04/2021, 11:30 PM
You mean using Pinot? No
Mayank
05/04/2021, 11:30 PM
Using Pinot you can consume via Kafka and store in S3, but it will be Pinot index format.
m
Mus
05/04/2021, 11:32 PM
Got it, thanks! Is there any other service that you know that can handle this on a very big scale?
m
Mayank
05/04/2021, 11:39 PM
Usually done via ETL pipelines (as you may need transforms on Kafka topic before storing). Depending on your specific requirements there may be standard solutions out there you can try.
👍 1
k
Kishore G
05/05/2021, 12:36 AM
@User Gobblin can do that
Kishore G
05/05/2021, 12:37 AM
there are many projects that can move data from Kafka to S3, Kafka Connect as well