Hello everyone!
I'm writing an article for medium of a real case implementation of Apache Pinot as a way of studying. For that, I'm using a Strimzi cluster and the Twitter Api Kafka Connector (that you can find at https://www.confluent.io/hub/jcustenborder/kafka-connect-twitter) running on Minikube, to get data from twitter's api and ingest it into Pinot.
I followed the steps explained in this video
https://www.youtube.com/watch?v=Jc03u8rXc2w▾
making some adaptations to run on kubernetes. That way I was able to infer the schema of the "twitter-sample.json" file attached to this message by generating the schema file "twitter-old-schema.json", after that I had to remove some fields:
"schema.type", "schema.fields", "schema.optional", "schema.name",
and remove the prefix "payload." of every column to generate the file "twitter-schema.json". Then with this schema file and with the table config file "twitter-config.json" I created the REALTIME table "twitter-status-events" (using the column "CreatedAt" as datetime column) using pinot-admin.sh inside pinot controller's pod. But for some reason that I don't know I'm not getting any record in this table, do you have any idea what I'm doing wrong ?
(more information replied to this comment)
I also made some tests by only adding some columns instead of adding every column in "twitter-schema.json". By doing that I was able to get every record that was on kafka, but the only column filled with not null values was "CreatedAt".
m
Mayank
07/11/2022, 5:25 PM
Hi @Marlon Félix, let me take a look at your setup, will get back.
👍 1
m
Marlon Félix
07/11/2022, 5:28 PM
thank you!
m
Mayank
07/11/2022, 5:29 PM
Can you share some sample input records, or is that
twitter-sample.json
?
m
Marlon Félix
07/11/2022, 5:30 PM
"twitter-sample.json" is a sample record
👍 1
m
Mayank
07/11/2022, 5:43 PM
Hey @Marlon Félix, seems like you are assuming that you can ingest complex structs using
dot
notation. This is not the case at the moment. For example, you need to extract the nested
User.id
from input to write to a column in pinot.
m
Marlon Félix
07/11/2022, 5:49 PM
I understood. I saw in this video
https://www.youtube.com/watch?v=Jc03u8rXc2w▾
there was this possibility when adding the parameter