Hello all, I’ve started to ingest data from Kafka ...
# ingestion
p
Hello all, I’ve started to ingest data from Kafka & Hive. All dataset origin is set to “PROD”. Is it possible to adjust that in ingestion recipes and how can I change that after ingestion? Thank you
b
i assume u meant, remove "prod" from the UI browsing hierachy to get to the dataset? for instance: prod > hive > your-DB > your_table? there is a transformer you can apply in the ingest recipe to remove it such that it becomes hive > your-DB > your_table
p
Thank you for the answers! I will do that and let you know the results. Thank you @better-orange-49102 @square-activity-64562
Is it possible to update the dataset after ingestion? @mammoth-bear-12532 @big-carpet-38439
b
Yes absolutely. Ingesting the data again from the same dataset will create a new "version". You can see the versions in the UI or by accessing the MySQL database that datahub uses.
p
Thank you @better-orange-49102!
m
@polite-flower-25924 what is the issue with the env that you previously set? Is it incorrect?
The dataset urn includes the origin (env), so you can’t really change the dataset’s identifier, however you CAN change how you navigate to it by customizing the browse path aspect.
p
@mammoth-bear-12532 actually, I didn’t set anything. It’s automatically set to Prod
I just ingest the metadata from dev environment, but it’s populated with “PROD” even though I didn’t specify that
m
right, the default env for the connectors is PROD
thankyou 1
you will have to specify
env: DEV
to get the DEV env
thankyou 1