Hello Everyone, we have an internal implementation...
# getting-started
g
Hello Everyone, we have an internal implementation of DataHub working on ingesting data from MSSQL and BigQuery, so far everything has been done through the UI. However, we are expanding our use of the tool a little bit more and are planning on using the Python Emitter SDK. I was wondering if someone can just confirm my understanding of the below points: • SDK Can be used to create Datasets, Fields for those data sets and set various attributes inside the dataset • SDK Can be used to create lineage between datasets both upstream and downstream Thanks in advance and I appreciate you help šŸ™‚
šŸ” 1
šŸ“– 1
l
Hey there šŸ‘‹ I'm The DataHub Community Support bot. I'm here to help make sure the community can best support you with your request. Let's double check a few things first: āœ… There's a lot of good information on our docs site: www.datahubproject.io/docs, Have you searched there for a solution? āœ… button āœ… It's not uncommon that someone has run into your exact problem before in the community. Have you searched Slack for similar issues? āœ… button Did you find a solution to your issue? āŒ Sorry you weren't able to find a solution. I'm sending you some tips on info you can provide to help the community troubleshoot. Whenever you feel your issue is solved, please react āœ… to your original message to let us know!
d
Hello Omar, that is true! you can check related guides here: • create dataset : https://datahubproject.io/docs/api/tutorials/modifying-datasets#prerequisites • add lineage between dataset : https://datahubproject.io/docs/api/tutorials/modifying-dataset-lineage#add-lineage
g
Thank you so much