Hi all. I'm new to DataHub and have been exploring...
# advice-data-governance
b
Hi all. I'm new to DataHub and have been exploring the lineage capability, around which I have a couple of questions. 1. Is there any way we could automatically discover the lineage when metadata is ingested? 2. Is there any way we can achieve automation using Java SDK. I took a look at the documentation but there's only the Python way. Any help would be appreciated. Thanks
m
@best-umbrella-88325 which data sources are you looking to ingest lineage from?
for Java SDK, have you looked at this doc already? https://datahubproject.io/docs/metadata-integration/java/as-a-library
b
I'm trying to ingest lineage from AWS Glue , MySQL for the majority part.
Also, I did see this documentation, however it doesn't contain anything around lineage. It's more into entity management.
g
It's more into entity management.
That's effectively how you emit any metadata, python has a few more examples for how different aspects of an entity are emitted if you want more examples: https://github.com/datahub-project/datahub/tree/master/metadata-ingestion/examples/library The same set of aspects can be emitted through java