When we ingest metadata from different source syst...
# getting-started
w
When we ingest metadata from different source systems in an organization, it's possible that at the origin a field is called "score", while downstream system may name the same field as "marks". How do we handle it in DataHub to convey to a user that both are same fields? Is there any way to capture semantics in DataHub?
b
@witty-keyboard-20400 This is precisely what the Business Glossary is meant to address. It allows you to declare a controlled vocabulary of terms and relationships which can subsequently be associated with real-world data in a manner similar to tagging
w
@big-carpet-38439 Thanks for the input. Could you please point me to any example .yml and .json where I could see the structure of Business Glossary ingestion to DataHub?
b
Yeah of course
Here is a sample business glossary showing how you can define a Business Glossary in YML: https://gist.github.com/gabe-lyons/a9cbfd03fd4b30c5e1ac8d69ea0f7463 Here is a short video demoing what that glossary would look like once it was ingested in datahub: https://www.loom.com/share/3ddbcf86975c468d9a0c55d2428eab81
🙌 1
1