Hello,
As part of the Datahub adoption in our enterprise, we have a requirement to ingest and display a '_*business name*_' for the datasets and schema fields, so that our consumer is able to understand the business intent behind a given dataset/schema field.
As we look into the current Datahub metadata model and documentation, we wanted to reach out to you with the below two options and seek your thoughts on what would be more appropriate or if you have any other solve that would meet our business requirement. Please review and advise. If this is not the right channel/thread for this question, please let me know as well. [ Tagging a few others from our team - @aloof-author-52810 @dry-zoo-35797 @nice-window-93693 @aloof-ram-72401 @creamy-postman-32779 @rhythmic-agent-54913 @swift-nail-32514 ]
a. Utilize "*jsonProps*" at schema fields level to represent the business name of a given field. Per the definition in Datahub, "For schema fields that have other properties that are not modeled explicitly. Use this field to serialize those properties into a JSON string". Today, "jsonProps" is not searchable but we could make it searchable and make an upstream PR to Datahub. We would also need to make changes to the UI to display "jsonProps" in the schema tab of a given dataset against each field. Use "*customProperties*" at the Dataset level to represent the business name of the dataset.
b. Extend metadata model by adding an additional "*business name*" to the schema field's definition and dataset's definition, so we could ingest and display it in the UI. We will have to make changes to the metadata models and UI and make an upstream PR to Datahub if this would provide value to other enterprises as well.
c. Utilize glossaryTerms to represent the "*business name*" of a given schema field and dataset.
d. Utilize Tags to represent the "*business name*" of a given schema field and dataset.
e. Any other proposal as advised by Datahub.