Hi everyone - I'm looking for a description of the...
# ingestion
s
Hi everyone - I'm looking for a description of the business glossary file format. In particular, I would like to know which relationships types I can specify among business terms. The example yml file provided only contains the relationship elements "contains" and "inherits" (https://github.com/linkedin/datahub/blob/master/metadata-ingestion/examples/bootstrap_data/business_glossary.yml). My goal is to specify e.g. that "Glossary Term A" RELATES TO "Glossary Term B".
l
@stocky-noon-61140 Currently, these are the only modeled relationships. We can add another relationship type too to the model. Would love to understand the use-case and an example that you had in mind
s
@loud-island-88694 I cannot find the "inherits" and "contains" relationship in the file you referenced.
l
They are named "IsA" for inherits and "HasA" for contains
@green-football-43791 is
isRelatedTerms
a synonym for
IsA
?
s
Ah, I just had the impression that I figured out that: hasRelatedTerms = 'Contains', isRelatedTerms = 'Inherits',
l
So back to your use-case, it looks like we need to add a different relationship type
g
@stocky-noon-61140 exactly- hasRelatedTerms = Contains, isRelatedTerms = Inherits. Do the current relationship types support your use cases?
s
@green-football-43791 @loud-island-88694 I don't really see the difference between "has related terms" and "is related". If GLOSSARY TERM A has a related GLOSSARY TERM B; then GLOSSARY TERM A is also a related term of GLOSSARY TERM B and vice versa. So I don't see the distinction between 'contains' and 'inherits'. Also, from a technical perspective, are there any consequences of having contained / inherited glossary terms? To me it appears that linking glossary terms in this way (either by having "contained" or "inherited" glossary terms) has no effect. Looking forward to your feedback!
g
@stocky-noon-61140 - you are correct- there is no technical effect of these linkings, aside from the related terms appearing in the glossary term's page.
The two related terms lists are for indicating different connections between terms for the benefit of the end user.
The
isA
vs
hasA
of relationships indicate different types of connections.
IsA
indicates that one term is a specification of another term. For example: a
GeoLocation
may have an
isA
relationship with the broader term
Location
. Meanwhile,
GeoLocation
may have a
hasA
relationship with
Latitude
and
Longitude
, since any GeoLocation type will include a latitude and longitude field.