Hi colleagues, We would like to use DataHub lineag...
# advice-metadata-modeling
g
Hi colleagues, We would like to use DataHub lineage feature to be able to build pipelines dependancies graph. E,g, if some dataset is updated we want to understand which upstream and downstream pipelines need to be re-run to fix the dependent datasets. So we think GraphQL and lineage would serve our needs. The question is related to a particular query operation to leverage. When introspecting requests from UI when press Lineage button (in the right upper corner) the request to api/v2/graphql is sent with query where operation is getEntityLineage. This operation is also presented in master - https://github.com/datahub-project/datahub/blob/master/datahub-web-react/src/graphql/lineage.graphql#L308 But nothing is said about it in the doc - https://datahubproject.io/docs/graphql/queries So could you advise please if we can rely on this api to get what we want?
e
Hi @gorgeous-author-98854 the
getEntityLineage
call that you’re referring to is just an alias that just retrieves an entity, and then resolves its relationships (both upstream and downstream)
I would recommend you just make a call to get the dataset you want, and then resolve its relationships, please look at the API here https://datahubproject.io/docs/graphql/objects#dataset
g
@echoing-airport-49548 make sense. Thanks a lot!
e
No problem, happy to help!