Hi All! On a non Neo4j deployment, I saw datahub stored lineage in MySQL tables as upstreamLineage aspect.. Are there there other places where the lineage is also stored? I deleted all upstreamlineage rows from the backend database, but i still see Lineage in DataHub ui...
g
green-football-43791
10/22/2021, 7:36 PM
hey @kind-dawn-17532 - in non-neo4j deployments lineage data is indexed in elasticsearch
green-football-43791
10/22/2021, 7:36 PM
this allows datahub to look up lineage in both direction performantly.
green-football-43791
10/22/2021, 7:38 PM
if you run the restore indices command, you'll see elastic catch up with mysql
l
loud-island-88694
10/22/2021, 9:08 PM
In general, deleting rows directly from MySQL will lead to unpredictable results. It is better to use rollbacks if you want to undo certain ingestion runs.
➕ 1
loud-island-88694
10/22/2021, 9:09 PM
We will keep evolving the storage layer and you may break your installation if you delete the wrong rows
k
kind-dawn-17532
10/26/2021, 4:11 PM
Thanks! @green-football-43791 & @loud-island-88694! I had my upstream lineage created by posting multiple MCEs which were not a part of ingestion job.. I guess i need to wrap my MCEs in ingestion pipeline..