Hello,
Our reporting setup is Tableau on top of Redshift, and we are ingesting metadata from both these systems. I noticed that empty duplicate redshift datasets got created in Datahub from the Tableau ingestion process. Because of this our lineage is fragmented. Looking for your help on the following points:
• What must be causing this?
• What should we do to fix it?
• What needs to be done, so that this does not happen again?
Example of the situation:
1. urn
lidataPlatform:redshift,_
redshift_database_instance.hr_
schema.hr_table_,PROD
2. urn
lidataPlatform:redshift,_some_other_schema_._
hr_schema.hr_table_,PROD
3. urn
lidataPlatform:redshift,_some_other_schema_._
hr_schema.hr_table_,PROD
# 1 is the correct one that got ingested from the redshift recipe. #2 & #3 are the empty ones that got created via Tableau ingestion.
Please let me know if you need any further details. Thank you for your help.
CC:
@swift-plastic-79414