witty-butcher-82399
06/22/2022, 12:28 PMStatus(removed=True)
and the second event for the Ownerhip
aspect (nothing about the status in the second upsert).
• The dataset is wrongly shown in the UI as a valid dataset (not soft-deleted). We have also checked the backend and the dataset has Status(removed=False)
.
So if the issue is not during the ingestion, it must be the backend the one deciding to enable back the dataset for some reason.
Looking for something supporting our assumption we have found this in the source code https://github.com/datahub-project/datahub/blob/8c8f1b987a0c9fc29f4005aa8d132ad2550f3f05/metadata-io/src/main/java/com/linkedin/metadata/entity/EntityService.java#L1097 I could be wrong but it looks like in some cases, the backend decides to set the removal flag to false. It’s like it decides to re-enable back the dataset because there are other aspects being updated. If that’s true and while it could make sense in some cases, it causes our simple use case to misbehave. WDYT? Could be that the root cause of the issue?witty-butcher-82399
06/22/2022, 12:29 PMwitty-butcher-82399
06/23/2022, 7:15 AMdazzling-judge-80093
06/23/2022, 7:17 AMworried-branch-76677
06/25/2022, 4:35 AMworried-branch-76677
06/25/2022, 4:37 AMwitty-butcher-82399
06/28/2022, 7:06 AMwitty-butcher-82399
06/28/2022, 12:57 PM/usr/local/lib/python3.8/site-packages/datahub/ingestion/transformer/add_dataset_browse_path.py:33: DeprecationWarning: Call to deprecated class DatasetTransformer. (Legacy transformer that supports transforming MCE-s using transform_one method. Use BaseTransformer directly and implement the transform_aspect method)
return cls(config, ctx)
/usr/local/lib/python3.8/site-packages/datahub/ingestion/transformer/add_dataset_ownership.py:174: DeprecationWarning: Call to deprecated class DatasetTransformer. (Legacy transformer that supports transforming MCE-s using transform_one method. Use BaseTransformer directly and implement the transform_aspect method)
However, there are others connector that don’t complain about deprecated classes, such as pattern_add_dataset_schema_tags
.
Our experience is stateful ingestion does not work properly with connectors showing the deprecation warning while other connectors (we have only tested the one adding tags to the schema) do work correctly.
However this somehow invalidates our previous confirmation that the problem was on backend and not on ingestion. So quite confusing.
WDYT?dazzling-judge-80093
06/28/2022, 2:16 PMwitty-butcher-82399
06/28/2022, 2:30 PMmammoth-bear-12532
dazzling-judge-80093
06/30/2022, 10:41 AMdazzling-judge-80093
06/30/2022, 10:42 AMwitty-butcher-82399
06/30/2022, 10:47 AMwitty-butcher-82399
06/30/2022, 10:49 AMdazzling-judge-80093
06/30/2022, 10:49 AMdazzling-judge-80093
06/30/2022, 10:50 AMwitty-butcher-82399
06/30/2022, 10:51 AMdazzling-judge-80093
06/30/2022, 10:53 AM{
"auditHeader": null,
"entityType": "dataset",
"entityUrn": "urn:li:dataset:(urn:li:dataPlatform:bigquery,myproject.partition_test.users,PROD)",
"entityKeyAspect": null,
"changeType": "UPSERT",
"aspectName": "status",
"aspect": {
"value": "{\"removed\": true}",
"contentType": "application/json"
},
"systemMetadata": {
"lastObserved": 1656583428073,
"runId": "bigquery-2022_06_30-12_03_31",
"registryName": null,
"registryVersion": null,
"properties": null
}
},
{
"auditHeader": null,
"entityType": "dataset",
"entityUrn": "urn:li:dataset:(urn:li:dataPlatform:bigquery,myproject.partition_test.users,PROD)",
"entityKeyAspect": null,
"changeType": "UPSERT",
"aspectName": "ownership",
"aspect": {
"value": "{\"owners\": [{\"owner\": \"urn:li:corpuser:username1\", \"type\": \"PRODUCER\"}, {\"owner\": \"urn:li:corpGroup:groupname\", \"type\": \"PRODUCER\"}], \"lastModified\": {\"time\": 0, \"actor\": \"urn:li:corpuser:unknown\"}}",
"contentType": "application/json"
},
"systemMetadata": {
"lastObserved": 1656583428073,
"runId": "bigquery-2022_06_30-12_03_31",
"registryName": null,
"registryVersion": null,
"properties": null
}
},
dazzling-judge-80093
06/30/2022, 10:53 AMwitty-butcher-82399
06/30/2022, 12:56 PMdazzling-judge-80093
06/30/2022, 12:58 PMwitty-butcher-82399
06/30/2022, 12:59 PMwitty-butcher-82399
07/01/2022, 1:20 PMdazzling-judge-80093
07/01/2022, 1:21 PMdazzling-judge-80093
07/01/2022, 1:22 PMwitty-butcher-82399
07/01/2022, 1:25 PMwitty-butcher-82399
07/04/2022, 11:47 AMwitty-butcher-82399
07/06/2022, 3:34 PMlittle-megabyte-1074
witty-butcher-82399
07/07/2022, 7:09 AM