witty-butcher-82399
09/09/2022, 2:06 PMprocess_commit function in pipeline.py. It checks if there are errors or not, and depending on that and the commit policy, it will commit or not the checkpoint.
https://github.com/datahub-project/datahub/blob/23b929ea10daded7447f806f8860447626[…]e573a6/metadata-ingestion/src/datahub/ingestion/run/pipeline.py
However, I don’t see such a behaviour with the ingestion events themselves. Which means that ingestion pipeline could be publishing some events via the Sink and not committing the checkpoint.
In my opinion, publishing policy in the Sink should be aligned with committing policy. WDYT?helpful-optician-78938
09/09/2022, 5:31 PMwitty-butcher-82399
11/09/2022, 10:51 AM