Anyone know what this error means? I am using the...
# troubleshoot
c
Anyone know what this error means? I am using the Snowflake operator
Copy code
'failures': {'Stateful Ingestion': ['Fail safe mode triggered, entity difference percent:100.0 > fail_safe_threshold:{self.stateful_ingestion_config.fail_safe_threshold}']},
and I have this in my yaml:
Copy code
stateful_ingestion:
            enabled: false
b
@gray-shoe-75895 Can assist you here! Which version of the CLi are you on? We recently submitted a PR to address this exact issue!
l
I’m seeing the same error. I am running a bigquery ingestion with stateful_ingestion enabled. I'm running 0.9.2
Copy code
" 'failures': {'Stateful Ingestion': ['Fail safe mode triggered, entity difference percent:99.4413407821229 > "
g
I believe these issues are resolved in acryl-datahub 0.9.3
l
Got it. I will try 0.9.3 Thanks!
c
@gray-shoe-75895 I am facing this error right now in DataHub
v0.10.0
Some datasets are failing due to
difference percent
😞
To test if the issue was related to drastic changes in the datasets I did the following test: 1. Disabled stateful ingestion and run the ingestion 2. Turn the stateful ingestion on after a successful full ingestion and run the ingestion again After disabling the stateful ingestion no errors occur, as expected. Afterwards enabling the stateful ingestion again make the ingestion job fail again. 😞 I would expect the next run after a full ingestion to succeed. Since the previous ingestion was a full one, doesn’t it mean no drastic changes occurred in between? Or is my assumption wrong?
g
We disabled that whole fail_safe_threshold thing because it was causing a number of issues. Can you check if the specific ingestion is running with a different cli version (this is different than the main datahub app version, and will usually have a 4 digit version number)
c
I will check this and let you know
Indeed we are using an older version. 😮
Copy code
Cli report:
{'cli_version': '0.9.0',
 'cli_entry_location': '/tmp/datahub/ingest/venv-glue-0.9.0/lib/python3.10/site-packages/datahub/__init__.py',
 'py_version': '3.10.9 (main, Jan 23 2023, 22:32:48) [GCC 10.2.1 20210110]',
 'py_exec_path': '/tmp/datahub/ingest/venv-glue-0.9.0/bin/python3',
 'os_details': 'Linux-5.10.167-147.601.amzn2.x86_64-x86_64-with-glibc2.31',
 'mem_info': '105.59 MB'}
We recently upgraded DataHub to 0.10.0. It seems that in the ingestion pipeline setup, in the advanced tab, the version of the CLI still 0.9.0 😮 Looks like we need to change it and enforce the correct version we want in the CLI after upgrading DataHub
g
If you manually set a version in the UI (https://datahubproject.io/docs/ui-ingestion/#advanced-running-with-a-specific-cli-version), we’ll respect that. If you leave that field blank, it will get automatically updated when you upgrade the rest of the app