wonderful-notebook-20086
09/27/2022, 7:18 PMgetting-started
docker container images based on the Quickstart guide
I tried setting up a connection to our RS cluster and ran into this error:
'2022-09-27 18:30:45.362431 [exec_id=fec3ab48-c33b-4403-abfc-f61720c609ae] INFO: Starting execution for task with name=RUN_INGEST',
'2022-09-27 18:47:03.670827 [exec_id=fec3ab48-c33b-4403-abfc-f61720c609ae] INFO: Caught exception EXECUTING '
'task_id=fec3ab48-c33b-4403-abfc-f61720c609ae, name=RUN_INGEST, stacktrace=Traceback (most recent call last):\n'
' File "/usr/local/lib/python3.10/site-packages/acryl/executor/execution/sub_process_ingestion_task.py", line 182, in execute\n'
' await tasks.gather(_read_output_lines(), _report_progress(), _process_waiter())\n'
' File "/usr/local/lib/python3.10/site-packages/acryl/executor/execution/sub_process_ingestion_task.py", line 126, in '
'_read_output_lines\n'
' full_log_file.write(line)\n'
'OSError: [Errno 28] No space left on device\n'
'\n'
'During handling of the above exception, another exception occurred:\n'
'\n'
'OSError: [Errno 28] No space left on device\n'
'\n'
'During handling of the above exception, another exception occurred:\n'
'\n'
'Traceback (most recent call last):\n'
' File "/usr/local/lib/python3.10/site-packages/acryl/executor/execution/default_executor.py", line 123, in execute_task\n'
' task_event_loop.run_until_complete(task_future)\n'
' File "/usr/local/lib/python3.10/asyncio/base_events.py", line 646, in run_until_complete\n'
' return future.result()\n'
' File "/usr/local/lib/python3.10/site-packages/acryl/executor/execution/sub_process_ingestion_task.py", line 188, in execute\n'
' full_log_file.close()\n'
'OSError: [Errno 28] No space left on device\n'
Ingestion recipe yml looks something like this:
source:
type: redshift
config:
start_time: '2022-09-26 00:00:00Z'
end_time: '2022-09-26 12:00:00Z'
table_lineage_mode: mixed
include_table_lineage: true
database: insightsetl
password: '${etl2_test_datahub_creds}'
profiling:
enabled: true
host_port: '<http://pi-redshift-etl-2-test.ccvpgkqogsrc.us-east-1.redshift.amazonaws.com:8192|pi-redshift-etl-2-test.ccvpgkqogsrc.us-east-1.redshift.amazonaws.com:8192>'
stateful_ingestion:
enabled: true
username: datahub_ingestion
pipeline_name: 'urn:li:dataHubIngestionSource:93b5640d-8ed3-456e-89f9-0ec3def38733'
I'm not sure if it's a container issue or config or something else.gray-shoe-75895
09/28/2022, 12:51 AMgray-shoe-75895
09/28/2022, 12:52 AMwonderful-notebook-20086
09/28/2022, 5:47 PMwonderful-notebook-20086
09/28/2022, 8:48 PMwonderful-notebook-20086
09/28/2022, 9:05 PMinclude_copy_lineage
and I was finally able to get a successful ingestion run.
source:
type: redshift
config:
table_lineage_mode: stl_scan_based
include_table_lineage: true
include_copy_lineage: false
database: insightsetl
password: '${etl2test}'
profiling:
enabled: false
host_port: '<http://pi-redshift-etl-2-test.ccvpgkqogsrc.us-east-1.redshift.amazonaws.com:8192|pi-redshift-etl-2-test.ccvpgkqogsrc.us-east-1.redshift.amazonaws.com:8192>'
stateful_ingestion:
enabled: false
username: datahub_ingestion
pipeline_name: 'urn:li:dataHubIngestionSource:ef3016df-5b79-48fa-be92-885b2eba0ff0'
This is a cluster in our Beta stage into which we copy some sample data from upstream stages. So there's lots of copies happening all the time.wonderful-notebook-20086
09/28/2022, 9:06 PMgray-shoe-75895
09/28/2022, 9:08 PMinclude_copy_lineage
enabled, did it produce any warnings or errors in the logs?wonderful-notebook-20086
09/28/2022, 9:09 PMwonderful-notebook-20086
09/28/2022, 9:09 PMwonderful-notebook-20086
09/28/2022, 9:10 PMgray-shoe-75895
09/28/2022, 9:12 PMwonderful-notebook-20086
09/28/2022, 9:14 PMwonderful-notebook-20086
09/28/2022, 9:17 PMgray-shoe-75895
09/28/2022, 9:20 PMgray-shoe-75895
09/28/2022, 9:26 PMwonderful-notebook-20086
09/28/2022, 9:31 PMHere new means “lineage that wasn’t present in the last run, but showed up now” right?Correct