Shah Newaz Khan
02/04/2022, 6:22 AM_airbyte_tmp
tables show up in the target dataset and looks like the connection from source is running. However I don't see any of the .avro
files accumulating in gcs and the _airbyte_tmp
tables are empty. I have set the gcs staging
to not delete the tmp files? How can I tell if data is being lifted and shifted?sync history
logsHarshith (Airbyte)
02/04/2022, 7:22 AMShah Newaz Khan
02/04/2022, 4:15 PMairbyter server
i see logs such as
Collecting content into /tmp/toBePublished15261227433470886897.tmp before uploading.
Collecting content into /tmp/toBePublished6742600251625951330.tmp before uploading.
Publishing to S3 (bucket=airbyte-dev-logs; key=job-logging/workspace/2/0/logs.log/20220204161420_airbyte-worker-79f96f596f-k2w7w_f41302792c264c988e4b2dda04be8f09):
Publishing /tmp/toBePublished6742600251625951330.tmp to GCS blob (bucket=production_ddp_airbyte_logs; blob=job-logging/workspace/2/0/logs.log/20220204161420_airbyte-worker-79f96f596f-k2w7w_f41302792c264c988e4b2dda04be8f09):
2022-02-04 16:14:27 INFO i.a.w.DefaultReplicationWorker(lambda$getReplicationRunnable$3):251 - Records read: 4874000
2022-02-04 16:46:57 destination > 2022-02-04 16:46:57 INFO a.m.s.StreamTransferManager(uploadStreamPart):558 - [Manager uploading to mix_panel_landing/bq_staging/cohort_members/2022_02_04_1643939039719_0.avro with id ABPnzm4SM...6BefCfix0]: Finished uploading [Part number 205 containing 50.04 MB]
however I don't see those .avro
files in gcs2022-02-04 23:56:22 INFO i.a.w.p.KubePodProcess(exitValue):710 - Closed all resources for pod source-zendesk-support-sync-2-0-wenpm
2022-02-04 23:56:22 ERROR i.a.w.DefaultReplicationWorker(run):141 - Sync worker failed.
java.util.concurrent.ExecutionException: java.lang.RuntimeException: java.lang.RuntimeException: Cannot find pod while trying to retrieve exit code. This probably means the Pod was not correctly created.
mixpanel sync
pod
kubectl logs pod/source-mixpanel-sync-3-1-dcuis -f -n airbyte main
Using existing AIRBYTE_ENTRYPOINT: python /airbyte/integration_code/main.py
Waiting on CHILD_PID 7
PARENT_PID: 1
Heartbeat to worker failed, exiting...
received ABRT
Harshith (Airbyte)
02/07/2022, 8:12 AMShah Newaz Khan
02/07/2022, 4:36 PMHarshith (Airbyte)
02/09/2022, 10:34 AMShah Newaz Khan
02/09/2022, 9:52 PMfull refresh
instead of nncremental
sync), and did mark is self as succeeded (after extending the pod sweeper pod to clean up success worker sync pods
at 48h instead of 2h, however each subsequent syncs does the entire back fillHarshith (Airbyte)
02/11/2022, 6:44 AMShah Newaz Khan
02/18/2022, 12:04 AM48 h
helped