Anton Podviaznikov
05/21/2022, 8:47 AM49.25 GB | 32,773,497 emitted records | 32,773,497 committed records | 2h 32m 18s | Sync
So the speed is around 5MB/s.
How would I make it faster?
I increased
SUBMITTER_NUM_THREADS=40
MAX_SYNC_WORKERS=20
as was described here https://discuss.airbyte.io/t/scaling-airbyte-on-k8s-increased-job-parallelism/826
But I wasn't sure how to increase number of workers.
Also what else can I tune to make jobs go faster?Anton Podviaznikov
05/21/2022, 2:37 PMairbyte-worker
?Marcos Marx (Airbyte)
05/23/2022, 5:26 PMAnton Podviaznikov
05/23/2022, 9:07 PM0.38.4-alpha
. So this one should have dynamic rows fetching?Davin Chia (Airbyte)
05/25/2022, 9:22 AMLiren Tu (Airbyte)
05/25/2022, 9:42 PM0.4.12
may be helpful for performance, but it is unlikely that it will have more than 2x impact.
32,773,497 committed records
in 2h 32m 18s
means 4.5K row per second.
Based on our internal benchmark for Postgres, this velocity falls under the normal range. So right now we don’t have any magic wander to make it much faster than that.
We are working on improving the performance of our Postgres connector. But this is not a trivial task. So it may take a while.
This issue is tracked here: https://github.com/airbytehq/airbyte/issues/12532