Syed Farhan Ahmed
09/14/2021, 9:35 AM[DEPRECATED] Marcos Marx
How reliable is this approach of moving data through unix pipes?Do want to compare with what other method of moving data?
Do we have any documented benchmarks or performance/throughput stats of Airbyte?Not yet! There is an issue on Github to create some benchmarks
What is the maximum limit to the data being moved from source to destination? How does Airbyte perform if the data is very large (in GBs lets say)?There is no limit. You can control your resources and transfer the amount of data you want. I know some users who transfer +100Gb in each sync, other who transfer 17Tb of data using Airbyte. For large cases maybe you need to plan and size your instance differently.
Oleksandr Shevchenko
09/15/2021, 7:55 AMSyed Farhan Ahmed
09/15/2021, 10:14 AMDo want to compare with what other method of moving data?using something like a message queue or streaming?
Syed Farhan Ahmed
09/15/2021, 1:45 PMThere is no limit. You can control your resources and transfer the amount of data you want. I know some users who transfer +100Gb in each sync, other who transfer 17Tb of data using Airbyte. For large cases maybe you need to plan and size your instance differently.This happens in batch right? What is the maximum batch size? Because I am assuming that for large data sizes, small batch jobs will slow down the process.
Andrey Morskoy
10/01/2021, 7:10 AMsource-s3
(using minio, patched connector to support endpoint) is 168 KB/s (e2e: from s3 minio to stdout). At the same time, wget
for th same file from minio produces 850 MB/s. Also, I have profiling, so I see hotspots in fact. Could you please share your benchmark ?Andrey Morskoy
10/01/2021, 7:13 AMAndrey Morskoy
10/01/2021, 7:14 AM