Hello, We are evaluating airbyte and I have a few questions before I start running a full-fledged POC. Thank you in advance for taking the time to share your thoughts and knowledge:
• The big use case for us to use Airbyte is for CDC from MySQL and Postgres. I see that Airbyte is using Debezium 1.4.2 and it seems like the upgrade to the latest version is not on the radar and potentially a big lift. For context, we have multiple databases with more on the way and processing data in the range of 500 Million rows every day.
◦ I would love for some feedback on what the community's experience has been with the CDC sources in general
◦ The majority of issues I have seen reported with Debezium affecting versions >= 1.4.2 and likely to impact us are about not being able to parse the logs when there are DDL statements in the logs. Have people run into similar issues ?
• I see that the PR to use a configurable backend for Secrets has been merged. Is the functionality available in the Open-Source version ?
• I have not been able to confirm, but it seems like the way CDC works is that the first connection to the source will want to do a full snapshot of the data. This is a big no-no for us from the Production DB which is huge. Can the source connector be configured to only read from a given position in the transaction log (BinLog/LSN) ?