Suthira (Praer) Owlarn
05/29/2024, 9:24 AMkapa.ai
05/29/2024, 9:24 AMSuthira (Praer) Owlarn
05/29/2024, 9:32 AMbytesSynced
and rowsSynced
here: https://reference.airbyte.com/reference/getjobSuthira (Praer) Owlarn
05/29/2024, 9:33 AMkapa.ai
05/29/2024, 9:34 AMSELECT
statement, which is effectively a Full Refresh. - For subsequent syncs, Airbyte reads data from the previously synced position in the logs up to the start time of the sync. This means that Airbyte does not treat CDC sources as infinite streaming sources but rather processes the logs incrementally. - Airbyte keeps track of the current log position between syncs, ensuring that it only processes new changes since the last sync. 4. Updating Data for Each Stream: - Airbyte uses the Debezium core engine and Debezium MySQL connector to connect to the database and parse the binlog in JSON format. The JSON records are then passed onto the destination. - The Airbyte Protocol outputs records from sources, and records from UPDATE
statements appear the same way as records from INSERT
statements. This allows for flexible handling of data updates in the destination. For more detailed information, you can refer to the following sources: - [Consuming the MySQL binary log with Debezium](https://airbyte.com/tutorials/mysql-change-data-capture-cdc) - [Syncing Part 1](https://docs.airbyte.com/understanding-airbyte/cdc#syncing) - [Forum Post: Source MySQL CDC binlog files](https://discuss.airbyte.io/t/128) If you have any further questions or need more specific details, feel free to ask!