jorge-gt3
07/23/2025, 8:56 PMkapa.ai
07/23/2025, 8:57 PMINSERT
, UPDATE
, and DELETE
. Changes from operations like TRUNCATE
or ALTER
are not captured. - Data must be in tables, not views, and CDC must be enabled on each table you want to sync. If new tables or columns were added, ensure CDC is enabled for them and that Airbyte is aware of the schema changes. For new columns, you may need to disable and re-enable CDC on the table to ensure the new column is tracked and included in the sync ([MSSQL CDC docs](https://docs.airbyte.com/integrations/sources/mssql#change-data-capture-cdc), [CDC limitations](https://docs.airbyte.com/platform/next/understanding-airbyte/cdc#limitations), [Adding columns to CDC](https://docs.airbyte.com/integrations/sources/mssql/mssql-troubleshooting#adding-columns-to-existing-tables-with-cdc)). 2. Review Retention Period of CDC Data - By default, SQL Server retains CDC data for only three days. If your initial or incremental sync takes longer, you may lose change data before it is synced, resulting in missing records. Increase the retention period using the sp_cdc_change_job
stored procedure and restart the cleanup job to avoid data loss ([Extend CDC retention](https://docs.airbyte.com/integrations/sources/mssql#4-extend-the-retention-period-of-cdc-data)). 3. Check for Schema Changes - If columns were added to tables after CDC was enabled, those columns may not be tracked unless CDC is re-enabled for the table. This can result in missing data for those columns or even entire records if the schema is not aligned ([Adding columns to CDC](https://docs.airbyte.com/integrations/sources/mssql/mssql-troubleshooting#adding-columns-to-existing-tables-with-cdc)). 4. Investigate Sync Mode Limitations - The Incremental Sync - Append mode relies on a cursor column. If records are updated without updating the cursor field, those changes will not be picked up. Also, if multiple changes occur between syncs and the sync frequency is not granular enough, some changes may be missed ([Incremental sync limitations](https://docs.airbyte.com/platform/using-airbyte/core-concepts/sync-modes/incremental-append#known-limitations)). 5. Review Logs and Airbyte Job Output - Check the Airbyte job logs for any warnings or errors that might indicate records were skipped or filtered out. Look for any mention of records being filtered, connection issues, or schema mismatches. 6. Consider Initial Snapshot and CDC Overlap - During the initial sync, Airbyte takes a snapshot of the current state and then starts reading CDC logs. If there is a gap between the snapshot and CDC log reading, changes made during that window may be missed. This is a known challenge in CDC-based syncs and can lead to missing records if not handled carefully ([Forum discussion on missing rows](https://github.com/airbytehq/airbyte/issues/53614)). 7. Resource and Performance Issues - For very large tables, performance bottlenecks or timeouts can cause incomplete syncs. Ensure your Airbyte deployment has sufficient resources and consider increasing parallelism if using Kubernetes ([Forum post on slow initial load](https://discuss.airbyte.io/t/1686)). Summary of Actions: - Verify CDC is enabled and configured correctly on all relevant tables and columns. - Increase CDC retention period to cover the full duration of your sync. - Check for schema changes and re-enable CDC if needed. - Review Airbyte logs for errors or skipped records. - Ensure your sync frequency and resources are sufficient for your data volume. If after these checks the issue persists, consider running a Full Refresh sync to reconcile the record counts, and open a support ticket with Airbyte including your logs and configuration details for further assistance.jorge-gt3
07/23/2025, 8:58 PMkapa.ai
07/23/2025, 9:01 PMkapa.ai
07/23/2025, 9:01 PM