Chris Seber
07/18/2024, 4:15 PMkapa.ai
07/18/2024, 4:16 PMDetect and propagate schema changes
is set to Propagate all changes
or Propagate columns changes only
and Airbyte detects the schema change as part of a sync. Refreshing the schema manually and applying schema changes will not allow the backfill to occur [[source](https://docs.airbyte.com/using-airbyte/schema-change-management#backfill-new-or-renamed-columns)]. However, if you are experiencing missing records that are not being backfilled, it could be due to a variety of reasons. For instance, if the source API is unreliable, you might want to use checkpoint state to ensure data is not lost between syncs [[source](https://discuss.airbyte.io/t/1420)]. If you are using Salesforce as a source, you might need to reset the stream and sync a historical backfill to pull in all the updated values of the field if the formula of a field changes in Salesforce and no other field on the record is updated [[source](https://docs.airbyte.com/integrations/sources/salesforce#syncing-formula-fields)]. If you are still experiencing issues, it would be best to open a new topic on the Airbyte forum or GitHub issues to get further assistance from the community or the Airbyte team [[source](https://discuss.airbyte.io/t/418)].Chris Seber
07/18/2024, 4:18 PMkapa.ai
07/18/2024, 4:18 PMStream.state_checkpoint_interval
property. Here's an example: python class MyAmazingStream(Stream): # Save the state every 100 records state_checkpoint_interval = 100
(Source: [Airbyte Documentation](https://docs.airbyte.com/connector-development/cdk-python/incremental-stream#checkpointing-state)) 2. Stream Slices: This method is not explicitly mentioned in the provided sources, but it's another way to handle checkpointing in Airbyte. Remember, the state is checkpointed after successful read of a stream slice or after reading a batch of records corresponding to state_checkpoint_interval
. This helps avoid re-reading already successfully synced data. For more detailed information, you can refer to the [Airbyte Protocol Documentation](https://docs.airbyte.com/understanding-airbyte/airbyte-protocol#state--source).Chris Seber
07/18/2024, 4:27 PMkapa.ai
07/18/2024, 4:27 PMkapa.ai
07/18/2024, 4:27 PMChris Seber
07/18/2024, 4:40 PMkapa.ai
07/18/2024, 4:40 PMkapa.ai
07/18/2024, 4:41 PMjava.lang.RuntimeException: Unknown state type: xmin
suggests that there's an issue with the xmin state type during the sync process. This error might occur if the system is unable to retrieve the raw state due to some unsupported operations or configurations. In the context of Airbyte and Postgres, the xmin system column is used for tracking inserts and updates to your source data. However, there are some notable limitations associated with the xmin replication method, such as unsupported DDL operations, performance issues due to full table scans, and issues with Transaction ID (XID) wraparound ([source](https://docs.airbyte.com/integrations/sources/postgres/postgres-troubleshooting#xmin-limitations)). If you're encountering this error after an Airbyte upgrade, it could be due to some incompatibility or issue with the Postgres connector version or the configuration of the sync process ([source](https://github.com/airbytehq/airbyte/issues/30998)). To troubleshoot this issue, you might want to: 1. Check the version of your Postgres connector and consider updating it if necessary. 2. Review your sync configurations, particularly those related to the xmin system column and incremental sync. 3. If the issue persists, consider switching to a different replication method, such as CDC replication, especially if your database is experiencing Xmin wraparound ([source](https://docs.airbyte.com/integrations/sources/postgres/postgres-troubleshooting#xmin-wraparound)). Remember to always backup your data and test any changes in a controlled environment before applying them to your production system.Chris Seber
07/18/2024, 4:43 PMkapa.ai
07/18/2024, 4:43 PMkapa.ai
07/18/2024, 4:43 PMjava.lang.RuntimeException: Unknown state type: xmin
suggests that there's an issue with the xmin state type during the sync process. This error might occur if the system is unable to retrieve the raw state due to some unsupported operations or configurations. In the context of Airbyte and Postgres, the xmin system column is used for tracking inserts and updates to your source data. However, there are some notable limitations associated with the xmin replication method, such as unsupported DDL operations, performance issues due to full table scans, and issues with Transaction ID (XID) wraparound ([source](https://docs.airbyte.com/integrations/sources/postgres/postgres-troubleshooting#xmin-limitations)). If you're encountering this error after an Airbyte upgrade, it could be due to some incompatibility or issue with the Postgres connector version or the configuration of the sync process ([source](https://github.com/airbytehq/airbyte/issues/30998)). To troubleshoot this issue, you might want to: 1. Check the version of your Postgres connector and consider updating it if necessary. 2. Review your sync configurations, particularly those related to the xmin system column and incremental sync. 3. If the issue persists, consider switching to a different replication method, such as CDC replication, especially if your database is experiencing Xmin wraparound ([source](https://docs.airbyte.com/integrations/sources/postgres/postgres-troubleshooting#xmin-wraparound)). Remember to always backup your data and test any changes in a controlled environment before applying them to your production system.