Description
Topic
I have enabled Change Data Capture (CDC) on my PostgreSQL source following the instructions provided in the Airbyte PostgreSQL source documentation. However, during each sync, Airbyte is loading the entire dataset instead of only the changes. This issue persists even when there are no updates in the source database.
Relevant information
Steps Taken:
- Enabled logical replication in PostgreSQL.
- Created a replication slot and a publication as per the documentation.
- Set max_slot_wal_keep_size, max_wal_size, and wal_keep_size to 10GB.
- Verified that CDC is correctly configured on the PostgreSQL side.
Logs:This 4th time ran sync logdefault_workspace_logs_1388_txt.txt
Expected Behavior:
Airbyte should only sync the changed records instead of reloading the entire dataset on every sync.
Actual Behavior:
Each sync loads the whole dataset, resulting in increased load times and performance issues.
Additional Information:
PostgreSQL Version: 16
Airbyte Version: 0.58.0 >
Questions:
- What could be the underlying cause of the incremental changes not being detected?
- Are there specific logs or configurations I should examine to troubleshoot this issue?
I would appreciate any assistance in diagnosing and resolving this problem.