Closed
Description
Environment
- Airbyte version: v0.35.35-alpha
- OS Version / Instance: AWS EC2 t2.xlarge
- Deployment: Docker
- Source Connector and version: Postgres 0.4.8
- Destination Connector and version: Snowflake 0.4.14
- Severity: Critical?
- Step where error happened: Sync job
Current Behavior
I have a Postgres to Snowflake sync setup with incremental deduped sync (cursor = updated_at timestamp) to run hourly and is sometime failing to sync all the records that were created at or around the timestamp stored in the cursor.
Ex for a sync with missing records:
- From the Airbyte log:
Found matching cursor in state. Stream: AirbyteStreamNameNamespacePair{name='xxx', namespace='public'}. Cursor Field: updated_at Value: 2022-03-01T23:03:07.302674Z
- Postgres has 60 records where
date_trunc(ms, updated_at::timestamp_ntz) = '2022-03-01T23:03:07.302'::timestamp_ntz
- After the sync has only 21 record where where
date_trunc(ms, updated_at::timestamp_ntz) = '2022-03-01T23:03:07.302'::timestamp_ntz
Expected Behavior
All records should be synced to the destination.
Logs
Steps to Reproduce
- Postgres to Snowflake sync setup with incremental deduped sync
- Create record in Postgres with a timestamp equal (to the ms?) to the cursor in Airbyte
- Check for missing records in Snowflake
Are you willing to submit a PR?
No.