Skip to content

Example of a fully concurrent source with GNews #37386

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

maxi297
Copy link
Contributor

@maxi297 maxi297 commented Apr 17, 2024

What

To serve as documentation only (it seems like GNews has very low API rate limit and hence was a very bad pick for concurrency), I've tried to move source-gnews to Concurrent CDK with the new API.

How

By using the airbyte_cdk.sources.streams.concurrent package without relying on the adapters.

Identification of missing parts

Must

  • Do we need some kind of entrypoint?
    • ConcurrentSource does not support spec, check and discover
  • We have specific implementation of DateTimeStreamStateConverter to easy the migration for already existing source but for brand new, it would be nice to have one where the state datetime format does not need to be provided
  • Cleaning Partition. is_closed as it is not used and hence is confusing as to how it should be implemented

Nice to have

  • HttpRequester with error handling. Maybe we can use the declarative one for now but I assume Brian Lai will prioritize the work soonish.

Can this PR be safely reverted and rolled back?

This PR is not meant the be merged.

Copy link

vercel bot commented Apr 17, 2024

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment
Name Status Preview Comments Updated (UTC)
airbyte-docs ⬜️ Ignored (Inspect) Visit Preview Apr 17, 2024 7:12pm

@octavia-squidington-iii octavia-squidington-iii added area/connectors Connector related issues CDK Connector Development Kit connectors/source/gnews labels Apr 17, 2024
def close(self) -> None:
self._cursor.close_partition(self)

def __hash__(self) -> int:
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We can probably have the default implement in the CDK and relying on to_slice

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/connectors Connector related issues CDK Connector Development Kit connectors/source/gnews
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants