-
Notifications
You must be signed in to change notification settings - Fork 4.6k
Source Mixpanel: "export" stream make line parsing more robust #18846
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: Sergey Chvalyuk <[email protected]>
/test connector=connectors/source-mixpanel
Build FailedTest summary info:
|
Signed-off-by: Sergey Chvalyuk <[email protected]>
Signed-off-by: Sergey Chvalyuk <[email protected]>
Signed-off-by: Sergey Chvalyuk <[email protected]>
Signed-off-by: Sergey Chvalyuk <[email protected]>
/test connector=connectors/source-mixpanel
Build PassedTest summary info:
|
Signed-off-by: Sergey Chvalyuk <[email protected]>
export
stream improve line parsing# combine record from 2 standing nearby parts | ||
assert list(stream.iter_dicts([record_string, record_string[:2], record_string[2:], record_string])) == [record, record, record] | ||
# drop record parts because they are not standing nearby | ||
assert list(stream.iter_dicts([record_string, record_string[:2], record_string, record_string[2:]])) == [record, record] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
could you do it using pytest.mark.parametrize
so that the parameters would be more human readable?
/publish connector=connectors/source-mixpanel
if you have connectors that successfully published but failed definition generation, follow step 4 here |
/publish connector=connectors/source-mixpanel
if you have connectors that successfully published but failed definition generation, follow step 4 here |
Signed-off-by: Sergey Chvalyuk [email protected]
What
Try to fix https://github.com/airbytehq/oncall/issues/945
For "export" stream make line parsing more robust:
The incoming stream of records has to be JSON lines format.
From time to time for some reason, the one record can be split into multiple lines.
We try to combine such split parts into one record only if parts go nearby.
How
Describe the solution
Recommended reading order
x.java
y.python
🚨 User Impact 🚨
Are there any breaking changes? What is the end result perceived by the user? If yes, please merge this PR with the 🚨🚨 emoji so changelog authors can further highlight this if needed.
Pre-merge Checklist
Expand the relevant checklist and delete the others.
Updating a connector
Community member or Airbyter
airbyte_secret
./gradlew :airbyte-integrations:connectors:<name>:integrationTest
.README.md
bootstrap.md
. See description and examplesdocs/integrations/<source or destination>/<name>.md
including changelog. See changelog exampleAirbyter
If this is a community PR, the Airbyte engineer reviewing this PR is responsible for the below items.
/test connector=connectors/<name>
command is passing/publish
command described hereTests
Unit
Put your unit tests output here.
Integration
Put your integration tests output here.
Acceptance
Put your acceptance tests output here.