You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
| epidemiology | HTTPS |[https://storage.googleapis.com/covid19-open-data/v2/latest/epidemiology.csv](https://storage.googleapis.com/covid19-open-data/v2/latest/epidemiology.csv)|||[COVID-19 Public dataset](https://console.cloud.google.com/marketplace/product/bigquery-public-datasets/covid19-public-data-program?filter=solution-type:dataset&id=7d6cc408-53c8-4485-a187-b8cb9a5c0b56) on BigQuery |
186
+
| Dataset Name | Storage | URL | Reader Impl | Service Account | Description |
| epidemiology | HTTPS |[https://storage.googleapis.com/covid19-open-data/v2/latest/epidemiology.csv](https://storage.googleapis.com/covid19-open-data/v2/latest/epidemiology.csv)|||[COVID-19 Public dataset](https://console.cloud.google.com/marketplace/product/bigquery-public-datasets/covid19-public-data-program?filter=solution-type:dataset&id=7d6cc408-53c8-4485-a187-b8cb9a5c0b56) on BigQuery |
189
189
| hr_and_financials | GCS | gs://airbyte-vault/financial.csv | smart_open or gcfs |`{"type": "service_account", "private_key_id": "XXXXXXXX", ...}`| data from a private bucket, a service account is necessary |
190
-
| landsat_index | GCS | gcp-public-data-landsat/index.csv.gz | smart_open || Using smart_open, we don't need to specify the compression (note the gs:// is optional too, same for other providers) |
190
+
| landsat_index | GCS | gcp-public-data-landsat/index.csv.gz | smart_open || Using smart_open, we don't need to specify the compression (note the gs:// is optional too, same for other providers) |
| landsat_index | GCS | gs://gcp-public-data-landsat/index.csv.gz | GCFS |`{"compression": "gzip"}`| Additional reader options to specify a compression option to `read_csv`|
197
197
| GDELT | S3 | s3://gdelt-open-data/events/20190914.export.csv ||`{"sep": "\t", "header": null}`| Here is TSV data separated by tabs without header row from [AWS Open Data](https://registry.opendata.aws/gdelt/)|
198
198
| server_logs | local | /local/logs.log ||`{"sep": ";"}`| After making sure a local text file exists at `/tmp/airbyte_local/logs.log` with logs file from some server that are delimited by ';' delimiters |
199
199
200
200
Example for SFTP:
201
201
202
-
| Dataset Name | Storage | User | Password | Host | URL | Reader Options | Description |
| Test Rebext | SFTP | demo | password | test.rebext.net | /pub/example/readme.txt |`{"sep": "\r\n", "header": null, "names": \["text"], "engine": "python"}`| We use `python` engine for `read_csv` in order to handle delimiter of more than 1 character while providing our own column names. |
205
205
206
206
Please see (or add) more at `airbyte-integrations/connectors/source-file/integration_tests/integration_source_test.py` for further usages examples.
@@ -217,6 +217,7 @@ In order to read large files from a remote location, this connector uses the [sm
0 commit comments