-
Notifications
You must be signed in to change notification settings - Fork 4.6k
clarify staging setup guide for bq destination #9255
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 1 commit
6eb6126
220afe2
cad0eed
be412fb
5feef4e
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -111,15 +111,13 @@ This is the recommended configuration for uploading data to BigQuery. It works b | |
* **GCS Bucket Path** | ||
* **Block Size (MB) for GCS multipart upload** | ||
* **GCS Bucket Keep files after migration** | ||
* See [this](https://cloud.google.com/storage/docs/creating-buckets) for instructions on how to create a GCS bucket. | ||
* See [this](https://cloud.google.com/storage/docs/creating-buckets) for instructions on how to create a GCS bucket. The bucket cannot have a retention policy. Set Protection Tools to none or Object versioning. | ||
* **HMAC Key Access ID** | ||
* See [this](https://cloud.google.com/storage/docs/authentication/hmackeys) on how to generate an access key. | ||
* We recommend creating an Airbyte-specific user or service account. This user or account will require read and write permissions to objects in the bucket. | ||
* See [this](https://cloud.google.com/storage/docs/authentication/managing-hmackeys) on how to generate an access key. For more information on hmac keys please reference the [GCP docs](https://cloud.google.com/storage/docs/authentication/hmackeys) | ||
* We recommend creating an Airbyte-specific user or service account. This user or account will require the following permissions for the bucket: `Storage Object Admin`. You can set those by going to the permissions tab in the GCS bucket and adding the appropriate the email address of the service account or user and adding the aforementioned permissions. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. is there a reason we didn't originally specify the the exact permissions needed? the GCP permission model has so many permissions here that seem vaguely related. it took me a few tries to get the right one. maybe there's another combination of |
||
* **Secret Access Key** | ||
* Corresponding key to the above access ID. | ||
* Make sure your GCS bucket is accessible from the machine running Airbyte. | ||
* This depends on your networking setup. | ||
* The easiest way to verify if Airbyte is able to connect to your GCS bucket is via the check connection tool in the UI. | ||
* Make sure your GCS bucket is accessible from the machine running Airbyte. This depends on your networking setup. The easiest way to verify if Airbyte is able to connect to your GCS bucket is via the check connection tool in the UI. | ||
|
||
### `Standard` uploads | ||
This uploads data directly from your source to BigQuery. While this is faster to setup initially, **we strongly recommend that you do not use this option for anything other than a quick demo**. It is more than 10x slower than the GCS uploading option and will fail for many datasets. Please be aware you may see some failures for big datasets and slow sources, e.g. if reading from source takes more than 10-12 hours. This is caused by the Google BigQuery SDK client limitations. For more details please check [https://github.com/airbytehq/airbyte/issues/3549](https://github.com/airbytehq/airbyte/issues/3549) | ||
|
Uh oh!
There was an error while loading. Please reload this page.