You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/integrations/sources/google-sheets.md
+13-2
Original file line number
Diff line number
Diff line change
@@ -97,8 +97,18 @@ If your spreadsheet is viewable by anyone with its link, no further action is ne
97
97
- To authenticate your Google account via OAuth, select **Authenticate via Google (OAuth)** from the dropdown and enter your Google application's client ID, client secret, and refresh token.
98
98
<!-- /env:oss -->
99
99
6. For **Spreadsheet Link**, enter the link to the Google spreadsheet. To get the link, go to the Google spreadsheet you want to sync, click **Share** in the top right corner, and click **Copy Link**.
100
-
7. (Optional) You may enable the option to **Convert Column Names to SQL-Compliant Format**. Enabling this option will allow the connector to convert column names to a standardized, SQL-friendly format. For example, a column name of `Café Earnings 2022` will be converted to `cafe_earnings_2022`. We recommend enabling this option if your target destination is SQL-based (ie Postgres, MySQL). Set to false by default.
101
-
8. Click **Set up source** and wait for the tests to complete.
100
+
7. For **Batch Size**, enter an integer which represents batch size when processing a Google Sheet. Default value is 200.
101
+
Batch size is an integer representing row batch size for each sent request to Google Sheets API.
102
+
Row batch size means how many rows are processed from the google sheet, for example default value 200
103
+
would process rows 1-201, then 201-401 and so on.
104
+
Based on [Google Sheets API limits documentation](https://developers.google.com/sheets/api/limits),
105
+
it is possible to send up to 300 requests per minute, but each individual request has to be processed under 180 seconds,
106
+
otherwise the request returns a timeout error. In regards to this information, consider network speed and
107
+
number of columns of the google sheet when deciding a batch_size value.
108
+
Default value should cover most of the cases, but if a google sheet has over 100,000 records or more,
109
+
consider increasing batch_size value.
110
+
8. (Optional) You may enable the option to **Convert Column Names to SQL-Compliant Format**. Enabling this option will allow the connector to convert column names to a standardized, SQL-friendly format. For example, a column name of `Café Earnings 2022` will be converted to `cafe_earnings_2022`. We recommend enabling this option if your target destination is SQL-based (ie Postgres, MySQL). Set to false by default.
111
+
9. Click **Set up source** and wait for the tests to complete.
102
112
103
113
<HideInUI>
104
114
@@ -151,6 +161,7 @@ Airbyte batches requests to the API in order to efficiently pull data and respec
| 0.5.1 | 2024-04-11 |[35404](https://github.com/airbytehq/airbyte/pull/35404)| Add `row_batch_size` parameter more granular control read records |
154
165
| 0.5.0 | 2024-03-26 |[36515](https://github.com/airbytehq/airbyte/pull/36515)| Resolve poetry dependency conflict, add record counts to state messages |
155
166
| 0.4.0 | 2024-03-19 |[36267](https://github.com/airbytehq/airbyte/pull/36267)| Pin airbyte-cdk version to `^0`|
0 commit comments