Select your cookie preferences

We use essential cookies and similar tools that are necessary to provide our site and services. We use performance cookies to collect anonymous statistics, so we can understand how customers use our site and make improvements. Essential cookies cannot be deactivated, but you can choose “Customize” or “Decline” to decline performance cookies.

If you agree, AWS and approved third parties will also use cookies to provide useful site features, remember your preferences, and display relevant content, including relevant advertising. To accept or decline all non-essential cookies, choose “Accept” or “Decline.” To make more detailed choices, choose “Customize.”

Reading from Google Sheets entities

Focus mode
Reading from Google Sheets entities - AWS Glue

Prerequisites

  • A Google SpreadSheet that you would like to read from. You will need the SpreadSheet ID and tabName of the spreadsheet.

Google Sheets Entity and Field Details:

Entity Data Type Supported Operators
Spreadsheet String N/A (filter is not supported)

Example

googleSheets_read = glueContext.create_dynamic_frame.from_options( connection_type="googlesheets", connection_options={ "connectionName": "connectionName", "ENTITY_NAME": "{SpreadSheetID}#{SheetTabName}", "API_VERSION": "v4" }

Partitioning queries

For Record Base Partitioning only, NUM_PARTITIONS can be provided as additional spark options if you want to utilize concurrency in Spark. With this parameter, the original query would be split into NUM_PARTITIONS number of sub-queries that can be executed by spark tasks concurrently.

Example with NUM_PARTITIONS

googlesheets_read = glueContext.create_dynamic_frame.from_options( connection_type="googlesheets", connection_options={ "connectionName": "connectionName", "ENTITY_NAME": "{SpreadSheetID}#{SheetTabName}", "API_VERSION": "v4", "NUM_PARTITIONS": "10" }
PrivacySite termsCookie preferences
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.