Querying metadata tables with AWS analytics services - Amazon Simple Storage Service

Querying metadata tables with AWS analytics services

Note

The S3 Metadata feature is in preview release for Amazon S3 and is subject to change.

You can query your S3 managed metadata tables with AWS analytics services such as Amazon Athena, Amazon Redshift, and Amazon EMR.

Before you can run queries, you must first integrate the S3 table buckets in your AWS account and Region with AWS analytics services.

Note

Integrating S3 table buckets with AWS analytics services is in preview release and subject to change.

Querying metadata tables with Amazon Athena

After you integrate your S3 table buckets with AWS analytics services, you can start querying your metadata tables in Athena. In your queries, specify your catalog as s3tablescatalog and your database as aws_s3_metadata (which is the namespace for your metadata tables). For more information, see Querying Amazon S3 tables with Athena.

Querying metadata tables with Amazon Redshift

After you integrate your S3 table buckets with AWS analytics services, you create a resource link to your metadata table namespace (aws_s3_metadata). Once that's done, you can start querying your metadata tables in the Amazon Redshift console. For more information, see Accessing Amazon S3 tables with Amazon Redshift.

Querying metadata tables with Amazon EMR

To query your metadata tables by using Amazon EMR, you create an Amazon EMR cluster configured for Apache Iceberg and connect to your metadata tables using Apache Spark. You can set this up by integrating your S3 table buckets with AWS analytics services or using the open-source Amazon S3 Tables Catalog for Iceberg client catalog.

For more information, see Accessing Amazon S3 tables with Amazon EMR.