Configuring Trino on Amazon EMR

Topics

Configuring connectors for Trino
Monitoring

Configuring connectors for Trino

Connecting to AWS Glue as your Hive metastore

It's important and useful to understand that you can configure AWS Glue Data Catalog as your Hive metastore when running queries with Trino. When you do this, you can add fine-grained permissions on the data sources using AWS Lake Formation. For additional information, including steps to set up a cluster with a Hive metastore, see Using the AWS Glue Data Catalog as the metastore for Hive.

For information on integrating EMR on EKS with AWS Glue, see the following best practices, EMR Containers integration with AWS Glue.

Connecting to Iceberg tables when using Trino with Amazon EMR

Iceberg is an open table format for analytic tables. It was created for engines like Spark and Trino to query big data from the same tables, using SQL queries. It includes features like isolating data reads and writes, so a reader can avoid querying data that's partially updated, for instance. It also supports state features, like snapshots. It provides an abstraction layer through the use of metadata and manifest files. These describe table schema and make it easy to query data without having to know a lot of details about how it's formatted or organized. When you're connected you can both read data from the tables update data, or write new data to the underlying files.

There's a workshop available that shows you how to configure Iceberg tables with Amazon EMR and AWS Glue. For more information, see Analytics Workshop - Set Up and Use Apache Iceberg Tables on Your Data Lake.

Connecting with Clients

You can connect with Trino using an available JDBC driver. For more information, see JDBC driver in the Trino Documentation.

Monitoring

You can monitor Amazon EMR clusters through the AWS Management Console. For more information, see View and monitor an Amazon EMR cluster as it performs work. Amazon EMR also sends its monitoring metrics to Amazon CloudWatch. For more information about monitoring an Amazon EMR cluster, see Amazon CloudWatch events and metrics from Amazon EMR.

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Connect to the primary node and run queries

Best practices for Trino on Amazon EMR

Select your cookie preferences

Customize cookie preferences

Essential

Performance

Functional

Advertising

Unable to save cookie preferences

Configuring Trino on Amazon EMR

Topics

Configuring connectors for Trino

Connecting to AWS Glue as your Hive metastore

Connecting to Iceberg tables when using Trino with Amazon EMR

Connecting with Clients

Monitoring

Did this page help you?

Next topic:

Previous topic:

Need help?