Best practices for Trino on Amazon EMR - Amazon EMR

Best practices for Trino on Amazon EMR

Trino’s architecture is designed for fast, distributed SQL queries on large datasets across multiple data sources, following a coordinator-worker model, where each component has a specialized role in query execution. There are a few areas or categories you can focus on in order to configure your Amazon EMR cluster running Trino for its best performance. These include the following:

Adjusting cluster configuration settings for memory optimization.
Optimizing settings for data partitioning and data distribution.
Using dynamic filtering to reduce query-result counts.

Some of these settings are tuned automatically when you use Trino with Amazon EMR. Others can be set manually through the console or through CLI commands. The topics in this section help you configure your data and your cluster optimally.

Topics

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Configuring Trino on Amazon EMR

Key areas of focus for performance improvement

Select your cookie preferences

Customize cookie preferences

Essential

Performance

Functional

Advertising

Unable to save cookie preferences

Best practices for Trino on Amazon EMR

Topics

Did this page help you?

Next topic:

Previous topic:

Need help?