AI Service

Get Started with Amazon SageMaker

The next generation of Amazon SageMaker is the center for all your data, analytics, and AI.

Overview of Amazon SageMaker

Bringing together widely adopted AWS artificial intelligence and machine learning (AI/ML) and analytics capabilities, the next generation of Amazon SageMaker delivers an integrated experience for analytics and AI with unified access to all your data. The next generation of Amazon SageMaker consists of two primary components:

  • Amazon SageMaker Unified Studio, which provides an integrated experience to use all your data and tools for analytics and AI
  • Amazon SageMaker Catalog, which enables secure discovery and access to approved data and models.

Additionally, SageMaker is built upon an open lakehouse architecture that unifies access to all your data across Amazon Simple Storage Service (Amazon S3) data lakes, Amazon Redshift data warehouses, and other external sources.

Benefits
  • Build with all your tools for analytics and AI in SageMaker Unified Studio
  • Develop and scale AI with a comprehensive set of AI capabilities
  • Reduce data silos and unify all your data with SageMaker lakehouse architecture
  • Meet your enterprise security needs with end-to-end data and AI governance

To get started, go to the Amazon SageMaker user guide.

SageMaker diagram

Capabilities of Amazon SageMaker Unified Studio

Amazon SageMaker Unified Studio is a single data and AI development environment where you can find and access all of the data in your organization and act on it using the best tools across any use case.

  • Streamline access to familiar tools and functionality from purpose-built AWS analytics and artificial intelligence and machine learning (AI/ML) services like Amazon EMR, AWS Glue, Amazon Athena, Amazon Redshift, Amazon Bedrock, and Amazon SageMaker AI.
  • Develop ML and foundation models (FMs) using the fully managed infrastructure, tools, and workflows of SageMaker AI.
  • Efficiently build generative AI applications in a trusted and secure environment using Amazon Bedrock.
  • Analyze, prepare, and integrate data for analytics and AI using open source frameworks on Amazon Athena, Amazon EMR, and AWS Glue
  • Streamline your data journey with Amazon Q Developer to author code, generate SQL, integrate data, and troubleshoot.

Key concepts

To learn more about Amazon SageMaker Unified Studio, Catalog, lakehouse architecture, and SageMaker AI, explore the following guides:

Use cases

Learn more about how to apply your use case to SageMaker Unified Studio and which underlying services are being used.

Import and query data

You can use the SageMaker Unified Studio query editor to perform analysis using SQL. It provides a place to write and run queries, view results, and share your work with your team. You can also import and query data sets in your existing Glue Data Catalog resources.

Scale data infrastructure and integrate access controls

SageMaker lakehouse architecture provides a unified environment for accessing, discovering, preparing, and analyzing data from various sources for machine learning (ML) and analytics workloads. Use lakehouse access controls to:

  • Streamlining the creation of connections to diverse data sources through a unified interface
  • Centralizing access control management through AWS Lake Formation
  • Enabling in-place querying through federated catalogs without data movement
  • Providing fine-grained permissions at the catalog, database, table, and column levels
  • Exploring data for ad hoc reporting and proof of concept before setting up new zero-ETL pipelines
Fine-tune foundation models

Amazon SageMaker Unified Studio provides a large collection of state-of-the-art foundation models. These models support use cases such as content writing, code generation, question answering, copywriting, summarization, classification, information retrieval, and more. You can find, customize, and deploy these foundation models in the JumpStart model catalog. You can use the foundation models to build your own generative AI solutions for a wide range of applications.

Create a chat agent application

Amazon Bedrock in SageMaker Unified Studio offers multiple playgrounds that allow you to easily access and experiment with Amazon Bedrock models. With the chat playground, you can chat with a model through text and image prompts. With the image and video playground, you can use a compatible model to generate and edit images and videos. In addition to the playgrounds, you can also use Amazon Bedrock in SageMaker Unified Studio to create chat agent apps and flows apps.

Start building

Now that we've covered what Amazon SageMaker is and its benefits, you can get started by selecting one of the following workflows:
Setup
Setting up Amazon SageMaker

Learn how to set up Amazon SageMaker.

Upload and query data
Get started with uploading and querying data

Learn how to write and run queries, view results, and share your work with your team.

Bring existing resources
Get started with importing and querying data sets for AWS Glue Data Catalog and Amazon S3 in Amazon SageMaker Unified Studio

Learn how to access and leverage your existing AWS Glue Data Catalog resources within Amazon SageMaker Unified Studio, allowing you to query and analyze your data without moving or duplicating it.

Get started with compute
Get started using EMR Serverless in Amazon SageMaker Unified Studio

Use a single EMR Serverless application on multiple clusters and run clusters on demand as it fits your use case and needs.

Model Development
Get started fine-tuning foundation models

Learn how to fine-tune foundation models, Amazon SageMaker Unified Studio provides an example training dataset for each model that's eligible for training.

Get started with lakehouse architecture
Get started with lakehouse access controls for Athena federated queries in Amazon SageMaker Unified Studio

This guide shows you how to use SageMaker lakehouse architecture with integrated access controls for Athena federated queries.

Resources