Select your cookie preferences

We use essential cookies and similar tools that are necessary to provide our site and services. We use performance cookies to collect anonymous statistics, so we can understand how customers use our site and make improvements. Essential cookies cannot be deactivated, but you can choose “Customize” or “Decline” to decline performance cookies.

If you agree, AWS and approved third parties will also use cookies to provide useful site features, remember your preferences, and display relevant content, including relevant advertising. To accept or decline all non-essential cookies, choose “Accept” or “Decline.” To make more detailed choices, choose “Customize.”

Introduction to Delta Lake - Amazon EMR

Introduction to Delta Lake

Delta Lake is an open-source project that helps implement modern data lake architectures commonly built on Amazon S3. Delta Lake offers the following capabilities:

  • Atomic, consistent, isolated, durable (ACID) transactions on Spark. Readers see a consistent view of the table during a Spark job.

  • Scalable metadata handling with distributed processing by Spark.

  • Combines streaming and batch uses cases with the same Delta table.

  • Automatic schema enforcement to avoid bad records during data ingestion.

  • Time travel with data versioning.

  • Supports merge, update, and delete operations for complex use cases like change data capture (CDC), streaming upserts, and more.

PrivacySite termsCookie preferences
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.