AWS Data Pipeline is no longer available to new customers. Existing customers of AWS Data Pipeline can continue to use the service as normal. Learn more
Related services
AWS Data Pipeline works with the following services to store data.
-
Amazon DynamoDB — Provides a fully managed NoSQL database with fast performance at a low cost. For more information, see Amazon DynamoDB Developer Guide.
-
Amazon RDS — Provides a fully managed relational database that scales to large datasets. For more information, see Amazon Relational Database Service Developer Guide.
-
Amazon Redshift — Provides a fast, fully managed, petabyte-scale data warehouse that makes it easy and cost-effective to analyze a vast amount of data. For more information, see Amazon Redshift Database Developer Guide.
-
Amazon S3 — Provides secure, durable, and highly scalable object storage. For more information, see Amazon Simple Storage Service User Guide.
AWS Data Pipeline works with the following compute services to transform data.
-
Amazon EC2 — Provides resizable computing capacity—literally, servers in Amazon's data centers—that you use to build and host your software systems. For more information, see Amazon EC2 User Guide.
-
Amazon EMR — Makes it easy, fast, and cost-effective for you to distribute and process vast amounts of data across Amazon EC2 servers, using a framework such as Apache Hadoop or Apache Spark. For more information, see Amazon EMR Developer Guide.