AWS Data Pipeline is no longer available to new customers. Existing customers of AWS Data Pipeline can continue to use the service as normal. Learn more
Before You Begin
Be sure you've completed the following steps.
-
Complete the tasks in Setting up for AWS Data Pipeline.
-
(Optional) Set up a VPC for the instance and a security group for the VPC.
-
Create an Amazon S3 bucket as a data output.
For more information, see Create a Bucket in Amazon Simple Storage Service User Guide.
-
Create and launch a MySQL database instance as your data source.
For more information, see Launch a DB Instance in the Amazon RDS Getting Started Guide. After you have an Amazon RDS instance, see Create a Table
in the MySQL documentation. Note
Make a note of the user name and the password you used for creating the MySQL instance. After you've launched your MySQL database instance, make a note of the instance's endpoint. You'll need this information later.
-
Connect to your MySQL database instance, create a table, and then add test data values to the newly created table.
For illustration purposes, we created this tutorial using a MySQL table with the following configuration and sample data. The following screen shot is from MySQL Workbench 5.2 CE:
For more information, see Create a Table
in the MySQL documentation and the MySQL Workbench product page . -
Create a topic for sending email notification and make a note of the topic Amazon Resource Name (ARN). For more information, see Create a Topic in Amazon Simple Notification Service Getting Started Guide.
-
(Optional) This tutorial uses the default IAM role policies created by AWS Data Pipeline. If you would rather create and configure your IAM role policy and trust relationships, follow the instructions described in IAM Roles for AWS Data Pipeline.