AWS DMS Serverless components

Focus mode

AWS DMS Serverless components - AWS Database Migration Service

Supported Endpoints Creating a serverless replication Modifying AWS DMS serverless replications Compute Config Understanding autoscaling in AWS DMS serverless Monitoring AWS DMS serverless replications Extended Full-Load Throughput Amazon Redshift Extended Full Load Resumability

To manage the resources needed to perform a replication, AWS DMS Serverless has granular states that reveal different internal actions taken by the service. When you start the replication, AWS DMS Serverless calculates the capacity load, provisions the calculated capacity, and starts the data replication according to the following replication states.

The following diagram shows the state transitions for an AWS DMS Serverless replication.

The first state after you start the replication is Initializing. In this state, all the required parameters are initialized.
The states immediately following include Preparing Metadata Resources, Testing Connection, and Fetching Metadata. In these states, AWS DMS Serverless connects to your source database to obtain the information needed to predict the capacity needed.
- When the replication state is Testing Connection, AWS DMS Serverless verifies that the connection to your source and target databases are set up successfully.
- The replication state following Testing Connection is Fetching Metadata. Here, AWS DMS retrieves the information needed to calculate capacity.
- Once AWS DMS retrieves the necessary information, the next state is Calculating Capacity. Here, the system calculates the size of underlying resources required to perform the replication.
The state transition following Calculating Capacity is Provisioning Capacity. While the replication is in this state, AWS DMS Serverless initializes the underlying compute resources.
The replication state after all resources are successfully provisioned is Replication Starting. In this state, AWS DMS Serverless begins the replication of data. The phases of a replication include the following:
- Full load: In this phase, DMS replicates the source data store as it was when the replication started.
- CDC (initial): In this phase, DMS replicates the changes to the source data store that occurred during the Full Load phase. DMS only runs this phase if the StopTaskCachedChangesNotApplied task setting is false.
- CDC (ongoing): After the initial CDC phase, DMS replicates changes on the source database as they occur. DMS only continues to run replication after the initial CDC phase if the StopTaskCachedChangesApplied task setting is false.
The final state is Running. In the Running state, the replication of data is ongoing.
A replication that you stop enters the Stopped state. You can restart a stopped replication under the following circumstances:
- You can't restart a replication that DMS has deprovisioned.
- You can restart a stopped CDC-only or full-load and CDC replication using the StartReplication action. You can't restart a stopped replication using the console.
- You can't restart a stopped replication that uses PostgreSQL as an engine.

This topic contains the following sections.

Supported Endpoints
Creating a serverless replication
Modifying AWS DMS serverless replications
Compute Config
Understanding autoscaling in AWS DMS serverless
Monitoring AWS DMS serverless replications
Enhanced Throughput for Full-Load Oracle to Amazon Redshift and Amazon S3 Migrations
Oracle to Redshift : Full Load Resumability

For AWS DMS Serverless, the left-hand navigation panel of the AWS DMS console has a new option, Serverless replications. For Serverless Replications, you specify Replications instead of replication instance types or tasks to define a replication. In addition, you specify the maximum and minimum DMS capacity units (DCUs) that you want DMS to provision for the replication. A DCU is 2GB of RAM. AWS DMS bills your account for each DCU that your replication is currently using. For information about AWS DMS pricing, see AWS Database Migration Service pricing.

AWS DMS then automatically provisions replication resources based on your table mappings and the predicted size of your workload. This capacity unit is a value in the range of the minimum and maximum capacity unit values that you specify.

Supported Endpoints

With AWS DMS Serverless, you do not need to choose and manage engine versions, as the service handles that setting. AWS DMS Serverless supports the following sources:

MongoDB
Amazon DocumentDB (with MongoDB compatibility)
Microsoft SQL Server
PostgreSQL-compatible databases
MySQL-compatible databases
MariaDB
Oracle
Amazon S3
IBM Db2

AWS DMS Serverless supports the following targets:

Microsoft SQL Server
PostgreSQL
MySQL-compatible databases
Oracle
Amazon S3
Amazon Redshift
Amazon DynamoDB
Amazon Kinesis Data Streams
Amazon Managed Streaming for Apache Kafka
Amazon OpenSearch Service
Amazon DocumentDB (with MongoDB compatibility)
Amazon Neptune

As part of AWS DMS Serverless, you have access to console commands that allow you to create, configure, start, and manage AWS DMS serverless replications. To run these commands using the Serverless replications section of the console, you need to do one of the following:

Set up a new AWS Identity and Access Management (IAM) policy and IAM role to attach that policy to.
Use an AWS CloudFormation template to provide the access that you need.

AWS DMS Serverless requires a service linked role (SLR) to exist in your account. AWS DMS manages the creation and usage of this role. For more information about making sure that you have the necessary SLR, see Service-linked role for AWS DMS Serverless.

Creating a serverless replication

To create a serverless replication between two existing AWS DMS endpoints, do the following. For information about creating AWS DMS endpoints, see Creating source and target endpoints.

Creating a serverless replication

Sign in to the AWS Management Console and open the AWS DMS console at https://console.aws.amazon.com/dms/v2/.
On the navigation pane, choose Serverless replications, and then choose Create replication.

On the Create replication page, specify your serverless replication configuration:

Option	Action
Name	Enter a name to identify the replication, such as `DMS-replication`.
Descriptive Amazon Resource Name (ARN)- Optional	You can use this optional parameter to provide a description of the replication.
Source database endpoint	Choose existing endpoints in your account. Note that AWS DMS Serverless only supports a subset of the endpoint types that AWS DMS standard supports.
Target database endpoint	Choose existing endpoints in your account. Note that AWS DMS Serverless only supports a subset of the endpoint types that AWS DMS standard supports.
Replication type	Choose a replication type based on your requirements: Full load: AWS DMS migrates existing data only. Full load and change data capture (CDC): AWS DMS migrates existing data and changes that occur during replication. Change data capture (CDC): AWS DMS only migrates changes that occur after you start replication.

In the Settings section, set the settings that your replication requires.

In the Table mappings section, set up table mapping to define rules to select and filter data that you are replicating. Before you specify your mapping, make sure that you review the documentation section on data type mapping for your source and your target database. For information about data type mapping for your source and target databases, see the data types section for your source and target endpoint types in the Working with AWS DMS endpoints topic.

In the Compute settings section, set the following settings. For information about Compute Config settings, see Compute Config.

Option	Action
VPC	Choose an existing VPC.
Subnet group	Choose an existing subnet group.
VPC security group(s)	Choose default if it isn't already chosen.
AWS KMS key	Choose an appropriate KMS key. For information about KMS keys, see Creating keys in the AWS Key Management Service API Reference.
Deployment	Leave as is.
Availability Zone	Leave as is.
Minimum DMS capacity units (DCU) - (Optional)	Leave blank to use the default value of 1 DCU.
Maximum DMS capacity units (DCU)	Choose 16 DCU.

Leave the Maintenance settings as they are.

Choose Create replication.

AWS DMS creates a serverless replication to perform your migration.

Modifying AWS DMS serverless replications

To modify your replication configuration, use the modify-replication-config action. You can only modify an AWS DMS replication configuration that is in the CREATED, STOPPED, or FAILED states. For information about the modify-replication-config action, see ModifyReplicationConfig in the AWS Database Migration Service API Reference.

To modify a serverless replication configuration by using the AWS Management Console

Sign in to the AWS Management Console and open the AWS DMS console at https://console.aws.amazon.com/dms/v2/.
In the navigation pane, choose Serverless replications.

Choose the replication you want to modify. The following table describes the modifications you can make based on the current state of the replication.

Setting	Description	Allowed States
Name	You can change the name of the replication. Enter a name for the replication that contains from 8 to 16 printable ASCII characters (excluding /,", and @). The name should be unique for your account for the AWS Region you selected. You can choose to add some details to the name, such as including the AWS Region and task you are performing, for example: `west2-mysql2mysql-config1`.	`ReplicationState` is `CREATED`, `STOPPED`, or `FAILED`.
Source database endpoint	Choose a new existing source endpoint as the source for the replication.	`ReplicationState` is `CREATED`, or `FAILED` when `ProvisionState` is `null`.
Target database endpoint	Choose a new existing target endpoint as the target for the replication.	`ReplicationState` is `CREATED`, or `FAILED` when `ProvisionState` is `null`.
Replication type	You can modify the type of a serverless replication.	`ReplicationState` is `CREATED`, or `FAILED` when `ProvisionState` is `null`.
Replication Settings	You can modify the replication settings, including the target table preparation mode, whether to include LOB columns in replication, maximum LOB size, validation, and logging. For more information, see Task settings.	`ReplicationState` is `CREATED`, `STOPPED`, or `FAILED`.
Table mappings	You can modify the table mapping settings for a serverless replication, including the selection rules and the transformation rules. For more information, see Table mapping.	`ReplicationState` is `CREATED`, `STOPPED`, or `FAILED`.
Compute config	You can modify the compute configuration settings for a serverless replication, including the networking settings, scaling settings, and maintenance settings. For information about Compute Config settings, see Compute Config.	You can modify the following scaling, maintenance, and network settings when the `ReplicationState` is `CREATED`, `STOPPED`, or `FAILED`: `MinCapacityUnits` `MaxCapacityUnits` `MultiAZ` `PreferredMaintenanceWindow` `VpcSecurityGroupIds` You can modify the following networking and security settings when the `ReplicationState` is `CREATED`, or `FAILED` when `ProvisionState` is `null`: `AvailabilityZone` `DnsNameServers` `KmsKeyId` `ReplicationSubnetGroupId`

Compute Config

You configure your replication provisioning using the Compute Config parameter or console section. The fields in the Compute Config object include the following:

Option	Description
MinCapacityUnits	This is the minimum number of DMS Capacity Units (DCU) that AWS DMS will provision. This is also the minimum DCU that autoscaling could scale down to.
MaxCapacityUnits	This is the the maximum DMS Capacity Units (DCU) that AWS DMS can provision, depending on your replication's capacity prediction. This is also the maximum DCU that autoscaling could scale up to.
KmsKeyId	The encryption key to use to encrypt replication storage and connection information. If you choose (Default) aws/dms, AWS DMS uses the default KMS key associated with your account and AWS Region. A description and your account number are shown, along with the key's ARN. For more information about using the encryption key, see Setting an encryption key and specifying AWS KMS permissions. For this tutorial, leave (Default) aws/dms chosen.
ReplicationSubnetGroupId	The replication subnet group in your selected VPC where you want the replication to be created. If your source database is in a VPC, choose the subnet group that contains the source database as the location for your replication. For more information about replication subnet groups, see Creating a replication subnet group.
VpcSecurityGroupIds	The replication instance is created in a VPC. If your source database is in a VPC, choose the VPC security group that provides access to the DB instance where the database is located.
PreferredMaintenanceWindow	This parameter defines a weekly time range during which system maintenance can occur, in Universal Coordinated Time (UTC). The default is a 30-minute window selected at random from an 8-hour block of time per AWS Region, occurring on a random day of the week.
MultiAZ	Setting this optional parameter creates a standby replica of your replication in another Availability Zone for failover support. If you intend to use change data capture (CDC) or ongoing replication, we recommend that you turn on this option.

Understanding autoscaling in AWS DMS serverless

After you provision a replication and it is in the RUNNING state, the AWS DMS service manages the capacity of the underlying resources to adapt to changing workloads. This management scales replication resources based on the following replication settings:

MinCapacityUnits
MaxCapacityUnits

Replications scale up after a period of exceeding an upper utilization threshold, and down when capacity utilization is below a minimum capacity utilization threshold for a longer period.

Note

Serverless replications can't autoscale down while a full load is in progress.

Tuning autoscaling in AWS DMS serverless

To tune your replication autoscaling parameters, we recommend that you set MaxCapacityUnits to the maximum value, and let AWS DMS manage provisioning of resources. It is recommended that you choose the largest DCU maximum capacity setting to allow the greatest benefit from auto-scaling, to accommodate spikes in transaction volume. The pricing calculator shows the maximum monthly cost if your replication continuously uses the maximum DCU. The maximum DCU does not represent the actual cost, as you only pay for the capacity used.

If your replication is not using its resources at full capacity, AWS DMS will gradually deprovision resources to save you costs. However, since provisioning and deprovisioning resources takes time, we recommend that you set your MinCapacityUnits setting to a value that can handle any sudden spikes you expect in your replication workload. This will keep your replication from being under-provisioned while AWS DMS provisions resources for the higher workload level.

If you under-provision your replication with a maximum capacity setting that is too low for data requirements, or a minimum capacity that is too low to handle sudden spikes in your replication workload, you might see your CapacityUtilization metric consistently at its maximum value. This can cause your replication to fail. If your replication fails due to under-provisioned resources, AWS DMS creates an out-of-memory event in your replication logs. If the out-of-memory condition happened due to a sudden spike in your replication workload, the replication will auto-scale and restart.

Monitoring AWS DMS serverless replications

AWS provides several tools for monitoring your AWS DMS serverless replications, and responding to potential incidents:

AWS DMS serverless replication metrics
AWS DMS serverless replication logs

AWS DMS serverless replication metrics

Serverless replication monitoring includes Amazon CloudWatch metrics for the following statistics. These statistics are grouped by each serverless replication.

Metric	Units	Description
CapacityUtilization	Percent	The percentage of memory used by the serverless replication
CDCIncomingChanges	Percent	The total number of change events at a point-in-time that are waiting to be applied to the target. Note that this is not the same as a measure of the transaction change rate of the source endpoint. A large number for this metric usually indicates AWS DMS is unable to apply captured changes in a timely manner, thus causing high target latency.
CDCLatencySource	Seconds	The gap, in seconds, between the last event captured from the source endpoint and current system time stamp of the AWS DMS instance. CDCLatencySource represents the latency between source and replication instance. High CDCLatencySource means the process of capturing changes from source is delayed. To identify latency in an ongoing replication, you can view this metric together with CDCLatencyTarget. If both CDCLatencySource and CDCLatencyTarget are high, investigate CDCLatencySource first. CDCLatencySource can be 0 when there is no replication lag between the source and the replication. CDCLatencySource can also become zero when the replication attempts to read the next event in the source's transaction log and there are no new events compared to the last time it read from the source. When this happens, the replication resets the CDCLatencySource to 0.
CDCLatencyTarget	Seconds	The gap, in seconds, between the first event timestamp waiting to commit on the target and the current timestamp of the AWS DMS instance. Target latency is the difference between the replication instance server time and the oldest unconfirmed event id forwarded to a target component. In other words, target latency is the timestamp difference between the replication instance and the oldest event applied but unconfirmed by TRG endpoint (99%). When CDCLatencyTarget is high, it indicates the process of applying change events to the target is delayed. To identify latency in an ongoing replication, you can view this metric together with CDCLatencySource. If CDCLatencyTarget is high but CDCLatencySource isn’t high, investigate if: No primary keys or indexes are in the target Resource bottlenecks occur in the target or replication instance Network issues reside between replication and target
CDCThroughputBandwidthTarget	KB/ second	Outgoing data transmitted for the target in KB per second. CDCThroughputBandwidth records outgoing data transmitted on sampling points. If no network traffic is found, the value is zero. Because CDC does not issue long-running transactions, network traffic may not be recorded.
CDCThroughputRowsSource	Rows/ second	Incoming changes from the source in rows per second.
CDCThroughputRowsTarget	Rows/ second	Outgoing changes for the target in rows per second.
FullLoadThroughputBandwidthTarget	KB/ second	Outgoing data transmitted from a full load for the target in KB per second.
FullLoadThroughputRowsTarget	Rows/ second	Outgoing changes from a full load for the target in rows per second.

AWS DMS serverless replication logs

You can use Amazon CloudWatch to log replication information during an AWS DMS migration process. You enable logging when you select replication settings.

Serverless replications upload status logs to your CloudWatch account to provide increased visibility into the progress of the replication, and to assist with troubleshooting.

AWS DMS uploads serverless-linked logs to a dedicated log group with the prefix dms-serverless-replication-<your replication config resource ID>. Within this log group, there is a log stream called dms-serverless-replication-orchestrator-<your replication config resource ID>. This log stream reports the replication state of your replication, and an associated message providing further details on the work it is doing in this stage. For examples of log entries, see Serverless replication log examples following.

Note

AWS DMS doesn't create either the log group or stream until you run the replication. AWS DMS doesn't create the log group or stream if you only create the replication.

To view logs of a replication that ran, follow these steps:

Open the AWS DMS console, and choose Serverless replications from the navigation pane. The Serverless replications dialog appears.
Go to the Configuration section and choose View serverless logs in the General column. The CloudWatch log group opens.
Locate the Migration task logs section and choose View CloudWatch Logs.

If your replication fails, AWS DMS creates a log entry with a replication state of failed, and a message describing the reason for the failure. You should check your CloudWatch logs as the first step in troubleshooting a failed replication.

Note

As with AWS DMS Classic, you have the option to enable more granular logging on the progress of the data migration itself; that is, the logs emitted by the underlying replication task. You can enable these logs in your replication settings by setting EnableLogging in the Logging field to true , as in the following JSON example:


{
  "Logging": {
    "EnableLogging": true
  }
}

If you enable these logs, they only begin appearing during the running stage of your serverless replication. They will appear under the same log group as the previous log stream, but will be under the new log stream dms-serverless-serv-res-id-{unique identifier}. See the following section for information about how to interpret serverless replication logs.

Serverless replication log examples

This section includes examples of log entries for serverless replications.

Example: Replication start

When you run a serverless replication, AWS DMS creates a log entry similar to the following:


{'replication_state':'initializing', 'message': 'Initializing the replication workflow.'}

Example: Replication failure

If one of the endpoints of the replication is not configured correctly, AWS DMS creates a log entry similar to the following:


{'replication_state':'failed', 'message': 'Test connection failed for endpoint X.', 'failure_message': 'X'}

If you see this message in your log after a failure, make sure that the specified endpoint is healthy and configured correctly.

Enhanced Throughput for Full-Load Oracle to Amazon Redshift and Amazon S3 Migrations

AWS DMS provides significantly improved throughput performance for full-load migrations from Oracle to Amazon Redshift and Amazon S3. DMS automatically enables this feature for tables without the custom parallel-load option in its table mappings. For tables with customized parallel-load options, DMS serverless distributes the table load based on the given table mapping configurations. To use enhanced throughput, do the following:

Provide selection rules that don't reference partitions or boundaries. For example, if the table settings in the table mappings contains parallel-load, DMS Serverless won't use the enhanced throughput feature. For more information, see Selection rules and actions.
Set MaxFileSize and WriteBufferSize to 64 MB. For more information, see Endpoint settings when using Amazon Redshift as a target for AWS DMS.
We recommend setting CompressCsvFiles to true for a data store with sparse data, and false for a data store with dense data.
Set the following task settings to 0:
- ParallelLoadThreads
- ParallelLoadQueuesPerThread
- ParallelApplyThreads
- ParallelApplyQueuesPerThread
- ParallelLoadBufferSize
Set MaxFullLoadSubTasks to 49 to support parallel data migration.
Set LOB mode to inline. For more information, see Setting LOB support for source databases in an AWS DMS task.

AWS DMS doesn't provide enhanced throughput performance for the replications with tables using parallel-load. For more information, see Using parallel load for selected tables, views, and collections.

Oracle to Redshift : Full Load Resumability

AWS DMS provides significantl enhancement for full-load resumability from Oracle to Amazon Redshift. When a table contains partitions/segments in the source database, the full load process replicates the data by the segment or the partition. In the event of an error, the segment containing the problematic record is skipped, and the process continues with the replication of the remaining segments by marking the problem segment as failed.

The full-load process does not replicate the data into target database for those segments that had error records. All the other segments/partitions are replicated to the target database. With the full-load resumability enhancement, you can initiate the replication process again after rectifying the errors. During the resume action, the process retrieves the replication status of segments and selects only those segments that previously encountered errors.

By default, the FullLoadResumability feature is disabled for the replication. To enable the feature, add "PersistPartitionProgressForResumption": true under FullLoadSettings:


"FullLoadSettings": {
    "PersistPartitionProgressForResumption": true,
    "TargetTablePrepMode": "DO_NOTHING",
    "CreatePkAfterFullLoad": false,
    "StopTaskCachedChangesApplied": false,
    "StopTaskCachedChangesNotApplied": false,
    "MaxFullLoadSubTasks": 8,
    "TransactionConsistencyTimeout": 600,
    "CommitRate": 10000
  }

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Working with AWS DMS Serverless

Serverless limitations

Select your cookie preferences

Customize cookie preferences

Essential

Performance

Functional

Advertising

Unable to save cookie preferences

AWS DMS Serverless components

This topic contains the following sections.

Supported Endpoints

Creating a serverless replication

Creating a serverless replication

Modifying AWS DMS serverless replications

To modify a serverless replication configuration by using the AWS Management Console

Compute Config

Understanding autoscaling in AWS DMS serverless

Note

Tuning autoscaling in AWS DMS serverless

Monitoring AWS DMS serverless replications

AWS DMS serverless replication metrics

AWS DMS serverless replication logs

Note

Note

Serverless replication log examples

Example: Replication start

Example: Replication failure

Enhanced Throughput for Full-Load Oracle to Amazon Redshift and Amazon S3 Migrations

Oracle to Redshift : Full Load Resumability

On this page

Did this page help you?

Next topic:

Previous topic:

Need help?

AWS DMS Serverless replication states