Deploying into production - Migrating to Apache HBase on Amazon S3 on Amazon EMR

Deploying into production

After you complete the steps in this section, you are ready to migrate the full dataset from your HDFS-backed cluster to HBase on Amazon S3 and restore it to an HBase on Amazon S3 cluster running in your AWS production account.

Preparing Amazon S3 for production load

Analyze the Amazon CloudWatch metrics for Amazon S3 captured for the HBase root directory in the development account and confirm the number of requests per Amazon S3 API as noted in the Preparing the test environment section of this document.

If you expect a rapid increase in the request rate for the HBase on Amazon S3 root directory bucket in the production account to more than the rates in the Preparing the test environment section of this document, open a support case to prepare for the workload and to avoid any temporary limits on your request rate. You do not need to open a support case for request rates lower than those in the Preparing the test environment section.

Preparing the production environment

Follow all the steps in Preparing the test envvironment to prepare your production environment with the configuration settings you have found during the testing phase.

To migrate and restore the full dataset into the production environment, follow the steps in the Migrating and restoring Apache HBase tables on Apache HBase on Amazon S3 section of this document.