Amazon SageMaker model parallelism library v1 examples - Amazon SageMaker

Amazon SageMaker model parallelism library v1 examples

This page provides a list of blogs and Jupyter notebooks that present practical examples of implementing the SageMaker model parallelism (SMP) library v1 to run distributed training jobs on SageMaker.

Blogs and Case Studies

The following blogs discuss case studies about using SMP v1.

Example notebooks

Example notebooks are provided in the SageMaker examples GitHub repository. To download the examples, run the following command to clone the repository and go to training/distributed_training/pytorch/model_parallel.

Note

Clone and run the example notebooks in the following SageMaker ML IDEs.

git clone https://github.com/aws/amazon-sagemaker-examples.git cd amazon-sagemaker-examples/training/distributed_training/pytorch/model_parallel

SMP v1 example notebooks for PyTorch

SMP v1 example notebooks for TensorFlow