Amazon SageMaker AI model parallelism library v2 examples
This page provides a list of blogs and Jupyter notebooks that present practical examples of implementing the SageMaker model parallelism (SMP) library v2 to run distributed training jobs on SageMaker AI.
Blogs and Case Studies
The following blogs discuss case studies about using SMP v2.
PyTorch example notebooks
Example notebooks are provided in the SageMaker AI examples GitHub repositorytraining/distributed_training/pytorch/model_parallel_v2
.
Note
Clone and run the example notebooks in the following SageMaker AI ML IDEs.
-
SageMaker JupyterLab (available in Studio created after December 2023)
-
SageMaker Code Editor (available in Studio created after December 2023)
-
Studio Classic (available as an application in Studio created after December 2023)
git clone https://github.com/aws/amazon-sagemaker-examples.git cd amazon-sagemaker-examples/training/distributed_training/pytorch/model_parallel_v2
SMP v2 example notebooks