Create a SageMaker HyperPod cluster on training plans using the SageMaker AI console
To create an SageMaker HyperPod cluster using training plans from the SageMaker AI console UI, follow these steps:
-
Navigate to the SageMaker AI console at https://console.aws.amazon.com/sagemaker/
. -
In the left navigation pane, choose Hyperpod, and then Create cluster.
-
When configuring an instance group, you can select a plan that aligns with your compute capacity needs.
Review and create your cluster. Instance groups using a training plan scale up to the
specified target instance count when the training plan becomes Active
, subject to
available capacity. Thirty minutes before each Reserved Capacity period ends, the instance
group begins scaling down to zero instances. This scaled-down state persists until the next
Reserved Capacity period begins or the plan ends. Throughout this process, an healthy
instance group maintains an InService
status after its initial creation,
regardless of the current instance count.