Connect to HyperPod clusters and submit tasks to clusters
You can launch machine learning workloads on HyperPod clusters within Amazon SageMaker Studio IDEs. When you launch Studio IDEs on a HyperPod cluster, a set of commands are available to help you get started. You can work on your training scripts, use Docker containers for the training scripts, and submit jobs to the cluster, all from within the Studio IDEs. The following section provides information on how to connect your cluster to Studio IDEs.
In Amazon SageMaker Studio you can navigate to one of your clusters in HyperPod clusters (under Compute) and view your list of clusters. You can connect your cluster to an IDE listed under Actions.
You can also choose your custom file system from the list of options. For information on how to get this set up, see Set up HyperPod in Studio.
Alternatively, you can create a space and launch an IDE using the AWS CLI. Use the following
commands to do so. The following example creates a Private
JupyterLab
space for
with
the user-profile-name
FSx for Lustre file system attached.fs-id
-
Create a space using the
create-space
AWS CLI. aws sagemaker create-space \ --region
your-region
\ --ownership-settings "OwnerUserProfileName=user-profile-name
" \ --space-sharing-settings "SharingType=Private" \ --space-settings "AppType=JupyterLab,CustomFileSystems=[{FSxLustreFileSystem={FileSystemId=fs-id
}}]" -
Create the app using the
create-app
AWS CLI. aws sagemaker create-app \ --region
your-region
\ --space-namespace-name
\ --resource-spec '{"ec2InstanceType":"'"instance-type
"'","appEnvironmentArn":"'"image-arn
"'"}'
Once you have your applications open, you can submit tasks directly to the clusters you are connected to.