Managed Spot Training Lifecycle
You can monitor a training job using TrainingJobStatus
and
SecondaryStatus
returned by DescribeTrainingJob.
The list below shows how TrainingJobStatus
and SecondaryStatus
values
change depending on the training scenario:
-
Spot instances acquired with no interruption during training
-
InProgress
:Starting
↠Downloading
↠Training
↠Uploading
-
-
Spot instances interrupted once. Later, enough spot instances were acquired to finish the training job.
-
InProgress
:Starting
↠Downloading
↠Training
↠Interrupted
↠Starting
↠Downloading
↠Training
↠Uploading
-
-
Spot instances interrupted twice and
MaxWaitTimeInSeconds
exceeded.-
InProgress
:Starting
↠Downloading
↠Training
↠Interrupted
↠Starting
↠Downloading
↠Training
↠Interrupted
↠Downloading
↠Training
-
Stopping
:Stopping
-
Stopped
:MaxWaitTimeExceeded
-
-
Spot instances were never launched.
-
InProgress
:Starting
-
Stopping
:Stopping
-
Stopped
:MaxWaitTimeExceeded
-