Select your cookie preferences

We use essential cookies and similar tools that are necessary to provide our site and services. We use performance cookies to collect anonymous statistics, so we can understand how customers use our site and make improvements. Essential cookies cannot be deactivated, but you can choose “Customize” or “Decline” to decline performance cookies.

If you agree, AWS and approved third parties will also use cookies to provide useful site features, remember your preferences, and display relevant content, including relevant advertising. To accept or decline all non-essential cookies, choose “Accept” or “Decline.” To make more detailed choices, choose “Customize.”

Videos and video frame labeling

Focus mode
Videos and video frame labeling - Amazon SageMaker AI

You can use Ground Truth to classify videos and annotate video frames (still images extracted from videos) using one of the three built-in video task types. These task types streamline the process of creating video and video frame labeling jobs using the Amazon SageMaker AI console, API, and language-specific SDKs.

  • Video clip classification – Enable workers to classify videos into categories you specify. For example, you can use this task type to have workers categorize videos into topics like sports, comedy, music, and education. To learn more, see Classify videos.

  • Video frame labeling jobs – Enable workers to annotate video frames extracted from a video using bounding boxes, polylines, polygons or keypoint annotation tools. Ground Truth offers two built-in task types to label video frames:

    • Video frame object detection: Enable workers to identify and locate objects in video frames.

    • Video frame object tracking: Enable workers to track the movement of objects across video frames.

    • Video frame adjustment jobs: Have workers adjust labels, label category attributes, and frame attributes from a previous video frame object detection or object tracking labeling job.

    • Video frame verification jobs: Have workers verify labels, label category attributes, and frame attributes from a previous video frame object detection or object tracking labeling job.

    If you have video files, you can use the Ground Truth automatic frame extraction tool to extract video frames from your videos. To learn more, see Video Frame Input Data.

Tip

To learn more about supported file types and input data quotas, see Input data.

PrivacySite termsCookie preferences
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.