3D point cloud labeling jobs overview
This topic provides an overview of the unique features of a Ground Truth 3D point cloud labeling job. You can use the 3D point cloud labeling jobs to have workers label objects in a 3D point cloud generated from a 3D sensors like LiDAR and depth cameras or generated from 3D reconstruction by stitching images captured by an agent like a drone.
Job pre-processing time
When you create a 3D point cloud labeling job, you need to provide an input manifest file. The input manifest file can be:
-
A frame input manifest file that has a single point cloud frame on each line.
-
A sequence input manifest file that has a single sequence on each line. A sequence is defined as a temporal series of point cloud frames.
For both types of manifest files, job pre-processing time (that is, the time before Ground Truth starts sending tasks to your workers) depends on the total number and size of point cloud frames you provide in your input manifest file. For frame input manifest files, this is the number of lines in your manifest file. For sequence manifest files, this is the number of frames in each sequence multiplied by the total number of sequences, or lines, in your manifest file.
Additionally, the number of points per point cloud and the number of fused sensor data
objects (like images) factor into job pre-processing times. On average, Ground Truth can
pre-process 200 point cloud frames in approximately 5 minutes. If you create a 3D point
cloud labeling job with a large number of point cloud frames, you might experience
longer job pre-processing times. For example, if you create a sequence input manifest
file with 4 point cloud sequences, and each sequence contains 200 point clouds, Ground Truth
pre-processes 800 point clouds and so your job pre-processing time might be around 20
minutes. During this time, your labeling job status is InProgress
.
While your 3D point cloud labeling job is pre-processing, you receive CloudWatch messages
notifying you of the status of your job. To identify these messages, search for
3D_POINT_CLOUD_PROCESSING_STATUS
in your labeling job logs.
For frame input manifest files, your CloudWatch logs will have a message similar to the following:
{ "labeling-job-name": "example-point-cloud-labeling-job", "event-name": "3D_POINT_CLOUD_PROCESSING_STATUS", "event-log-message": "datasetObjectId from: 0 to 10, status: IN_PROGRESS" }
The event log message, datasetObjectId from: 0 to 10, status: IN_PROGRESS
identifies the number of frames from your input manifest that have been processed. You
receive a new message every time a frame has been processed. For example, after a single
frame has processed, you receive another message that says datasetObjectId from: 1
to 10, status: IN_PROGRESS
.
For sequence input manifest files, your CloudWatch logs will have a message similar to the following:
{ "labeling-job-name": "example-point-cloud-labeling-job", "event-name": "3D_POINT_CLOUD_PROCESSING_STATUS", "event-log-message": "datasetObjectId: 0, status: IN_PROGRESS" }
The event log message, datasetObjectId from: 0, status: IN_PROGRESS
identifies the number of sequences from your input manifest that have been processed.
You receive a new message every time a sequence has been processed. For example, after a
single sequence has processed, you receive a message that says datasetObjectId
from: 1, status: IN_PROGRESS
as the next sequence begins processing.
Job completion times
3D point cloud labeling jobs can take workers hours to complete. You can set the total amount of time that workers can work on each task when you create a labeling job. The maximum time you can set for workers to work on tasks is 7 days. The default value is 3 days.
It is strongly recommended that you create tasks that workers can complete within 12 hours. Workers must keep the worker UI open while working on a task. They can save work as they go and Ground Truth will save their work every 15 minutes.
When using the SageMaker AI CreateLabelingJob
API operation, set the total time a
task is available to workers in the TaskTimeLimitInSeconds
parameter of
HumanTaskConfig
.
When you create a labeling job in the console, you can specify this time limit when you select your workforce type and your work team.
Workforces
When you create a 3D point cloud labeling job, you need to specify a work team that will complete your point cloud annotation tasks. You can choose a work team from a private workforce of your own workers, or from a vendor workforce that you select in the AWS Marketplace. You cannot use the Amazon Mechanical Turk workforce for 3D point cloud labeling jobs.
To learn more about vendor workforce, see Subscribe to vendor workforces.
To learn how to create and manage a private workforce, see Private workforce.
Worker user interface (UI)
Ground Truth provides a worker user interface (UI), tools, and assistive labeling features to help workers complete your 3D point cloud labeling tasks.
You can preview the worker UI when you create a labeling job in the console.
When you create a labeling job using the API operation CreateLabelingJob
,
you must provide an ARN provided by Ground Truth in the parameter HumanTaskUiArn
to specify the worker UI for your task
type. You can use HumanTaskUiArn
with the SageMaker AI RenderUiTemplate
API operation to preview the worker UI.
You provide worker instructions, labels, and optionally, label category attributes that are displayed in the worker UI.
Label category attributes
When you create a 3D point cloud object tracking or object detection labeling job, you can add one or more label category attributes. You can add frame attributes to all 3D point cloud task types:
-
Label category attribute – A list of options (strings), a free form text box, or a numeric field associated with one or more labels. It is used by workers to to provide metadata about a label.
-
Frame attribute – A list of options (strings), a free form text box, or a numeric field that appears on each point cloud frame a worker is sent to annotate. It is used by workers to provide metadata about frames.
Additionally, you can use label and frame attributes to have workers verify labels in a 3D point cloud label verification job.
Use the following sections to learn more about these attributes. To learn how to add label category and frame attributes to a labeling job, use the Create Labeling Job section on the task type page of your choice.
Label category attributes
Add label category attributes to labels to give workers the ability to provide more information about the annotations they create. A label category attribute is added to an individual label, or to all labels. When a label category attribute is applied to all labels it is referred to as a global label category attribute.
For example, if you add the label category car, you might also want to capture additional data about your labeled cars, such as if they are occluded or the size of the car. You can capture this metadata using label category attributes. In this example, if you added the attribute occluded to the car label category, you can assign partial, completely, no to the occluded attribute and enable workers to select one of these options.
When you create a label verification job, you add labels category attributes to each label you want workers to verify.
Frame attributes
Add frame attributes to give workers the ability to provide more information about individual point cloud frames. You can specify up to 10 frame attributes, and these attributes will appear on all frames.
For example, you can add a frame attribute that allows workers to enter a number. You may want to use this attribute to have workers identify the number of objects they see in a particular frame.
In another example, you may want to provide a free-form text box to give workers the ability to provide a free form answer to a question.
When you create a label verification job, you can add one or more frame attributes to ask workers to provide feedback on all labels in a point cloud frame.
Worker instructions
You can provide worker instructions to help your workers complete your point cloud labeling tasks. You might want to use these instructions to do the following:
-
Best practices and things to avoid when annotating objects.
-
Explanation of the label category attributes provided (for object detection and object tracking tasks), and how to use them.
-
Advice on how to save time while labeling by using keyboard shortcuts.
You can add your worker instructions using the SageMaker AI console while creating a
labeling job. If you create a labeling job using the API operation
CreateLabelingJob
, you specify worker instructions in your label
category configuration file.
In addition to your instructions, Ground Truth provides a link to help workers navigate and use the worker portal. View these instructions by selecting the task type on Worker instructions.
Declining tasks
Workers are able to decline tasks.
Workers decline a task if the instructions are not clear, input data is not
displaying correctly, or if they encounter some other issue with the task. If the
number of workers per dataset object (NumberOfHumanWorkersPerDataObject
) decline the task,
the data object is marked as expired and will not be sent to additional
workers.