Find deduplication key and ID in your output data
You can see the deduplication key and ID in your output data. The
deduplication key is identified by
dataset-objectid-attribute-name
. When you use your own
custom deduplication key, your output contains something similar to the
following:
"dataset-objectid-attribute-name": "
byo-key
", "byo-key
": "UniqueId
",
When you do not specify a key, you can find the deduplication ID that Ground Truth
assigned to your data object as follows. The
$
parameter identifies your deduplication ID. label-attribute-name
-object-id
{ "source-ref":"
s3://bucket/prefix/object1
", "dataset-objectid-attribute-name":"$label-attribute-name
-object-id" "label-attribute-name
" :0, "label-attribute-name
-metadata": {...
}, "$label-attribute-name
-object-id":"<service-generated-key>
" }
For
, if
the data object came through an Amazon S3 configuration, Ground Truth adds a unique
value used by the service and emits a new field keyed by
<service-generated-key>
$
which shows
the Amazon S3 sequencer used. If object was fed to SNS directly, Ground Truth use the
SNS message ID.sequencer