Images - Amazon Bedrock

Images

The Amazon Bedrock Data Automation (BDA) feature offers a comprehensive set of standard outputs for image processing to generate insights from your images. You can use these insights to enable a wide range of applications and use cases, such as content discovery, contextual ad placement, and brand safety. Here's an overview of each operation type available as part of standard outputs for images:

Image Summary

Image summary generates a descriptive caption for an image. This feature is enabled within the standard output configuration by default.

IAB Taxonomy

The Interactive Advertising Bureau (IAB) classification applies a standard advertising taxonomy to classify image content. For Preview, BDA will support 24 top-level (L1) categories and 85 second-level (L2) categories. To download the list of IAB categories supported by BDA, click here.

Image Text Detection

This feature detects and extracts text that appears visually in an image and provides bounding box information, indicating the coordinates of each detected text element within the image, and confidence scores. This feature is enabled within the standard output configuration by default.

Content Moderation

Content moderation detects inappropriate, unwanted, or offensive content in an image. For Preview, BDA will support 7 moderation categories: Explicit, Non-Explicit Nudity of Intimate parts and Kissing, Swimwear or Underwear, Violence, Drugs & Tobacco, Alcohol, Hate symbols. Explicit text in images is not flagged.

Bounding boxes and the associated confidence scores can be enabled or disabled for relevant features like text detection to provide location coordinates in the image. By default, image summary and image text detection are enabled.

Image Standard Output

The following is an example of a standard output for an image processed through BDA:

{ "metadata": { "id": "image_123", "semantic_modality": "IMAGE", "s3_bucket": "my-s3-bucket", "s3_prefix": "images/", "image_width_pixels": 1920, "image_height_pixels": 1080, "color_depth": 24, "image_encoding": "JPEG" }, "image": { "summary": "Lively party scene with decorations and supplies", "iab_categories": [ { "id": "iab_12345", "type": "IAB", "category": "Party Supplies", "confidence": 0.9, "parent_name": "Events & Attractions", "taxonomy_level": 2 }, { "id": "iab_67890", "type": "IAB", "category": "Decorations", "confidence": 0.8, "parent_name": "Events & Attractions", "taxonomy_level": 1 } ], "content_moderation": [ { "id": "mod_12345", "type": "MODERATION", "category": "Drugs & Tobacco Paraphernalia & Use", "confidence": 0.7, "parent_name": "Drugs & Tobacco", "taxonomy_level": 2 } ], "text_words": [ { "id": "word_1", "text": "lively", "confidence": 0.9, "line_id": "line_1", "locations": [ { "bounding_box": { "left": 100, "top": 200, "width": 50, "height": 20 }, "polygon": [ {"x": 100, "y": 200}, {"x": 150, "y": 200}, {"x": 150, "y": 220}, {"x": 100, "y": 220} ] } ] }, { "id": "word_2", "text": "party", "confidence": 0.85, "line_id": "line_1", "locations": [ { "bounding_box": { "left": 160, "top": 200, "width": 70, "height": 20 }, "polygon": [ {"x": 160, "y": 200}, {"x": 230, "y": 200}, {"x": 230, "y": 220}, {"x": 160, "y": 220} ] } ] } ], "text_lines": [ { "id": "line_1", "text": "lively party", "confidence": 0.9, "locations": [ { "bounding_box": { "left": 100, "top": 200, "width": 200, "height": 20 }, "polygon": [ {"x": 100, "y": 200}, {"x": 300, "y": 200}, {"x": 300, "y": 220}, {"x": 100, "y": 220} ] } ] } ] }, "statistics": { "entity_count": 7, "object_count": 3, "line_count": 2, "word_count": 9 } }

This output includes:

  • Image metadata

  • Image summarization

  • IAB categorization

  • Content moderation results

  • Detected text with word and line-level information

  • Bounding boxes and polygons for text locations

  • Statistics about the analyzed content