AWS::Bedrock::DataSource CustomTransformationConfiguration

Settings for customizing steps in the data source content ingestion pipeline.

You can configure the data source to process documents with a Lambda function after they are parsed and converted into chunks. When you add a post-chunking transformation, the service stores chunked documents in an S3 bucket and invokes a Lambda function to process them.

To process chunked documents with a Lambda function, define an S3 bucket path for input and output objects, and a transformation that specifies the Lambda function to invoke. You can use the Lambda function to customize how chunks are split, and the metadata for each chunk.

Syntax

To declare this entity in your CloudFormation template, use the following syntax:

JSON


{
  "IntermediateStorage" : IntermediateStorage,
  "Transformations" : [ Transformation, ... ]
}

YAML


  IntermediateStorage: 
    IntermediateStorage
  Transformations: 
    - Transformation

Properties

IntermediateStorage

An S3 bucket path for input and output objects.

Required: Yes

Type: IntermediateStorage

Update requires: No interruption

Transformations

A Lambda function that processes documents.

Required: Yes

Type: Array of Transformation

Minimum: 1

Maximum: 1

Update requires: No interruption

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

CrawlFilterConfiguration

DataSourceConfiguration