ParsingConfiguration
Settings for parsing document contents. By default, the service converts the contents of each document into text before splitting it into chunks. To improve processing of PDF files with tables and images, you can configure the data source to convert the pages of text into images and use a model to describe the contents of each page.
Contents
- parsingStrategy
-
The parsing strategy for the data source.
Type: String
Valid Values:
BEDROCK_FOUNDATION_MODEL
Required: Yes
- bedrockFoundationModelConfiguration
-
Settings for a foundation model used to parse documents for a data source.
Type: BedrockFoundationModelConfigurationForParsing object
Required: No
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: