ChunkingConfiguration
Details about how to chunk the documents in the data source. A chunk refers to an excerpt from a data source that is returned when the knowledge base that it belongs to is queried.
Contents
- chunkingStrategy
-
Knowledge base can split your source data into chunks. A chunk refers to an excerpt from a data source that is returned when the knowledge base that it belongs to is queried. You have the following options for chunking your data. If you opt for
NONE
, then you may want to pre-process your files by splitting them up such that each file corresponds to a chunk.Type: String
Valid Values:
FIXED_SIZE | NONE | HIERARCHICAL | SEMANTIC
Required: Yes
- fixedSizeChunkingConfiguration
-
Configurations for when you choose fixed-size chunking. If you set the
chunkingStrategy
asNONE
, exclude this field.Type: FixedSizeChunkingConfiguration object
Required: No
- hierarchicalChunkingConfiguration
-
Settings for hierarchical document chunking for a data source. Hierarchical chunking splits documents into layers of chunks where the first layer contains large chunks, and the second layer contains smaller chunks derived from the first layer.
Type: HierarchicalChunkingConfiguration object
Required: No
- semanticChunkingConfiguration
-
Settings for semantic document chunking for a data source. Semantic chunking splits a document into smaller documents based on groups of similar content derived from the text with natural language processing.
Type: SemanticChunkingConfiguration object
Required: No
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: