HierarchicalChunkingConfiguration
Settings for hierarchical document chunking for a data source. Hierarchical chunking splits documents into layers of chunks where the first layer contains large chunks, and the second layer contains smaller chunks derived from the first layer.
You configure the number of tokens to overlap, or repeat across adjacent chunks. For example, if you set overlap tokens to 60, the last 60 tokens in the first chunk are also included at the beginning of the second chunk. For each layer, you must also configure the maximum number of tokens in a chunk.
Contents
- levelConfigurations
-
Token settings for each layer.
Type: Array of HierarchicalChunkingLevelConfiguration objects
Array Members: Fixed number of 2 items.
Required: Yes
- overlapTokens
-
The number of tokens to repeat across chunks in the same layer.
Type: Integer
Valid Range: Minimum value of 1.
Required: Yes
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: