Interface InferenceConfiguration.Builder
- All Superinterfaces:
Buildable,CopyableBuilder<InferenceConfiguration.Builder,,InferenceConfiguration> SdkBuilder<InferenceConfiguration.Builder,,InferenceConfiguration> SdkPojo
- Enclosing class:
InferenceConfiguration
-
Method Summary
Modifier and TypeMethodDescriptionThe maximum number of tokens to allow in the generated response.stopSequences(String... stopSequences) A list of stop sequences.stopSequences(Collection<String> stopSequences) A list of stop sequences.temperature(Float temperature) The likelihood of the model selecting higher-probability options while generating a response.The percentage of most-likely candidates that the model considers for the next token.Methods inherited from interface software.amazon.awssdk.utils.builder.CopyableBuilder
copyMethods inherited from interface software.amazon.awssdk.utils.builder.SdkBuilder
applyMutation, buildMethods inherited from interface software.amazon.awssdk.core.SdkPojo
equalsBySdkFields, sdkFieldNameToField, sdkFields
-
Method Details
-
maxTokens
The maximum number of tokens to allow in the generated response. The default value is the maximum allowed value for the model that you are using.
- Parameters:
maxTokens- The maximum number of tokens to allow in the generated response. The default value is the maximum allowed value for the model that you are using.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
temperature
The likelihood of the model selecting higher-probability options while generating a response. A lower value makes the model more likely to choose higher-probability options, while a higher value makes the model more likely to choose lower-probability options.
- Parameters:
temperature- The likelihood of the model selecting higher-probability options while generating a response. A lower value makes the model more likely to choose higher-probability options, while a higher value makes the model more likely to choose lower-probability options.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
topP
The percentage of most-likely candidates that the model considers for the next token. For example, if you choose a value of 0.8 for
topP, the model selects from the top 80% of the probability distribution of tokens that could be next in the sequence.- Parameters:
topP- The percentage of most-likely candidates that the model considers for the next token. For example, if you choose a value of 0.8 fortopP, the model selects from the top 80% of the probability distribution of tokens that could be next in the sequence.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
stopSequences
A list of stop sequences. A stop sequence is a sequence of characters that causes the model to stop generating the response.
- Parameters:
stopSequences- A list of stop sequences. A stop sequence is a sequence of characters that causes the model to stop generating the response.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
stopSequences
A list of stop sequences. A stop sequence is a sequence of characters that causes the model to stop generating the response.
- Parameters:
stopSequences- A list of stop sequences. A stop sequence is a sequence of characters that causes the model to stop generating the response.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-