Interface InferenceConfiguration.Builder

  • Method Details

    • maxTokens

      The maximum number of tokens to allow in the generated response. The default value is the maximum allowed value for the model that you are using.

      Parameters:
      maxTokens - The maximum number of tokens to allow in the generated response. The default value is the maximum allowed value for the model that you are using.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • temperature

      InferenceConfiguration.Builder temperature(Float temperature)

      The likelihood of the model selecting higher-probability options while generating a response. A lower value makes the model more likely to choose higher-probability options, while a higher value makes the model more likely to choose lower-probability options.

      Parameters:
      temperature - The likelihood of the model selecting higher-probability options while generating a response. A lower value makes the model more likely to choose higher-probability options, while a higher value makes the model more likely to choose lower-probability options.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • topP

      The percentage of most-likely candidates that the model considers for the next token. For example, if you choose a value of 0.8 for topP, the model selects from the top 80% of the probability distribution of tokens that could be next in the sequence.

      Parameters:
      topP - The percentage of most-likely candidates that the model considers for the next token. For example, if you choose a value of 0.8 for topP, the model selects from the top 80% of the probability distribution of tokens that could be next in the sequence.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • stopSequences

      InferenceConfiguration.Builder stopSequences(Collection<String> stopSequences)

      A list of stop sequences. A stop sequence is a sequence of characters that causes the model to stop generating the response.

      Parameters:
      stopSequences - A list of stop sequences. A stop sequence is a sequence of characters that causes the model to stop generating the response.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • stopSequences

      InferenceConfiguration.Builder stopSequences(String... stopSequences)

      A list of stop sequences. A stop sequence is a sequence of characters that causes the model to stop generating the response.

      Parameters:
      stopSequences - A list of stop sequences. A stop sequence is a sequence of characters that causes the model to stop generating the response.
      Returns:
      Returns a reference to this object so that method calls can be chained together.