class BuiltinEvaluator
| Language | Type name |
|---|---|
.NET | Amazon.CDK.AWS.Bedrock.Agentcore.Alpha.BuiltinEvaluator |
Go | github.com/aws/aws-cdk-go/awsbedrockagentcorealpha/v2#BuiltinEvaluator |
Java | software.amazon.awscdk.services.bedrock.agentcore.alpha.BuiltinEvaluator |
Python | aws_cdk.aws_bedrock_agentcore_alpha.BuiltinEvaluator |
TypeScript (source) | @aws-cdk/aws-bedrock-agentcore-alpha ยป BuiltinEvaluator |
Built-in evaluators provided by Amazon Bedrock AgentCore.
These evaluators assess different aspects of agent performance at various levels (session, trace, or tool call).
Example
declare const customEvaluator: agentcore.Evaluator;
const evaluation = new agentcore.OnlineEvaluationConfig(this, 'MixedEvaluation', {
onlineEvaluationConfigName: 'mixed_evaluation',
evaluators: [
// Built-in evaluators
agentcore.EvaluatorReference.builtin(agentcore.BuiltinEvaluator.HELPFULNESS),
agentcore.EvaluatorReference.builtin(agentcore.BuiltinEvaluator.CORRECTNESS),
// Custom evaluator
agentcore.EvaluatorReference.custom(customEvaluator),
],
dataSource: agentcore.DataSourceConfig.fromCloudWatchLogs({
logGroupNames: ['/aws/bedrock-agentcore/my-agent'],
serviceNames: ['my-agent.default'],
}),
});
Initializer
new BuiltinEvaluator(value: string)
Parameters
- value
stringโ - The evaluator identifier string.
Properties
| Name | Type | Description |
|---|---|---|
| value | string | The string value of the built-in evaluator. |
| static COHERENCE | Builtin | Evaluates whether the response is logically structured and coherent. |
| static CONCISENESS | Builtin | Evaluates whether the response is appropriately brief without missing key information. |
| static CORRECTNESS | Builtin | Evaluates whether the information in the agent's response is factually accurate. |
| static FAITHFULNESS | Builtin | Evaluates whether information in the response is supported by provided context/sources. |
| static GOAL_SUCCESS_RATE | Builtin | Evaluates whether the conversation successfully meets the user's goals. |
| static HARMFULNESS | Builtin | Evaluates whether the response contains harmful content. |
| static HELPFULNESS | Builtin | Evaluates from user's perspective how useful and valuable the agent's response is. |
| static INSTRUCTION_FOLLOWING | Builtin | Measures how well the agent follows the provided system instructions. |
| static REFUSAL | Builtin | Detects when agent evades questions or directly refuses to answer. |
| static RESPONSE_RELEVANCE | Builtin | Evaluates whether the response appropriately addresses the user's query. |
| static STEREOTYPING | Builtin | Detects content that makes generalizations about individuals or groups. |
| static TOOL_PARAMETER_ACCURACY | Builtin | Evaluates how accurately the agent extracts parameters from user queries. |
| static TOOL_SELECTION_ACCURACY | Builtin | Evaluates whether the agent selected the appropriate tool for the task. |
value
Type:
string
The string value of the built-in evaluator.
static COHERENCE
Type:
Builtin
Evaluates whether the response is logically structured and coherent.
static CONCISENESS
Type:
Builtin
Evaluates whether the response is appropriately brief without missing key information.
static CORRECTNESS
Type:
Builtin
Evaluates whether the information in the agent's response is factually accurate.
static FAITHFULNESS
Type:
Builtin
Evaluates whether information in the response is supported by provided context/sources.
static GOAL_SUCCESS_RATE
Type:
Builtin
Evaluates whether the conversation successfully meets the user's goals.
static HARMFULNESS
Type:
Builtin
Evaluates whether the response contains harmful content.
static HELPFULNESS
Type:
Builtin
Evaluates from user's perspective how useful and valuable the agent's response is.
static INSTRUCTION_FOLLOWING
Type:
Builtin
Measures how well the agent follows the provided system instructions.
static REFUSAL
Type:
Builtin
Detects when agent evades questions or directly refuses to answer.
static RESPONSE_RELEVANCE
Type:
Builtin
Evaluates whether the response appropriately addresses the user's query.
static STEREOTYPING
Type:
Builtin
Detects content that makes generalizations about individuals or groups.
static TOOL_PARAMETER_ACCURACY
Type:
Builtin
Evaluates how accurately the agent extracts parameters from user queries.
static TOOL_SELECTION_ACCURACY
Type:
Builtin
Evaluates whether the agent selected the appropriate tool for the task.

.NET
Go
Java
Python
TypeScript (