Prerequisites Get started API Documentation Streaming response format

Get started

You can use the harness through the AgentCore CLI or directly with AWS SDKs such as boto3. The CLI is the fastest path for most developers; SDKs are for programmatic use from your own application.

Prerequisites

AWS credentials configured in one of the preview regions: US East (N. Virginia), US West (Oregon), Europe (Frankfurt), or Asia Pacific (Sydney).
For the CLI: Node.js 20+, AgentCore CLI installed with the preview channel:
```
npm install -g @aws/agentcore@preview
```
For the SDK/boto3: Python 3.10+, boto3 installed, and an IAM execution role the harness can assume. See the execution role policy for minimum permissions.

Get started

Example

AgentCore CLI

Create a harness project non-interactively with flags:


agentcore create --name myresearchagent --model-provider bedrock

Deploy and invoke:


agentcore deploy
agentcore invoke --harness myresearchagent \
  --session-id "$(uuidgen)" \
  "Research three tropical vacation options under $3k, within five hours of NYC."

The response streams to your terminal. Reuse the same --session-id across invocations to continue the conversation in the same environment.

To add more harnesses to an existing project, use agentcore add harness. To generate a standalone Python invoke script, add --with-invoke-script.

Useful flags:

--no-browser (-b) - use the terminal TUI instead of the browser inspector
--logs (-l) - run in non-interactive mode with logs to stdout
--port <port> (-p) - set the dev server port (default 8080)
--no-traces - disable local OTEL trace collection

Interactive

Run agentcore create without flags to launch the interactive wizard:


agentcore create

Enter your project name:
Select Harness as the project type:
Choose your model provider:
Choose your environment (default, container URI, or Dockerfile):
Configure memory:
Optionally configure advanced settings (tools, auth, network, lifecycle, limits, truncation, session storage):
Review your configuration and confirm:

After confirmation, the wizard scaffolds your project. Deploy with agentcore deploy, then invoke with agentcore invoke.

Check project status at any time with agentcore status:

Local development

If you want to test the harness in your local environment, you can run the dev server:


agentcore dev

When you run agentcore dev, the CLI first deploys your harness resources to AWS, creating the IAM role, harness, and any memory or credentials configured in your project:

Deploy progress: CloudFormation resource creation

Once deployment completes, it starts a local server and opens the agent inspector in your browser where you can chat with the harness, inspect traces, and browse project resources:

Expand Harness Settings to view and override the harness configuration for the current session:

Agent inspector: harness settings and configuration

AWS CLI/boto3

Create the harness with a name and execution role:


aws bedrock-agentcore-control create-harness \
  --harness-name "MyHarness" \
  --execution-role-arn "arn:aws:iam::123456789012:role/MyHarnessRole"

Poll get-harness until "status": "READY". Note the arn in the response.


aws bedrock-agentcore-control get-harness \
  --harness-id "MyHarness-XyZ123"

Invoke from Python. If you don’t specify a model, the harness defaults to Anthropic Claude Sonnet 4.6 on Amazon Bedrock:


import boto3

client = boto3.client("bedrock-agentcore", region_name="us-west-2")

response = client.invoke_harness(
    harnessArn="arn:aws:bedrock-agentcore:us-west-2:123456789012:harness/MyHarness-XyZ123",  # Replace with your harness ARN
    runtimeSessionId="1234abcd-12ab-34cd-56ef-1234567890ab",
    messages=[{
        "role": "user",
        "content": [{"text": "Research three tropical vacation options under $3k."}]
    }],
)

for event in response["stream"]:
    if "contentBlockDelta" in event:
        delta = event["contentBlockDelta"].get("delta", {})
        if "text" in delta:
            print(delta["text"], end="", flush=True)
    elif "runtimeClientError" in event:
        print(f"\nError: {event['runtimeClientError']['message']}")

That’s all you need to get a harness running. The following sections cover everything you can configure.

Note

The runtimeSessionId must be at least 33 characters. Use a UUID or similar identifier. Reuse the same session ID across invocations to continue a conversation in the same environment.

API Documentation

For additional details, see the API Reference:

Streaming response format

InvokeHarness returns a stream of events. The key event types are:

messageStart - beginning of a new message (includes role)
contentBlockStart - beginning of a content block (text, toolUse, or toolResult)
contentBlockDelta - incremental content (text, toolUse input, reasoningContent)
contentBlockStop - end of a content block
messageStop - end of the message (includes stopReason)
metadata - token usage and latency metrics
runtimeClientError - error during execution

The stopReason in messageStop indicates why the agent stopped:

end_turn - the agent finished normally
tool_use - the agent is calling an inline function and waiting for a client-side result
max_tokens - the model’s per-turn token limit was reached
max_iterations_exceeded - the maxIterations limit was hit
timeout_exceeded - the timeoutSeconds limit was hit
max_output_tokens_exceeded - the maxTokens budget was exhausted

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

AgentCore harness

Configure agents and models