Inference using Anthropic Messages API
The Anthropic Messages API provides native access to Claude models using the Anthropic
request and response format. You can use the Messages API on both the
bedrock-mantle and bedrock-runtime endpoints, giving you
flexibility to choose the authentication and integration style that fits your application.
| Endpoint | Base URL | Use case |
|---|---|---|
bedrock-mantle |
https://bedrock-mantle.{region}.api.aws/anthropic/v1/messages |
Recommended for new applications. Supports API key auth and Workspaces (Anthropic-compatible). |
bedrock-runtime |
https://bedrock-runtime.{region}.amazonaws.com |
Use with existing AWS SDK integrations and invocation logging. |
Supported Regions
The Messages API is available on the bedrock-mantle endpoint in all
Regions that support bedrock-mantle (see Supported Regions and Endpoints).
On the bedrock-runtime endpoint, it is available in all Regions where
Claude models are available. See Regional availability for
details.
Prerequisites
Before using the Messages API, ensure you have the following:
-
Model access – Request access to Claude models in the Amazon Bedrock console. See Request access to models.
-
Authentication – Depends on the endpoint:
-
bedrock-mantle– Use a Amazon Bedrock API key (see API keys) or AWS SigV4 credentials. -
bedrock-runtime– Use AWS SigV4 credentials via the AWS SDK (boto3, etc.).
-
-
Anthropic version header – All requests require an API version:
-
bedrock-mantle: Setanthropic-version: 2023-06-01as an HTTP header. -
bedrock-runtime: Set"anthropic_version": "bedrock-2023-05-31"in the request body.
-
Basic request
To send a basic message to a Claude model, choose the tab for your preferred method, and then follow the steps:
Stream responses
To receive response tokens incrementally as they are generated, choose the tab for your preferred method, and then follow the steps:
Supported features
The Messages API supports the following advanced features with Claude models:
-
System prompts – Set model behavior with a
systemparameter. -
Multi-turn conversations – Pass alternating
userandassistantmessages. -
Tool use – Define tools the model can call. See Use a tool to complete an Amazon Bedrock model response.
-
Vision – Send images in the
contentarray alongside text. -
Extended thinking – Enable chain-of-thought reasoning. See Enhance model responses with model reasoning.
-
Prompt caching – Cache frequently used context to reduce latency and cost.
For the full Messages API request and response format, including all parameters and features, see Anthropic Claude Messages API.