For a list of Region codes and endpoints supported in Amazon Bedrock, see Amazon Bedrock endpoints and quotas. This topic describes predefined inference profiles that you can use and the Regions and models that support application inference profiles.
Topics
Supported cross-region inference profiles
You can carry out cross-region inference with cross-region (system-defined) inference profiles. Cross-region inference allows you to seamlessly manage unplanned traffic bursts by utilizing compute across different AWS Regions. With cross-region inference, you can distribute traffic across multiple AWS Regions.
Cross-region (system-defined) inference profiles are named after the model that they support and defined by the Regions that they support. To understand how a cross-region inference profile handles your requests, review the following definitions:
-
Source Region – The Region from which you make the API request that specifies the inference profile.
-
Destination Region – A Region to which the Amazon Bedrock service can route the request from your source Region.
You invoke a cross-region inference profile from a source Region and the Amazon Bedrock service routes your request to any of the destination Regions defined in the inference profile.
Note
Some inference profiles route to different destination Regions depending on the source Region from which you call it. For example, if you call us.anthropic.claude-3-haiku-20240307-v1:0
from US East (Ohio), it can route requests to us-east-1
, us-east-2
, or us-west-2
, but if you call it from US West (Oregon), it can route requests to only us-east-1
and us-west-2
.
To check the source and destination Regions for an inference profile, you can do one of the following:
-
Expand the corresponding section in the list of supported cross-region inference profiles.
-
Send a GetInferenceProfile request with an Amazon Bedrock control plane endpoint from a source Region and specify the Amazon Resource Name (ARN) or ID of the inference profile in the
inferenceProfileIdentifier
field. Themodels
field in the response maps to a list of model ARNs, in which you can identify each destination Region.
Note
Inference profiles are immutable, meaning that we don't add new Regions to an existing inference profile. However, we might create new inference profiles that incorporate new Regions. You can update your systems to use these inference profiles by changing the IDs in your setup to the new ones.
Expand one of the following sections to see information about a cross-region inference profile, the source Regions from which it can be called, and the destination Regions to which it can route requests.
To call the US Nova Lite inference profile, specify the following inference profile ID in one of the source Regions:
us.amazon.nova-lite-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
us-west-2 |
us-east-1 us-east-2 us-west-2 |
us-east-2 |
us-east-1 us-east-2 us-west-2 |
us-east-1 |
us-east-1 us-east-2 us-west-2 |
To call the US Nova Micro inference profile, specify the following inference profile ID in one of the source Regions:
us.amazon.nova-micro-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
us-west-2 |
us-east-1 us-east-2 us-west-2 |
us-east-2 |
us-east-1 us-east-2 us-west-2 |
us-east-1 |
us-east-1 us-east-2 us-west-2 |
To call the US Nova Pro inference profile, specify the following inference profile ID in one of the source Regions:
us.amazon.nova-pro-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
us-west-2 |
us-east-1 us-east-2 us-west-2 |
us-east-2 |
us-east-1 us-east-2 us-west-2 |
us-east-1 |
us-east-1 us-east-2 us-west-2 |
To call the US Anthropic Claude 3.5 Haiku inference profile, specify the following inference profile ID in one of the source Regions:
us.anthropic.claude-3-5-haiku-20241022-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
us-west-2 |
us-east-1 us-east-2 us-west-2 |
us-east-21 |
us-east-1 us-east-2 us-west-2 |
us-east-1 |
us-east-1 us-east-2 us-west-2 |
To call the US Anthropic Claude 3.5 Sonnet inference profile, specify the following inference profile ID in one of the source Regions:
us.anthropic.claude-3-5-sonnet-20240620-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
us-west-2 |
us-east-1 us-west-2 |
us-east-2 |
us-east-1 us-west-2 |
us-east-1 |
us-east-1 us-west-2 |
To call the US Anthropic Claude 3.5 Sonnet v2 inference profile, specify the following inference profile ID in one of the source Regions:
us.anthropic.claude-3-5-sonnet-20241022-v2:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
us-west-2 |
us-east-1 us-east-2 us-west-2 |
us-east-2 |
us-east-1 us-east-2 us-west-2 |
us-east-1 |
us-east-1 us-east-2 us-west-2 |
To call the US Anthropic Claude 3.7 Sonnet inference profile, specify the following inference profile ID in one of the source Regions:
us.anthropic.claude-3-7-sonnet-20250219-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
us-west-2 |
us-east-1 us-east-2 us-west-2 |
us-east-2 |
us-east-1 us-east-2 us-west-2 |
us-east-1 |
us-east-1 us-east-2 us-west-2 |
To call the US Anthropic Claude 3 Haiku inference profile, specify the following inference profile ID in one of the source Regions:
us.anthropic.claude-3-haiku-20240307-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
us-west-2 |
us-east-1 us-west-2 |
us-east-2 |
us-east-1 us-east-2 us-west-2 |
us-east-1 |
us-east-1 us-west-2 |
To call the US Anthropic Claude 3 Opus inference profile, specify the following inference profile ID in one of the source Regions:
us.anthropic.claude-3-opus-20240229-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
us-west-2 |
us-east-1 us-west-2 |
us-east-1 |
us-east-1 us-west-2 |
To call the US Anthropic Claude 3 Sonnet inference profile, specify the following inference profile ID in one of the source Regions:
us.anthropic.claude-3-sonnet-20240229-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
us-west-2 |
us-east-1 us-west-2 |
us-east-1 |
us-east-1 us-west-2 |
To call the US Meta Llama 3.1 Instruct 405B inference profile, specify the following inference profile ID in one of the source Regions:
us.meta.llama3-1-405b-instruct-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
us-east-21 |
us-east-1 us-east-2 us-west-2 |
To call the US Meta Llama 3.1 70B Instruct inference profile, specify the following inference profile ID in one of the source Regions:
us.meta.llama3-1-70b-instruct-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
us-west-2 |
us-east-1 us-east-2 us-west-2 |
us-east-21 |
us-east-1 us-east-2 us-west-2 |
us-east-1 |
us-east-1 us-east-2 us-west-2 |
To call the US Meta Llama 3.1 8B Instruct inference profile, specify the following inference profile ID in one of the source Regions:
us.meta.llama3-1-8b-instruct-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
us-west-2 |
us-east-1 us-east-2 us-west-2 |
us-east-2 |
us-east-1 us-east-2 us-west-2 |
us-east-1 |
us-east-1 us-east-2 us-west-2 |
To call the US Meta Llama 3.2 11B Instruct inference profile, specify the following inference profile ID in one of the source Regions:
us.meta.llama3-2-11b-instruct-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
us-west-2 |
us-east-1 us-west-2 |
us-east-2 |
us-east-1 us-east-2 us-west-2 |
us-east-1 |
us-east-1 us-west-2 |
To call the US Meta Llama 3.2 1B Instruct inference profile, specify the following inference profile ID in one of the source Regions:
us.meta.llama3-2-1b-instruct-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
us-west-2 |
us-east-1 us-west-2 |
us-east-2 |
us-east-1 us-east-2 us-west-2 |
us-east-1 |
us-east-1 us-west-2 |
To call the US Meta Llama 3.2 3B Instruct inference profile, specify the following inference profile ID in one of the source Regions:
us.meta.llama3-2-3b-instruct-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
us-west-2 |
us-east-1 us-west-2 |
us-east-2 |
us-east-1 us-east-2 us-west-2 |
us-east-1 |
us-east-1 us-west-2 |
To call the US Meta Llama 3.2 90B Instruct inference profile, specify the following inference profile ID in one of the source Regions:
us.meta.llama3-2-90b-instruct-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
us-west-2 |
us-east-1 us-west-2 |
us-east-2 |
us-east-1 us-east-2 us-west-2 |
us-east-1 |
us-east-1 us-west-2 |
To call the US Meta Llama 3.3 70B Instruct inference profile, specify the following inference profile ID in one of the source Regions:
us.meta.llama3-3-70b-instruct-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
us-west-2 |
us-east-1 us-east-2 us-west-2 |
us-east-2 |
us-east-1 us-east-2 us-west-2 |
us-east-1 |
us-east-1 us-east-2 us-west-2 |
To call the EU Nova Lite inference profile, specify the following inference profile ID in one of the source Regions:
eu.amazon.nova-lite-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
eu-west-3 |
eu-central-1 eu-north-1 eu-west-1 eu-west-3 |
eu-west-1 |
eu-central-1 eu-north-1 eu-west-1 eu-west-3 |
eu-north-1 |
eu-central-1 eu-north-1 eu-west-1 eu-west-3 |
eu-central-1 |
eu-central-1 eu-north-1 eu-west-1 eu-west-3 |
To call the EU Nova Micro inference profile, specify the following inference profile ID in one of the source Regions:
eu.amazon.nova-micro-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
eu-west-3 |
eu-central-1 eu-north-1 eu-west-1 eu-west-3 |
eu-west-1 |
eu-central-1 eu-north-1 eu-west-1 eu-west-3 |
eu-north-1 |
eu-central-1 eu-north-1 eu-west-1 eu-west-3 |
eu-central-1 |
eu-central-1 eu-north-1 eu-west-1 eu-west-3 |
To call the EU Nova Pro inference profile, specify the following inference profile ID in one of the source Regions:
eu.amazon.nova-pro-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
eu-west-3 |
eu-central-1 eu-north-1 eu-west-1 eu-west-3 |
eu-west-1 |
eu-central-1 eu-north-1 eu-west-1 eu-west-3 |
eu-north-1 |
eu-central-1 eu-north-1 eu-west-1 eu-west-3 |
eu-central-1 |
eu-central-1 eu-north-1 eu-west-1 eu-west-3 |
To call the EU Anthropic Claude 3.5 Sonnet inference profile, specify the following inference profile ID in one of the source Regions:
eu.anthropic.claude-3-5-sonnet-20240620-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
eu-west-3 |
eu-central-1 eu-west-1 eu-west-3 |
eu-west-1 |
eu-central-1 eu-west-1 eu-west-3 |
eu-central-1 |
eu-central-1 eu-west-1 eu-west-3 |
To call the EU Anthropic Claude 3 Haiku inference profile, specify the following inference profile ID in one of the source Regions:
eu.anthropic.claude-3-haiku-20240307-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
eu-west-3 |
eu-central-1 eu-west-1 eu-west-3 |
eu-west-1 |
eu-central-1 eu-west-1 eu-west-3 |
eu-central-1 |
eu-central-1 eu-west-1 eu-west-3 |
To call the EU Anthropic Claude 3 Sonnet inference profile, specify the following inference profile ID in one of the source Regions:
eu.anthropic.claude-3-sonnet-20240229-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
eu-west-3 |
eu-central-1 eu-west-1 eu-west-3 |
eu-west-1 |
eu-central-1 eu-west-1 eu-west-3 |
eu-central-1 |
eu-central-1 eu-west-1 eu-west-3 |
To call the EU Meta Llama 3.2 1B Instruct inference profile, specify the following inference profile ID in one of the source Regions:
eu.meta.llama3-2-1b-instruct-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
eu-west-3 |
eu-central-1 eu-west-1 eu-west-3 |
eu-west-1 |
eu-central-1 eu-west-1 eu-west-3 |
eu-central-1 |
eu-central-1 eu-west-1 eu-west-3 |
To call the EU Meta Llama 3.2 3B Instruct inference profile, specify the following inference profile ID in one of the source Regions:
eu.meta.llama3-2-3b-instruct-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
eu-west-3 |
eu-central-1 eu-west-1 eu-west-3 |
eu-west-1 |
eu-central-1 eu-west-1 eu-west-3 |
eu-central-1 |
eu-central-1 eu-west-1 eu-west-3 |
To call the APAC Nova Lite inference profile, specify the following inference profile ID in one of the source Regions:
apac.amazon.nova-lite-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
ap-southeast-2 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-southeast-1 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-south-1 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-northeast-2 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-northeast-1 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
To call the APAC Nova Micro inference profile, specify the following inference profile ID in one of the source Regions:
apac.amazon.nova-micro-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
ap-southeast-2 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-southeast-1 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-south-1 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-northeast-2 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-northeast-1 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
To call the APAC Nova Pro inference profile, specify the following inference profile ID in one of the source Regions:
apac.amazon.nova-pro-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
ap-southeast-2 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-southeast-1 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-south-1 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-northeast-2 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-northeast-1 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
To call the APAC Anthropic Claude 3.5 Sonnet inference profile, specify the following inference profile ID in one of the source Regions:
apac.anthropic.claude-3-5-sonnet-20240620-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
ap-southeast-2 |
ap-northeast-1 ap-northeast-2 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-southeast-1 |
ap-northeast-1 ap-northeast-2 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-south-1 |
ap-northeast-1 ap-northeast-2 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-northeast-2 |
ap-northeast-1 ap-northeast-2 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-northeast-1 |
ap-northeast-1 ap-northeast-2 ap-south-1 ap-southeast-1 ap-southeast-2 |
To call the APAC Anthropic Claude 3.5 Sonnet v2 inference profile, specify the following inference profile ID in one of the source Regions:
apac.anthropic.claude-3-5-sonnet-20241022-v2:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
ap-southeast-2 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-southeast-1 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-south-2 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-south-2 ap-southeast-1 ap-southeast-2 |
ap-south-1 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-northeast-3 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-northeast-2 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-northeast-1 |
ap-northeast-1 ap-northeast-2 ap-northeast-3 ap-south-1 ap-southeast-1 ap-southeast-2 |
To call the APAC Anthropic Claude 3 Haiku inference profile, specify the following inference profile ID in one of the source Regions:
apac.anthropic.claude-3-haiku-20240307-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
ap-southeast-2 |
ap-northeast-1 ap-northeast-2 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-southeast-1 |
ap-northeast-1 ap-northeast-2 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-south-1 |
ap-northeast-1 ap-northeast-2 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-northeast-2 |
ap-northeast-1 ap-northeast-2 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-northeast-1 |
ap-northeast-1 ap-northeast-2 ap-south-1 ap-southeast-1 ap-southeast-2 |
To call the APAC Anthropic Claude 3 Sonnet inference profile, specify the following inference profile ID in one of the source Regions:
apac.anthropic.claude-3-sonnet-20240229-v1:0
The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:
Source Region | Destination Regions |
---|---|
ap-southeast-2 |
ap-northeast-1 ap-northeast-2 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-southeast-1 |
ap-northeast-1 ap-northeast-2 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-south-1 |
ap-northeast-1 ap-northeast-2 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-northeast-2 |
ap-northeast-1 ap-northeast-2 ap-south-1 ap-southeast-1 ap-southeast-2 |
ap-northeast-1 |
ap-northeast-1 ap-northeast-2 ap-south-1 ap-southeast-1 ap-southeast-2 |
Note
1 In this Region, the specified model can be optimized for latency. For more information, see Optimize model inference for latency.
Supported Regions and models for application inference profiles
Application inference profiles is supported in the following Regions (for more information about Regions supported in Amazon Bedrock see Amazon Bedrock endpoints and quotas):
-
US East (N. Virginia)
-
US East (Ohio)
-
US West (Oregon)
-
Asia Pacific (Tokyo)
-
Asia Pacific (Seoul)
-
Asia Pacific (Mumbai)
-
Asia Pacific (Singapore)
-
Asia Pacific (Sydney)
-
Canada (Central)
-
Europe (Frankfurt)
-
Europe (Ireland)
-
Europe (London)
-
Europe (Paris)
-
South America (São Paulo)
Application inference profiles is supported for the following foundation models (to see which Regions support each model, refer to Supported foundation models in Amazon Bedrock):
-
Amazon Titan Embeddings G1 - Text
-
Amazon Titan Image Generator G1 v2
-
Amazon Titan Image Generator G1
-
Amazon Titan Text Embeddings V2
-
Anthropic Anthropic Claude 2.1
-
Anthropic Claude 3 Haiku
-
Anthropic Claude 3 Opus
-
Anthropic Claude 3 Sonnet
-
Anthropic Claude 3.5 Sonnet
-
Anthropic Claude 3.7 Sonnet
-
Meta Llama 3 70B Instruct
-
Meta Llama 3 8B Instruct
-
Meta Llama 3.2 11B Instruct
-
Meta Llama 3.2 1B Instruct
-
Meta Llama 3.2 3B Instruct
-
Meta Llama 3.2 90B Instruct
-
Mistral AI Mistral 7B Instruct
-
Mistral AI Mixtral 8x7B Instruct
-
Stability AI SDXL 1.0