Supported Regions and models for inference profiles - Amazon Bedrock

Supported Regions and models for inference profiles

For a list of Region codes and endpoints supported in Amazon Bedrock, see Amazon Bedrock endpoints and quotas. This topic describes predefined inference profiles that you can use and the Regions and models that support application inference profiles.

Supported cross-region inference profiles

You can carry out cross-region inference with cross-region (system-defined) inference profiles. Cross-region inference allows you to seamlessly manage unplanned traffic bursts by utilizing compute across different AWS Regions. With cross-region inference, you can distribute traffic across multiple AWS Regions.

Cross-region (system-defined) inference profiles are named after the model that they support and defined by the Regions that they support. To understand how a cross-region inference profile handles your requests, review the following definitions:

  • Source Region – The Region from which you make the API request that specifies the inference profile.

  • Destination Region – A Region to which the Amazon Bedrock service can route the request from your source Region.

You invoke a cross-region inference profile from a source Region and the Amazon Bedrock service routes your request to any of the destination Regions defined in the inference profile.

Note

Some inference profiles route to different destination Regions depending on the source Region from which you call it. For example, if you call us.anthropic.claude-3-haiku-20240307-v1:0 from US East (Ohio), it can route requests to us-east-1, us-east-2, or us-west-2, but if you call it from US West (Oregon), it can route requests to only us-east-1 and us-west-2.

To check the source and destination Regions for an inference profile, you can do one of the following:

Note

Inference profiles are immutable, meaning that we don't add new Regions to an existing inference profile. However, we might create new inference profiles that incorporate new Regions. You can update your systems to use these inference profiles by changing the IDs in your setup to the new ones.

Expand one of the following sections to see information about a cross-region inference profile, the source Regions from which it can be called, and the destination Regions to which it can route requests.

To call the US Nova Lite inference profile, specify the following inference profile ID in one of the source Regions:

us.amazon.nova-lite-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-west-2

us-east-1

us-east-2

us-west-2

us-east-2

us-east-1

us-east-2

us-west-2

us-east-1

us-east-1

us-east-2

us-west-2

To call the US Nova Micro inference profile, specify the following inference profile ID in one of the source Regions:

us.amazon.nova-micro-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-west-2

us-east-1

us-east-2

us-west-2

us-east-2

us-east-1

us-east-2

us-west-2

us-east-1

us-east-1

us-east-2

us-west-2

To call the US Nova Pro inference profile, specify the following inference profile ID in one of the source Regions:

us.amazon.nova-pro-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-west-2

us-east-1

us-east-2

us-west-2

us-east-2

us-east-1

us-east-2

us-west-2

us-east-1

us-east-1

us-east-2

us-west-2

To call the US Anthropic Claude 3.5 Haiku inference profile, specify the following inference profile ID in one of the source Regions:

us.anthropic.claude-3-5-haiku-20241022-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-west-2

us-east-1

us-east-2

us-west-2

us-east-2

us-east-1

us-east-2

us-west-2

us-east-1

us-east-1

us-east-2

us-west-2

To call the US Anthropic Claude 3.5 Sonnet inference profile, specify the following inference profile ID in one of the source Regions:

us.anthropic.claude-3-5-sonnet-20240620-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-west-2

us-east-1

us-west-2

us-east-2

us-east-1

us-west-2

us-east-1

us-east-1

us-west-2

To call the US Anthropic Claude 3.5 Sonnet v2 inference profile, specify the following inference profile ID in one of the source Regions:

us.anthropic.claude-3-5-sonnet-20241022-v2:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-west-2

us-east-1

us-east-2

us-west-2

us-east-2

us-east-1

us-east-2

us-west-2

us-east-1

us-east-1

us-east-2

us-west-2

To call the US Anthropic Claude 3.7 Sonnet inference profile, specify the following inference profile ID in one of the source Regions:

us.anthropic.claude-3-7-sonnet-20250219-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-west-2

us-east-1

us-east-2

us-west-2

us-east-2

us-east-1

us-east-2

us-west-2

us-east-1

us-east-1

us-east-2

us-west-2

To call the US Anthropic Claude 3 Haiku inference profile, specify the following inference profile ID in one of the source Regions:

us.anthropic.claude-3-haiku-20240307-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-west-2

us-east-1

us-west-2

us-east-2

us-east-1

us-east-2

us-west-2

us-east-1

us-east-1

us-west-2

To call the US Anthropic Claude 3 Opus inference profile, specify the following inference profile ID in one of the source Regions:

us.anthropic.claude-3-opus-20240229-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-west-2

us-east-1

us-west-2

us-east-1

us-east-1

us-west-2

To call the US Anthropic Claude 3 Sonnet inference profile, specify the following inference profile ID in one of the source Regions:

us.anthropic.claude-3-sonnet-20240229-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-west-2

us-east-1

us-west-2

us-east-1

us-east-1

us-west-2

To call the US DeepSeek-R1 inference profile, specify the following inference profile ID in one of the source Regions:

us.deepseek.r1-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-west-2

us-east-1

us-east-2

us-west-2

us-east-2

us-east-1

us-east-2

us-west-2

us-east-1

us-east-1

us-east-2

us-west-2

To call the US Meta Llama 3.1 Instruct 405B inference profile, specify the following inference profile ID in one of the source Regions:

us.meta.llama3-1-405b-instruct-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-east-2

us-east-1

us-east-2

us-west-2

To call the US Meta Llama 3.1 70B Instruct inference profile, specify the following inference profile ID in one of the source Regions:

us.meta.llama3-1-70b-instruct-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-west-2

us-east-1

us-east-2

us-west-2

us-east-2

us-east-1

us-east-2

us-west-2

us-east-1

us-east-1

us-east-2

us-west-2

To call the US Meta Llama 3.1 8B Instruct inference profile, specify the following inference profile ID in one of the source Regions:

us.meta.llama3-1-8b-instruct-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-west-2

us-east-1

us-east-2

us-west-2

us-east-2

us-east-1

us-east-2

us-west-2

us-east-1

us-east-1

us-east-2

us-west-2

To call the US Meta Llama 3.2 11B Instruct inference profile, specify the following inference profile ID in one of the source Regions:

us.meta.llama3-2-11b-instruct-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-west-2

us-east-1

us-west-2

us-east-2

us-east-1

us-east-2

us-west-2

us-east-1

us-east-1

us-west-2

To call the US Meta Llama 3.2 1B Instruct inference profile, specify the following inference profile ID in one of the source Regions:

us.meta.llama3-2-1b-instruct-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-west-2

us-east-1

us-west-2

us-east-2

us-east-1

us-east-2

us-west-2

us-east-1

us-east-1

us-west-2

To call the US Meta Llama 3.2 3B Instruct inference profile, specify the following inference profile ID in one of the source Regions:

us.meta.llama3-2-3b-instruct-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-west-2

us-east-1

us-west-2

us-east-2

us-east-1

us-east-2

us-west-2

us-east-1

us-east-1

us-west-2

To call the US Meta Llama 3.2 90B Instruct inference profile, specify the following inference profile ID in one of the source Regions:

us.meta.llama3-2-90b-instruct-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-west-2

us-east-1

us-west-2

us-east-2

us-east-1

us-east-2

us-west-2

us-east-1

us-east-1

us-west-2

To call the US Meta Llama 3.3 70B Instruct inference profile, specify the following inference profile ID in one of the source Regions:

us.meta.llama3-3-70b-instruct-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-west-2

us-east-1

us-east-2

us-west-2

us-east-2

us-east-1

us-east-2

us-west-2

us-east-1

us-east-1

us-east-2

us-west-2

To call the US-GOV Claude 3.5 Sonnet inference profile, specify the following inference profile ID in one of the source Regions:

us-gov.anthropic.claude-3-5-sonnet-20240620-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-gov-east-1

us-gov-east-1

us-gov-west-1

To call the US-GOV Claude 3 Haiku inference profile, specify the following inference profile ID in one of the source Regions:

us-gov.anthropic.claude-3-haiku-20240307-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
us-gov-east-1

us-gov-east-1

us-gov-west-1

To call the EU Nova Lite inference profile, specify the following inference profile ID in one of the source Regions:

eu.amazon.nova-lite-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
eu-west-3

eu-central-1

eu-north-1

eu-west-1

eu-west-3

eu-west-1

eu-central-1

eu-north-1

eu-west-1

eu-west-3

eu-north-1

eu-central-1

eu-north-1

eu-west-1

eu-west-3

eu-central-1

eu-central-1

eu-north-1

eu-west-1

eu-west-3

To call the EU Nova Micro inference profile, specify the following inference profile ID in one of the source Regions:

eu.amazon.nova-micro-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
eu-west-3

eu-central-1

eu-north-1

eu-west-1

eu-west-3

eu-west-1

eu-central-1

eu-north-1

eu-west-1

eu-west-3

eu-north-1

eu-central-1

eu-north-1

eu-west-1

eu-west-3

eu-central-1

eu-central-1

eu-north-1

eu-west-1

eu-west-3

To call the EU Nova Pro inference profile, specify the following inference profile ID in one of the source Regions:

eu.amazon.nova-pro-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
eu-west-3

eu-central-1

eu-north-1

eu-west-1

eu-west-3

eu-west-1

eu-central-1

eu-north-1

eu-west-1

eu-west-3

eu-north-1

eu-central-1

eu-north-1

eu-west-1

eu-west-3

eu-central-1

eu-central-1

eu-north-1

eu-west-1

eu-west-3

To call the EU Anthropic Claude 3.5 Sonnet inference profile, specify the following inference profile ID in one of the source Regions:

eu.anthropic.claude-3-5-sonnet-20240620-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
eu-west-3

eu-central-1

eu-west-1

eu-west-3

eu-west-1

eu-central-1

eu-west-1

eu-west-3

eu-central-1

eu-central-1

eu-west-1

eu-west-3

To call the EU Anthropic Claude 3 Haiku inference profile, specify the following inference profile ID in one of the source Regions:

eu.anthropic.claude-3-haiku-20240307-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
eu-west-3

eu-central-1

eu-west-1

eu-west-3

eu-west-1

eu-central-1

eu-west-1

eu-west-3

eu-central-1

eu-central-1

eu-west-1

eu-west-3

To call the EU Anthropic Claude 3 Sonnet inference profile, specify the following inference profile ID in one of the source Regions:

eu.anthropic.claude-3-sonnet-20240229-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
eu-west-3

eu-central-1

eu-west-1

eu-west-3

eu-west-1

eu-central-1

eu-west-1

eu-west-3

eu-central-1

eu-central-1

eu-west-1

eu-west-3

To call the EU Meta Llama 3.2 1B Instruct inference profile, specify the following inference profile ID in one of the source Regions:

eu.meta.llama3-2-1b-instruct-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
eu-west-3

eu-central-1

eu-west-1

eu-west-3

eu-west-1

eu-central-1

eu-west-1

eu-west-3

eu-central-1

eu-central-1

eu-west-1

eu-west-3

To call the EU Meta Llama 3.2 3B Instruct inference profile, specify the following inference profile ID in one of the source Regions:

eu.meta.llama3-2-3b-instruct-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
eu-west-3

eu-central-1

eu-west-1

eu-west-3

eu-west-1

eu-central-1

eu-west-1

eu-west-3

eu-central-1

eu-central-1

eu-west-1

eu-west-3

To call the APAC Nova Lite inference profile, specify the following inference profile ID in one of the source Regions:

apac.amazon.nova-lite-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
ap-southeast-2

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

ap-southeast-1

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

ap-south-1

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

ap-northeast-2

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

ap-northeast-1

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

To call the APAC Nova Micro inference profile, specify the following inference profile ID in one of the source Regions:

apac.amazon.nova-micro-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
ap-southeast-2

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

ap-southeast-1

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

ap-south-1

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

ap-northeast-2

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

ap-northeast-1

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

To call the APAC Nova Pro inference profile, specify the following inference profile ID in one of the source Regions:

apac.amazon.nova-pro-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
ap-southeast-2

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

ap-southeast-1

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

ap-south-1

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

ap-northeast-2

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

ap-northeast-1

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

To call the APAC Anthropic Claude 3.5 Sonnet inference profile, specify the following inference profile ID in one of the source Regions:

apac.anthropic.claude-3-5-sonnet-20240620-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
ap-southeast-2

ap-northeast-1

ap-northeast-2

ap-south-1

ap-southeast-1

ap-southeast-2

ap-southeast-1

ap-northeast-1

ap-northeast-2

ap-south-1

ap-southeast-1

ap-southeast-2

ap-south-1

ap-northeast-1

ap-northeast-2

ap-south-1

ap-southeast-1

ap-southeast-2

ap-northeast-2

ap-northeast-1

ap-northeast-2

ap-south-1

ap-southeast-1

ap-southeast-2

ap-northeast-1

ap-northeast-1

ap-northeast-2

ap-south-1

ap-southeast-1

ap-southeast-2

To call the APAC Anthropic Claude 3.5 Sonnet v2 inference profile, specify the following inference profile ID in one of the source Regions:

apac.anthropic.claude-3-5-sonnet-20241022-v2:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
ap-southeast-2

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

ap-southeast-1

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

ap-south-2

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-south-2

ap-southeast-1

ap-southeast-2

ap-south-1

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

ap-northeast-3

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

ap-northeast-2

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

ap-northeast-1

ap-northeast-1

ap-northeast-2

ap-northeast-3

ap-south-1

ap-southeast-1

ap-southeast-2

To call the APAC Anthropic Claude 3 Haiku inference profile, specify the following inference profile ID in one of the source Regions:

apac.anthropic.claude-3-haiku-20240307-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
ap-southeast-2

ap-northeast-1

ap-northeast-2

ap-south-1

ap-southeast-1

ap-southeast-2

ap-southeast-1

ap-northeast-1

ap-northeast-2

ap-south-1

ap-southeast-1

ap-southeast-2

ap-south-1

ap-northeast-1

ap-northeast-2

ap-south-1

ap-southeast-1

ap-southeast-2

ap-northeast-2

ap-northeast-1

ap-northeast-2

ap-south-1

ap-southeast-1

ap-southeast-2

ap-northeast-1

ap-northeast-1

ap-northeast-2

ap-south-1

ap-southeast-1

ap-southeast-2

To call the APAC Anthropic Claude 3 Sonnet inference profile, specify the following inference profile ID in one of the source Regions:

apac.anthropic.claude-3-sonnet-20240229-v1:0

The following table shows the source Regions from which you can call the inference profile and the destination Regions to which the requests can be routed:

Source Region Destination Regions
ap-southeast-2

ap-northeast-1

ap-northeast-2

ap-south-1

ap-southeast-1

ap-southeast-2

ap-southeast-1

ap-northeast-1

ap-northeast-2

ap-south-1

ap-southeast-1

ap-southeast-2

ap-south-1

ap-northeast-1

ap-northeast-2

ap-south-1

ap-southeast-1

ap-southeast-2

ap-northeast-2

ap-northeast-1

ap-northeast-2

ap-south-1

ap-southeast-1

ap-southeast-2

ap-northeast-1

ap-northeast-1

ap-northeast-2

ap-south-1

ap-southeast-1

ap-southeast-2

Note

1 In this Region, the specified model can be optimized for latency. For more information, see Optimize model inference for latency.

Supported Regions and models for application inference profiles

Application inference profiles is supported in the following Regions (for more information about Regions supported in Amazon Bedrock see Amazon Bedrock endpoints and quotas):

  • US East (N. Virginia)

  • US East (Ohio)

  • US West (Oregon)

  • AWS GovCloud (US-East)

  • Asia Pacific (Tokyo)

  • Asia Pacific (Seoul)

  • Asia Pacific (Mumbai)

  • Asia Pacific (Singapore)

  • Asia Pacific (Sydney)

  • Canada (Central)

  • Europe (Frankfurt)

  • Europe (Ireland)

  • Europe (London)

  • Europe (Paris)

  • South America (São Paulo)

Application inference profiles can be created from all models and inference profiles supported in Amazon Bedrock. For more information about models supported in Amazon Bedrock, see Supported foundation models in Amazon Bedrock.