Supported Regions and models for inference profiles - Amazon Bedrock

Supported Regions and models for inference profiles

For a list of Region codes and endpoints supported in Amazon Bedrock, see Amazon Bedrock endpoints and quotas. This topic describes predefined inference profiles that you can use and the Regions and models that support application inference profiles.

Supported cross-region inference profiles

You can carry out cross-region inference with cross region (system-defined) inference profiles. These inference profiles are named after the model that they support and the area whose regions they support. You must call an inference profile from one of the regions it includes.

The following cross region inference profiles are available for use:

Inference profile Inference profile ID Regions included
US Anthropic Claude 3 Sonnet us.anthropic.claude-3-sonnet-20240229-v1:0

us-east-1

us-west-2

US Anthropic Claude 3 Opus us.anthropic.claude-3-opus-20240229-v1:0

us-east-1

us-west-2

US Anthropic Claude 3 Haiku us.anthropic.claude-3-haiku-20240307-v1:0

us-east-1

us-east-21

us-west-2

us-gov-east-1

US Meta Llama 3.2 11B Instruct us.meta.llama3-2-11b-instruct-v1:0

us-east-1

us-east-21

us-west-2

US Meta Llama 3.2 3B Instruct us.meta.llama3-2-3b-instruct-v1:0

us-east-1

us-east-21

us-west-2

US Meta Llama 3.2 90B Instruct us.meta.llama3-2-90b-instruct-v1:0

us-east-1

us-east-21

us-west-2

US Meta Llama 3.2 1B Instruct us.meta.llama3-2-1b-instruct-v1:0

us-east-1

us-east-21

us-west-2

US Anthropic Claude 3.5 Sonnet us.anthropic.claude-3-5-sonnet-20240620-v1:0

us-east-1

us-east-21

us-west-2

us-gov-east-1

US Anthropic Claude 3.5 Sonnet v2 us.anthropic.claude-3-5-sonnet-20241022-v2:0

us-east-1

us-west-2

US Anthropic Claude 3.5 Haiku us.anthropic.claude-3-5-haiku-20241022-v1:0

us-east-1

us-east-21 2

us-west-2

US Meta Llama 3.1 8B Instruct us.meta.llama3-1-8b-instruct-v1:0

us-east-1

us-east-21

us-west-2

US Meta Llama 3.1 70B Instruct us.meta.llama3-1-70b-instruct-v1:0

us-east-1

us-east-21 2

us-west-2

US Nova Lite us.amazon.nova-lite-v1:0

us-east-1

us-east-21

us-west-2

US Nova Pro us.amazon.nova-pro-v1:0

us-east-1

us-east-21

us-west-2

US Nova Micro us.amazon.nova-micro-v1:0

us-east-1

us-east-21

us-west-2

US Meta Llama 3.1 Instruct 405B us.meta.llama3-1-405b-instruct-v1:0

us-east-21 2

EU Anthropic Claude 3 Sonnet eu.anthropic.claude-3-sonnet-20240229-v1:0

eu-central-1

eu-west-1

eu-west-3

EU Anthropic Claude 3.5 Sonnet eu.anthropic.claude-3-5-sonnet-20240620-v1:0

eu-central-1

eu-west-1

eu-west-3

EU Anthropic Claude 3 Haiku eu.anthropic.claude-3-haiku-20240307-v1:0

eu-central-1

eu-west-1

eu-west-3

EU Meta Llama 3.2 3B Instruct eu.meta.llama3-2-3b-instruct-v1:0

eu-central-1

eu-west-1

eu-west-3

EU Meta Llama 3.2 1B Instruct eu.meta.llama3-2-1b-instruct-v1:0

eu-central-1

eu-west-1

eu-west-3

APAC Anthropic Claude 3 Sonnet apac.anthropic.claude-3-sonnet-20240229-v1:0

ap-northeast-1

ap-northeast-2

ap-south-1

ap-southeast-1

ap-southeast-2

APAC Anthropic Claude 3.5 Sonnet apac.anthropic.claude-3-5-sonnet-20240620-v1:0

ap-northeast-1

ap-northeast-2

ap-south-1

ap-southeast-1

ap-southeast-2

APAC Anthropic Claude 3 Haiku apac.anthropic.claude-3-haiku-20240307-v1:0

ap-northeast-1

ap-northeast-2

ap-south-1

ap-southeast-1

ap-southeast-2

Note

1 Requests originating from us-east-2 can be routed to us-east-1 or us-west-2. However, requests originating in us-east-1 and us-west-2 won't be routed to us-east-2.

2 In this Region, the specified model can be optimized for latency. For more information, see Optimize model inference for latency.

Supported Regions and models for application inference profiles

Application inference profiles is supported in the following Regions (for more information about Regions supported in Amazon Bedrock see Amazon Bedrock endpoints and quotas):

  • US East (N. Virginia)

  • US East (Ohio)

  • US West (Oregon)

  • Asia Pacific (Tokyo)

  • Asia Pacific (Seoul)

  • Asia Pacific (Mumbai)

  • Asia Pacific (Singapore) (Gated)

  • Asia Pacific (Sydney)

  • Canada (Central)

  • Europe (Frankfurt)

  • Europe (Ireland) (Gated)

  • Europe (London)

  • Europe (Paris)

  • South America (São Paulo)

Application inference profiles is supported for the following foundation models (to see which Regions support each model, refer to Supported foundation models in Amazon Bedrock):

  • Amazon Titan Embeddings G1 - Text

  • Amazon Titan Image Generator G1 v2

  • Amazon Titan Image Generator G1

  • Amazon Titan Text Embeddings V2

  • Anthropic Claude 2.1

  • Anthropic Claude 3 Haiku

  • Anthropic Claude 3 Opus

  • Anthropic Claude 3 Sonnet

  • Anthropic Claude 3.5 Sonnet

  • Meta Llama 3 70B Instruct

  • Meta Llama 3 8B Instruct

  • Meta Llama 3.2 11B Instruct

  • Meta Llama 3.2 1B Instruct

  • Meta Llama 3.2 3B Instruct

  • Meta Llama 3.2 90B Instruct

  • Mistral AI Mistral 7B Instruct

  • Mistral AI Mixtral 8x7B Instruct

  • Stability AI SDXL 1.0