可用的基础模型 - Amazon SageMaker

可用的基础模型

Amazon SageMaker JumpStart 内置提供先进的公开可用和专有的基础模型,可以自定义并集成到您的生成式人工智能工作流中。

公开可用的基础模型

Amazon SageMaker JumpStart 载入并维护来自第三方来源的开源基础模型。要开始使用其中一种公开可用的模型,请查看JumpStart 基础模型的使用或浏览可用的示例笔记本之一。在公开可用模型的给定示例笔记本中,尝试切换模型 ID,以实验同一模型系列中的不同模型。

有关模型 ID 和使用 SageMaker Python SDK 部署公开可用 JumpStart 基础模型的资源的更多信息,请参阅 通过 SageMaker Python SDK 使用基础模型

按照定义,基础模型可以根据多种下游任务进行调整。基础模型在大量的通用领域数据上进行训练,可以针对多种使用案例实施相同的模型或进行自定义。选择基础模型时,首先要定义一项特定任务,例如文本生成或映像生成。

公开可用的时间序列预测模型

时间序列预测模型旨在分析和预测一段时间内的序列数据。这些模型可应用于金融、天气预报或能源需求预测等多个领域。Chronos 模型专为时间序列预测任务定制,可根据历史数据规律进行准确预测。

模型名称 模型 ID 模型来源 可微调

Chronos T5 Small

autogluon-forecasting-chronos-t5-small

Amazon

Chronos T5 Base

autogluon-forecasting-chronos-t5-base

Amazon

Chronos T5 Large

autogluon-forecasting-chronos-t5-large

Amazon

公开可用的文本生成模型

文本生成基础模型可用于多种下游任务,包括文本摘要、文本分类、问题回答、长篇内容生成、简短文案写作、信息提取等。

模型名称 模型 ID 模型来源 可微调

Alexa TM 20B

pytorch-textgeneration1-alexa20b

Amazon

Bloom 1b1

huggingface-textgeneration-bloom-1b1

Hugging Face

Bloom 1b7

huggingface-textgeneration-bloom-1b7

Hugging Face

Bloom 3B

huggingface-textgeneration1-bloom-3b

Hugging Face

Bloom 560m

huggingface-textgeneration-bloom-560m

Hugging Face

Bloom 7B1

huggingface-textgeneration1-bloom-7b1

Hugging Face

Bloomz 1b1

huggingface-textgeneration-bloomz-1b1

Hugging Face

Bloomz 1b7

huggingface-textgeneration-bloomz-1b7

Hugging Face

BloomZ 3B FP16

huggingface-textgeneration1-bloom-3b-fp16

Hugging Face

Bloomz 560m

huggingface-textgeneration-bloomz-560m

Hugging Face

BloomZ 7B1 FP16

huggingface-textgeneration1-bloomz-7b1-fp16

Hugging Face

Code Llama 13B

meta-textgeneration-llama-codellama-13b

Meta

Code Llama 13B Instruct

meta-textgeneration-llama-codellama-13b-instruct

Meta

Code Llama 13B Python

meta-textgeneration-llama-codellama-13b-python

Meta

Code Llama 34B

meta-textgeneration-llama-codellama-34b

Meta

Code Llama 34B Instruct

meta-textgeneration-llama-codellama-34b-instruct

Meta

Code Llama 34B Python

meta-textgeneration-llama-codellama-34b-python

Meta

Code Llama 70B

meta-textgeneration-llama-codellama-70b

Meta

Code Llama 70B Instruct

meta-textgeneration-llama-codellama-70b-instruct

Meta

Code Llama 70B Python

meta-textgeneration-llama-codellama-70b-python

Meta

Code Llama 7B

meta-textgeneration-llama-codellama-7b

Meta

Code Llama 7B Instruct

meta-textgeneration-llama-codellama-7b-instruct

Meta

Code Llama 7B Python

meta-textgeneration-llama-codellama-7b-python

Meta

CyberAgentLM2-7B-Chat (CALM2-7B-Chat)

huggingface-llm-calm2-7b-chat-bf16

Hugging Face

DistilGPT2

huggingface-textgeneration-distilgpt2

Hugging Face

Dolly V2 12b BF16

huggingface-textgeneration-dolly-v2-12b-bf16

Hugging Face

Dolly V2 3b BF16

huggingface-textgeneration-dolly-v2-3b-bf16

Hugging Face

Dolly V2 7b BF16

huggingface-textgeneration-dolly-v2-7b-bf16

Hugging Face

Dolphin 2.2.1 Mistral 7B

huggingface-llm-dolphin-2-2-1-mistral-7b

Hugging Face

Dolphin 2.5 Mixtral 8 7B

huggingface-llm-dolphin-2-5-mixtral-8x7b

Hugging Face

Dolphin 2.7 Mixtral 8 7B

huggingface-llm-dolphin-2-7-mixtral-8x7b

Hugging Face

EleutherAI GPT Neo 2.7B

huggingface-llm-eleutherai-gpt-neo-1-3b

Hugging Face

EleutherAI GPT Neo 2.7B

huggingface-llm-eleutherai-gpt-neo-2-7b

Hugging Face

Falcon 180B BF16

huggingface-llm-falcon-180b-bf16

Hugging Face

Falcon 180B Chat BF16

huggingface-llm-falcon-180b-chat-bf16

Hugging Face

Falcon 40B BF16

huggingface-llm-falcon-40b-bf16

Hugging Face

Falcon 40B Instruct BF16

huggingface-llm-falcon-40b-instruct-bf16

Hugging Face

Falcon 7B BF16

huggingface-llm-falcon-7b-bf16

Hugging Face

Falcon 7B Instruct BF16

huggingface-llm-falcon-7b-instruct-bf16

Hugging Face

Falcon Lite

huggingface-llm-amazon-falconlite

Hugging Face

Falcon Lite 2

huggingface-llm-amazon-falconlite2

Hugging Face

Falcon RW 1B

huggingface-llm-tiiuae-falcon-rw-1b

Hugging Face

Flan-T5 Base

huggingface-text2text-flan-t5-base

Hugging Face

在 Samsum 数据集上微调 Flan-T5 基本模型

huggingface-text2text-flan-t5-base-samsum

Hugging Face

Flan-T5 Large

huggingface-text2text-flan-t5-large

Hugging Face

Flan-T5 Small

huggingface-text2text-flan-t5-small

Hugging Face

Flan-T5 XL

huggingface-text2text-flan-t5-xl

Hugging Face

Flan-T5 XXL

huggingface-text2text-flan-t5-xxl

Hugging Face

Flan-UL2 BF16

huggingface-text2text-flan-ul2-bf16

Hugging Face

Gemma 2B

huggingface-llm-gemma-2b

Hugging Face

Gemma 2B Instruct

huggingface-llm-gemma-2b-instruct

Hugging Face

Gemma 7B

huggingface-llm-gemma-7b

Hugging Face

Gemma 7B Instruct

huggingface-llm-gemma-7b-instruct

Hugging Face

GPT 2

huggingface-textgeneration-gpt2

Hugging Face

GPT NeoX 20B FP16

huggingface-textgeneration2-gpt-neox-20b-fp16

Hugging Face

GPT NeoXT Chat Base 20B FP16

huggingface-textgeneration2-gpt-neoxt-chat-base-20b-fp16

Hugging Face

GPT-2 XL

huggingface-textgeneration1-gpt-2-xl

Hugging Face

GPT-J 6B

huggingface-textgeneration1-gpt-j-6b

Hugging Face

GPT-Neo 1.3B

huggingface-textgeneration1-gpt-neo-1-3b

Hugging Face

GPT-Neo 125M

huggingface-textgeneration1-gpt-neo-125m

Hugging Face

GPT-NEO 2.7B

huggingface-textgeneration1-gpt-neo-2-7b

Hugging Face

Japanese StableLM Instruct Alpha 7B v2

model-textgenerationjp-japanese-stablelm-instruct-alpha-7b-v2

Hugging Face

LightGPT Instruct 6B

huggingface-textgeneration1-lightgpt

Hugging Face

Lite Llama 460M 1T

huggingface-llm-ahxt-litellama-460m-1t

Hugging Face

Llama 2 13B

meta-textgeneration-llama-2-13b

Meta

Llama 2 13B Chat

meta-textgeneration-llama-2-13b-f

Meta

Llama 2 13B Chat Neuron

meta-textgenerationneuron-llama-2-13b-f

Meta

Llama 2 13B Neuron

meta-textgenerationneuron-llama-2-13b

Meta

Llama 2 70B

meta-textgeneration-llama-2-70b

Meta

Llama 2 70B Chat

meta-textgeneration-llama-2-70b-f

Meta

Llama 2 70B Chat Neuron

meta-textgenerationneuron-llama-2-70b-f

Meta

Llama 2 70B Neuron

meta-textgenerationneuron-llama-2-70b

Meta

Llama 2 7B

meta-textgeneration-llama-2-7b

Meta

Llama 2 7B Chat

meta-textgeneration-llama-2-7b-f

Meta

Llama 2 7B Chat Neuron

meta-textgenerationneuron-llama-2-7b-f

Meta

Llama 2 7B Neuron

meta-textgenerationneuron-llama-2-7b

Meta

Llama 3 8B

meta-textgeneration-llama-3-8b

Meta

Llama 3 8B Instruct

meta-textgeneration-llama-3-8b-instruct

Meta

Llama 3 70B

meta-textgeneration-llama-3-70b

Meta

Llama 3 70B Instruct

meta-textgeneration-llama-3-70b-instruct

Meta

Llama Guard 7B

meta-textgeneration-llama-guard-7b

Meta

Mistral 7B

huggingface-llm-mistral-7b

Hugging Face

Mistral 7B Instruct

huggingface-llm-mistral-7b-instruct

Hugging Face

Mistral 7B OpenOrca AWQ

huggingface-llm-thebloke-mistral-7b-openorca-awq

Hugging Face

Mistral 7B SFT Alpha

huggingface-llm-huggingfaceh4-mistral-7b-sft-alpha

Hugging Face

Mistral 7B SFT Beta

huggingface-llm-huggingfaceh4-mistral-7b-sft-beta

Hugging Face

Mistral Lite

huggingface-llm-amazon-mistrallite

Hugging Face

Mistral Trix V1

huggingface-llm-cultrix-mistraltrix-v1

Hugging Face

Mixtral 8x7B

huggingface-llm-mixtral-8x7b

Hugging Face

Mixtral 8x7B Instruct

huggingface-llm-mixtral-8x7b-instruct

Hugging Face

MPT 7B BF16

huggingface-textgeneration1-mpt-7b-bf16

Hugging Face

MPT 7B Instruct BF16

huggingface-textgeneration1-mpt-7b-instruct-bf16

Hugging Face

MPT 7B StoryWriter-65k+ BF16

huggingface-textgeneration1-mpt-7b-storywriter-bf16

Hugging Face

Multilingual GPT

huggingface-llm-ai-forever-mgpt

Hugging Face

Nous Hermes 2 SOLAR 10.7B

huggingface-llm-nousresearch-nous-hermes-2-solar-10-7b

Hugging Face

Nous Hermes Llama 2 13B

huggingface-llm-nousresearch-nous-hermes-llama2-13b

Hugging Face

Nous Hermes Llama 2 7B

huggingface-llm-nousresearch-nous-hermes-llama-2-7b

Hugging Face

Open Hermes 2 Mistral 7B

huggingface-llm-teknium-openhermes-2-mistral-7b

Hugging Face

Open LlaMa

huggingface-textgeneration-open-llama

Hugging Face

Open Llama 7B V2

huggingface-llm-openlm-research-open-llama-7b-v2

Hugging Face

Platypus 2 7B

huggingface-llm-garage-baind-platypus2-7b

Hugging Face

Pythia 160m Deduped

huggingface-llm-eleutherai-pythia-160m-deduped

Hugging Face

Pythia 7m Deduped

huggingface-llm-eleutherai-pythia-70m-deduped

Hugging Face

Quality Controlled Paraphrase Generation

huggingface-text2text-qcpg-sentences

Hugging Face

RedPajama INCITE Base 3B V1

huggingface-textgeneration1-redpajama-incite-base-3B-v1-fp16

Hugging Face

RedPajama INCITE Base 7B V1

huggingface-textgeneration1-redpajama-incite-base-7B-v1-fp16

Hugging Face

RedPajama INCITE Chat 3B V1

huggingface-textgeneration1-redpajama-incite-chat-3B-v1-fp16

Hugging Face

RedPajama INCITE Chat 7B V1

huggingface-textgeneration1-redpajama-incite-chat-7B-v1-fp16

Hugging Face

RedPajama INCITE Instruct 3B V1

huggingface-textgeneration1-redpajama-incite-instruct-3B-v1-fp16

Hugging Face

RedPajama INCITE Instruct 7B V1

huggingface-textgeneration1-redpajama-incite-instruct-7B-v1-fp16

Hugging Face

Rinna Bilingual GPT NeoX 4B Instruction PPO

huggingface-llm-bilingual-rinna-4b-instruction-ppo-bf16

Hugging Face

Rinna Japanese GPT NeoX 3.6B Instruction PPO

huggingface-llm-rinna-3-6b-instruction-ppo-bf16

Hugging Face

Star Chat Alpha

huggingface-llm-huggingfaceh4-starchat-alpha

Hugging Face

Star Chat Beta

huggingface-llm-huggingfaceh4-starchat-beta

Hugging Face

StarCoder

huggingface-llm-starcoder

Hugging Face

StarCoderBase

huggingface-llm-starcoderbase

Hugging Face

T0pp

huggingface-text2text-bigscience-t0pp

Hugging Face

T5 One Line Summary

huggingface-text2text-t5-one-line-summary

Hugging Face

Tiny Llama 1.1B

huggingface-llm-tinyllama-1-1b-intermediate-step-1431k-3

Hugging Face

Tiny Llama 1.1B Chat V0.6

huggingface-llm-tinyllama-tinyllama-1-1b-chat-v0-6

Hugging Face

Tiny Llama 1.1B Chat V1

huggingface-llm-tinyllama-tinyllama-1-1b-chat-v1-0

Hugging Face

Writer Palmyra Small

huggingface-llm-writer-palmyra-small

Hugging Face

YARN Mistral 7B 128k

huggingface-llm-nousresearch-yarn-mistral-7b-128k

Hugging Face

Zephyr 7B Alpha

huggingface-llm-huggingfaceh4-zephyr-7b-alpha

Hugging Face

Zephyr 7B Beta

huggingface-llm-huggingfaceh4-zephyr-7b-beta

Hugging Face

要探索最新的文本生成 JumpStart 基础模型,请使用 Amazon SageMaker JumpStart 入门产品描述页面上的文本生成筛选条件。您也可以直接在 Amazon SageMaker Studio 用户界面或 SageMaker Studio Classic 用户界面中根据任务探索基础模型。只有一部分公开可用的文本生成模型可在 JumpStart 中进行微调。有关更多信息,请参阅 在 Amazon SageMaker Studio Classic 中使用基础模型

公开可用的映像生成模型

JumpStart 提供了各种各样的 Stable Diffusion 映像生成基础模型,包括来自 Stability AI 的基础模型,以及 Hugging Face 中用于特定文本到映像任务的预训练模型。如果您需要微调文本到图像基础模型,可以使用 Stability AI 中的 Stable Diffusion 2.1 基础模型。如果您想探索已经接受过特定艺术风格训练的模型,您可以直接在 Amazon SageMaker Studio 用户界面或 SageMaker Studio Classic 用户界面中探索源自 Hugging Face 的众多第三方模型之一。

要探索最新的图像生成 JumpStart 基础模型,请使用 Amazon SageMaker JumpStart 入门产品描述页面上的文本到图像筛选条件。要开始使用您选择的文本转图像基础模型,请参阅JumpStart 基础模型的使用

专有基础模型

Amazon SageMaker JumpStart 提供了对第三方提供商的专有基础模型的访问,这些提供商包括 AI21 LabsCohereLightOn

要开始使用这些专有模型之一,请参阅JumpStart 基础模型的使用。要使用专有基础模型,您必须先在 AWS Marketplace 中订阅该模型。订阅模型后,在 Studio 或 SageMaker Studio Classic 中找到基础模型。有关更多信息,请参阅 SageMaker JumpStart 预训练模型

要探索适用于各种使用案例的最新专有基础模型,请参阅 Amazon SageMaker JumpStart 入门