可用的基础模型 - 亚马逊 SageMaker AI

本文属于机器翻译版本。若本译文内容与英语原文存在差异,则一律以英文原文为准。

可用的基础模型

Amazon SageMaker JumpStart 提供 state-of-the-art内置的公开可用和专有基础模型,用于自定义并集成到您的生成式 AI 工作流程中。

公开可用的基础模型

Amaz SageMaker JumpStart on 载入并维护来自第三方来源的开源基础模型。要开始使用其中一种公开可用的模型,请查看JumpStart 基础模型用法或浏览可用的示例笔记本之一。在公开可用模型的给定示例笔记本中,尝试切换模型 ID,以实验同一模型系列中的不同模型。

有关模型的更多信息 IDs 以及有关使用部署公开 JumpStart 基础模型的资源 SageMaker Python 软件开发工具包,请参阅将基础模型与 SageMaker Python SDK

按照定义,基础模型可以根据多种下游任务进行调整。基础模型在大量的通用领域数据上进行训练,可以针对多种使用案例实施相同的模型或进行自定义。选择基础模型时,首先要定义一项特定任务,例如文本生成或映像生成。

公开可用的时间序列预测模型

时间序列预测模型旨在分析和预测一段时间内的序列数据。这些模型可应用于金融、天气预报或能源需求预测等多个领域。Chronos 模型专为时间序列预测任务定制,可根据历史数据规律进行准确预测。

模型名称 模型 ID 模型来源 可微调

Chronos T5 Small

autogluon-forecasting-chronos-t5-small

Amazon

Chronos T5 Base

autogluon-forecasting-chronos-t5-base

Amazon

Chronos T5 Large

autogluon-forecasting-chronos-t5-large

Amazon

公开可用的文本生成模型

文本生成基础模型可用于多种下游任务,包括文本摘要、文本分类、问题回答、长篇内容生成、简短文案写作、信息提取等。

模型名称 模型 ID 模型来源 可微调

Alexa TM 20B

pytorch-textgeneration1-alexa20b

Amazon

Bloom 1b1

huggingface-textgeneration-bloom-1b1

Hugging Face

Bloom 1b7

huggingface-textgeneration-bloom-1b7

Hugging Face

Bloom 3B

huggingface-textgeneration1-bloom-3b

Hugging Face

Bloom 560m

huggingface-textgeneration-bloom-560m

Hugging Face

Bloom 7B1

huggingface-textgeneration1-bloom-7b1

Hugging Face

Bloomz 1b1

huggingface-textgeneration-bloomz-1b1

Hugging Face

Bloomz 1b7

huggingface-textgeneration-bloomz-1b7

Hugging Face

BloomZ 3B FP16

huggingface-textgeneration1-bloom-3b-fp16

Hugging Face

Bloomz 560m

huggingface-textgeneration-bloomz-560m

Hugging Face

BloomZ 7B1 FP16

huggingface-textgeneration1-bloomz-7b1-fp16

Hugging Face

Code Llama 13B

meta-textgeneration-llama-codellama-13b

Meta

Code Llama 13B Instruct

meta-textgeneration-llama-codellama-13b-instruct

Meta

Code Llama 13B Python

meta-textgeneration-llama-codellama-13b-python

Meta

Code Llama 34B

meta-textgeneration-llama-codellama-34b

Meta

Code Llama 34B Instruct

meta-textgeneration-llama-codellama-34b-instruct

Meta

Code Llama 34B Python

meta-textgeneration-llama-codellama-34b-python

Meta

Code Llama 70B

meta-textgeneration-llama-codellama-70b

Meta

Code Llama 70B Instruct

meta-textgeneration-llama-codellama-70b-instruct

Meta

Code Llama 70B Python

meta-textgeneration-llama-codellama-70b-python

Meta

Code Llama 7B

meta-textgeneration-llama-codellama-7b

Meta

Code Llama 7B Instruct

meta-textgeneration-llama-codellama-7b-instruct

Meta

Code Llama 7B Python

meta-textgeneration-llama-codellama-7b-python

Meta

CyberAgentLM2-7B-Chat (-7B-Chat) CALM2

huggingface-llm-calm2-7b-chat-bf16

Hugging Face

Distil GPT2

huggingface-textgeneration-distilgpt2

Hugging Face

Dolly V2 12b BF16

huggingface-textgeneration-dolly-v2-12b-bf16

Hugging Face

Dolly V2 3b BF16

huggingface-textgeneration-dolly-v2-3b-bf16

Hugging Face

Dolly V2 7b BF16

huggingface-textgeneration-dolly-v2-7b-bf16

Hugging Face

Dolphin 2.2.1 Mistral 7B

huggingface-llm-dolphin-2-2-1-mistral-7b

Hugging Face

Dolphin 2.5 Mixtral 8 7B

huggingface-llm-dolphin-2-5-mixtral-8x7b

Hugging Face

Dolphin 2.7 Mixtral 8 7B

huggingface-llm-dolphin-2-7-mixtral-8x7b

Hugging Face

EleutherAI GPT Neo 2.7B

huggingface-llm-eleutherai-gpt-neo-1-3b

Hugging Face

EleutherAI GPT Neo 2.7B

huggingface-llm-eleutherai-gpt-neo-2-7b

Hugging Face

Falcon 180B BF16

huggingface-llm-falcon-180b-bf16

Hugging Face

Falcon 180B 聊天室 BF16

huggingface-llm-falcon-180b-chat-bf16

Hugging Face

Falcon 40B BF16

huggingface-llm-falcon-40b-bf16

Hugging Face

Falcon 40B Instruct BF16

huggingface-llm-falcon-40b-instruct-bf16

Hugging Face

Falcon 7B BF16

huggingface-llm-falcon-7b-bf16

Hugging Face

Falcon 7B Instruct BF16

huggingface-llm-falcon-7b-instruct-bf16

Hugging Face

Falcon Lite

huggingface-llm-amazon-falconlite

Hugging Face

Falcon Lite 2

huggingface-llm-amazon-falconlite2

Hugging Face

Falcon RW 1B

huggingface-llm-tiiuae-falcon-rw-1b

Hugging Face

Flan-T5 Base

huggingface-text2text-flan-t5-base

Hugging Face

在 Samsum 数据集上微调 Flan-T5 基本模型

huggingface-text2text-flan-t5-base-samsum

Hugging Face

Flan-T5 Large

huggingface-text2text-flan-t5-large

Hugging Face

Flan-T5 Small

huggingface-text2text-flan-t5-small

Hugging Face

Flan-T5 XL

huggingface-text2text-flan-t5-xl

Hugging Face

Flan-T5 XXL

huggingface-text2text-flan-t5-xxl

Hugging Face

果馅饼-UL2 BF16

huggingface-text2text-flan-ul2-bf16

Hugging Face

Gemma 2B

huggingface-llm-gemma-2b

Hugging Face

Gemma 2B Instruct

huggingface-llm-gemma-2b-instruct

Hugging Face

Gemma 7B

huggingface-llm-gemma-7b

Hugging Face

Gemma 7B Instruct

huggingface-llm-gemma-7b-instruct

Hugging Face

GPT 2

huggingface-textgeneration-gpt2

Hugging Face

GPT NeoX 20B FP16

huggingface-textgeneration2-gpt-neox-20b-fp16

Hugging Face

GPT NeoXT Chat Base 20B FP16

huggingface-textgeneration2-gpt-neoxt-chat-base-20b-fp16

Hugging Face

GPT-2 XL

huggingface-textgeneration1-gpt-2-xl

Hugging Face

GPT-J 6B

huggingface-textgeneration1-gpt-j-6b

Hugging Face

GPT-Neo 1.3B

huggingface-textgeneration1-gpt-neo-1-3b

Hugging Face

GPT-Neo 125M

huggingface-textgeneration1-gpt-neo-125m

Hugging Face

GPT-NEO 2.7B

huggingface-textgeneration1-gpt-neo-2-7b

Hugging Face

Japanese StableLM Instruct Alpha 7B v2

model-textgenerationjp-japanese-stablelm-instruct-alpha-7b-v2

Hugging Face

LightGPT Instruct 6B

huggingface-textgeneration1-lightgpt

Hugging Face

Lite Llama 460M 1T

huggingface-llm-ahxt-litellama-460m-1t

Hugging Face

Llama 2 13B

meta-textgeneration-llama-2-13b

Meta

Llama 2 13B Chat

meta-textgeneration-llama-2-13b-f

Meta

Llama 2 13B Chat Neuron

meta-textgenerationneuron-llama-2-13b-f

Meta

Llama 2 13B Neuron

meta-textgenerationneuron-llama-2-13b

Meta

Llama 2 70B

meta-textgeneration-llama-2-70b

Meta

Llama 2 70B Chat

meta-textgeneration-llama-2-70b-f

Meta

Llama 2 70B Chat Neuron

meta-textgenerationneuron-llama-2-70b-f

Meta

Llama 2 70B Neuron

meta-textgenerationneuron-llama-2-70b

Meta

Llama 2 7B

meta-textgeneration-llama-2-7b

Meta

Llama 2 7B Chat

meta-textgeneration-llama-2-7b-f

Meta

Llama 2 7B Chat Neuron

meta-textgenerationneuron-llama-2-7b-f

Meta

Llama 2 7B Neuron

meta-textgenerationneuron-llama-2-7b

Meta

Llama 3 8B

meta-textgeneration-llama-3-8b

Meta

Llama 3 8B Instruct

meta-textgeneration-llama-3-8b-instruct

Meta

Llama 3 70B

meta-textgeneration-llama-3-70b

Meta

Llama 3 70B Instruct

meta-textgeneration-llama-3-70b-instruct

Meta

Llama Guard 7B

meta-textgeneration-llama-guard-7b

Meta

Mistral 7B

huggingface-llm-mistral-7b

Hugging Face

Mistral 7B Instruct

huggingface-llm-mistral-7b-instruct

Hugging Face

Mistral 7 OpenOrca B AWQ

huggingface-llm-thebloke-mistral-7b-openorca-awq

Hugging Face

Mistral 7B SFT Alpha

huggingface-llm-huggingfaceh4-mistral-7b-sft-alpha

Hugging Face

Mistral 7B SFT Beta

huggingface-llm-huggingfaceh4-mistral-7b-sft-beta

Hugging Face

Mistral Lite

huggingface-llm-amazon-mistrallite

Hugging Face

Mistral Trix V1

huggingface-llm-cultrix-mistraltrix-v1

Hugging Face

Mixtral 8x7B

huggingface-llm-mixtral-8x7b

Hugging Face

Mixtral 8x7B Instruct

huggingface-llm-mixtral-8x7b-instruct

Hugging Face

MPT 7B BF16

huggingface-textgeneration1-mpt-7b-bf16

Hugging Face

MPT 7B Instruct BF16

huggingface-textgeneration1-mpt-7b-instruct-bf16

Hugging Face

MPT 7 StoryWriter B -65k+ BF16

huggingface-textgeneration1-mpt-7b-storywriter-bf16

Hugging Face

Multilingual GPT

huggingface-llm-ai-forever-mgpt

Hugging Face

Nous Hermes 2 SOLAR 10.7B

huggingface-llm-nousresearch-nous-hermes-2-solar-10-7b

Hugging Face

Nous Hermes Llama 2 13B

huggingface-llm-nousresearch-nous-hermes-llama2-13b

Hugging Face

Nous Hermes Llama 2 7B

huggingface-llm-nousresearch-nous-hermes-llama-2-7b

Hugging Face

Open Hermes 2 Mistral 7B

huggingface-llm-teknium-openhermes-2-mistral-7b

Hugging Face

打开 LlaMa

huggingface-textgeneration-open-llama

Hugging Face

Open Llama 7B V2

huggingface-llm-openlm-research-open-llama-7b-v2

Hugging Face

Platypus 2 7B

huggingface-llm-garage-baind-platypus2-7b

Hugging Face

Pythia 160m Deduped

huggingface-llm-eleutherai-pythia-160m-deduped

Hugging Face

Pythia 7m Deduped

huggingface-llm-eleutherai-pythia-70m-deduped

Hugging Face

Quality Controlled Paraphrase Generation

huggingface-text2text-qcpg-sentences

Hugging Face

RedPajama Incite Base 3B V1

huggingface-textgeneration1-redpajama-incite-base-3B-v1-fp16

Hugging Face

RedPajama Incite Base 7B V1

huggingface-textgeneration1-redpajama-incite-base-7B-v1-fp16

Hugging Face

RedPajama INCITE Chat 3B V1

huggingface-textgeneration1-redpajama-incite-chat-3B-v1-fp16

Hugging Face

RedPajama INCITE Chat 7B V1

huggingface-textgeneration1-redpajama-incite-chat-7B-v1-fp16

Hugging Face

RedPajama INCITE Instruct 3B V1

huggingface-textgeneration1-redpajama-incite-instruct-3B-v1-fp16

Hugging Face

RedPajama INCITE Instruct 7B V1

huggingface-textgeneration1-redpajama-incite-instruct-7B-v1-fp16

Hugging Face

Rinna Bilingual GPT NeoX 4B Instruction PPO

huggingface-llm-bilingual-rinna-4b-instruction-ppo-bf16

Hugging Face

Rinna Japanese GPT NeoX 3.6B Instruction PPO

huggingface-llm-rinna-3-6b-instruction-ppo-bf16

Hugging Face

Star Chat Alpha

huggingface-llm-huggingfaceh4-starchat-alpha

Hugging Face

Star Chat Beta

huggingface-llm-huggingfaceh4-starchat-beta

Hugging Face

StarCoder

huggingface-llm-starcoder

Hugging Face

StarCoderBase

huggingface-llm-starcoderbase

Hugging Face

T0pp

huggingface-text2text-bigscience-t0pp

Hugging Face

T5 One Line Summary

huggingface-text2text-t5-one-line-summary

Hugging Face

Tiny Llama 1.1B

huggingface-llm-tinyllama-1-1b-intermediate-step-1431k-3

Hugging Face

Tiny Llama 1.1B Chat V0.6

huggingface-llm-tinyllama-tinyllama-1-1b-chat-v0-6

Hugging Face

Tiny Llama 1.1B Chat V1

huggingface-llm-tinyllama-tinyllama-1-1b-chat-v1-0

Hugging Face

Writer Palmyra Small

huggingface-llm-writer-palmyra-small

Hugging Face

YARN Mistral 7B 128k

huggingface-llm-nousresearch-yarn-mistral-7b-128k

Hugging Face

Zephyr 7B Alpha

huggingface-llm-huggingfaceh4-zephyr-7b-alpha

Hugging Face

Zephyr 7B Beta

huggingface-llm-huggingfaceh4-zephyr-7b-beta

Hugging Face

要探索最新的文本生成 JumpStart 基础模型,请使用 Amazon 入门 SageMaker JumpStart产品描述页面上的 “文本生成” 筛选器。您也可以直接在 Amazon SageMaker Studio 用户界面或 SageMaker Studio Classic 用户界面中根据任务探索基础模型。只有一部分公开可用的文本生成模型可供微调。 JumpStart有关更多信息,请参阅 在 Amazon SageMaker Studio 经典版中使用基础模型

公开可用的映像生成模型

JumpStart 提供各种 Stable Diffusion 图像生成基础模型,包括来自 Stability AI 的基础模型以及用于特定 text-to-image任务的预训练模型 Hugging Face。 如果你需要微调 text-to-image基础模型,你可以使用 Stability AI 中的 Stable Diffusion 2.1 基础版。如果你想探索已经接受过特定艺术风格训练的模型,你可以探索来自的众多第三方模型之一 Hugging Face 直接在 Amazon SageMaker Studio 用户界面或 SageMaker AI Studio 经典界面中。

要探索最新的图片生成 JumpStart 基础模型,请使用 Amazon 入门 SageMaker JumpStart产品描述页面上的 “文字转图片” 筛选器。要开始使用您选择 text-to-image的基础模型,请参阅JumpStart 基础模型用法

专有基础模型

亚马逊允许访问第三方 SageMaker JumpStart 提供商提供的专有基础模型,例如AI21 实验室Coher e 和。LightOn

要开始使用这些专有模型之一,请参阅JumpStart 基础模型用法。要使用专有基础模型,您必须先在 AWS Marketplace中订阅该模型。订阅模型后,在 Studio 或 SageMaker Studio Classic 中找到基础模型。有关更多信息,请参阅 SageMaker JumpStart 预训练模型

要探索适用于各种用例的最新专有基础模型,请参阅 Amazon 入门 SageMaker JumpStart