Prerequisites for Amazon Bedrock Model Distillation

Complete the following prerequisites before you start a model distillation job:

Decide on a teacher model

Choose a teacher model that is significantly larger and more capable than the student model, and whose accuracy you want to achieve for your use case. To make the distillation job more effective, select a model that is already trained on task similar to your use case. For information on the teacher models supported by Amazon Bedrock see Supported models and Regions for Amazon Bedrock Model Distillation.
Decide on a student model

Choose a student model that is significantly smaller in size. For information on the student models that Amazon Bedrock supports, see Supported models and Regions for Amazon Bedrock Model Distillation.
Prepare your input dataset

To prepare input datasets for your custom model, you create .jsonl files, each line of which is a JSON object corresponding to a record. The files you create must conform to the format for the customization method and model that you choose and the records in it must conform to size requirements.
Note
If you are using Anthropic or Meta Llama models, continue with this step.
If you are using Amazon Nova models for distillation, see the following guidelines and then continue with step 4.
- Guidelines for preparing your data for Amazon Nova models.
- Guidelines for model distillation for Amazon Nova.
Provide the input data as prompts. Amazon Bedrock uses the input data to generate responses from the teacher model and uses the generated responses to fine-tune the student model. For more information about inputs Amazon Bedrock uses, and for choosing an option that works best for your use case, see How Amazon Bedrock Model Distillation works.

Choose the option that works best for your use case for instructions on preparing your input dataset:

Option 1: Provide your own prompts

Collect your prompts and store them in a JSON Line (JSONL) format. Each record in the JSONL must use the following structure.
- Include the schemaVersion field that must have the value bedrock-conversion-2024.
- [Optional] Include a system prompt that indicates the role assigned to the model.
- In messages field, include the user role containing the input prompt provided to the model.
- [Optional] In the messages field, include assistant role containing the desired response.
For the preview release Anthropic and Meta Llama models support only single -turn conversation prompts, meaning you can only have one user prompt. The Amazon Nova models support multi-turn conversations, allowing you to provide multiple user and assistant exchanges within one record.

Example format
```
{
    "schemaVersion": "bedrock-conversation-2024",
    "system": [
        {
            "text": "A chat between a curious User and an artificial intelligence Bot. The Bot gives helpful, detailed, and polite answers to the User's questions."
        }
    ],    
    "messages": [
        {
            "role": "user",
            "content": [
                {
                    "text": "why is the sky blue"
                }
            ]
        },
        {
            "role": "assistant"
            "content": [
               {
                   "text": "The sky is blue because molecules in the air scatter blue light from the Sun more than other colors."
               }
            ]
        }
    ]
}
```
Option 2: Use invocation logs

To use invocation logs for model distillation, set the model invocation logging on, use one of the model invocation operations, and make sure that you've set up an Amazon S3 bucket as the destination for the logs. Before you can start the model distillation job, you must provide Amazon Bedrock permissions to access the logs. For more information about setting up the invocation logs, see Monitor model invocation using Amazon CloudWatch Logs.

With this option, you can specify if you want Amazon Bedrock to use only the prompts, or to use prompt-response pairs from the invocation log. If you want Amazon Bedrock to use only prompts, then Amazon Bedrock might add proprietary data synthesis techniques to generate diverse and higher-quality responses from the teacher model. If you want Amazon Bedrock to use prompt-response pairs, then Amazon Bedrock won't re-generate responses from the teacher model. Amazon Bedrock will directly use the responses from the invocation log to fine-tune the student model.
Important
You can provide a maximum of 15K prompts or prompt-response pairs to Amazon Bedrock for fine-tuning the student model. To ensure that the student model is fine-tuned to meet your specific requirements, we highly recommend the following:
- If you want Amazon Bedrock to use prompts only, make sure that there are at least 100 prompt-response pairs generated from across all models.
- If you want Amazon Bedrock to use responses from your invocation logs, make sure that you have at least 100 prompt-response pairs generated from the model in your invocation logs that exactly match with the teacher model you've chosen.
You can optionally add request metadata to the prompt-response pairs in the invocation log using one of the model invocation operations and then later use it to filter the logs. Amazon Bedrock can use the filtered logs to fine-tune the student model.

To filter the logs using multiple request metadata, use a single operation Boolean operator AND, OR, or NOT. You cannot combine operations. For single request metadata filtering, use the Boolean operator NOT.
If you do not already have an IAM service role with proper permissions, create a new custom AWS Identity and Access Management (IAM) service role with the proper permissions by following the instructions at Create a service role for model customization to set up the role. You can skip this prerequisite if you plan to use the AWS Management Console to automatically create a service role for you.
(Optional) Set up extra security configurations.
- You can encrypt input and output data, customization jobs, or inference requests made to custom models. For more information, see Encryption of model customization jobs and artifacts.
- You can create a virtual private cloud (VPC) to protect your customization jobs. For more information, see [Optional] Protect your model customization jobs using a VPC.

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Supported models and Regions for model distillation

Add request metadata

Prerequisites for Amazon Bedrock Model Distillation

Note

Important