Create a knowledge base by connecting to a structured data store

PDF

RSS

Focus mode

Create a knowledge base by connecting to a structured data store - Amazon Bedrock

To connect a knowledge base to a structured data store, you specify the following components:

The data store containing your data. You can connect to the following data stores:
- Amazon Redshift
- AWS Glue Data Catalog (AWS Lake Formation)
The query engine (currently, only Amazon Redshift is supported) to use to convert natural language user queries into SQL queries that can be used to extract data from your data store.
- The authentication method for using the query engine. The following options are available:
  - IAM role – Authenticate using the IAM service role with permissions to manage your knowledge base.
  - Temporary credentials user name – Authenticate using the query engine database user.
  - Secrets Manager – Authenticate with an AWS Secrets Manager secret that is linked to your database credentials.
  The authentication methods available differ by the query engine and data store that you use. To see support for different authentication types, see Set up query engine for your structured data store in Amazon Bedrock Knowledge Bases and Allow your Amazon Bedrock Knowledge Bases service role to access your data store.
- (Optional) Query configurations for improving the accuracy of SQL generation:
  - Maximum query time – The amount of time after which the query times out.
  - Descriptions – Provides metadata or supplementary information about tables or columns. You can include descriptions of the tables or columns, usage notes, or any additional attributes. The descriptions you add can improve SQL query generation by providing extra context and information about the structure of the tables or columns.
  - Inclusions and Exclusions – Specifies a set of tables or columns to be included or excluded for SQL generation. This field is crucial if you want to limit the scope of SQL queries to a defined subset of available tables or columns. This option can help optimize the generation process by reducing unnecessary table or column references.
    
    If you specify inclusions, all other tables and columns are ignored. If you specify exclusions, the tables and columns you specify are ignored.
    
    Note
    Inclusions and exclusions aren't a substitute for guardrails and is only intended for improving model accuracy.
  - Curated queries – A set of predefined question and answer examples. Questions are written as natural language queries (NLQ) and answers are the corresponding SQL query. These examples help the SQL generation process by providing examples of the kinds of queries that should be generated. They serve as reference points to improve the accuracy and relevance of generative SQL outputs.

Expand the section that corresponds to your use case:

To connect to a structured data store using the AWS Management Console, do the following:

Sign in to the AWS Management Console using an IAM role with Amazon Bedrock permissions, and open the Amazon Bedrock console at https://console.aws.amazon.com/bedrock/.
In the left navigation pane, choose Knowledge bases.
In the Knowledge bases section, choose Create and then select Knowledge base with structured data store.
Set up the following details for the knowledge base:
1. (Optional) Change the default name and provide a description for your knowledge base.
2. Select the query engine to use for retrieving data from your data store.
3. Choose an IAM service role with the proper permissions to create and manage this knowledge base. You can let Amazon Bedrock create the service role or choose a custom role that you have created. For more information about creating a custom role, see Prerequisites for creating an Amazon Bedrock knowledge base with a structured data store.
4. (Optional) Add tags to associate with your knowledge base. For more information, see Tagging Amazon Bedrock resources.
5. Choose Next.
Configure your query engine:
1. Select the service in which you created a cluster or workgroup. Then choose the cluster or workgroup to use.
2. Select the authentication method and provide the necessary fields.
3. Select the data store in which to store your metadata. Then, choose or enter the name of the database.
4. (Optional) Modify the query configurations as necessary. Refer to the beginning of this topic for more information about different configurations.
5. Choose Next.
Review your knowledge base configurations and edit any sections as necessary. Confirm to create your knowledge base.

Use the console

To connect to a structured data store using the AWS Management Console, do the following:

Sign in to the AWS Management Console using an IAM role with Amazon Bedrock permissions, and open the Amazon Bedrock console at https://console.aws.amazon.com/bedrock/.
In the left navigation pane, choose Knowledge bases.
In the Knowledge bases section, choose Create and then select Knowledge base with structured data store.
Set up the following details for the knowledge base:
1. (Optional) Change the default name and provide a description for your knowledge base.
2. Select the query engine to use for retrieving data from your data store.
3. Choose an IAM service role with the proper permissions to create and manage this knowledge base. You can let Amazon Bedrock create the service role or choose a custom role that you have created. For more information about creating a custom role, see Prerequisites for creating an Amazon Bedrock knowledge base with a structured data store.
4. (Optional) Add tags to associate with your knowledge base. For more information, see Tagging Amazon Bedrock resources.
5. Choose Next.
Configure your query engine:
1. Select the service in which you created a cluster or workgroup. Then choose the cluster or workgroup to use.
2. Select the authentication method and provide the necessary fields.
3. Select the data store in which to store your metadata. Then, choose or enter the name of the database.
4. (Optional) Modify the query configurations as necessary. Refer to the beginning of this topic for more information about different configurations.
5. Choose Next.
Review your knowledge base configurations and edit any sections as necessary. Confirm to create your knowledge base.

To connect to a structured data store using the Amazon Bedrock API, send a CreateKnowledgeBase request with an Agents for Amazon Bedrock build-time endpoint with the following general request body:


{
    "name": "string",
    "roleArn": "string",
    "knowledgeBaseConfiguration": {
        "type": "SQL",
        "sqlKnowledgeBaseConfiguration": SqlKnowledgeBaseConfiguration
    },
    "description": "string",
    "clientToken": "string",
    "tags": {
        "string": "string"
    }
}

The following fields are required.

Field	Basic description
Name	A name for the knowledge base
roleArn	A knowledge base service role with the proper permissions. You can use the console to automatically create a service role with the proper permissions.
knowledgeBaseConfiguration	Contains configurations for the knowledge base. For a structured database, specify `SQL` as the `type` and include the `sqlKnowledgeBaseConfiguration` field.

The following fields are optional.

Field	Use
description	To include a description for the knowledge base.
clientToken	To ensure the API request completes only once. For more information, see Ensuring idempotency.
tags	To associate tags with the flow. For more information, see Tagging Amazon Bedrock resources.

The SQLKnowledgeBaseConfiguration depends on the query engine that you use. For Amazon Redshift, specify the type field as REDSHIFT and include the redshiftConfiguration field, which maps to a RedshiftConfiguration. For the RedshiftConfiguration, you configure the following fields:

You can configure the following types of query engine:

If your Amazon Redshift databases are provisioned on dedicated compute nodes, the value of the queryEngineConfiguration field should be a RedshiftQueryEngineConfiguration in the following format:


{
    "type": "PROVISIONED",
    "provisionedConfiguration": {
        "clusterIdentifier": "string",
        "authConfiguration": RedshiftProvisionedAuthConfiguration
    },
}

Specify the ID of the cluster in the clusterIdentifier field. The RedshiftProvisionedAuthConfiguration depends on the type of authorization you're using. Select the tab that matches your authorization method:

If you're using Amazon Redshift Serverless, the value of the queryConfigurationfield should be a RedshiftQueryEngineConfiguration in the following format:


{
    "type": "SERVERLESS",
    "serverlessConfiguration": {
        "workgroupArn": "string",
        "authConfiguration": 
    }
}

Specify the ARN of your workgroup in the workgroupArn field. The RedshiftServerlessAuthConfiguration depends on the type of authorization you're using. Select the tab that matches your authorization method:

This field maps to an array containing a single RedshiftQueryEngineStorageConfiguration, whose format depends on where your data is stored.

If your data is stored in AWS Glue Data Catalog, the RedshiftQueryEngineStorageConfiguration should be in the following format:


{
    "type": "AWS_DATA_CATALOG",
    "awsDataCatalogConfiguration": {
        "tableNames": ["string"]
    }
}

Add the name of each table that you want to connect your knowledge base to in the array that tableNames maps to.

Note

Enter table names in the pattern described in Cross-database queries (${databaseName}.${tableName}). You can include all tables by specifying ${databaseName.*}.

If your data is stored in an Amazon Redshift database, the RedshiftQueryEngineStorageConfiguration should be in the following format:


{
    "type": "string",
    "redshiftConfiguration": {
        "databaseName": "string"
    }
}

Specify the name of your Amazon Redshift database in the databaseName field.

Note

Enter table names in the pattern described in Cross-database queries (${databaseName}.${tableName}). You can include all tables by specifying ${databaseName.*}.

If your database is mounted through Amazon SageMaker AI Lakehouse, the database name is in the format ${db}@${schema}.

This field maps to the following QueryGenerationConfiguration that you can use to configure how your data is queried:


{
    "executionTimeoutSeconds": number,
    "generationContext": {
        "tables": [
            {
                "name": "string",
                "description": "string",
                "inclusion": "string",
                "columns": [
                    {
                        "name": "string",
                        "description": "string",
                        "inclusion": "string"
                    },
                    ...
                ]
            },
            ...
        ],
        "curatedQueries": [
            {
                "naturalLanguage": "string",
                "sql": "string"
            },
            ...
        ]
    }
}

If you want the query to time out, specify the timeout duration in seconds in the executionTimeoutSeconds field.

The generationContext field maps to a QueryGenerationContext object in which you can configure as many of the following options as you need.

Important

If you include a generation context, the query engine makes a best effort attempt to apply it when generating SQL. The generation context is non-deterministic and is only intended for improving model accuracy. To ensure accuracy, verify the generated SQL queries.

For information about generation contexts that you can include, expand the following sections:

To improve the accuracy of SQL generation for querying the database, you can provide a description for the table or column that provides more context than a short table or column name. You can do the following:

To add a description for a table, include a QueryGenerationTable object in the tables array. In that object, specify the name of the table in the name field and a description in the description field, as in the following example:
```
{
    "name": "database.schema.tableA",
    "description": "Description for Table A"
}
```
To add a description for a column, include a QueryGenerationTable object in the tables array. In that object, specify the name of the table in the name field and include the columns field, which maps to an array of QueryGenerationColumn. In a QueryGenerationColumn object, include the name of the column in the name field and a description in the description field, as in the following example:
```
{
    "name": "database.schema.tableA.columnA",
    "columns": [
        {
            "name": "Column A",
            "description": "Description for Column A"
        }
    ]
}
```

You can add a description for both a table and a column in it, as in the following example:


{
    "name": "database.schema.tableA",
    "description": "Description for Table A",
    "columns": [
        {
            "name": "database.schema.tableA.columnA",
            "description": "Description for Column A"
        }
    ]
}

Note

Enter table and column names in the pattern described in Cross-database queries. If your database is in AWS Glue Data Catalog, the format is awsdatacatalog.gluedatabase.table.

You can suggest tables or columns to include or exclude when generating SQL by using the inclusion field in the QueryGenerationTable and QueryGenerationColumn objects. You can specify one of the following values in the inclusion field:

INCLUDE – Only the tables or columns that you specify are included as context when generating SQL.
EXCLUDE – The tables or columns that you specify are excluded as context when generating SQL.

You can specify whether to include or exclude tables or columns in the following ways:

To include or exclude a table, include a QueryGenerationTable object in the tables array. In that object, specify the name of the table in the name field and whether to include or exclude it in the inclusion field, as in the following example:
```
{
    "name": "database.schema.tableA",
    "inclusion": "EXCLUDE"
}
```
The query engine doesn't add Table A in the additional context for generating SQL.
To include or exclude a column, include a QueryGenerationTable object in the tables array. In that object, specify the name of the table in the name field and include the columns field, which maps to an array of QueryGenerationColumn. In a QueryGenerationColumn object, include the name of the column in the name field and whether to include or exclude it in the inclusion field, as in the following example:
```
{
    "name": "database.schema.tableA",
    "columns": [
        {
            "name": "database.schema.tableA.columnA",
            "inclusion": "EXCLUDE"
        }
    ]
}
```
The SQL generation ignores Column A in Table A in the context when generating SQL.

You can combine tables and columns when specifying inclusions or exclusions, as in the following example:


{
    "name": "database.schema.tableA",
    "inclusion": "INCLUDE",
    "columns": [
        {
            "name": "database.schema.tableA.columnA",
            "inclusion": "EXCLUDE"
        }
    ]
}

SQL generation includes Table A, but excludes Column A within it when adding context for generating SQL.

Important

Table and column exclusions aren't substitutes for guardrails. These table and column inclusions and exclusions are used as additional context for model to consider when generating SQL.

To improve a query engine's accuracy in converting user queries into SQL queries, you can provide it examples in the curatedQueries field in the QueryGenerationContext object, which maps to an array of CuratedQuery objects. Each object contains the following fields:

naturalLanguage – An example of a query in natural language.
sql – The SQL query that corresponds to the natural language query.

Use the API

To connect to a structured data store using the Amazon Bedrock API, send a CreateKnowledgeBase request with an Agents for Amazon Bedrock build-time endpoint with the following general request body:


{
    "name": "string",
    "roleArn": "string",
    "knowledgeBaseConfiguration": {
        "type": "SQL",
        "sqlKnowledgeBaseConfiguration": SqlKnowledgeBaseConfiguration
    },
    "description": "string",
    "clientToken": "string",
    "tags": {
        "string": "string"
    }
}

The following fields are required.

Field	Basic description
Name	A name for the knowledge base
roleArn	A knowledge base service role with the proper permissions. You can use the console to automatically create a service role with the proper permissions.
knowledgeBaseConfiguration	Contains configurations for the knowledge base. For a structured database, specify `SQL` as the `type` and include the `sqlKnowledgeBaseConfiguration` field.

The following fields are optional.

Field	Use
description	To include a description for the knowledge base.
clientToken	To ensure the API request completes only once. For more information, see Ensuring idempotency.
tags	To associate tags with the flow. For more information, see Tagging Amazon Bedrock resources.

You can configure the following types of query engine:

If your Amazon Redshift databases are provisioned on dedicated compute nodes, the value of the queryEngineConfiguration field should be a RedshiftQueryEngineConfiguration in the following format:


{
    "type": "PROVISIONED",
    "provisionedConfiguration": {
        "clusterIdentifier": "string",
        "authConfiguration": RedshiftProvisionedAuthConfiguration
    },
}

If you're using Amazon Redshift Serverless, the value of the queryConfigurationfield should be a RedshiftQueryEngineConfiguration in the following format:


{
    "type": "SERVERLESS",
    "serverlessConfiguration": {
        "workgroupArn": "string",
        "authConfiguration": 
    }
}

queryEngineConfiguration

You can configure the following types of query engine:

If your Amazon Redshift databases are provisioned on dedicated compute nodes, the value of the queryEngineConfiguration field should be a RedshiftQueryEngineConfiguration in the following format:


{
    "type": "PROVISIONED",
    "provisionedConfiguration": {
        "clusterIdentifier": "string",
        "authConfiguration": RedshiftProvisionedAuthConfiguration
    },
}

Amazon Redshift Provisioned

If your Amazon Redshift databases are provisioned on dedicated compute nodes, the value of the queryEngineConfiguration field should be a RedshiftQueryEngineConfiguration in the following format:


{
    "type": "PROVISIONED",
    "provisionedConfiguration": {
        "clusterIdentifier": "string",
        "authConfiguration": RedshiftProvisionedAuthConfiguration
    },
}

anchor anchor anchor

If you authorize with your IAM role, you need to specify only IAM as the type in the RedshiftProvisionedAuthConfiguration with no additional fields.


{
    "type": "IAM"
}

If you're using Amazon Redshift Serverless, the value of the queryConfigurationfield should be a RedshiftQueryEngineConfiguration in the following format:


{
    "type": "SERVERLESS",
    "serverlessConfiguration": {
        "workgroupArn": "string",
        "authConfiguration": 
    }
}

Amazon Redshift Serverless

If you're using Amazon Redshift Serverless, the value of the queryConfigurationfield should be a RedshiftQueryEngineConfiguration in the following format:


{
    "type": "SERVERLESS",
    "serverlessConfiguration": {
        "workgroupArn": "string",
        "authConfiguration": 
    }
}

anchor anchor

If you authorize with your IAM role, you need to specify only IAM as the type in the RedshiftServerlessAuthConfiguration with no additional fields.


{
    "type": "IAM"
}

This field maps to an array containing a single RedshiftQueryEngineStorageConfiguration, whose format depends on where your data is stored.

If your data is stored in AWS Glue Data Catalog, the RedshiftQueryEngineStorageConfiguration should be in the following format:


{
    "type": "AWS_DATA_CATALOG",
    "awsDataCatalogConfiguration": {
        "tableNames": ["string"]
    }
}

Add the name of each table that you want to connect your knowledge base to in the array that tableNames maps to.

Note

Enter table names in the pattern described in Cross-database queries (${databaseName}.${tableName}). You can include all tables by specifying ${databaseName.*}.

If your data is stored in an Amazon Redshift database, the RedshiftQueryEngineStorageConfiguration should be in the following format:


{
    "type": "string",
    "redshiftConfiguration": {
        "databaseName": "string"
    }
}

Specify the name of your Amazon Redshift database in the databaseName field.

Note

Enter table names in the pattern described in Cross-database queries (${databaseName}.${tableName}). You can include all tables by specifying ${databaseName.*}.

If your database is mounted through Amazon SageMaker AI Lakehouse, the database name is in the format ${db}@${schema}.

storageConfigurations

This field maps to an array containing a single RedshiftQueryEngineStorageConfiguration, whose format depends on where your data is stored.

If your data is stored in AWS Glue Data Catalog, the RedshiftQueryEngineStorageConfiguration should be in the following format:


{
    "type": "AWS_DATA_CATALOG",
    "awsDataCatalogConfiguration": {
        "tableNames": ["string"]
    }
}

Add the name of each table that you want to connect your knowledge base to in the array that tableNames maps to.

Note

Enter table names in the pattern described in Cross-database queries (${databaseName}.${tableName}). You can include all tables by specifying ${databaseName.*}.

AWS Glue Data Catalog

If your data is stored in AWS Glue Data Catalog, the RedshiftQueryEngineStorageConfiguration should be in the following format:


{
    "type": "AWS_DATA_CATALOG",
    "awsDataCatalogConfiguration": {
        "tableNames": ["string"]
    }
}

Add the name of each table that you want to connect your knowledge base to in the array that tableNames maps to.

Note

Enter table names in the pattern described in Cross-database queries (${databaseName}.${tableName}). You can include all tables by specifying ${databaseName.*}.

If your data is stored in an Amazon Redshift database, the RedshiftQueryEngineStorageConfiguration should be in the following format:


{
    "type": "string",
    "redshiftConfiguration": {
        "databaseName": "string"
    }
}

Specify the name of your Amazon Redshift database in the databaseName field.

Note

Enter table names in the pattern described in Cross-database queries (${databaseName}.${tableName}). You can include all tables by specifying ${databaseName.*}.

If your database is mounted through Amazon SageMaker AI Lakehouse, the database name is in the format ${db}@${schema}.

Amazon Redshift databases

If your data is stored in an Amazon Redshift database, the RedshiftQueryEngineStorageConfiguration should be in the following format:


{
    "type": "string",
    "redshiftConfiguration": {
        "databaseName": "string"
    }
}

Specify the name of your Amazon Redshift database in the databaseName field.

Note

Enter table names in the pattern described in Cross-database queries (${databaseName}.${tableName}). You can include all tables by specifying ${databaseName.*}.

If your database is mounted through Amazon SageMaker AI Lakehouse, the database name is in the format ${db}@${schema}.

This field maps to the following QueryGenerationConfiguration that you can use to configure how your data is queried:


{
    "executionTimeoutSeconds": number,
    "generationContext": {
        "tables": [
            {
                "name": "string",
                "description": "string",
                "inclusion": "string",
                "columns": [
                    {
                        "name": "string",
                        "description": "string",
                        "inclusion": "string"
                    },
                    ...
                ]
            },
            ...
        ],
        "curatedQueries": [
            {
                "naturalLanguage": "string",
                "sql": "string"
            },
            ...
        ]
    }
}

If you want the query to time out, specify the timeout duration in seconds in the executionTimeoutSeconds field.

The generationContext field maps to a QueryGenerationContext object in which you can configure as many of the following options as you need.

Important

For information about generation contexts that you can include, expand the following sections:

To add a description for a table, include a QueryGenerationTable object in the tables array. In that object, specify the name of the table in the name field and a description in the description field, as in the following example:
```
{
    "name": "database.schema.tableA",
    "description": "Description for Table A"
}
```
To add a description for a column, include a QueryGenerationTable object in the tables array. In that object, specify the name of the table in the name field and include the columns field, which maps to an array of QueryGenerationColumn. In a QueryGenerationColumn object, include the name of the column in the name field and a description in the description field, as in the following example:
```
{
    "name": "database.schema.tableA.columnA",
    "columns": [
        {
            "name": "Column A",
            "description": "Description for Column A"
        }
    ]
}
```

You can add a description for both a table and a column in it, as in the following example:


{
    "name": "database.schema.tableA",
    "description": "Description for Table A",
    "columns": [
        {
            "name": "database.schema.tableA.columnA",
            "description": "Description for Column A"
        }
    ]
}

Note

Enter table and column names in the pattern described in Cross-database queries. If your database is in AWS Glue Data Catalog, the format is awsdatacatalog.gluedatabase.table.

INCLUDE – Only the tables or columns that you specify are included as context when generating SQL.
EXCLUDE – The tables or columns that you specify are excluded as context when generating SQL.

You can specify whether to include or exclude tables or columns in the following ways:

To include or exclude a table, include a QueryGenerationTable object in the tables array. In that object, specify the name of the table in the name field and whether to include or exclude it in the inclusion field, as in the following example:
```
{
    "name": "database.schema.tableA",
    "inclusion": "EXCLUDE"
}
```
The query engine doesn't add Table A in the additional context for generating SQL.
To include or exclude a column, include a QueryGenerationTable object in the tables array. In that object, specify the name of the table in the name field and include the columns field, which maps to an array of QueryGenerationColumn. In a QueryGenerationColumn object, include the name of the column in the name field and whether to include or exclude it in the inclusion field, as in the following example:
```
{
    "name": "database.schema.tableA",
    "columns": [
        {
            "name": "database.schema.tableA.columnA",
            "inclusion": "EXCLUDE"
        }
    ]
}
```
The SQL generation ignores Column A in Table A in the context when generating SQL.

You can combine tables and columns when specifying inclusions or exclusions, as in the following example:


{
    "name": "database.schema.tableA",
    "inclusion": "INCLUDE",
    "columns": [
        {
            "name": "database.schema.tableA.columnA",
            "inclusion": "EXCLUDE"
        }
    ]
}

SQL generation includes Table A, but excludes Column A within it when adding context for generating SQL.

Important

Table and column exclusions aren't substitutes for guardrails. These table and column inclusions and exclusions are used as additional context for model to consider when generating SQL.

naturalLanguage – An example of a query in natural language.
sql – The SQL query that corresponds to the natural language query.

queryGenerationConfiguration

This field maps to the following QueryGenerationConfiguration that you can use to configure how your data is queried:


{
    "executionTimeoutSeconds": number,
    "generationContext": {
        "tables": [
            {
                "name": "string",
                "description": "string",
                "inclusion": "string",
                "columns": [
                    {
                        "name": "string",
                        "description": "string",
                        "inclusion": "string"
                    },
                    ...
                ]
            },
            ...
        ],
        "curatedQueries": [
            {
                "naturalLanguage": "string",
                "sql": "string"
            },
            ...
        ]
    }
}

If you want the query to time out, specify the timeout duration in seconds in the executionTimeoutSeconds field.

The generationContext field maps to a QueryGenerationContext object in which you can configure as many of the following options as you need.

Important

For information about generation contexts that you can include, expand the following sections:

To add a description for a table, include a QueryGenerationTable object in the tables array. In that object, specify the name of the table in the name field and a description in the description field, as in the following example:
```
{
    "name": "database.schema.tableA",
    "description": "Description for Table A"
}
```
To add a description for a column, include a QueryGenerationTable object in the tables array. In that object, specify the name of the table in the name field and include the columns field, which maps to an array of QueryGenerationColumn. In a QueryGenerationColumn object, include the name of the column in the name field and a description in the description field, as in the following example:
```
{
    "name": "database.schema.tableA.columnA",
    "columns": [
        {
            "name": "Column A",
            "description": "Description for Column A"
        }
    ]
}
```

You can add a description for both a table and a column in it, as in the following example:


{
    "name": "database.schema.tableA",
    "description": "Description for Table A",
    "columns": [
        {
            "name": "database.schema.tableA.columnA",
            "description": "Description for Column A"
        }
    ]
}

Note

Enter table and column names in the pattern described in Cross-database queries. If your database is in AWS Glue Data Catalog, the format is awsdatacatalog.gluedatabase.table.

Add descriptions for tables or columns in the database

To add a description for a table, include a QueryGenerationTable object in the tables array. In that object, specify the name of the table in the name field and a description in the description field, as in the following example:
```
{
    "name": "database.schema.tableA",
    "description": "Description for Table A"
}
```
To add a description for a column, include a QueryGenerationTable object in the tables array. In that object, specify the name of the table in the name field and include the columns field, which maps to an array of QueryGenerationColumn. In a QueryGenerationColumn object, include the name of the column in the name field and a description in the description field, as in the following example:
```
{
    "name": "database.schema.tableA.columnA",
    "columns": [
        {
            "name": "Column A",
            "description": "Description for Column A"
        }
    ]
}
```

You can add a description for both a table and a column in it, as in the following example:


{
    "name": "database.schema.tableA",
    "description": "Description for Table A",
    "columns": [
        {
            "name": "database.schema.tableA.columnA",
            "description": "Description for Column A"
        }
    ]
}

Note

Enter table and column names in the pattern described in Cross-database queries. If your database is in AWS Glue Data Catalog, the format is awsdatacatalog.gluedatabase.table.

INCLUDE – Only the tables or columns that you specify are included as context when generating SQL.
EXCLUDE – The tables or columns that you specify are excluded as context when generating SQL.

You can specify whether to include or exclude tables or columns in the following ways:

To include or exclude a table, include a QueryGenerationTable object in the tables array. In that object, specify the name of the table in the name field and whether to include or exclude it in the inclusion field, as in the following example:
```
{
    "name": "database.schema.tableA",
    "inclusion": "EXCLUDE"
}
```
The query engine doesn't add Table A in the additional context for generating SQL.
To include or exclude a column, include a QueryGenerationTable object in the tables array. In that object, specify the name of the table in the name field and include the columns field, which maps to an array of QueryGenerationColumn. In a QueryGenerationColumn object, include the name of the column in the name field and whether to include or exclude it in the inclusion field, as in the following example:
```
{
    "name": "database.schema.tableA",
    "columns": [
        {
            "name": "database.schema.tableA.columnA",
            "inclusion": "EXCLUDE"
        }
    ]
}
```
The SQL generation ignores Column A in Table A in the context when generating SQL.

You can combine tables and columns when specifying inclusions or exclusions, as in the following example:


{
    "name": "database.schema.tableA",
    "inclusion": "INCLUDE",
    "columns": [
        {
            "name": "database.schema.tableA.columnA",
            "inclusion": "EXCLUDE"
        }
    ]
}

SQL generation includes Table A, but excludes Column A within it when adding context for generating SQL.

Important

Table and column exclusions aren't substitutes for guardrails. These table and column inclusions and exclusions are used as additional context for model to consider when generating SQL.

Include or exclude tables or columns in the database

INCLUDE – Only the tables or columns that you specify are included as context when generating SQL.
EXCLUDE – The tables or columns that you specify are excluded as context when generating SQL.

You can specify whether to include or exclude tables or columns in the following ways:

To include or exclude a table, include a QueryGenerationTable object in the tables array. In that object, specify the name of the table in the name field and whether to include or exclude it in the inclusion field, as in the following example:
```
{
    "name": "database.schema.tableA",
    "inclusion": "EXCLUDE"
}
```
The query engine doesn't add Table A in the additional context for generating SQL.
To include or exclude a column, include a QueryGenerationTable object in the tables array. In that object, specify the name of the table in the name field and include the columns field, which maps to an array of QueryGenerationColumn. In a QueryGenerationColumn object, include the name of the column in the name field and whether to include or exclude it in the inclusion field, as in the following example:
```
{
    "name": "database.schema.tableA",
    "columns": [
        {
            "name": "database.schema.tableA.columnA",
            "inclusion": "EXCLUDE"
        }
    ]
}
```
The SQL generation ignores Column A in Table A in the context when generating SQL.

You can combine tables and columns when specifying inclusions or exclusions, as in the following example:


{
    "name": "database.schema.tableA",
    "inclusion": "INCLUDE",
    "columns": [
        {
            "name": "database.schema.tableA.columnA",
            "inclusion": "EXCLUDE"
        }
    ]
}

SQL generation includes Table A, but excludes Column A within it when adding context for generating SQL.

Important

Table and column exclusions aren't substitutes for guardrails. These table and column inclusions and exclusions are used as additional context for model to consider when generating SQL.

naturalLanguage – An example of a query in natural language.
sql – The SQL query that corresponds to the natural language query.

Give the query engine example mappings of natural language to SQL queries

naturalLanguage – An example of a query in natural language.
sql – The SQL query that corresponds to the natural language query.

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Allow your service role to access your data store

Sync a structured data store

Next topic:

Sync a structured data store

Previous topic:

Allow your service role to access your data store

Need help?

Did this page help you?

Provide feedback

Privacy Site terms Cookie preferences

Select your cookie preferences

Create a knowledge base by connecting to a structured data store

Note

Use the console

Note

Note

Important

Note

Important

Use the API

queryEngineConfiguration

Amazon Redshift Provisioned

Amazon Redshift Serverless

Note

Note

storageConfigurations

Note

AWS Glue Data Catalog

Note

Note

Amazon Redshift databases

Note

Important

Note

Important

queryGenerationConfiguration

Important

Note

Add descriptions for tables or columns in the database

Note

Important

Include or exclude tables or columns in the database

Important

Give the query engine example mappings of natural language to SQL queries

Next topic:

Previous topic:

Need help?

Did this page help you?