# Amazon DocumentDB generative artificial intelligence
<a name="generative-ai"></a>

Amazon DocumentDB offers capabilities to enable machine learning (ML) and generative artificial intelligence (AI) models to work with data stored in Amazon DocumentDB in real time. Customers no longer have to spend time managing separate infrastructure, writing code to connect with another service, and duplicating data from their primary database.

For more information on artificial intelligence and how AWS can support your AI needs, see this ["What-is"](https://aws.amazon.com/what-is/artificial-intelligence/) article.

**Topics**
+ [

# No-code machine learning with Amazon SageMaker AI Canvas
](no-code-machine-learning.md)
+ [

# Vector search for Amazon DocumentDB
](vector-search.md)

# No-code machine learning with Amazon SageMaker AI Canvas
<a name="no-code-machine-learning"></a>

[Amazon SageMaker AI Canvas](https://docs.aws.amazon.com/sagemaker/latest/dg/canvas.html) enables you to build your own AI/ML models without having to write a single line of code. You can build ML models for common use cases such as regression and forecasting and can access and evaluate foundation models (FMs) from Amazon Bedrock. You can also access public FMs from Amazon SageMaker AI JumpStart for content generation, text extraction, and text summarization to support generative AI solutions.

## How to build no-code ML models with SageMaker AI Canvas
<a name="w2aac23b9b5"></a>

Amazon DocumentDB now integrates with Amazon SageMaker AI Canvas to enable no-code machine learning (ML) with data stored in Amazon DocumentDB. You can now build ML models for regression and forecasting needs and use foundation models for content summarization and generation using data stored in Amazon DocumentDB without writing a single line of code.

SageMaker AI Canvas provides a visual interface that allows Amazon DocumentDB customers to generate predictions without requiring any AI/ML expertise or write a single line of code. Customers can now launch the SageMaker AI Canvas workspace from the AWS Management Console, import and join Amazon DocumentDB data for data preparation and model training. Data in Amazon DocumentDB can now be used in SageMaker AI Canvas to build and augment models to predict customer churn, detect fraud, predict maintenance failures, forecast business metrics, and generate content. Customers can now publish and share ML-driven insights across teams using SageMaker AI Canvas’s native integration with Quick. Data ingestion pipelines in SageMaker AI Canvas run on Amazon DocumentDB secondary instances by default, ensuring that the performance of application and SageMaker AI Canvas ingestion workloads are not hindered.

Amazon DocumentDB customers can get started with SageMaker AI Canvas by navigating to the new Amazon DocumentDB No-Code ML Console page and connecting to new or available SageMaker AI Canvas workspaces.

## Configuring the SageMaker AI domain and user profile
<a name="sagemaker-domain"></a>

You can connect to Amazon DocumentDB clusters from SageMaker AI domains that are running in VPC Only mode. By launching a SageMaker AI domain in your VPC, you can control the data flow from your SageMaker AI Studio and Canvas environments. This allows you to restrict internet access, monitor and inspect traffic using standard AWS networking and security capabilities, and connect to other AWS resources through VPC endpoints. Please refer to [Amazon SageMaker AI Canvas Getting started](https://docs.aws.amazon.com/sagemaker/latest/dg/canvas-getting-started.html) and [Configure Amazon SageMaker AI Canvas in a VPC without internet access](https://docs.aws.amazon.com/sagemaker/latest/dg/canvas-vpc.html) located in the *Amazon SageMaker AI Developer Guide* to create your SageMaker AI domain to connect to your Amazon DocumentDB cluster.

## Configuring IAM access permissions for Amazon DocumentDB and SageMaker AI Canvas
<a name="iam-access-canvas"></a>

An Amazon DocumentDB user that has `AmazonDocDBConsoleFullAccess` attached to their associated role and identity can access the AWS Management Console. Add the following actions to the aforementioned role or identity to provide access to no-code machine learning with Amazon SageMaker AI Canvas.

```
"sagemaker:CreatePresignedDomainUrl",
"sagemaker:DescribeDomain",
"sagemaker:ListDomains",
"sagemaker:ListUserProfiles"
```

## Creating database users and roles for SageMaker AI Canvas
<a name="w2aac23b9c11"></a>

You can restrict access to the actions that users can perform on databases using role-based access control (RBAC) in Amazon DocumentDB. RBAC works by granting one or more roles to a user. These roles determine the operations that a user can perform on database resources. 

As a Canvas user, you connect to a Amazon DocumentDB database with username and password credentials. You can create a database user/role for a Canvas user that has read access to the specific databases using Amazon DocumentDB RBAC functionality.

For example, use the `createUser` operation:

```
db.createUser({
user: "canvas_user", 
pwd: "<insert-password>", 
roles: [{role: "read", db: "sample-database-1"}]
})
```

This creates a `canvas_user` which has read permissions to the `sample-database-1` database. Your Canvas analysts can use this credential to access data in your Amazon DocumentDB cluster. Refer to [Database access using Role-Based Access Control](role_based_access_control.md) to learn more. 

## Available regions
<a name="available-regions"></a>

The no-code integration is available in regions where both Amazon DocumentDB and Amazon SageMaker AI Canvas are supported. The regions include:
+ us-east-1 (N. Virginia)
+ us-east-2 (Ohio)
+ us-west-2 (Oregon)
+ ap-northeast-1 (Tokyo)
+ ap-northeast-2 (Seoul)
+ ap-south-1 (Mumbai)
+ ap-southeast-1 (Singapore)
+ ap-southeast-2 (Sydney)
+ eu-central-1 (Frankfurt)
+ eu-west-1 (Ireland)

Please refer to [Amazon SageMaker AI Canvas](https://docs.aws.amazon.com/sagemaker/latest/dg/canvas.html) in the *Amazon SageMaker AI Developer Guide* for the latest region availability.

# Vector search for Amazon DocumentDB
<a name="vector-search"></a>

Vector search is a method used in machine learning to find similar data points to a given data point by comparing their vector representations using distance or similarity metrics. The closer the two vectors are in the vector space, the more similar the underlying items are considered to be. This technique helps capture the semantic meaning of the data. This approach is useful in various applications, such as recommendation systems, natural language processing, and image recognition.

Vector search for Amazon DocumentDB combines the flexibility and rich querying capability of a JSON-based document database with the power of vector search. If you want to use your existing Amazon DocumentDB data or a flexible document data structure to build machine learning and generative AI use cases, such as semantic search experience, product recommendation, personalization, chatbots, fraud detection, and anomaly detection, then vector search for Amazon DocumentDB is an ideal choice for you. Vector search is available on Amazon DocumentDB 5.0 instance-based clusters.

**Topics**
+ [

## Inserting vectors
](#w2aac23c11b9)
+ [

## Creating a vector index
](#w2aac23c11c11)
+ [

## Getting an index definition
](#w2aac23c11c13)
+ [

## Querying vectors
](#w2aac23c11c15)
+ [

## Features and limitations
](#vector-limitations)
+ [

## Best practices
](#w2aac23c11c19)

## Inserting vectors
<a name="w2aac23c11b9"></a>

To insert vectors into your Amazon DocumentDB database, you can use existing insert methods: 

**Example**

In the following example, a collection of five documents within a test database is created. Each document includes two fields: the product name and its corresponding vector embedding.

```
db.collection.insertMany([
  {"product_name": "Product A", "vectorEmbedding": [0.2, 0.5, 0.8]},
  {"product_name": "Product B", "vectorEmbedding": [0.7, 0.3, 0.9]},
  {"product_name": "Product C", "vectorEmbedding": [0.1, 0.2, 0.5]},
  {"product_name": "Product D", "vectorEmbedding": [0.9, 0.6, 0.4]},
  {"product_name": "Product E", "vectorEmbedding": [0.4, 0.7, 0.2]}
]);
```

## Creating a vector index
<a name="w2aac23c11c11"></a>

Amazon DocumentDB supports both Hierarchical Navigable Small World (HNSW) indexing and Inverted File with Flat Compression (IVFFlat) indexing methods. An IVFFlat index segregates vectors into lists and subsequently searches a selected subset of those lists that are nearest to the query vector. On the other hand, an HNSW index organizes the vector data into a multi-layered graph. Although HNSW has slower build times compared to IVFFlat, it delivers better query performance and recall. Unlike IVFFlat, HNSW has no training step involved, allowing the index to be generated without any initial data load. For the majority of use cases, we recommend using the HNSW index type for vector search.

If you do not create a vector index, Amazon DocumentDB performs an exact nearest neighbor search, ensuring perfect recall. However, in production scenarios, speed is crucial. We recommend using vector indexes, which may trade some recall for improved speed. It's important to note that adding a vector index can lead to different query results.

**Templates**

You can use the following `createIndex` or `runCommand` templates to build a vector index on a vector field:

------
#### [ Using createIndex ]

In certain drivers, such as mongosh and Java, using the `vectorOptions` parameters in `createIndex` may result in an error. In such cases, we recommend using `runCommand`:

```
db.collection.createIndex(
  { "<vectorField>": "vector" },
  { "name": "<indexName>",
    "vectorOptions": {
      "type": " <hnsw> | <ivfflat> ",
      "dimensions": <number_of_dimensions>,
      "similarity": " <euclidean> | <cosine> | <dotProduct> ",
      "lists": <number_of_lists> [applicable for IVFFlat],
      "m": <max number of connections> [applicable for HNSW],
      "efConstruction": <size of the dynamic list for index build> [applicable for HNSW]
    }
  }
);
```

------
#### [ Using runCommand ]

In certain drivers, such as mongosh and Java, using the `vectorOptions` parameters in `createIndex` may result in an error. In such cases, we recommend using `runCommand`:

```
db.runCommand(
  { "createIndexes": "<collection>", 
  "indexes": [{
      key: { "<vectorField>": "vector" },
      vectorOptions: {
          type: " <hnsw> | <ivfflat> ",
          dimensions: <number of dimensions>,
          similarity: " <euclidean> | <cosine> | <dotProduct> ",
          lists: <number_of_lists> [applicable for IVFFlat],
          m: <max number of connections> [applicable for HNSW],
          efConstruction: <size of the dynamic list for index build> [applicable for HNSW]
          },
      name: "myIndex" 
      }] 
  }
);
```

------


| Parameter | Requirement | Data type | Description | Value(s) | 
| --- | --- | --- | --- | --- | 
|  **name**  |  optional  |  string  |  Specifies the name of the index.  |  Alphanumeric  | 
|  **type**  |  optional  |    |  Specifies the type of index.  |  Supported: hnsw or ivfflat Default: HNSW (engine patch 3.0.4574 onwards)  | 
|  **dimensions**  |  required  |  integer  |  Specifies the number of dimensions in the vector data.  |  Maximum of 2,000 dimensions.  | 
|  **similarity**  |  required  |  string  |  Specifies the distance metric used for the similarity calculation.  |  [\[See the AWS documentation website for more details\]](http://docs.aws.amazon.com/documentdb/latest/developerguide/vector-search.html)  | 
|  **lists**  |  required for IVFFlat  |  integer  |  Specifies the number of clusters that the IVFFlat index uses to group the vector data. The recommended setting is the \$1 of documents/1000 for up to 1M documents and `sqrt(# of documents)` for over 1M documents.  |  Minimum: 1 Maximum: Refer to the lists per instance type table in [Features and limitations](#vector-limitations) below.  | 
|  **m**  |  optional  |  integer  |  Specifies the max number of connections for an HNSW index  |  Default: 16 Range [2, 100]  | 
|  **efConstruction**  |  optional  |  integer  |  Specifies the size of the dynamic candidate list for constructing the graph for HNSW index. `efConstruction` must be greater than or equal to (2 \$1 m)  |  Default: 64 Range [4, 1000]  | 

It is important that you set the value of sub-parameters such as `lists` for IVFFlat and `m` and `efConstruction` for HNSW appropriately as it will affect the accuracy/recall, build time, and performance of your search. A higher list value increases the speed of the query as it reduces the number of vectors in each list, resulting in smaller regions. However, a smaller region size may lead to more recall errors, resulting in lower accuracy. For HNSW, increasing the value of `m` and `efConstruction` increases the accuracy, but also increases index build time and size. See the following examples:

**Examples**

------
#### [ HNSW ]

```
db.collection.createIndex(
  { "vectorEmbedding": "vector" },
  { "name": "myIndex",
    "vectorOptions": {
      "type": "hnsw",
      "dimensions": 3,
      "similarity": "euclidean",
      "m": 16,
      "efConstruction": 64
    }
  }
);
```

------
#### [ IVFFlat ]

```
db.collection.createIndex(
  { "vectorEmbedding": "vector" },
  { "name": "myIndex",
    "vectorOptions": {
      "type": "ivfflat",
      "dimensions": 3,
      "similarity": "euclidean",
      "lists":1
    }
  }
)
```

------

## Getting an index definition
<a name="w2aac23c11c13"></a>

You can view the details of your indexes, including vector indexes, using the `getIndexes` command:

**Example**

```
db.collection.getIndexes()
```

**Example output**

```
[
 {
  "v" : 4,
  "key" : {
   "_id" : 1
  },
  "name" : "_id_",
  "ns" : "test.collection"
 },
 {
  "v" : 4,
  "key" : {
   "vectorEmbedding" : "vector"
  },
  "name" : "myIndex",
  "vectorOptions" : {
   "type" : "ivfflat",
   "dimensions" : 3,
   "similarity" : "euclidean",
   "lists" : 1
  },
  "ns" : "test.collection"
 }
]
```

## Querying vectors
<a name="w2aac23c11c15"></a>

Amazon DocumentDB supports two vector search operators for querying vectors:

### Classic vector search operator
<a name="w2aac23c11c15b5"></a>

Use the following template to query a vector:

```
db.collection.aggregate([
  {
    $search: {
      "vectorSearch": {
        "vector": <query vector>, 
        "path": "<vectorField>", 
        "similarity": "<distance metric>",
        "k": <number of results>,
        "probes":<number of probes> [applicable for IVFFlat],
        "efSearch":<size of the dynamic list during search> [applicable for HNSW]
      }
    }
  }
]);
```


| Parameter | Requirement | Type | Description | Value(s) | 
| --- | --- | --- | --- | --- | 
|  **vectorSearch**  |  required  |  operator  |  Used inside \$1search command to query the vectors.  |    | 
|  **vector**  |  required  |  array  |  Indicates the query vector that will be used to find similar vectors.  |    | 
|  **path**  |  required  |  string  |  Defines the name of the vector field.  |    | 
|  **k**  |  required  |  integer  |  Specifies the number of results that the search returns.  |    | 
|  **similarity**  |  required  |  string  |  Specifies the distance metric used for the similarity calculation.  |  [\[See the AWS documentation website for more details\]](http://docs.aws.amazon.com/documentdb/latest/developerguide/vector-search.html)  | 
|  **probes**  |  optional  |  integer  |  The number of clusters you want vector search to inspect. A higher value provides better recall at the cost of speed. It can be set to the number of lists for exact nearest neighbor search (at which point the planner won’t use the index). The recommended setting to start fine-tuning is `sqrt(# of lists)`.  |  Default: 1  | 
|  **efSearch**  |  optional  |  integer  |  Specifies the size of the dynamic candidate list that HNSW index uses during search. A higher value of `efSearch` provides better recall at cost of speed.  |  Default: 40 Range [1, 1000]  | 

It is important to fine tune the value of `efSearch` (HNSW) or `probes` (IVFFlat) to achieve your desired performance and accuracy. See the following example operations:

------
#### [ HNSW ]

```
db.collection.aggregate([
  {
    $search: {
      "vectorSearch": {
        "vector": [0.2, 0.5, 0.8], 
        "path": "vectorEmbedding", 
        "similarity": "euclidean",
        "k": 2,
        "efSearch": 40
      }
    }
  }
]);
```

------
#### [ IVFFlat ]

```
db.collection.aggregate([
  {
    $search: {
      "vectorSearch": {
        "vector": [0.2, 0.5, 0.8], 
        "path": "vectorEmbedding", 
        "similarity": "euclidean",
        "k": 2,
        "probes": 1
      }
    }
  }
]);
```

------

**Example output**

Output from this operation looks something like the following:

```
{ "_id" : ObjectId("653d835ff96bee02cad7323c"), "product_name" : "Product A", "vectorEmbedding" : [ 0.2, 0.5, 0.8 ] }
{ "_id" : ObjectId("653d835ff96bee02cad7323e"), "product_name" : "Product C", "vectorEmbedding" : [ 0.1, 0.2, 0.5 ] }
```

### `$vectorSearch` operator (available in Amazon DocumentDB 8.0 onwards)
<a name="w2aac23c11c15b7"></a>

Use the following template to query a vector:

```
db.collection.aggregate([
{
  "$vectorSearch": {
    "exact": true | false,
    "index": "<index-name>" [supports only HNSW index],
    "limit": <number-of-results> [same as k],
    "path": "<vector field-to-search>",
    "queryVector": <array-of-numbers>,
    "numCandidates": <number-of-candidates> [same as efSearch], 
  }
}])
```

## Features and limitations
<a name="vector-limitations"></a>

**Version compatibility**
+ Vector search for Amazon DocumentDB is only available on Amazon DocumentDB 5.0\$1 instance-based clusters.

**Vectors**
+ Amazon DocumentDB can index vectors of up to 2,000 dimensions. However, up to 16,000 dimensions can be stored without an index.

**Indexes**
+ For IVFFlat index creation, the recommended setting for lists parameter is the number of documents/1000 for up to 1M documents and `sqrt(# of documents)` for over 1M documents. Due to a working memory limit, Amazon DocumentDB supports a certain maximum value of the lists parameter depending on the number of dimensions. For your reference, the following table provides the maximum values of lists parameter for vectors of 500, 1000, and 2,000 dimensions:    
[\[See the AWS documentation website for more details\]](http://docs.aws.amazon.com/documentdb/latest/developerguide/vector-search.html)
+ No other index options such as `compound`, `sparse` or `partial` are supported with vector indexes.
+ Parallel index build is not supported for HNSW index in Amazon DocumentDB 5.0.

**Vector query**
+ For vector search query, it is important to fine tune the parameters such as `probes` or `efSearch` for optimum results. The higher the value of `probes` or `efSearch` parameter, the higher the recall and lower the speed. The recommended setting to start fine tuning the probes parameter is `sqrt(# of lists)`. 

## Best practices
<a name="w2aac23c11c19"></a>

Learn best practices for working with vector search in Amazon DocumentDB. This section is continually updated as new best practices are identified.
+ Inverted File with Flat Compression (IVFFlat) index creation involves clustering and organizing the data points based on similarities. Hence, in order for an index to be more effective, we recommend that you at least load some data before creating the index. 
+ For vector search queries, it is important to fine tune the parameters such as `probes` or `efSearch` for optimum results. The higher the value of the `probes` or `efSearch` parameter, the higher is the recall and lower is the speed. The recommended setting to start fine tuning the `probes` parameter is `sqrt(lists)`. 

**Resources**
+ [Vector search what's new blog post](https://aws.amazon.com/blogs/aws/vector-search-for-amazon-documentdb-with-mongodb-compatibility-is-now-generally-available)
+ [Semantic search code sample](https://github.com/aws-samples/amazon-documentdb-samples/tree/master/blogs/semanticsearch-docdb)
+ [Amazon DocumentDB vector search code samples](https://github.com/aws-samples/amazon-documentdb-samples/tree/master/samples/vector-search)