StartIngestionJob
Begins a data ingestion job. Data sources are ingested into your knowledge base so that Large Language Models (LLMs) can use your data.
Request Syntax
PUT /knowledgebases/knowledgeBaseId
/datasources/dataSourceId
/ingestionjobs/ HTTP/1.1
Content-type: application/json
{
"clientToken": "string
",
"description": "string
"
}
URI Request Parameters
The request uses the following URI parameters.
- dataSourceId
-
The unique identifier of the data source you want to ingest into your knowledge base.
Pattern:
^[0-9a-zA-Z]{10}$
Required: Yes
- knowledgeBaseId
-
The unique identifier of the knowledge base for the data ingestion job.
Pattern:
^[0-9a-zA-Z]{10}$
Required: Yes
Request Body
The request accepts the following data in JSON format.
- clientToken
-
A unique, case-sensitive identifier to ensure that the API request completes no more than one time. If this token matches a previous request, Amazon Bedrock ignores the request, but does not return an error. For more information, see Ensuring idempotency.
Type: String
Length Constraints: Minimum length of 33. Maximum length of 256.
Pattern:
^[a-zA-Z0-9](-*[a-zA-Z0-9]){0,256}$
Required: No
- description
-
A description of the data ingestion job.
Type: String
Length Constraints: Minimum length of 1. Maximum length of 200.
Required: No
Response Syntax
HTTP/1.1 202
Content-type: application/json
{
"ingestionJob": {
"dataSourceId": "string",
"description": "string",
"failureReasons": [ "string" ],
"ingestionJobId": "string",
"knowledgeBaseId": "string",
"startedAt": "string",
"statistics": {
"numberOfDocumentsDeleted": number,
"numberOfDocumentsFailed": number,
"numberOfDocumentsScanned": number,
"numberOfMetadataDocumentsModified": number,
"numberOfMetadataDocumentsScanned": number,
"numberOfModifiedDocumentsIndexed": number,
"numberOfNewDocumentsIndexed": number
},
"status": "string",
"updatedAt": "string"
}
}
Response Elements
If the action is successful, the service sends back an HTTP 202 response.
The following data is returned in JSON format by the service.
- ingestionJob
-
Contains information about the data ingestion job.
Type: IngestionJob object
Errors
For information about the errors that are common to all actions, see Common Errors.
- AccessDeniedException
-
The request is denied because of missing access permissions.
HTTP Status Code: 403
- ConflictException
-
There was a conflict performing an operation.
HTTP Status Code: 409
- InternalServerException
-
An internal server error occurred. Retry your request.
HTTP Status Code: 500
- ResourceNotFoundException
-
The specified resource Amazon Resource Name (ARN) was not found. Check the Amazon Resource Name (ARN) and try your request again.
HTTP Status Code: 404
- ServiceQuotaExceededException
-
The number of requests exceeds the service quota. Resubmit your request later.
HTTP Status Code: 402
- ThrottlingException
-
The number of requests exceeds the limit. Resubmit your request later.
HTTP Status Code: 429
- ValidationException
-
Input validation failed. Check your request parameters and retry the request.
HTTP Status Code: 400
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: