DeleteInferenceProfile
Deletes an application inference profile. For more information, see Increase throughput and resilience with cross-region inference in Amazon Bedrock. in the Amazon Bedrock User Guide.
Request Syntax
DELETE /inference-profiles/inferenceProfileIdentifier
HTTP/1.1
URI Request Parameters
The request uses the following URI parameters.
- inferenceProfileIdentifier
-
The Amazon Resource Name (ARN) or ID of the application inference profile to delete.
Length Constraints: Minimum length of 1. Maximum length of 2048.
Pattern:
^(arn:aws(|-us-gov|-cn|-iso|-iso-b):bedrock:(|[0-9a-z-]{0,20}):(|[0-9]{12}):(inference-profile|application-inference-profile)/)?[a-zA-Z0-9-:.]+$
Required: Yes
Request Body
The request does not have a request body.
Response Syntax
HTTP/1.1 200
Response Elements
If the action is successful, the service sends back an HTTP 200 response with an empty HTTP body.
Errors
For information about the errors that are common to all actions, see Common Errors.
- AccessDeniedException
-
The request is denied because of missing access permissions.
HTTP Status Code: 403
- ConflictException
-
Error occurred because of a conflict while performing an operation.
HTTP Status Code: 400
- InternalServerException
-
An internal server error occurred. Retry your request.
HTTP Status Code: 500
- ResourceNotFoundException
-
The specified resource Amazon Resource Name (ARN) was not found. Check the Amazon Resource Name (ARN) and try your request again.
HTTP Status Code: 404
- ThrottlingException
-
The number of requests exceeds the limit. Resubmit your request later.
HTTP Status Code: 429
- ValidationException
-
Input validation failed. Check your request parameters and retry the request.
HTTP Status Code: 400
Examples
Delete an application inference profile
Assuming you've created an application inference profile called USClaudeSonnetApplicationIP
, run the following example to delete it:
Sample Request
DELETE /inference-profiles/USClaudeSonnetApplicationIP HTTP/1.1
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: