Request Inferences from a Deployed Service (AWS CLI)

Focus mode

Request Inferences from a Deployed Service (AWS CLI) - Amazon SageMaker AI

Inference requests can be made with the sagemaker-runtime invoke-endpoint once you have an Amazon SageMaker AI endpoint InService. You can make inference requests with the AWS Command Line Interface (AWS CLI). The following example shows how to send an image for inference:


aws sagemaker-runtime invoke-endpoint --endpoint-name 'insert name of your endpoint here' --body fileb://image.jpg --content-type=application/x-image output_file.txt

An output_file.txt with information about your inference requests is made if the inference was successful.

For TensorFlow submit an input with application/json as the content type.


aws sagemaker-runtime invoke-endpoint --endpoint-name 'insert name of your endpoint here' --body fileb://input.json --content-type=application/json output_file.txt

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Request Inferences from a Deployed Service (Boto3)

Inference Container Images

Next topic:

Inference Container Images

Previous topic:

Request Inferences from a Deployed Service (Boto3)

Need help?

Select your cookie preferences

Customize cookie preferences

Essential

Performance

Functional

Advertising

Unable to save cookie preferences

Request Inferences from a Deployed Service (AWS CLI)

Next topic:

Previous topic:

Need help?

Related resources

Did this page help you?

Related resources