DetectText
Detects text in the input image and converts it into machine-readable text.
Pass the input image as base64-encoded image bytes or as a reference to an image in an Amazon S3 bucket. If you use the AWS CLI to call Amazon Rekognition operations, you must pass it as a reference to an image in an Amazon S3 bucket. For the AWS CLI, passing image bytes is not supported. The image must be either a .png or .jpeg formatted file.
The DetectText
operation returns text in an array of TextDetection elements, TextDetections
. Each
TextDetection
element provides information about a single word or line of text
that was detected in the image.
A word is one or more script characters that are not separated by spaces.
DetectText
can detect up to 100 words in an image.
A line is a string of equally spaced words. A line isn't necessarily a complete
sentence. For example, a driver's license number is detected as a line. A line ends when there
is no aligned text after it. Also, a line ends when there is a large gap between words,
relative to the length of the words. This means, depending on the gap between words, Amazon Rekognition
may detect multiple lines in text aligned in the same direction. Periods don't represent the
end of a line. If a sentence spans multiple lines, the DetectText
operation
returns multiple lines.
To determine whether a TextDetection
element is a line of text or a word,
use the TextDetection
object Type
field.
To be detected, text must be within +/- 90 degrees orientation of the horizontal axis.
For more information, see Detecting text.
Request Syntax
{
"Filters": {
"RegionsOfInterest": [
{
"BoundingBox": {
"Height": number
,
"Left": number
,
"Top": number
,
"Width": number
},
"Polygon": [
{
"X": number
,
"Y": number
}
]
}
],
"WordFilter": {
"MinBoundingBoxHeight": number
,
"MinBoundingBoxWidth": number
,
"MinConfidence": number
}
},
"Image": {
"Bytes": blob
,
"S3Object": {
"Bucket": "string
",
"Name": "string
",
"Version": "string
"
}
}
}
Request Parameters
For information about the parameters that are common to all actions, see Common Parameters.
The request accepts the following data in JSON format.
- Filters
-
Optional parameters that let you set the criteria that the text must meet to be included in your response.
Type: DetectTextFilters object
Required: No
- Image
-
The input image as base64-encoded bytes or an Amazon S3 object. If you use the AWS CLI to call Amazon Rekognition operations, you can't pass image bytes.
If you are using an AWS SDK to call Amazon Rekognition, you might not need to base64-encode image bytes passed using the
Bytes
field. For more information, see Image specifications.Type: Image object
Required: Yes
Response Syntax
{
"TextDetections": [
{
"Confidence": number,
"DetectedText": "string",
"Geometry": {
"BoundingBox": {
"Height": number,
"Left": number,
"Top": number,
"Width": number
},
"Polygon": [
{
"X": number,
"Y": number
}
]
},
"Id": number,
"ParentId": number,
"Type": "string"
}
],
"TextModelVersion": "string"
}
Response Elements
If the action is successful, the service sends back an HTTP 200 response.
The following data is returned in JSON format by the service.
- TextDetections
-
An array of text that was detected in the input image.
Type: Array of TextDetection objects
- TextModelVersion
-
The model version used to detect text.
Type: String
Errors
For information about the errors that are common to all actions, see Common Errors.
- AccessDeniedException
-
You are not authorized to perform the action.
HTTP Status Code: 400
- ImageTooLargeException
-
The input image size exceeds the allowed limit. If you are calling DetectProtectiveEquipment, the image size or resolution exceeds the allowed limit. For more information, see Guidelines and quotas in Amazon Rekognition.
HTTP Status Code: 400
- InternalServerError
-
Amazon Rekognition experienced a service issue. Try your call again.
HTTP Status Code: 500
- InvalidImageFormatException
-
The provided image format is not supported.
HTTP Status Code: 400
- InvalidParameterException
-
Input parameter violated a constraint. Validate your parameter before calling the API operation again.
HTTP Status Code: 400
- InvalidS3ObjectException
-
Amazon Rekognition is unable to access the S3 object specified in the request.
HTTP Status Code: 400
- ProvisionedThroughputExceededException
-
The number of requests exceeded your throughput limit. If you want to increase this limit, contact Amazon Rekognition.
HTTP Status Code: 400
- ThrottlingException
-
Amazon Rekognition is temporarily unable to process the request. Try your call again.
HTTP Status Code: 500
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: