Analyzing Identity Documents
Amazon Textract can extract relevant information from passports, driver licenses, and other identity documentation issued by the US Government using the AnalyzeID API. With Analyze ID, businesses can quickly, and accurately extract information from IDs such as US driver licenses, and passports that have different template or format. AnalyzeID API returns three categories of data types:
-
Key-value pairs available on ID such as Date of Birth, Date of Issue, ID #, Class, and Restrictions.
-
Implied fields on the document that may not have explicit keys associated with them such as Name, Address, and Issued By.
-
The text of the document, the same as would be returned by document text detection.
Key names are standardized within the response. For example, if your driver license says LIC# (license number) and passport says Passport No, Analyze ID response will return the standardized key as “Document ID” along with the raw key (such as LIC#). This standardization lets customers combine information across many IDs that use different terms for the same concept.
Analyze ID returns information in the structures called
IdentityDocumentFields
. These are JSON
structures
containing two pieces of information: the normalized Type and the Value associated with
the Type. These both also have a confidence score. For more information, see Identity Documentation Response
Objects. For more
information regarding the text detection returned by Analyze ID, see Text Detection and Document Analysis
Response Objects
You can use synchronous operations to analyze a driver's license or passport. To
analyze these documents, you use the AnalyzeID operation and pass an identity document
to it. AnalyzeID
returns the entire set of results. For more information,
see Analyzing Identity Documentation with Amazon Textract.
Note
Some identity documents, such as driver's licenses, have two sides. You can pass the front and back images of driver licenses as separate images within the same Analyze ID API request.