Select your cookie preferences

We use essential cookies and similar tools that are necessary to provide our site and services. We use performance cookies to collect anonymous statistics, so we can understand how customers use our site and make improvements. Essential cookies cannot be deactivated, but you can choose “Customize” or “Decline” to decline performance cookies.

If you agree, AWS and approved third parties will also use cookies to provide useful site features, remember your preferences, and display relevant content, including relevant advertising. To accept or decline all non-essential cookies, choose “Accept” or “Decline.” To make more detailed choices, choose “Customize.”

Running analysis jobs for custom entity recognition

Focus mode
Running analysis jobs for custom entity recognition - Amazon Comprehend

You can run an asynchronous analysis job to detect custom entities in a set of one or more documents.

Before you begin

You need a custom entity recognition model (also known as a recognizer) before you can detect custom entities. For more information about these models, see Training custom entity recognizer models.

A recognizer that is trained with plain-text annotations supports entity detection for plain-text documents only. A recognizer that is trained with PDF document annotations supports entity detection for plain-text documents, images, PDF files, and Word documents. For files other than text files, Amazon Comprehend performs text extraction before running the analysis. For information about the input files, see Inputs for asynchronous custom analysis.

If you plan to analyze image files or scanned PDF documents, your IAM policy must grant permissions to use two Amazon Textract API methods (DetectDocumentText and AnalyzeDocument). Amazon Comprehend invokes these methods during text extraction. For an example policy, see Permissions required to perform document analysis actions.

To run an async analysis job, you perform the following overall steps:

  1. Store the documents in an Amazon S3 bucket.

  2. Use the API or console to start the analysis job.

  3. Monitor the progress of the analysis job.

  4. After the job runs to completion, retrieve the results of the analysis from the S3 bucket that you specified when you started the job.

PrivacySite termsCookie preferences
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.