# DatasetEntityRecognizerDocuments Describes the documents submitted with a dataset for an entity recognizer model. ## Contents ** S3Uri ** Specifies the Amazon S3 location where the documents for the dataset are located. Type: String Length Constraints: Maximum length of 1024. Pattern: `s3://[a-z0-9][\.\-a-z0-9]{1,61}[a-z0-9](/.*)?` Required: Yes ** InputFormat ** Specifies how the text in an input file should be processed. This is optional, and the default is ONE\$1DOC\$1PER\$1LINE. ONE\$1DOC\$1PER\$1FILE - Each file is considered a separate document. Use this option when you are processing large documents, such as newspaper articles or scientific papers. ONE\$1DOC\$1PER\$1LINE - Each line in a file is considered a separate document. Use this option when you are processing many short documents, such as text messages. Type: String Valid Values: `ONE_DOC_PER_FILE | ONE_DOC_PER_LINE` Required: No ## See Also For more information about using this API in one of the language-specific AWS SDKs, see the following: + [AWS SDK for C\$1\$1](https://docs.aws.amazon.com/goto/SdkForCpp/comprehend-2017-11-27/DatasetEntityRecognizerDocuments) + [AWS SDK for Java V2](https://docs.aws.amazon.com/goto/SdkForJavaV2/comprehend-2017-11-27/DatasetEntityRecognizerDocuments) + [AWS SDK for Ruby V3](https://docs.aws.amazon.com/goto/SdkForRubyV3/comprehend-2017-11-27/DatasetEntityRecognizerDocuments)