Classifier
Classifiers are triggered during a crawl task. A classifier checks whether a given file is
in a format it can handle. If it is, the classifier creates a schema in the form of a
StructType
object that matches that data format.
You can use the standard classifiers that AWS Glue provides, or you can write your own
classifiers to best categorize your data sources and specify the appropriate schemas to use
for them. A classifier can be a grok
classifier, an XML
classifier,
a JSON
classifier, or a custom CSV
classifier, as specified in one
of the fields in the Classifier
object.
Contents
- CsvClassifier
-
A classifier for comma-separated values (CSV).
Type: CsvClassifier object
Required: No
- GrokClassifier
-
A classifier that uses
grok
.Type: GrokClassifier object
Required: No
- JsonClassifier
-
A classifier for JSON content.
Type: JsonClassifier object
Required: No
- XMLClassifier
-
A classifier for XML content.
Type: XMLClassifier object
Required: No
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: