AWS::Glue::Classifier CsvClassifier
A classifier for custom CSV
content.
Syntax
To declare this entity in your AWS CloudFormation template, use the following syntax:
JSON
{ "AllowSingleColumn" :
Boolean
, "ContainsCustomDatatype" :[ String, ... ]
, "ContainsHeader" :String
, "CustomDatatypeConfigured" :Boolean
, "Delimiter" :String
, "DisableValueTrimming" :Boolean
, "Header" :[ String, ... ]
, "Name" :String
, "QuoteSymbol" :String
}
YAML
AllowSingleColumn:
Boolean
ContainsCustomDatatype:- String
ContainsHeader:String
CustomDatatypeConfigured:Boolean
Delimiter:String
DisableValueTrimming:Boolean
Header:- String
Name:String
QuoteSymbol:String
Properties
AllowSingleColumn
-
Enables the processing of files that contain only one column.
Required: No
Type: Boolean
Update requires: No interruption
ContainsCustomDatatype
-
Indicates whether the CSV file contains custom data types.
Required: No
Type: Array of String
Update requires: No interruption
ContainsHeader
-
Indicates whether the CSV file contains a header.
A value of
UNKNOWN
specifies that the classifier will detect whether the CSV file contains headings.A value of
PRESENT
specifies that the CSV file contains headings.A value of
ABSENT
specifies that the CSV file does not contain headings.Required: No
Type: String
Allowed values:
UNKNOWN | PRESENT | ABSENT
Update requires: No interruption
CustomDatatypeConfigured
-
Enables the configuration of custom data types.
Required: No
Type: Boolean
Update requires: No interruption
Delimiter
-
A custom symbol to denote what separates each column entry in the row.
Required: No
Type: String
Pattern:
[^\r\n]
Minimum:
1
Maximum:
1
Update requires: No interruption
DisableValueTrimming
-
Specifies not to trim values before identifying the type of column values. The default value is
true
.Required: No
Type: Boolean
Update requires: No interruption
Header
-
A list of strings representing column names.
Required: No
Type: Array of String
Update requires: No interruption
Name
-
The name of the classifier.
Required: No
Type: String
Pattern:
[\u0020-\uD7FF\uE000-\uFFFD\uD800\uDC00-\uDBFF\uDFFF\t]*
Minimum:
1
Maximum:
255
Update requires: No interruption
QuoteSymbol
-
A custom symbol to denote what combines content into a single column value. It must be different from the column delimiter.
Required: No
Type: String
Pattern:
[^\r\n]
Minimum:
1
Maximum:
1
Update requires: No interruption
Examples
Create a custom classifier test
With AWS::Glue::Classifier
you can create a custom classifier test.
JSON
{ "Description": "AWS Glue custom classifier test", "Resources": { "MyCSVclassifier": { "Type": "AWS::Glue::Classifier", "Properties": { "CsvClassifier": { "AllowSingleColumn": true, "ContainsHeader": "PRESENT", "Delimiter": ",", "Header": [ "id", "name" ], "Name": "csvclassify", "QuoteSymbol": "\"" } } } } }
YAML
Description: AWS Glue custom classifier test Resources: MyCSVclassifier: Type: 'AWS::Glue::Classifier' Properties: CsvClassifier: AllowSingleColumn: true ContainsHeader: PRESENT Delimiter: ',' Header: - id - name Name: csvclassify QuoteSymbol: '"'