AWS::Glue::Table StorageDescriptor
Describes the physical storage of table data.
Syntax
To declare this entity in your AWS CloudFormation template, use the following syntax:
JSON
{ "BucketColumns" :
[ String, ... ]
, "Columns" :[ Column, ... ]
, "Compressed" :Boolean
, "InputFormat" :String
, "Location" :String
, "NumberOfBuckets" :Integer
, "OutputFormat" :String
, "Parameters" :Json
, "SchemaReference" :SchemaReference
, "SerdeInfo" :SerdeInfo
, "SkewedInfo" :SkewedInfo
, "SortColumns" :[ Order, ... ]
, "StoredAsSubDirectories" :Boolean
}
YAML
BucketColumns:
- String
Columns:- Column
Compressed:Boolean
InputFormat:String
Location:String
NumberOfBuckets:Integer
OutputFormat:String
Parameters:Json
SchemaReference:SchemaReference
SerdeInfo:SerdeInfo
SkewedInfo:SkewedInfo
SortColumns:- Order
StoredAsSubDirectories:Boolean
Properties
BucketColumns
-
A list of reducer grouping columns, clustering columns, and bucketing columns in the table.
Required: No
Type: Array of String
Update requires: No interruption
Columns
-
A list of the
Columns
in the table.Required: No
Type: Array of Column
Update requires: No interruption
Compressed
-
True
if the data in the table is compressed, orFalse
if not.Required: No
Type: Boolean
Update requires: No interruption
InputFormat
-
The input format:
SequenceFileInputFormat
(binary), orTextInputFormat
, or a custom format.Required: No
Type: String
Pattern:
[\u0020-\uD7FF\uE000-\uFFFD\uD800\uDC00-\uDBFF\uDFFF\t]*
Maximum:
128
Update requires: No interruption
Location
-
The physical location of the table. By default, this takes the form of the warehouse location, followed by the database location in the warehouse, followed by the table name.
Required: No
Type: String
Pattern:
[\u0020-\uD7FF\uE000-\uFFFD\uD800\uDC00-\uDBFF\uDFFF\r\n\t]*
Maximum:
2056
Update requires: No interruption
NumberOfBuckets
-
Must be specified if the table contains any dimension columns.
Required: No
Type: Integer
Update requires: No interruption
OutputFormat
-
The output format:
SequenceFileOutputFormat
(binary), orIgnoreKeyTextOutputFormat
, or a custom format.Required: No
Type: String
Pattern:
[\u0020-\uD7FF\uE000-\uFFFD\uD800\uDC00-\uDBFF\uDFFF\t]*
Maximum:
128
Update requires: No interruption
Parameters
-
The user-supplied properties in key-value form.
Required: No
Type: Json
Update requires: No interruption
SchemaReference
-
An object that references a schema stored in the AWS Glue Schema Registry.
Required: No
Type: SchemaReference
Update requires: No interruption
SerdeInfo
-
The serialization/deserialization (SerDe) information.
Required: No
Type: SerdeInfo
Update requires: No interruption
SkewedInfo
-
The information about values that appear frequently in a column (skewed values).
Required: No
Type: SkewedInfo
Update requires: No interruption
SortColumns
-
A list specifying the sort order of each bucket in the table.
Required: No
Type: Array of Order
Update requires: No interruption
StoredAsSubDirectories
-
True
if the table data is stored in subdirectories, orFalse
if not.Required: No
Type: Boolean
Update requires: No interruption
See also
-
StorageDescriptor Structure in the AWS Glue Developer Guide