AWS::KinesisFirehose::DeliveryStream ParquetSerDe
A serializer to use for converting data to the Parquet format before storing it in
Amazon S3. For more information, see Apache Parquet
Syntax
To declare this entity in your AWS CloudFormation template, use the following syntax:
JSON
{ "BlockSizeBytes" :
Integer
, "Compression" :String
, "EnableDictionaryCompression" :Boolean
, "MaxPaddingBytes" :Integer
, "PageSizeBytes" :Integer
, "WriterVersion" :String
}
YAML
BlockSizeBytes:
Integer
Compression:String
EnableDictionaryCompression:Boolean
MaxPaddingBytes:Integer
PageSizeBytes:Integer
WriterVersion:String
Properties
BlockSizeBytes
-
The Hadoop Distributed File System (HDFS) block size. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 256 MiB and the minimum is 64 MiB. Firehose uses this value for padding calculations.
Required: No
Type: Integer
Minimum:
67108864
Update requires: No interruption
Compression
-
The compression code to use over data blocks. The possible values are
UNCOMPRESSED
,SNAPPY
, andGZIP
, with the default beingSNAPPY
. UseSNAPPY
for higher decompression speed. UseGZIP
if the compression ratio is more important than speed.Required: No
Type: String
Allowed values:
UNCOMPRESSED | GZIP | SNAPPY
Update requires: No interruption
EnableDictionaryCompression
-
Indicates whether to enable dictionary compression.
Required: No
Type: Boolean
Update requires: No interruption
MaxPaddingBytes
-
The maximum amount of padding to apply. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 0.
Required: No
Type: Integer
Minimum:
0
Update requires: No interruption
PageSizeBytes
-
The Parquet page size. Column chunks are divided into pages. A page is conceptually an indivisible unit (in terms of compression and encoding). The minimum value is 64 KiB and the default is 1 MiB.
Required: No
Type: Integer
Minimum:
65536
Update requires: No interruption
WriterVersion
-
Indicates the version of row format to output. The possible values are
V1
andV2
. The default isV1
.Required: No
Type: String
Allowed values:
V1 | V2
Update requires: No interruption