interface DataFormatConversionConfigurationProperty
Language | Type name |
---|---|
![]() | Amazon.CDK.AWS.KinesisFirehose.CfnDeliveryStream.DataFormatConversionConfigurationProperty |
![]() | github.com/aws/aws-cdk-go/awscdk/v2/awskinesisfirehose#CfnDeliveryStream_DataFormatConversionConfigurationProperty |
![]() | software.amazon.awscdk.services.kinesisfirehose.CfnDeliveryStream.DataFormatConversionConfigurationProperty |
![]() | aws_cdk.aws_kinesisfirehose.CfnDeliveryStream.DataFormatConversionConfigurationProperty |
![]() | aws-cdk-lib » aws_kinesisfirehose » CfnDeliveryStream » DataFormatConversionConfigurationProperty |
Specifies that you want Kinesis Data Firehose to convert data from the JSON format to the Parquet or ORC format before writing it to Amazon S3.
Kinesis Data Firehose uses the serializer and deserializer that you specify, in addition to the column information from the AWS Glue table, to deserialize your input data from JSON and then serialize it to the Parquet or ORC format. For more information, see Kinesis Data Firehose Record Format Conversion .
Example
// The code below shows an example of how to instantiate this type.
// The values are placeholders you should change.
import { aws_kinesisfirehose as kinesisfirehose } from 'aws-cdk-lib';
const dataFormatConversionConfigurationProperty: kinesisfirehose.CfnDeliveryStream.DataFormatConversionConfigurationProperty = {
enabled: false,
inputFormatConfiguration: {
deserializer: {
hiveJsonSerDe: {
timestampFormats: ['timestampFormats'],
},
openXJsonSerDe: {
caseInsensitive: false,
columnToJsonKeyMappings: {
columnToJsonKeyMappingsKey: 'columnToJsonKeyMappings',
},
convertDotsInJsonKeysToUnderscores: false,
},
},
},
outputFormatConfiguration: {
serializer: {
orcSerDe: {
blockSizeBytes: 123,
bloomFilterColumns: ['bloomFilterColumns'],
bloomFilterFalsePositiveProbability: 123,
compression: 'compression',
dictionaryKeyThreshold: 123,
enablePadding: false,
formatVersion: 'formatVersion',
paddingTolerance: 123,
rowIndexStride: 123,
stripeSizeBytes: 123,
},
parquetSerDe: {
blockSizeBytes: 123,
compression: 'compression',
enableDictionaryCompression: false,
maxPaddingBytes: 123,
pageSizeBytes: 123,
writerVersion: 'writerVersion',
},
},
},
schemaConfiguration: {
catalogId: 'catalogId',
databaseName: 'databaseName',
region: 'region',
roleArn: 'roleArn',
tableName: 'tableName',
versionId: 'versionId',
},
};
Properties
Name | Type | Description |
---|---|---|
enabled? | boolean | IResolvable | Defaults to true . |
input | IResolvable | Input | Specifies the deserializer that you want Firehose to use to convert the format of your data from JSON. |
output | IResolvable | Output | Specifies the serializer that you want Firehose to use to convert the format of your data to the Parquet or ORC format. |
schema | IResolvable | Schema | Specifies the AWS Glue Data Catalog table that contains the column information. |
enabled?
Type:
boolean |
IResolvable
(optional)
Defaults to true
.
Set it to false
if you want to disable format conversion while preserving the configuration details.
inputFormatConfiguration?
Type:
IResolvable
|
Input
(optional)
Specifies the deserializer that you want Firehose to use to convert the format of your data from JSON.
This parameter is required if Enabled
is set to true.
outputFormatConfiguration?
Type:
IResolvable
|
Output
(optional)
Specifies the serializer that you want Firehose to use to convert the format of your data to the Parquet or ORC format.
This parameter is required if Enabled
is set to true.
schemaConfiguration?
Type:
IResolvable
|
Schema
(optional)
Specifies the AWS Glue Data Catalog table that contains the column information.
This parameter is required if Enabled
is set to true.