Reading non-printable UTF-8 character data

Focus mode

Reading non-printable UTF-8 character data - Amazon DynamoDB

To read and write non-printable UTF-8 character data, you can use the STORED AS SEQUENCEFILE clause when you create a Hive table. A SequenceFile is a Hadoop binary file format. You need to use Hadoop to read this file. The following example shows how to export data from DynamoDB into Amazon S3. You can use this functionality to handle non-printable UTF-8 encoded characters.


CREATE EXTERNAL TABLE s3_export(a_col string, b_col bigint, c_col array<string>)
STORED AS SEQUENCEFILE
LOCATION 's3://bucketname/path/subpath/';

INSERT OVERWRITE TABLE s3_export SELECT *
FROM hiveTableName;

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Using data compression

Performance tuning

Select your cookie preferences

Customize cookie preferences

Essential

Performance

Functional

Advertising

Unable to save cookie preferences

Reading non-printable UTF-8 character data

Related resources

Did this page help you?

Related resources

Next topic:

Previous topic:

Need help?