This is the new AWS CloudFormation Template Reference Guide. Please update your bookmarks and links. For help getting started with CloudFormation, see the AWS CloudFormation User Guide.
AWS::SageMaker::ProcessingJob DatasetDefinition
Configuration for Dataset Definition inputs. The Dataset Definition input must specify
exactly one of either AthenaDatasetDefinition or
RedshiftDatasetDefinition types.
Syntax
To declare this entity in your AWS CloudFormation template, use the following syntax:
JSON
{ "AthenaDatasetDefinition" :AthenaDatasetDefinition, "DataDistributionType" :String, "InputMode" :String, "LocalPath" :String, "RedshiftDatasetDefinition" :RedshiftDatasetDefinition}
YAML
AthenaDatasetDefinition:AthenaDatasetDefinitionDataDistributionType:StringInputMode:StringLocalPath:StringRedshiftDatasetDefinition:RedshiftDatasetDefinition
Properties
AthenaDatasetDefinition-
Configuration for Athena Dataset Definition input.
Required: No
Type: AthenaDatasetDefinition
Update requires: Replacement
DataDistributionType-
Whether the generated dataset is
FullyReplicatedorShardedByS3Key(default).Required: No
Type: String
Allowed values:
FullyReplicated | ShardedByS3KeyUpdate requires: Replacement
InputMode-
Whether to use
FileorPipeinput mode. InFile(default) mode, Amazon SageMaker copies the data from the input source onto the local Amazon Elastic Block Store (Amazon EBS) volumes before starting your training algorithm. This is the most commonly used input mode. InPipemode, Amazon SageMaker streams input data from the source directly to your algorithm without using the EBS volume.Required: No
Type: String
Allowed values:
File | PipeUpdate requires: Replacement
LocalPath-
The local path where you want Amazon SageMaker to download the Dataset Definition inputs to run a processing job.
LocalPathis an absolute path to the input data. This is a required parameter whenAppManagedisFalse(default).Required: No
Type: String
Pattern:
.*Minimum:
0Maximum:
256Update requires: Replacement
RedshiftDatasetDefinition-
Configuration for Redshift Dataset Definition input.
Required: No
Type: RedshiftDatasetDefinition
Update requires: Replacement