class SageMakerCreateTransformJob (construct)
Language | Type name |
---|---|
.NET | Amazon.CDK.AWS.StepFunctions.Tasks.SageMakerCreateTransformJob |
Java | software.amazon.awscdk.services.stepfunctions.tasks.SageMakerCreateTransformJob |
Python | aws_cdk.aws_stepfunctions_tasks.SageMakerCreateTransformJob |
TypeScript (source) | @aws-cdk/aws-stepfunctions-tasks » SageMakerCreateTransformJob |
Implements
IConstruct
, IConstruct
, IDependable
, IChainable
, INextable
Class representing the SageMaker Create Transform Job task.
Example
new tasks.SageMakerCreateTransformJob(this, 'Batch Inference', {
transformJobName: 'MyTransformJob',
modelName: 'MyModelName',
modelClientOptions: {
invocationsMaxRetries: 3, // default is 0
invocationsTimeout: Duration.minutes(5), // default is 60 seconds
},
transformInput: {
transformDataSource: {
s3DataSource: {
s3Uri: 's3://inputbucket/train',
s3DataType: tasks.S3DataType.S3_PREFIX,
}
}
},
transformOutput: {
s3OutputPath: 's3://outputbucket/TransformJobOutputPath',
},
transformResources: {
instanceCount: 1,
instanceType: ec2.InstanceType.of(ec2.InstanceClass.M4, ec2.InstanceSize.XLARGE),
}
});
Initializer
new SageMakerCreateTransformJob(scope: Construct, id: string, props: SageMakerCreateTransformJobProps)
Parameters
- scope
Construct
- id
string
- props
Sage
Maker Create Transform Job Props
Construct Props
Name | Type | Description |
---|---|---|
model | string | Name of the model that you want to use for the transform job. |
transform | Transform | Dataset to be transformed and the Amazon S3 location where it is stored. |
transform | string | Transform Job Name. |
transform | Transform | S3 location where you want Amazon SageMaker to save the results from the transform job. |
batch | Batch | Number of records to include in a mini-batch for an HTTP inference request. |
comment? | string | An optional description for this state. |
environment? | { [string]: string } | Environment variables to set in the Docker container. |
heartbeat? | Duration | Timeout for the heartbeat. |
input | string | JSONPath expression to select part of the state to be the input to this state. |
integration | Integration | AWS Step Functions integrates with services directly in the Amazon States Language. |
max | number | Maximum number of parallel requests that can be sent to each instance in a transform job. |
max | Size | Maximum allowed size of the payload, in MB. |
model | Model | Configures the timeout and maximum number of retries for processing a transform job invocation. |
output | string | JSONPath expression to select select a portion of the state output to pass to the next state. |
result | string | JSONPath expression to indicate where to inject the state's output. |
result | { [string]: any } | The JSON that will replace the state's raw result and become the effective result before ResultPath is applied. |
role? | IRole | Role for the Transform Job. |
tags? | { [string]: string } | Tags to be applied to the train job. |
timeout? | Duration | Timeout for the state machine. |
transform | Transform | ML compute instances for the transform job. |
modelName
Type:
string
Name of the model that you want to use for the transform job.
transformInput
Type:
Transform
Dataset to be transformed and the Amazon S3 location where it is stored.
transformJobName
Type:
string
Transform Job Name.
transformOutput
Type:
Transform
S3 location where you want Amazon SageMaker to save the results from the transform job.
batchStrategy?
Type:
Batch
(optional, default: No batch strategy)
Number of records to include in a mini-batch for an HTTP inference request.
comment?
Type:
string
(optional, default: No comment)
An optional description for this state.
environment?
Type:
{ [string]: string }
(optional, default: No environment variables)
Environment variables to set in the Docker container.
heartbeat?
Type:
Duration
(optional, default: None)
Timeout for the heartbeat.
inputPath?
Type:
string
(optional, default: The entire task input (JSON path '$'))
JSONPath expression to select part of the state to be the input to this state.
May also be the special value JsonPath.DISCARD, which will cause the effective input to be the empty object {}.
integrationPattern?
Type:
Integration
(optional, default: IntegrationPattern.REQUEST_RESPONSE
for most tasks.
IntegrationPattern.RUN_JOB
for the following exceptions:
BatchSubmitJob
, EmrAddStep
, EmrCreateCluster
, EmrTerminationCluster
, and EmrContainersStartJobRun
.)
AWS Step Functions integrates with services directly in the Amazon States Language.
You can control these AWS services using service integration patterns
maxConcurrentTransforms?
Type:
number
(optional, default: Amazon SageMaker checks the optional execution-parameters to determine the settings for your chosen algorithm.
If the execution-parameters endpoint is not enabled, the default value is 1.)
Maximum number of parallel requests that can be sent to each instance in a transform job.
maxPayload?
Type:
Size
(optional, default: 6)
Maximum allowed size of the payload, in MB.
modelClientOptions?
Type:
Model
(optional, default: 0 retries and 60 seconds of timeout)
Configures the timeout and maximum number of retries for processing a transform job invocation.
outputPath?
Type:
string
(optional, default: The entire JSON node determined by the state input, the task result,
and resultPath is passed to the next state (JSON path '$'))
JSONPath expression to select select a portion of the state output to pass to the next state.
May also be the special value JsonPath.DISCARD, which will cause the effective output to be the empty object {}.
resultPath?
Type:
string
(optional, default: Replaces the entire input with the result (JSON path '$'))
JSONPath expression to indicate where to inject the state's output.
May also be the special value JsonPath.DISCARD, which will cause the state's input to become its output.
resultSelector?
Type:
{ [string]: any }
(optional, default: None)
The JSON that will replace the state's raw result and become the effective result before ResultPath is applied.
You can use ResultSelector to create a payload with values that are static or selected from the state's raw result.
role?
Type:
IRole
(optional, default: A role is created with AmazonSageMakerFullAccess
managed policy)
Role for the Transform Job.
tags?
Type:
{ [string]: string }
(optional, default: No tags)
Tags to be applied to the train job.
timeout?
Type:
Duration
(optional, default: None)
Timeout for the state machine.
transformResources?
Type:
Transform
(optional, default: 1 instance of type M4.XLarge)
ML compute instances for the transform job.
Properties
Name | Type | Description |
---|---|---|
end | INextable [] | Continuable states of this Chainable. |
id | string | Descriptive identifier for this chainable. |
node | Construct | The construct tree node associated with this construct. |
role | IRole | The execution role for the Sagemaker transform job. |
start | State | First state of this Chainable. |
state | string | Tokenized string that evaluates to the state's ID. |
task | Task | |
task | Policy [] |
endStates
Type:
INextable
[]
Continuable states of this Chainable.
id
Type:
string
Descriptive identifier for this chainable.
node
Type:
Construct
The construct tree node associated with this construct.
role
Type:
IRole
The execution role for the Sagemaker transform job.
Only available after task has been added to a state machine.
startState
Type:
State
First state of this Chainable.
stateId
Type:
string
Tokenized string that evaluates to the state's ID.
taskMetrics?
Type:
Task
(optional)
taskPolicies?
Type:
Policy
[]
(optional)
Methods
Name | Description |
---|---|
add | Add a recovery handler for this state. |
add | Add a prefix to the stateId of this state. |
add | Add retry configuration for this state. |
bind | Register this state as part of the given graph. |
metric(metricName, props?) | Return the given named metric for this Task. |
metric | Metric for the number of times this activity fails. |
metric | Metric for the number of times the heartbeat times out for this activity. |
metric | The interval, in milliseconds, between the time the Task starts and the time it closes. |
metric | The interval, in milliseconds, for which the activity stays in the schedule state. |
metric | Metric for the number of times this activity is scheduled. |
metric | Metric for the number of times this activity is started. |
metric | Metric for the number of times this activity succeeds. |
metric | The interval, in milliseconds, between the time the activity is scheduled and the time it closes. |
metric | Metric for the number of times this activity times out. |
next(next) | Continue normal execution with the given state. |
to | Return the Amazon States Language object for this state. |
to | Returns a string representation of this construct. |
addCatch(handler, props?)
public addCatch(handler: IChainable, props?: CatchProps): TaskStateBase
Parameters
- handler
IChainable
- props
Catch
Props
Returns
Add a recovery handler for this state.
When a particular error occurs, execution will continue at the error handler instead of failing the state machine execution.
addPrefix(x)
public addPrefix(x: string): void
Parameters
- x
string
Add a prefix to the stateId of this state.
addRetry(props?)
public addRetry(props?: RetryProps): TaskStateBase
Parameters
- props
Retry
Props
Returns
Add retry configuration for this state.
This controls if and how the execution will be retried if a particular error occurs.
bindToGraph(graph)
public bindToGraph(graph: StateGraph): void
Parameters
- graph
State
Graph
Register this state as part of the given graph.
Don't call this. It will be called automatically when you work with states normally.
metric(metricName, props?)
public metric(metricName: string, props?: MetricOptions): Metric
Parameters
- metricName
string
- props
Metric
Options
Returns
Return the given named metric for this Task.
metricFailed(props?)
public metricFailed(props?: MetricOptions): Metric
Parameters
- props
Metric
Options
Returns
Metric for the number of times this activity fails.
metricHeartbeatTimedOut(props?)
public metricHeartbeatTimedOut(props?: MetricOptions): Metric
Parameters
- props
Metric
Options
Returns
Metric for the number of times the heartbeat times out for this activity.
metricRunTime(props?)
public metricRunTime(props?: MetricOptions): Metric
Parameters
- props
Metric
Options
Returns
The interval, in milliseconds, between the time the Task starts and the time it closes.
metricScheduleTime(props?)
public metricScheduleTime(props?: MetricOptions): Metric
Parameters
- props
Metric
Options
Returns
The interval, in milliseconds, for which the activity stays in the schedule state.
metricScheduled(props?)
public metricScheduled(props?: MetricOptions): Metric
Parameters
- props
Metric
Options
Returns
Metric for the number of times this activity is scheduled.
metricStarted(props?)
public metricStarted(props?: MetricOptions): Metric
Parameters
- props
Metric
Options
Returns
Metric for the number of times this activity is started.
metricSucceeded(props?)
public metricSucceeded(props?: MetricOptions): Metric
Parameters
- props
Metric
Options
Returns
Metric for the number of times this activity succeeds.
metricTime(props?)
public metricTime(props?: MetricOptions): Metric
Parameters
- props
Metric
Options
Returns
The interval, in milliseconds, between the time the activity is scheduled and the time it closes.
metricTimedOut(props?)
public metricTimedOut(props?: MetricOptions): Metric
Parameters
- props
Metric
Options
Returns
Metric for the number of times this activity times out.
next(next)
public next(next: IChainable): Chain
Parameters
- next
IChainable
Returns
Continue normal execution with the given state.
toStateJson()
public toStateJson(): json
Returns
json
Return the Amazon States Language object for this state.
toString()
public toString(): string
Returns
string
Returns a string representation of this construct.