AWS Data Pipeline 2012-10-29
- Client: Aws\DataPipeline\DataPipelineClient
- Service ID: datapipeline
- Version: 2012-10-29
This page describes the parameters and results for the operations of the AWS Data Pipeline (2012-10-29), and shows how to use the Aws\DataPipeline\DataPipelineClient object to call the described operations. This documentation is specific to the 2012-10-29 API version of the service.
Operation Summary
Each of the following operations can be created from a client using
$client->getCommand('CommandName')
, where "CommandName" is the
name of one of the following operations. Note: a command is a value that
encapsulates an operation and the parameters used to create an HTTP request.
You can also create and send a command immediately using the magic methods
available on a client object: $client->commandName(/* parameters */)
.
You can send the command asynchronously (returning a promise) by appending the
word "Async" to the operation name: $client->commandNameAsync(/* parameters */)
.
- ActivatePipeline ( array $params = [] )
- Validates the specified pipeline and starts processing pipeline tasks.
- AddTags ( array $params = [] )
- Adds or modifies tags for the specified pipeline.
- CreatePipeline ( array $params = [] )
- Creates a new, empty pipeline.
- DeactivatePipeline ( array $params = [] )
- Deactivates the specified running pipeline.
- DeletePipeline ( array $params = [] )
- Deletes a pipeline, its pipeline definition, and its run history.
- DescribeObjects ( array $params = [] )
- Gets the object definitions for a set of objects associated with the pipeline.
- DescribePipelines ( array $params = [] )
- Retrieves metadata about one or more pipelines.
- EvaluateExpression ( array $params = [] )
- Task runners call EvaluateExpression to evaluate a string in the context of the specified object.
- GetPipelineDefinition ( array $params = [] )
- Gets the definition of the specified pipeline.
- ListPipelines ( array $params = [] )
- Lists the pipeline identifiers for all active pipelines that you have permission to access.
- PollForTask ( array $params = [] )
- Task runners call PollForTask to receive a task to perform from AWS Data Pipeline.
- PutPipelineDefinition ( array $params = [] )
- Adds tasks, schedules, and preconditions to the specified pipeline.
- QueryObjects ( array $params = [] )
- Queries the specified pipeline for the names of objects that match the specified set of conditions.
- RemoveTags ( array $params = [] )
- Removes existing tags from the specified pipeline.
- ReportTaskProgress ( array $params = [] )
- Task runners call ReportTaskProgress when assigned a task to acknowledge that it has the task.
- ReportTaskRunnerHeartbeat ( array $params = [] )
- Task runners call ReportTaskRunnerHeartbeat every 15 minutes to indicate that they are operational.
- SetStatus ( array $params = [] )
- Requests that the status of the specified physical or logical pipeline objects be updated in the specified pipeline.
- SetTaskStatus ( array $params = [] )
- Task runners call SetTaskStatus to notify AWS Data Pipeline that a task is completed and provide information about the final status.
- ValidatePipelineDefinition ( array $params = [] )
- Validates the specified pipeline definition to ensure that it is well formed and can be run without error.
Paginators
Paginators handle automatically iterating over paginated API results. Paginators are associated with specific API operations, and they accept the parameters that the corresponding API operation accepts. You can get a paginator from a client class using getPaginator($paginatorName, $operationParameters). This client supports the following paginators:
Operations
ActivatePipeline
$result = $client->activatePipeline
([/* ... */]); $promise = $client->activatePipelineAsync
([/* ... */]);
Validates the specified pipeline and starts processing pipeline tasks. If the pipeline does not pass validation, activation fails.
If you need to pause the pipeline to investigate an issue with a component, such as a data source or script, call DeactivatePipeline.
To activate a finished pipeline, modify the end date for the pipeline and then activate it.
Parameter Syntax
$result = $client->activatePipeline([ 'parameterValues' => [ [ 'id' => '<string>', // REQUIRED 'stringValue' => '<string>', // REQUIRED ], // ... ], 'pipelineId' => '<string>', // REQUIRED 'startTimestamp' => <integer || string || DateTime>, ]);
Parameter Details
Members
- parameterValues
-
- Type: Array of ParameterValue structures
A list of parameter values to pass to the pipeline at activation.
- pipelineId
-
- Required: Yes
- Type: string
The ID of the pipeline.
- startTimestamp
-
- Type: timestamp (string|DateTime or anything parsable by strtotime)
The date and time to resume the pipeline. By default, the pipeline resumes from the last completed execution.
Result Syntax
[]
Result Details
Errors
- PipelineNotFoundException:
The specified pipeline was not found. Verify that you used the correct user and account identifiers.
- PipelineDeletedException:
The specified pipeline has been deleted.
- InternalServiceError:
An internal service error occurred.
- InvalidRequestException:
The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.
AddTags
$result = $client->addTags
([/* ... */]); $promise = $client->addTagsAsync
([/* ... */]);
Adds or modifies tags for the specified pipeline.
Parameter Syntax
$result = $client->addTags([ 'pipelineId' => '<string>', // REQUIRED 'tags' => [ // REQUIRED [ 'key' => '<string>', // REQUIRED 'value' => '<string>', // REQUIRED ], // ... ], ]);
Parameter Details
Members
- pipelineId
-
- Required: Yes
- Type: string
The ID of the pipeline.
- tags
-
- Required: Yes
- Type: Array of Tag structures
The tags to add, as key/value pairs.
Result Syntax
[]
Result Details
Errors
- InternalServiceError:
An internal service error occurred.
- InvalidRequestException:
The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.
- PipelineNotFoundException:
The specified pipeline was not found. Verify that you used the correct user and account identifiers.
- PipelineDeletedException:
The specified pipeline has been deleted.
CreatePipeline
$result = $client->createPipeline
([/* ... */]); $promise = $client->createPipelineAsync
([/* ... */]);
Creates a new, empty pipeline. Use PutPipelineDefinition to populate the pipeline.
Parameter Syntax
$result = $client->createPipeline([ 'description' => '<string>', 'name' => '<string>', // REQUIRED 'tags' => [ [ 'key' => '<string>', // REQUIRED 'value' => '<string>', // REQUIRED ], // ... ], 'uniqueId' => '<string>', // REQUIRED ]);
Parameter Details
Members
- description
-
- Type: string
The description for the pipeline.
- name
-
- Required: Yes
- Type: string
The name for the pipeline. You can use the same name for multiple pipelines associated with your AWS account, because AWS Data Pipeline assigns each pipeline a unique pipeline identifier.
- tags
-
- Type: Array of Tag structures
A list of tags to associate with the pipeline at creation. Tags let you control access to pipelines. For more information, see Controlling User Access to Pipelines in the AWS Data Pipeline Developer Guide.
- uniqueId
-
- Required: Yes
- Type: string
A unique identifier. This identifier is not the same as the pipeline identifier assigned by AWS Data Pipeline. You are responsible for defining the format and ensuring the uniqueness of this identifier. You use this parameter to ensure idempotency during repeated calls to
CreatePipeline
. For example, if the first call toCreatePipeline
does not succeed, you can pass in the same unique identifier and pipeline name combination on a subsequent call toCreatePipeline
.CreatePipeline
ensures that if a pipeline already exists with the same name and unique identifier, a new pipeline is not created. Instead, you'll receive the pipeline identifier from the previous attempt. The uniqueness of the name and unique identifier combination is scoped to the AWS account or IAM user credentials.
Result Syntax
[ 'pipelineId' => '<string>', ]
Result Details
Members
- pipelineId
-
- Required: Yes
- Type: string
The ID that AWS Data Pipeline assigns the newly created pipeline. For example,
df-06372391ZG65EXAMPLE
.
Errors
- InternalServiceError:
An internal service error occurred.
- InvalidRequestException:
The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.
DeactivatePipeline
$result = $client->deactivatePipeline
([/* ... */]); $promise = $client->deactivatePipelineAsync
([/* ... */]);
Deactivates the specified running pipeline. The pipeline is set to the DEACTIVATING
state until the deactivation process completes.
To resume a deactivated pipeline, use ActivatePipeline. By default, the pipeline resumes from the last completed execution. Optionally, you can specify the date and time to resume the pipeline.
Parameter Syntax
$result = $client->deactivatePipeline([ 'cancelActive' => true || false, 'pipelineId' => '<string>', // REQUIRED ]);
Parameter Details
Members
- cancelActive
-
- Type: boolean
Indicates whether to cancel any running objects. The default is true, which sets the state of any running objects to
CANCELED
. If this value is false, the pipeline is deactivated after all running objects finish. - pipelineId
-
- Required: Yes
- Type: string
The ID of the pipeline.
Result Syntax
[]
Result Details
Errors
- PipelineNotFoundException:
The specified pipeline was not found. Verify that you used the correct user and account identifiers.
- PipelineDeletedException:
The specified pipeline has been deleted.
- InternalServiceError:
An internal service error occurred.
- InvalidRequestException:
The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.
DeletePipeline
$result = $client->deletePipeline
([/* ... */]); $promise = $client->deletePipelineAsync
([/* ... */]);
Deletes a pipeline, its pipeline definition, and its run history. AWS Data Pipeline attempts to cancel instances associated with the pipeline that are currently being processed by task runners.
Deleting a pipeline cannot be undone. You cannot query or restore a deleted pipeline. To temporarily pause a pipeline instead of deleting it, call SetStatus with the status set to PAUSE
on individual components. Components that are paused by SetStatus can be resumed.
Parameter Syntax
$result = $client->deletePipeline([ 'pipelineId' => '<string>', // REQUIRED ]);
Parameter Details
Members
- pipelineId
-
- Required: Yes
- Type: string
The ID of the pipeline.
Result Syntax
[]
Result Details
Errors
- PipelineNotFoundException:
The specified pipeline was not found. Verify that you used the correct user and account identifiers.
- InternalServiceError:
An internal service error occurred.
- InvalidRequestException:
The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.
DescribeObjects
$result = $client->describeObjects
([/* ... */]); $promise = $client->describeObjectsAsync
([/* ... */]);
Gets the object definitions for a set of objects associated with the pipeline. Object definitions are composed of a set of fields that define the properties of the object.
Parameter Syntax
$result = $client->describeObjects([ 'evaluateExpressions' => true || false, 'marker' => '<string>', 'objectIds' => ['<string>', ...], // REQUIRED 'pipelineId' => '<string>', // REQUIRED ]);
Parameter Details
Members
- evaluateExpressions
-
- Type: boolean
Indicates whether any expressions in the object should be evaluated when the object descriptions are returned.
- marker
-
- Type: string
The starting point for the results to be returned. For the first call, this value should be empty. As long as there are more results, continue to call
DescribeObjects
with the marker value from the previous call to retrieve the next set of results. - objectIds
-
- Required: Yes
- Type: Array of strings
The IDs of the pipeline objects that contain the definitions to be described. You can pass as many as 25 identifiers in a single call to
DescribeObjects
. - pipelineId
-
- Required: Yes
- Type: string
The ID of the pipeline that contains the object definitions.
Result Syntax
[ 'hasMoreResults' => true || false, 'marker' => '<string>', 'pipelineObjects' => [ [ 'fields' => [ [ 'key' => '<string>', 'refValue' => '<string>', 'stringValue' => '<string>', ], // ... ], 'id' => '<string>', 'name' => '<string>', ], // ... ], ]
Result Details
Members
- hasMoreResults
-
- Type: boolean
Indicates whether there are more results to return.
- marker
-
- Type: string
The starting point for the next page of results. To view the next page of results, call
DescribeObjects
again with this marker value. If the value is null, there are no more results. - pipelineObjects
-
- Required: Yes
- Type: Array of PipelineObject structures
An array of object definitions.
Errors
- InternalServiceError:
An internal service error occurred.
- InvalidRequestException:
The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.
- PipelineNotFoundException:
The specified pipeline was not found. Verify that you used the correct user and account identifiers.
- PipelineDeletedException:
The specified pipeline has been deleted.
DescribePipelines
$result = $client->describePipelines
([/* ... */]); $promise = $client->describePipelinesAsync
([/* ... */]);
Retrieves metadata about one or more pipelines. The information retrieved includes the name of the pipeline, the pipeline identifier, its current state, and the user account that owns the pipeline. Using account credentials, you can retrieve metadata about pipelines that you or your IAM users have created. If you are using an IAM user account, you can retrieve metadata about only those pipelines for which you have read permissions.
To retrieve the full pipeline definition instead of metadata about the pipeline, call GetPipelineDefinition.
Parameter Syntax
$result = $client->describePipelines([ 'pipelineIds' => ['<string>', ...], // REQUIRED ]);
Parameter Details
Members
- pipelineIds
-
- Required: Yes
- Type: Array of strings
The IDs of the pipelines to describe. You can pass as many as 25 identifiers in a single call. To obtain pipeline IDs, call ListPipelines.
Result Syntax
[ 'pipelineDescriptionList' => [ [ 'description' => '<string>', 'fields' => [ [ 'key' => '<string>', 'refValue' => '<string>', 'stringValue' => '<string>', ], // ... ], 'name' => '<string>', 'pipelineId' => '<string>', 'tags' => [ [ 'key' => '<string>', 'value' => '<string>', ], // ... ], ], // ... ], ]
Result Details
Members
- pipelineDescriptionList
-
- Required: Yes
- Type: Array of PipelineDescription structures
An array of descriptions for the specified pipelines.
Errors
- PipelineNotFoundException:
The specified pipeline was not found. Verify that you used the correct user and account identifiers.
- PipelineDeletedException:
The specified pipeline has been deleted.
- InternalServiceError:
An internal service error occurred.
- InvalidRequestException:
The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.
EvaluateExpression
$result = $client->evaluateExpression
([/* ... */]); $promise = $client->evaluateExpressionAsync
([/* ... */]);
Task runners call EvaluateExpression
to evaluate a string in the context of the specified object. For example, a task runner can evaluate SQL queries stored in Amazon S3.
Parameter Syntax
$result = $client->evaluateExpression([ 'expression' => '<string>', // REQUIRED 'objectId' => '<string>', // REQUIRED 'pipelineId' => '<string>', // REQUIRED ]);
Parameter Details
Members
- expression
-
- Required: Yes
- Type: string
The expression to evaluate.
- objectId
-
- Required: Yes
- Type: string
The ID of the object.
- pipelineId
-
- Required: Yes
- Type: string
The ID of the pipeline.
Result Syntax
[ 'evaluatedExpression' => '<string>', ]
Result Details
Members
- evaluatedExpression
-
- Required: Yes
- Type: string
The evaluated expression.
Errors
- InternalServiceError:
An internal service error occurred.
- TaskNotFoundException:
The specified task was not found.
- InvalidRequestException:
The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.
- PipelineNotFoundException:
The specified pipeline was not found. Verify that you used the correct user and account identifiers.
- PipelineDeletedException:
The specified pipeline has been deleted.
GetPipelineDefinition
$result = $client->getPipelineDefinition
([/* ... */]); $promise = $client->getPipelineDefinitionAsync
([/* ... */]);
Gets the definition of the specified pipeline. You can call GetPipelineDefinition
to retrieve the pipeline definition that you provided using PutPipelineDefinition.
Parameter Syntax
$result = $client->getPipelineDefinition([ 'pipelineId' => '<string>', // REQUIRED 'version' => '<string>', ]);
Parameter Details
Members
- pipelineId
-
- Required: Yes
- Type: string
The ID of the pipeline.
- version
-
- Type: string
The version of the pipeline definition to retrieve. Set this parameter to
latest
(default) to use the last definition saved to the pipeline oractive
to use the last definition that was activated.
Result Syntax
[ 'parameterObjects' => [ [ 'attributes' => [ [ 'key' => '<string>', 'stringValue' => '<string>', ], // ... ], 'id' => '<string>', ], // ... ], 'parameterValues' => [ [ 'id' => '<string>', 'stringValue' => '<string>', ], // ... ], 'pipelineObjects' => [ [ 'fields' => [ [ 'key' => '<string>', 'refValue' => '<string>', 'stringValue' => '<string>', ], // ... ], 'id' => '<string>', 'name' => '<string>', ], // ... ], ]
Result Details
Members
- parameterObjects
-
- Type: Array of ParameterObject structures
The parameter objects used in the pipeline definition.
- parameterValues
-
- Type: Array of ParameterValue structures
The parameter values used in the pipeline definition.
- pipelineObjects
-
- Type: Array of PipelineObject structures
The objects defined in the pipeline.
Errors
- InternalServiceError:
An internal service error occurred.
- InvalidRequestException:
The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.
- PipelineNotFoundException:
The specified pipeline was not found. Verify that you used the correct user and account identifiers.
- PipelineDeletedException:
The specified pipeline has been deleted.
ListPipelines
$result = $client->listPipelines
([/* ... */]); $promise = $client->listPipelinesAsync
([/* ... */]);
Lists the pipeline identifiers for all active pipelines that you have permission to access.
Parameter Syntax
$result = $client->listPipelines([ 'marker' => '<string>', ]);
Parameter Details
Members
- marker
-
- Type: string
The starting point for the results to be returned. For the first call, this value should be empty. As long as there are more results, continue to call
ListPipelines
with the marker value from the previous call to retrieve the next set of results.
Result Syntax
[ 'hasMoreResults' => true || false, 'marker' => '<string>', 'pipelineIdList' => [ [ 'id' => '<string>', 'name' => '<string>', ], // ... ], ]
Result Details
Members
- hasMoreResults
-
- Type: boolean
Indicates whether there are more results that can be obtained by a subsequent call.
- marker
-
- Type: string
The starting point for the next page of results. To view the next page of results, call
ListPipelinesOutput
again with this marker value. If the value is null, there are no more results. - pipelineIdList
-
- Required: Yes
- Type: Array of PipelineIdName structures
The pipeline identifiers. If you require additional information about the pipelines, you can use these identifiers to call DescribePipelines and GetPipelineDefinition.
Errors
- InternalServiceError:
An internal service error occurred.
- InvalidRequestException:
The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.
PollForTask
$result = $client->pollForTask
([/* ... */]); $promise = $client->pollForTaskAsync
([/* ... */]);
Task runners call PollForTask
to receive a task to perform from AWS Data Pipeline. The task runner specifies which tasks it can perform by setting a value for the workerGroup
parameter. The task returned can come from any of the pipelines that match the workerGroup
value passed in by the task runner and that was launched using the IAM user credentials specified by the task runner.
If tasks are ready in the work queue, PollForTask
returns a response immediately. If no tasks are available in the queue, PollForTask
uses long-polling and holds on to a poll connection for up to a 90 seconds, during which time the first newly scheduled task is handed to the task runner. To accomodate this, set the socket timeout in your task runner to 90 seconds. The task runner should not call PollForTask
again on the same workerGroup
until it receives a response, and this can take up to 90 seconds.
Parameter Syntax
$result = $client->pollForTask([ 'hostname' => '<string>', 'instanceIdentity' => [ 'document' => '<string>', 'signature' => '<string>', ], 'workerGroup' => '<string>', // REQUIRED ]);
Parameter Details
Members
- hostname
-
- Type: string
The public DNS name of the calling task runner.
- instanceIdentity
-
- Type: InstanceIdentity structure
Identity information for the EC2 instance that is hosting the task runner. You can get this value from the instance using
http://169.254.169.254/latest/meta-data/instance-id
. For more information, see Instance Metadata in the Amazon Elastic Compute Cloud User Guide. Passing in this value proves that your task runner is running on an EC2 instance, and ensures the proper AWS Data Pipeline service charges are applied to your pipeline. - workerGroup
-
- Required: Yes
- Type: string
The type of task the task runner is configured to accept and process. The worker group is set as a field on objects in the pipeline when they are created. You can only specify a single value for
workerGroup
in the call toPollForTask
. There are no wildcard values permitted inworkerGroup
; the string must be an exact, case-sensitive, match.
Result Syntax
[ 'taskObject' => [ 'attemptId' => '<string>', 'objects' => [ '<id>' => [ 'fields' => [ [ 'key' => '<string>', 'refValue' => '<string>', 'stringValue' => '<string>', ], // ... ], 'id' => '<string>', 'name' => '<string>', ], // ... ], 'pipelineId' => '<string>', 'taskId' => '<string>', ], ]
Result Details
Members
- taskObject
-
- Type: TaskObject structure
The information needed to complete the task that is being assigned to the task runner. One of the fields returned in this object is
taskId
, which contains an identifier for the task being assigned. The calling task runner usestaskId
in subsequent calls to ReportTaskProgress and SetTaskStatus.
Errors
- InternalServiceError:
An internal service error occurred.
- InvalidRequestException:
The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.
- TaskNotFoundException:
The specified task was not found.
PutPipelineDefinition
$result = $client->putPipelineDefinition
([/* ... */]); $promise = $client->putPipelineDefinitionAsync
([/* ... */]);
Adds tasks, schedules, and preconditions to the specified pipeline. You can use PutPipelineDefinition
to populate a new pipeline.
PutPipelineDefinition
also validates the configuration as it adds it to the pipeline. Changes to the pipeline are saved unless one of the following three validation errors exists in the pipeline.
- An object is missing a name or identifier field.
- A string or reference field is empty.
- The number of objects in the pipeline exceeds the maximum allowed objects.
- The pipeline is in a FINISHED state.
Pipeline object definitions are passed to the PutPipelineDefinition
action and returned by the GetPipelineDefinition action.
Parameter Syntax
$result = $client->putPipelineDefinition([ 'parameterObjects' => [ [ 'attributes' => [ // REQUIRED [ 'key' => '<string>', // REQUIRED 'stringValue' => '<string>', // REQUIRED ], // ... ], 'id' => '<string>', // REQUIRED ], // ... ], 'parameterValues' => [ [ 'id' => '<string>', // REQUIRED 'stringValue' => '<string>', // REQUIRED ], // ... ], 'pipelineId' => '<string>', // REQUIRED 'pipelineObjects' => [ // REQUIRED [ 'fields' => [ // REQUIRED [ 'key' => '<string>', // REQUIRED 'refValue' => '<string>', 'stringValue' => '<string>', ], // ... ], 'id' => '<string>', // REQUIRED 'name' => '<string>', // REQUIRED ], // ... ], ]);
Parameter Details
Members
- parameterObjects
-
- Type: Array of ParameterObject structures
The parameter objects used with the pipeline.
- parameterValues
-
- Type: Array of ParameterValue structures
The parameter values used with the pipeline.
- pipelineId
-
- Required: Yes
- Type: string
The ID of the pipeline.
- pipelineObjects
-
- Required: Yes
- Type: Array of PipelineObject structures
The objects that define the pipeline. These objects overwrite the existing pipeline definition.
Result Syntax
[ 'errored' => true || false, 'validationErrors' => [ [ 'errors' => ['<string>', ...], 'id' => '<string>', ], // ... ], 'validationWarnings' => [ [ 'id' => '<string>', 'warnings' => ['<string>', ...], ], // ... ], ]
Result Details
Members
- errored
-
- Required: Yes
- Type: boolean
Indicates whether there were validation errors, and the pipeline definition is stored but cannot be activated until you correct the pipeline and call
PutPipelineDefinition
to commit the corrected pipeline. - validationErrors
-
- Type: Array of ValidationError structures
The validation errors that are associated with the objects defined in
pipelineObjects
. - validationWarnings
-
- Type: Array of ValidationWarning structures
The validation warnings that are associated with the objects defined in
pipelineObjects
.
Errors
- InternalServiceError:
An internal service error occurred.
- InvalidRequestException:
The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.
- PipelineNotFoundException:
The specified pipeline was not found. Verify that you used the correct user and account identifiers.
- PipelineDeletedException:
The specified pipeline has been deleted.
QueryObjects
$result = $client->queryObjects
([/* ... */]); $promise = $client->queryObjectsAsync
([/* ... */]);
Queries the specified pipeline for the names of objects that match the specified set of conditions.
Parameter Syntax
$result = $client->queryObjects([ 'limit' => <integer>, 'marker' => '<string>', 'pipelineId' => '<string>', // REQUIRED 'query' => [ 'selectors' => [ [ 'fieldName' => '<string>', 'operator' => [ 'type' => 'EQ|REF_EQ|LE|GE|BETWEEN', 'values' => ['<string>', ...], ], ], // ... ], ], 'sphere' => '<string>', // REQUIRED ]);
Parameter Details
Members
- limit
-
- Type: int
The maximum number of object names that
QueryObjects
will return in a single call. The default value is 100. - marker
-
- Type: string
The starting point for the results to be returned. For the first call, this value should be empty. As long as there are more results, continue to call
QueryObjects
with the marker value from the previous call to retrieve the next set of results. - pipelineId
-
- Required: Yes
- Type: string
The ID of the pipeline.
- query
-
- Type: Query structure
The query that defines the objects to be returned. The
Query
object can contain a maximum of ten selectors. The conditions in the query are limited to top-level String fields in the object. These filters can be applied to components, instances, and attempts. - sphere
-
- Required: Yes
- Type: string
Indicates whether the query applies to components or instances. The possible values are:
COMPONENT
,INSTANCE
, andATTEMPT
.
Result Syntax
[ 'hasMoreResults' => true || false, 'ids' => ['<string>', ...], 'marker' => '<string>', ]
Result Details
Members
- hasMoreResults
-
- Type: boolean
Indicates whether there are more results that can be obtained by a subsequent call.
- ids
-
- Type: Array of strings
The identifiers that match the query selectors.
- marker
-
- Type: string
The starting point for the next page of results. To view the next page of results, call
QueryObjects
again with this marker value. If the value is null, there are no more results.
Errors
- PipelineNotFoundException:
The specified pipeline was not found. Verify that you used the correct user and account identifiers.
- PipelineDeletedException:
The specified pipeline has been deleted.
- InternalServiceError:
An internal service error occurred.
- InvalidRequestException:
The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.
RemoveTags
$result = $client->removeTags
([/* ... */]); $promise = $client->removeTagsAsync
([/* ... */]);
Removes existing tags from the specified pipeline.
Parameter Syntax
$result = $client->removeTags([ 'pipelineId' => '<string>', // REQUIRED 'tagKeys' => ['<string>', ...], // REQUIRED ]);
Parameter Details
Members
- pipelineId
-
- Required: Yes
- Type: string
The ID of the pipeline.
- tagKeys
-
- Required: Yes
- Type: Array of strings
The keys of the tags to remove.
Result Syntax
[]
Result Details
Errors
- InternalServiceError:
An internal service error occurred.
- InvalidRequestException:
The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.
- PipelineNotFoundException:
The specified pipeline was not found. Verify that you used the correct user and account identifiers.
- PipelineDeletedException:
The specified pipeline has been deleted.
ReportTaskProgress
$result = $client->reportTaskProgress
([/* ... */]); $promise = $client->reportTaskProgressAsync
([/* ... */]);
Task runners call ReportTaskProgress
when assigned a task to acknowledge that it has the task. If the web service does not receive this acknowledgement within 2 minutes, it assigns the task in a subsequent PollForTask call. After this initial acknowledgement, the task runner only needs to report progress every 15 minutes to maintain its ownership of the task. You can change this reporting time from 15 minutes by specifying a reportProgressTimeout
field in your pipeline.
If a task runner does not report its status after 5 minutes, AWS Data Pipeline assumes that the task runner is unable to process the task and reassigns the task in a subsequent response to PollForTask. Task runners should call ReportTaskProgress
every 60 seconds.
Parameter Syntax
$result = $client->reportTaskProgress([ 'fields' => [ [ 'key' => '<string>', // REQUIRED 'refValue' => '<string>', 'stringValue' => '<string>', ], // ... ], 'taskId' => '<string>', // REQUIRED ]);
Parameter Details
Members
- fields
-
- Type: Array of Field structures
Key-value pairs that define the properties of the ReportTaskProgressInput object.
- taskId
-
- Required: Yes
- Type: string
The ID of the task assigned to the task runner. This value is provided in the response for PollForTask.
Result Syntax
[ 'canceled' => true || false, ]
Result Details
Members
- canceled
-
- Required: Yes
- Type: boolean
If true, the calling task runner should cancel processing of the task. The task runner does not need to call SetTaskStatus for canceled tasks.
Errors
- InternalServiceError:
An internal service error occurred.
- InvalidRequestException:
The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.
- TaskNotFoundException:
The specified task was not found.
- PipelineNotFoundException:
The specified pipeline was not found. Verify that you used the correct user and account identifiers.
- PipelineDeletedException:
The specified pipeline has been deleted.
ReportTaskRunnerHeartbeat
$result = $client->reportTaskRunnerHeartbeat
([/* ... */]); $promise = $client->reportTaskRunnerHeartbeatAsync
([/* ... */]);
Task runners call ReportTaskRunnerHeartbeat
every 15 minutes to indicate that they are operational. If the AWS Data Pipeline Task Runner is launched on a resource managed by AWS Data Pipeline, the web service can use this call to detect when the task runner application has failed and restart a new instance.
Parameter Syntax
$result = $client->reportTaskRunnerHeartbeat([ 'hostname' => '<string>', 'taskrunnerId' => '<string>', // REQUIRED 'workerGroup' => '<string>', ]);
Parameter Details
Members
- hostname
-
- Type: string
The public DNS name of the task runner.
- taskrunnerId
-
- Required: Yes
- Type: string
The ID of the task runner. This value should be unique across your AWS account. In the case of AWS Data Pipeline Task Runner launched on a resource managed by AWS Data Pipeline, the web service provides a unique identifier when it launches the application. If you have written a custom task runner, you should assign a unique identifier for the task runner.
- workerGroup
-
- Type: string
The type of task the task runner is configured to accept and process. The worker group is set as a field on objects in the pipeline when they are created. You can only specify a single value for
workerGroup
. There are no wildcard values permitted inworkerGroup
; the string must be an exact, case-sensitive, match.
Result Syntax
[ 'terminate' => true || false, ]
Result Details
Members
- terminate
-
- Required: Yes
- Type: boolean
Indicates whether the calling task runner should terminate.
Errors
- InternalServiceError:
An internal service error occurred.
- InvalidRequestException:
The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.
SetStatus
$result = $client->setStatus
([/* ... */]); $promise = $client->setStatusAsync
([/* ... */]);
Requests that the status of the specified physical or logical pipeline objects be updated in the specified pipeline. This update might not occur immediately, but is eventually consistent. The status that can be set depends on the type of object (for example, DataNode or Activity). You cannot perform this operation on FINISHED
pipelines and attempting to do so returns InvalidRequestException
.
Parameter Syntax
$result = $client->setStatus([ 'objectIds' => ['<string>', ...], // REQUIRED 'pipelineId' => '<string>', // REQUIRED 'status' => '<string>', // REQUIRED ]);
Parameter Details
Members
- objectIds
-
- Required: Yes
- Type: Array of strings
The IDs of the objects. The corresponding objects can be either physical or components, but not a mix of both types.
- pipelineId
-
- Required: Yes
- Type: string
The ID of the pipeline that contains the objects.
- status
-
- Required: Yes
- Type: string
The status to be set on all the objects specified in
objectIds
. For components, usePAUSE
orRESUME
. For instances, useTRY_CANCEL
,RERUN
, orMARK_FINISHED
.
Result Syntax
[]
Result Details
Errors
- PipelineNotFoundException:
The specified pipeline was not found. Verify that you used the correct user and account identifiers.
- PipelineDeletedException:
The specified pipeline has been deleted.
- InternalServiceError:
An internal service error occurred.
- InvalidRequestException:
The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.
SetTaskStatus
$result = $client->setTaskStatus
([/* ... */]); $promise = $client->setTaskStatusAsync
([/* ... */]);
Task runners call SetTaskStatus
to notify AWS Data Pipeline that a task is completed and provide information about the final status. A task runner makes this call regardless of whether the task was sucessful. A task runner does not need to call SetTaskStatus
for tasks that are canceled by the web service during a call to ReportTaskProgress.
Parameter Syntax
$result = $client->setTaskStatus([ 'errorId' => '<string>', 'errorMessage' => '<string>', 'errorStackTrace' => '<string>', 'taskId' => '<string>', // REQUIRED 'taskStatus' => 'FINISHED|FAILED|FALSE', // REQUIRED ]);
Parameter Details
Members
- errorId
-
- Type: string
If an error occurred during the task, this value specifies the error code. This value is set on the physical attempt object. It is used to display error information to the user. It should not start with string "Service_" which is reserved by the system.
- errorMessage
-
- Type: string
If an error occurred during the task, this value specifies a text description of the error. This value is set on the physical attempt object. It is used to display error information to the user. The web service does not parse this value.
- errorStackTrace
-
- Type: string
If an error occurred during the task, this value specifies the stack trace associated with the error. This value is set on the physical attempt object. It is used to display error information to the user. The web service does not parse this value.
- taskId
-
- Required: Yes
- Type: string
The ID of the task assigned to the task runner. This value is provided in the response for PollForTask.
- taskStatus
-
- Required: Yes
- Type: string
If
FINISHED
, the task successfully completed. IfFAILED
, the task ended unsuccessfully. Preconditions use false.
Result Syntax
[]
Result Details
Errors
- InternalServiceError:
An internal service error occurred.
- TaskNotFoundException:
The specified task was not found.
- InvalidRequestException:
The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.
- PipelineNotFoundException:
The specified pipeline was not found. Verify that you used the correct user and account identifiers.
- PipelineDeletedException:
The specified pipeline has been deleted.
ValidatePipelineDefinition
$result = $client->validatePipelineDefinition
([/* ... */]); $promise = $client->validatePipelineDefinitionAsync
([/* ... */]);
Validates the specified pipeline definition to ensure that it is well formed and can be run without error.
Parameter Syntax
$result = $client->validatePipelineDefinition([ 'parameterObjects' => [ [ 'attributes' => [ // REQUIRED [ 'key' => '<string>', // REQUIRED 'stringValue' => '<string>', // REQUIRED ], // ... ], 'id' => '<string>', // REQUIRED ], // ... ], 'parameterValues' => [ [ 'id' => '<string>', // REQUIRED 'stringValue' => '<string>', // REQUIRED ], // ... ], 'pipelineId' => '<string>', // REQUIRED 'pipelineObjects' => [ // REQUIRED [ 'fields' => [ // REQUIRED [ 'key' => '<string>', // REQUIRED 'refValue' => '<string>', 'stringValue' => '<string>', ], // ... ], 'id' => '<string>', // REQUIRED 'name' => '<string>', // REQUIRED ], // ... ], ]);
Parameter Details
Members
- parameterObjects
-
- Type: Array of ParameterObject structures
The parameter objects used with the pipeline.
- parameterValues
-
- Type: Array of ParameterValue structures
The parameter values used with the pipeline.
- pipelineId
-
- Required: Yes
- Type: string
The ID of the pipeline.
- pipelineObjects
-
- Required: Yes
- Type: Array of PipelineObject structures
The objects that define the pipeline changes to validate against the pipeline.
Result Syntax
[ 'errored' => true || false, 'validationErrors' => [ [ 'errors' => ['<string>', ...], 'id' => '<string>', ], // ... ], 'validationWarnings' => [ [ 'id' => '<string>', 'warnings' => ['<string>', ...], ], // ... ], ]
Result Details
Members
- errored
-
- Required: Yes
- Type: boolean
Indicates whether there were validation errors.
- validationErrors
-
- Type: Array of ValidationError structures
Any validation errors that were found.
- validationWarnings
-
- Type: Array of ValidationWarning structures
Any validation warnings that were found.
Errors
- InternalServiceError:
An internal service error occurred.
- InvalidRequestException:
The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.
- PipelineNotFoundException:
The specified pipeline was not found. Verify that you used the correct user and account identifiers.
- PipelineDeletedException:
The specified pipeline has been deleted.
Shapes
Field
Description
A key-value pair that describes a property of a pipeline object. The value is specified as either a string value (StringValue
) or a reference to another object (RefValue
) but not as both.
Members
- key
-
- Required: Yes
- Type: string
The field identifier.
- refValue
-
- Type: string
The field value, expressed as the identifier of another object.
- stringValue
-
- Type: string
The field value, expressed as a String.
InstanceIdentity
Description
Identity information for the EC2 instance that is hosting the task runner. You can get this value by calling a metadata URI from the EC2 instance. For more information, see Instance Metadata in the Amazon Elastic Compute Cloud User Guide. Passing in this value proves that your task runner is running on an EC2 instance, and ensures the proper AWS Data Pipeline service charges are applied to your pipeline.
Members
- document
-
- Type: string
A description of an EC2 instance that is generated when the instance is launched and exposed to the instance via the instance metadata service in the form of a JSON representation of an object.
- signature
-
- Type: string
A signature which can be used to verify the accuracy and authenticity of the information provided in the instance identity document.
InternalServiceError
Description
An internal service error occurred.
Members
- message
-
- Type: string
Description of the error message.
InvalidRequestException
Description
The request was not valid. Verify that your request was properly formatted, that the signature was generated with the correct credentials, and that you haven't exceeded any of the service limits for your account.
Members
- message
-
- Type: string
Description of the error message.
Operator
Description
Contains a logical operation for comparing the value of a field with a specified value.
Members
- type
-
- Type: string
The logical operation to be performed: equal (
EQ
), equal reference (REF_EQ
), less than or equal (LE
), greater than or equal (GE
), or between (BETWEEN
). Equal reference (REF_EQ
) can be used only with reference fields. The other comparison types can be used only with String fields. The comparison types you can use apply only to certain object fields, as detailed below.The comparison operators EQ and REF_EQ act on the following fields:
- name
- @sphere
- parent
- @componentParent
- @instanceParent
- @status
- @scheduledStartTime
- @scheduledEndTime
- @actualStartTime
- @actualEndTime
The comparison operators
GE
,LE
, andBETWEEN
act on the following fields:- @scheduledStartTime
- @scheduledEndTime
- @actualStartTime
- @actualEndTime
Note that fields beginning with the at sign (@) are read-only and set by the web service. When you name fields, you should choose names containing only alpha-numeric values, as symbols may be reserved by AWS Data Pipeline. User-defined fields that you add to a pipeline should prefix their name with the string "my".
- values
-
- Type: Array of strings
The value that the actual field value will be compared with.
ParameterAttribute
Description
The attributes allowed or specified with a parameter object.
Members
- key
-
- Required: Yes
- Type: string
The field identifier.
- stringValue
-
- Required: Yes
- Type: string
The field value, expressed as a String.
ParameterObject
Description
Contains information about a parameter object.
Members
- attributes
-
- Required: Yes
- Type: Array of ParameterAttribute structures
The attributes of the parameter object.
- id
-
- Required: Yes
- Type: string
The ID of the parameter object.
ParameterValue
Description
A value or list of parameter values.
Members
- id
-
- Required: Yes
- Type: string
The ID of the parameter value.
- stringValue
-
- Required: Yes
- Type: string
The field value, expressed as a String.
PipelineDeletedException
Description
The specified pipeline has been deleted.
Members
- message
-
- Type: string
Description of the error message.
PipelineDescription
Description
Contains pipeline metadata.
Members
- description
-
- Type: string
Description of the pipeline.
- fields
-
- Required: Yes
- Type: Array of Field structures
A list of read-only fields that contain metadata about the pipeline: @userId, @accountId, and @pipelineState.
- name
-
- Required: Yes
- Type: string
The name of the pipeline.
- pipelineId
-
- Required: Yes
- Type: string
The pipeline identifier that was assigned by AWS Data Pipeline. This is a string of the form
df-297EG78HU43EEXAMPLE
. - tags
-
- Type: Array of Tag structures
A list of tags to associated with a pipeline. Tags let you control access to pipelines. For more information, see Controlling User Access to Pipelines in the AWS Data Pipeline Developer Guide.
PipelineIdName
Description
Contains the name and identifier of a pipeline.
Members
- id
-
- Type: string
The ID of the pipeline that was assigned by AWS Data Pipeline. This is a string of the form
df-297EG78HU43EEXAMPLE
. - name
-
- Type: string
The name of the pipeline.
PipelineNotFoundException
Description
The specified pipeline was not found. Verify that you used the correct user and account identifiers.
Members
- message
-
- Type: string
Description of the error message.
PipelineObject
Description
Contains information about a pipeline object. This can be a logical, physical, or physical attempt pipeline object. The complete set of components of a pipeline defines the pipeline.
Members
- fields
-
- Required: Yes
- Type: Array of Field structures
Key-value pairs that define the properties of the object.
- id
-
- Required: Yes
- Type: string
The ID of the object.
- name
-
- Required: Yes
- Type: string
The name of the object.
Query
Description
Defines the query to run against an object.
Members
- selectors
-
- Type: Array of Selector structures
List of selectors that define the query. An object must satisfy all of the selectors to match the query.
Selector
Description
A comparision that is used to determine whether a query should return this object.
Members
- fieldName
-
- Type: string
The name of the field that the operator will be applied to. The field name is the "key" portion of the field definition in the pipeline definition syntax that is used by the AWS Data Pipeline API. If the field is not set on the object, the condition fails.
- operator
-
- Type: Operator structure
Contains a logical operation for comparing the value of a field with a specified value.
Tag
Description
Tags are key/value pairs defined by a user and associated with a pipeline to control access. AWS Data Pipeline allows you to associate ten tags per pipeline. For more information, see Controlling User Access to Pipelines in the AWS Data Pipeline Developer Guide.
Members
- key
-
- Required: Yes
- Type: string
The key name of a tag defined by a user. For more information, see Controlling User Access to Pipelines in the AWS Data Pipeline Developer Guide.
- value
-
- Required: Yes
- Type: string
The optional value portion of a tag defined by a user. For more information, see Controlling User Access to Pipelines in the AWS Data Pipeline Developer Guide.
TaskNotFoundException
Description
The specified task was not found.
Members
- message
-
- Type: string
Description of the error message.
TaskObject
Description
Contains information about a pipeline task that is assigned to a task runner.
Members
- attemptId
-
- Type: string
The ID of the pipeline task attempt object. AWS Data Pipeline uses this value to track how many times a task is attempted.
- objects
-
- Type: Associative array of custom strings keys (id) to PipelineObject structures
Connection information for the location where the task runner will publish the output of the task.
- pipelineId
-
- Type: string
The ID of the pipeline that provided the task.
- taskId
-
- Type: string
An internal identifier for the task. This ID is passed to the SetTaskStatus and ReportTaskProgress actions.
ValidationError
Description
Defines a validation error. Validation errors prevent pipeline activation. The set of validation errors that can be returned are defined by AWS Data Pipeline.
Members
- errors
-
- Type: Array of strings
A description of the validation error.
- id
-
- Type: string
The identifier of the object that contains the validation error.
ValidationWarning
Description
Defines a validation warning. Validation warnings do not prevent pipeline activation. The set of validation warnings that can be returned are defined by AWS Data Pipeline.
Members
- id
-
- Type: string
The identifier of the object that contains the validation warning.
- warnings
-
- Type: Array of strings
A description of the validation warning.