start-export-task

This command starts an export task on a graph in Neptune Analytics. It allows you to export your graph into columnar structured .csv and .parquet files. Calling export on a graph will generate a unique taskId that you can use to track the progress of your export. When an export is triggered, a clone of your graph is created to process the export request, allowing your graph to continue servicing queries and analytics with no performance impact.

start-export-task syntax


aws neptune-graph start-export-task \
 --graph-identifier <GRAPH_ID> \
 --region <region> \
 --role-arn <arn> \
 --format <format> \
 [--parquet-type <parquet-type>] \
 --kms-key-identifier <kms-key> \
 --destination <s3-url> \
 [--export-filter <filter-json>] #See filtering section for details.

start-export-task inputs

--graph-identifier <GRAPH_ID> - The unique identifier of the Neptune Analytics graph to export.
--region <region> - The AWS region where the Neptune Analytics graph is located.
--role-arn <arn> - The ARN of an IAM role that grants Neptune Analytics the necessary permissions to access the Amazon S3 bucket for the export.
--format <format> - The output format for the exported data, either CSV or PARQUET.
--kms-key-identifier <kms-key> - The AWS KMS key to use for server-side encryption of the exported data in Amazon S3. For more information see Create and configure IAM role and AWS KMS key.
--destination <s3-url> - The Amazon S3 location where the exported data will be written. The provided role-arn must have permission to write to this location. Exported data will be written to this folder in a sub-directory given by the export taskId. See start-export-task output for more information.
--export-filter <filter-json> - A JSON object that specifies which vertices and edges to include in the export, based on their labels and properties. This field is optional, and if not provided, a value of ‘{}' is used, corresponding to an export of the whole property graph. For more detail on the export filter JSON object, see Specifying a filter for expanded syntax and examples.

start-export-task output

The response from the start-export-task is a JSON string. The taskId is the most significant value in the return, as this can be used to identify the export process when calling get-export-task or list-export-task, as well as identifying the export process in CloudWatch logs. Other values in the return can be used to keep track of which expert parameters were invoked for a given taskId.


{
    "graphId": "$GRAPH_ID", // The identifier of the graph being exported.
    "roleArn": "$arn", // The ARN of the IAM role being used to give
                       // export the required permissions.
    "taskId": "$taskId", // A unique id corresponding to the requested export. 
    "status": "INITIALIZING", // The status of the export.
                              // One of INITIALIZING,
                              //  EXPORTING,
                              //  SUCCEEDED,
                              //  FAILED,
                              //  CANCELLING,
                              //  CANCELLED
    "format": "PARQUET", // The requested format of the export.
                         // One of CSV or PARQUET.
    "destination": "$s3-uri", // The Amazon S3 location where the exported
                              // data will be written. 
    "kmsKeyIdentifier": "$kms_key", // The AWS KMS key to use for server-side
                                    // encryption of the exported data in Amazon S3.
    "parquetType": "COLUMNAR" // If a Parquet export was requested,
                              // gives the Parquet type.
}

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Exporting data

get-export-task