Corrections to a dataset - Amazon FinSpace

Corrections to a dataset

Important

Amazon FinSpace Dataset Browser will be discontinued on November 29, 2024. Starting November 29, 2023, FinSpace will no longer accept the creation of new Dataset Browser environments. Customers using Amazon FinSpace with Managed Kdb Insights will not be affected. For more information, review the FAQ or contact AWS Support to assist with your transition.

A changeset can be ingested as a correction to an already created changeset. This action does not delete the prior ingested set but signifies that the replaced changeset will be used when a view is created if both changesets fall under the specified date and time of the view.

To create a changeset that is a replacement to an existing changeset

  1. From the homepage, search for the dataset that you want to make corrections to.

  2. Choose the dataset name to view the dataset details page.

  3. Choose the All Data Views tab.

  4. Under the Dataset Update History table, from the list of changesets identify the changeset to be replaced and then choose the corrections icon (Two curved arrows forming a circular shape, indicating a refresh or sync operation.).

  5. Choose Choose CSV File to select and upload a file from your desktop.

  6. Once the file is uploaded, choose the input format for the ingested data from the following options:

    • Delimiter – Specifies the delimiter character. The default value is Comma.

    • Escape Character – Specifies a character to use for escaping. The default value is None.

    • Quotes – Specifies the character to use for quoting. The default value is Double Quotes (").

    • Multiline Records – Specifies whether a single record can span multiple lines. By default this option is disabled. Enable this option if you want any record to span multiple lines.

    • Treat First Line As Header – Specifies whether to treat the first line as a header. By default this option is disabled.

    • Skip First Data Line – Specifies whether to skip the first data line. By default this option is disabled.

  7. Choose Save. The changeset is added to the Dataset Update History table with a Pending or Running status that changes to Available once the update is successful.