Creating a dataset
Important
Amazon FinSpace Dataset Browser will be discontinued on March 26,
2025
. Starting November 29, 2023
, FinSpace will no longer accept the creation of new Dataset Browser
environments. Customers using Amazon FinSpace with Managed Kdb Insights
Note
In order to create and manage datasets, you must be a superuser or a member of a group with necessary permissions – Create Datasets.
A dataset can be created by loading a file using the Amazon FinSpace web application.
To create a dataset
Sign in to the FinSpace web application. For more information, see Signing in to the Amazon FinSpace web application.
On the left navigation bar of the home page, choose Add Data.
-
Drag and drop a .csv file or choose Browse Files to select a file. Once the file is detected by the web application, schema of the file will be displayed. The column names are read from the file and data types are inferred.
-
Change the data types as required by choosing Edit Derived Schema. Take note of the data types and formats that are supported.
-
Choose Save Schema.
-
Choose Confirm Schema & Upload File. This action starts the following process:
-
Create a dataset with name of the .csv file that was loaded and takes you to the dataset details page.
-
Once the upload of the sample data file is complete, a changeset is created with the content of the data file. Verify by checking the Dataset Update History table under All Data Views tab.
-
Data view creation process is started. Once the upload of the sample data file is complete, a process is kicked off to create a data view that can be analyzed in a notebook.
For small files of up to 100 megabytes, data view creation takes approximately 2 minutes. For larger files of around 1 gigabyte, expect data view creation to take approximately 3-4 minutes. Views with partitioning and sorting schemes may take longer.
Once a dataset is created, you can start adding data to it. A new set of data added to a dataset creates a corresponding changeset.
-