Add a step to your data flow - Amazon SageMaker AI

Add a step to your data flow

In your Data Wrangler data flows, you can add steps that represent data transformations and analyses.

To add a step to your data flow, select + next to any dataset node or previously added step. Then, select one of the following options:

  • Edit data types (For a Data types step only): If you have not added any transforms to a Data types step, you can double-click on the Data types step in your flow to open the Data tab and edit the data types that Data Wrangler inferred when importing your dataset.

  • Add transform: Adds a new transform step. See Transform data to learn more about the data transformations you can add.

  • Get data insights: Add analyses, such as histograms or custom visualizations. You can use this option to analyze your data at any point in the data flow. See Perform exploratory data analysis (EDA) to learn more about the analyses you can add.

  • Join: Find this option under Combine data to join two datasets and add the resulting dataset to the data flow. To learn more, see Join Datasets.

  • Concatenate: Find this option under Combine data to concatenate two datasets and add the resulting dataset to the data flow. To learn more, see Concatenate Datasets.