AWS Glue DataBrew
AWS Glue DataBrew is a fully managed visual data preparation service for cleaning, normalizing, and transforming data. It differs from AWS Glue ETL in that you don't have write code to work with it. DataBrew provides more than 250 built-in transformations, with a visual point-and-click interface for creating and managing data transformation jobs.
DataBrew is available in a separate console view from AWS Glue. It is natively integrated with several AWS services and supports many different file formats. For more information, see Product and service integrations.
DataBrew is based on the following six core concepts:
-
Project – The entire data preparation workspace in DataBrew
-
Dataset – A collection of structured or semi-structured data
-
Recipe – A set of data transformation steps; each step can contain many actions
-
Job – A set of instructions to run a recipe or a data profile job
-
Data lineage – The tracking of data in a visual interface to identify its origin
-
Data profile – A summary view of the shape of your data
AWS Glue DataBrew is integrated with AWS Glue Studio