Amazon Glue DataBrew
AWS Glue DataBrew is a visual data preparation tool that makes it easy for data analysts and data scientists to clean and normalize data to prepare it for analytics and machine learning. It provides over 250 pre-built transformations to automate data preparation tasks.
APIs
AWS Glue DataBrew API
The AWS Glue DataBrew API provides programmatic access to create and manage datasets, recipes, projects, jobs, and rulesets for visual data preparation and transformation workfl...
Features
Apply over 250 ready-to-use transformations without writing code, including filtering, normalizing, aggregating, and reformatting data.
Interactive visual interface to explore and transform data without writing code.
Save transformation steps as reusable recipes that can be versioned and shared across teams.
Automatically profile datasets to understand data quality, distribution, and statistics.
Define and enforce data quality rules with rulesets to validate data before processing.
Create shared projects for team-based data preparation with centralized management.
Use Cases
Clean, normalize, and transform raw data for business analytics dashboards and reports.
Prepare and transform features from raw data for training machine learning models.
Profile datasets and apply quality rules to ensure data meets standards before processing.
Automate recurring data transformation jobs as part of data pipeline workflows.