Why Glue Data Quality?
Data lakes may become data swamps without proper oversight. Setting up data quality checks is time-consuming, tedious and error prone. You must manually create data quality rules and write code to monitor data pipelines, and alert data consumers when data quality deteriorates. AWS Glue Data Quality reduces these manual quality efforts from days to hours. It automatically computes statistics, recommends quality rules, monitors, and alerts you when it detects issues. For hidden and hard-to-find issues, Glue Data Quality uses ML algorithms. The combined power of rule-based and ML approach, along with the serverless, scalable and open solution, enables you to deliver high quality data to make confident business decisions.