Listing Thumbnail

    GenAI-powered Data Quality

     Info
    Our solution accelerator, a GenAI-powered Data Quality engine for data pipelines, is designed to enable an automated in-built data quality check for data ingestion.

    Overview

    Data Quality has always been an important issue for CDOs and often plagued with complexity and the need for attention to detail that can require significant human resources. Recent developments in Generative AI provided breakthroughs for dealing with Data Quality enabling a level of automation in data quality analysis improving productivity, reducing costs and improving business outcomes.

    Our GenAI-powered engine for data quality in data pipelines is built using AWS CDK (Infrastructure as Code) and performs two critical functions: 1) enables automatic checks of tabular data using the rules provided by the user and 2) autonomously generates data quality rules, further enhancing its adaptability and functionality. It also provides a brief report that justifies the rules that were generated and applied, explaining what they intend to do, thus addressing an important issue of explainability. Human-in-the-loop logic system enables an immediate rectification of any errors or discrepancies, preventing any corrupted data from moving forward in the process.

    The engine is triggered by Amazon EventBridge, which initiates the GenAI pipeline. It utilizes AWS Lambda for compute tasks, for data pre-processing before it goes Amazon S3 for secured storage of data. Amazon SNS notifies stakeholders of any potential data quality issues. AWS CDK implements a CI/CD pipeline for continuous improvement of the system, ensuring that updates and enhancements are seamlessly integrated.

    Highlights

    • Automated tabular data quality checks enhancing productivity, driving efficiencies, reducing human errors and improving business outcomes
    • Continuous improvement through the autonomously generated rules
    • In-built explainability and 'human in the loop' ensures data integrity

    Details

    Delivery method

    Deployed on AWS

    Pricing

    Custom pricing options

    Pricing is based on your specific requirements and eligibility. To get a custom quote for your needs, request a private offer.

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Resources

    Vendor resources

    Support

    Vendor support

    Access to an AWS POA funding during a POC/Pilot/MVP phase
    Post MVP - basic support :documentation, FAQs, and email support during business hours

    contact: a.main@reply.com