Amazon SageMaker Data and AI Governance
Discover, govern, and collaborate on data and AI securelyOverview
The next generation of Amazon SageMaker simplifies the discovery, governance, and collaboration for data and AI across your lakehouse, AI models, and applications. With Amazon SageMaker Catalog, built on Amazon DataZone, users can securely discover and access approved data and models using semantic search with generative AI created metadata, or you could just ask Q Developer with natural language to find your data. Users can define and enforce access policies consistently using a single permission model with fine-grained access controls centrally in the SageMaker Unified Studio (preview). Seamlessly share and collaborate on data and AI assets through easy publishing and subscribing workflows. With Amazon SageMaker, you can safeguard and protect your AI models using Amazon Bedrock guardrails and implement responsible AI policies. Build trust throughout your organization with data quality monitoring and automation, sensitive data detection and data and ML lineage.
Benefits
Features
Curated data for context and findability
The Catalog brings business context to your technical metadata and enables you to enrich it with business context. You can make data visible with business context for all your users to find, understand, and trust data quickly and easily.
Automated metadata recommendations
Automate adding business descriptions and names to data, which helps you easily understand context and helps you avoid dealing with cryptic technical names. This automation is powered by large language models (LLMs) to increase accuracy and consistency.
Bring a consistent level of AI safety across all your applications
Amazon Bedrock guardrails help evaluate user inputs and FM responses based on use case specific policies, and provides an additional layer of safeguards regardless of the underlying Foundation Models.
Quickly audit and track models
Quickly audit and troubleshoot performance for all models, endpoints, and model monitoring jobs through a unified view. Track deviations from expected model behavior, as well as missing or inactive monitoring jobs, with automated alerts.
Data quality
With data quality statistics, data consumers can see data quality metrics from AWS or third-party systems. Data consumers can trust the data sources they use for decisions, and have data quality context as they search for assets. Data producers and IT teams can also use APIs to incorporate the data quality statistics from third-party systems into a unified, out-of-console portal.
Data and ML lineage
Understand the movement of data and models over time. Lineage can raise trust and an organization’s data and AI literacy by helping data consumers understand where data came from, how it changed, and its consumption. You can reduce time spent in mapping a data and AI assets and its relationships, troubleshooting and developing pipelines, and asserting data and AI governance practices.
Get started with SageMaker Data and AI Governance
Did you find what you were looking for today?
Let us know so we can improve the quality of the content on our pages.