Overview
IBM watsonx.data PayGo is an open, hybrid data lakehouse offering flexible usage-based pricing for analytics and AI workloads on AWS. It supports open table formats such as Apache Iceberg and Parquet and provides a unified metadata layer for querying structured and unstructured data across AWS, multi-cloud, and on-prem environments - without requiring ETL. Using Presto SQL and Apache Spark, PayGo enables federated, multi-engine analytics optimized for cost and performance.
watsonx.data offers enterprise-grade deployment flexibility and security, including VPCbased deployments, AWS PrivateLink, and support for FedRAMP (Medium) and HIPPA for AWS GovCloud. With builtin governance, automation, and meta-data-driven access controls, watsonx.data PayGo helps teams enhance data trust while simplifying setup and hybrid analytics. Native integrations with Db2 Warehouse on AWS RDS and Netezza on AWS allow organizations to augment existing data warehouse workloads, reducing storage and compute costs by shifting eligible workloads to more efficient lakehouse engines. Customers can reduce data warehouse costs by up to 50% when optimizing across engines and storage tiers.
Because watsonx.data PayGo uses a consumption-based pricing model, organizations can scale data engineering workloads, AI exploration, and business analytics on demand - ideal for dynamic or seasonal workloads. This makes PayGo a flexible option for teams building generative AI pipelines, hybrid analytics, and data modernization initiatives while maintaining governed access to all data across clouds and on-premises systems.
Q: What is the watsonx.data PayGo model?
PayGo offers flexible, consumption-based pricing that allows teams to scale analytics and AI workloads up or down without long-term contracts.
Q: How does watsonx.data support hybrid cloud analytics?
watsonx.data provides a unified entry point across AWS, on-prem, and multi-cloud environments using shared metadata and open table formats like Iceberg and Parquet.
Q: How can watsonx.data help reduce data warehouse costs?
Organizations can cut warehouse costs by up to 50% by offloading workloads to Presto and Spark and optimizing storage tiers.
Q: Who is watsonx.data PayGo best suited for?
Teams with variable or exploratory workloads - such as AI prototyping, seasonal analytics, or data engineering spikes - benefit from usage-based scaling.
Highlights
- Scale on demand: Pay only for what you use with usage-based billing optimized for variable analytics and AI workloads on AWS
- Hybrid data unification: Query AWS, on-prem, and multi-cloud data through shared metadata using Iceberg, Parquet, Presto, and Spark
- Reduce warehouse costs: Lower data warehouse workloads by up to 50% with multi-engine compute and storage optimization
Details
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Financing for AWS Marketplace purchases
Pricing
Dimension | Description | Cost/unit |
|---|---|---|
WXD_PG_SL1 | IBM watsonx.data as service pay per use 1 RU | $1.00 |
Vendor refund policy
Please contact your client account team for refund information
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Software as a Service (SaaS)
SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.
Resources
Vendor resources
Support
Vendor support
This product includes enterprise-grade support designed for fast deployment and low operational risk. Customers have access to comprehensive public documentation, step-by-step integration guides, and architecture references aligned with AWS best practices. Technical support is available through defined support channels with documented SLAs, and our team actively assists with onboarding, configuration, and troubleshooting. https://www.ibm.com/mysupport/s/?language=en_US
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.
Similar products



Customer reviews
Powerful Query Performance and Governance, But a Steep Onboarding Learning Curve
I also appreciate the interoperability with existing tools and open formats. Our engineering team didn’t have to completely rebuild pipelines or retrain users from scratch, which made adoption smoother internally.
Another big advantage has been governance and data visibility. In a regulated fintech environment, having stronger control over data access and lineage tracking became extremely important, especially for audit and compliance requirements.
From a business perspective, watsonx.data helped reduce infrastructure inefficiencies while improving access to analytics across teams. Analysts, data engineers, and operations teams were able to work from a more unified environment instead of constantly moving data between disconnected systems.
We also experienced a steeper learning curve around setup, integration, and governance policies compared to some lighter-weight analytics platforms we evaluated. Certain workflows required more technical involvement from our data engineering team than we originally expected.
Another area that could improve is the user experience within parts of the interface. While the platform is powerful, some administrative and configuration tasks don’t always feel as intuitive or streamlined as newer cloud-native tools in the market.
Performance has generally been strong for large workloads, but during early implementation we had to spend time tuning queries and optimizing storage configurations to get consistent results across different environments.
Pricing and infrastructure planning can also become a consideration for organizations scaling large enterprise deployments. Smaller teams without dedicated data engineering resources may find adoption more challenging initially.
One of the biggest problems was handling growing volumes of financial and operational data efficiently without constantly increasing infrastructure costs. Traditional warehouse scaling was becoming expensive, especially as our analytics workloads expanded across departments.
With watsonx.data, we were able to centralize access to structured and semi-structured data while still keeping flexibility in how the data was stored and queried. That significantly improved reporting speed and reduced the amount of manual data movement our engineering team had to manage.
A major benefit for us has been faster analytics and better visibility across teams. Earlier, generating large operational or customer-risk reports could take hours because data pipelines were fragmented. After implementation, analysts were able to query datasets more efficiently and collaborate from a more unified environment.
Unified Data Management with Learning Curve
Complex Setup and Rising Costs at Scale Despite a Strong Lakehouse Foundation
It also delivers strong performance with built-in query optimization and integrates easily with existing data tools, making analytics faster and simpler.
It can also become expensive at scale, particularly when handling large workloads or advanced features.
This benefits you by reducing data duplication, lowering costs, and enabling faster, more efficient analytics and decision-making.
Efficient and Scalable Lakehouse Platform for Modern Data Analytics
Another major advantage is its scalability and governance. The platform reliably supports high-volume enterprise data workloads while also providing strong security controls and solid data governance features.
I also think some UI workflows and monitoring features could be more intuitive. At times, troubleshooting performance issues or managing integrations across different environments takes extra effort than it should. Additionally, pricing and resource consumption can become expensive for large-scale deployments, so more transparent cost-optimization tools and simpler management features would help improve the overall experience.
With IBM watsonx.data, we can now query data across different sources more efficiently, without unnecessary duplication or migration. This has improved analytics performance, lowered storage and operational costs, and helped our teams reach insights faster to support decision-making. The platform’s scalability, along with its integration with AI and analytics tools, has also boosted productivity by simplifying big data processing and enabling quicker development of data-driven solutions. Overall, it has helped us streamline our data architecture while strengthening governance, flexibility, and operational efficiency.