IBM watsonx.data PayGo Usage-Based Hybrid Data Lakehouse on AWS

IBM watsonx.data PayGo is an open, hybrid data lakehouse with usage-based pricing for governed analytics and AI workloads across AWS environments

4.4

View purchase options

Overview

Try agent mode

Create proposal

Ask question

IBM watsonx.data PayGo is an open, hybrid data lakehouse offering flexible usage-based pricing for analytics and AI workloads on AWS. It supports open table formats such as Apache Iceberg and Parquet and provides a unified metadata layer for querying structured and unstructured data across AWS, multi-cloud, and on-prem environments - without requiring ETL. Using Presto SQL and Apache Spark, PayGo enables federated, multi-engine analytics optimized for cost and performance.

watsonx.data offers enterprise-grade deployment flexibility and security, including VPCbased deployments, AWS PrivateLink, and support for FedRAMP (Medium) and HIPPA for AWS GovCloud. With builtin governance, automation, and meta-data-driven access controls, watsonx.data PayGo helps teams enhance data trust while simplifying setup and hybrid analytics. Native integrations with Db2 Warehouse on AWS RDS and Netezza on AWS allow organizations to augment existing data warehouse workloads, reducing storage and compute costs by shifting eligible workloads to more efficient lakehouse engines. Customers can reduce data warehouse costs by up to 50% when optimizing across engines and storage tiers.

Because watsonx.data PayGo uses a consumption-based pricing model, organizations can scale data engineering workloads, AI exploration, and business analytics on demand - ideal for dynamic or seasonal workloads. This makes PayGo a flexible option for teams building generative AI pipelines, hybrid analytics, and data modernization initiatives while maintaining governed access to all data across clouds and on-premises systems.

Q: What is the watsonx.data PayGo model?

PayGo offers flexible, consumption-based pricing that allows teams to scale analytics and AI workloads up or down without long-term contracts.

Q: How does watsonx.data support hybrid cloud analytics?

watsonx.data provides a unified entry point across AWS, on-prem, and multi-cloud environments using shared metadata and open table formats like Iceberg and Parquet.

Q: How can watsonx.data help reduce data warehouse costs?

Organizations can cut warehouse costs by up to 50% by offloading workloads to Presto and Spark and optimizing storage tiers.

Q: Who is watsonx.data PayGo best suited for?

Teams with variable or exploratory workloads - such as AI prototyping, seasonal analytics, or data engineering spikes - benefit from usage-based scaling.

Highlights

Scale on demand: Pay only for what you use with usage-based billing optimized for variable analytics and AI workloads on AWS
Hybrid data unification: Query AWS, on-prem, and multi-cloud data through shared metadata using Iceberg, Parquet, Presto, and Spark
Reduce warehouse costs: Lower data warehouse workloads by up to 50% with multi-engine compute and storage optimization

Details

Sold by

IBM Software

Introducing multi-product solutions

You can now purchase comprehensive solutions tailored to use cases and industries.

Learn more

Explore multi-product solutions

Features and programs

Financing for AWS Marketplace purchases

AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.

View financing details

Pricing

IBM watsonx.data PayGo Usage-Based Hybrid Data Lakehouse on AWS

Info

View purchase options

Pricing is based on actual usage, with charges varying according to how much you consume. Subscriptions have no end date and may be canceled any time.

Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator to estimate your infrastructure costs.

Usage costs (1)

Info

Dimension	Description	Cost/unit
WXD_PG_SL1	IBM watsonx.data as service pay per use 1 RU	$1.00

Vendor refund policy

Please contact your client account team for refund information

How can we make this page better?

Tell us how we can improve this page, or report an issue with this product.

Legal

Vendor terms and conditions

Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

Content disclaimer

Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

Usage information

Info

Delivery details

Software as a Service (SaaS)

SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

Resources

Vendor resources

IBM watsonx.data documentation

watsonx.data community

Support

Vendor support

This product includes enterprise-grade support designed for fast deployment and low operational risk. Customers have access to comprehensive public documentation, step-by-step integration guides, and architecture references aligned with AWS best practices. Technical support is available through defined support channels with documented SLAs, and our team actively assists with onboarding, configuration, and troubleshooting. https://www.ibm.com/mysupport/s/?language=en_US

AWS infrastructure support

AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

Get support

Similar products

IBM Security QRadar SIEM v7.5.0 (BYOL)

By IBM Security

IBM QRadar SIEM empowers security analysts and security operations teams with the visibility, automation and insights needed to quickly detect anomalies and uncover advanced threats in real-time.

View product

IBM Granite 3.2 Instruct 8B

By IBM Data and AI

IBM Granite 3.2 Instruct is an open-source model with controllable reasoning, offering strong performance and enhanced complex thinking.

View product

IBM Verify Identity Access v11

By IBM Security

IBM Verify Identity Access helps you simplify your users' access while more securely adopting web, mobile and cloud technologies.

View product

IBM Granite 3.0 8B Instruct

By IBM Data and AI

IBM's Granite 3.0 8B Instruct is an 8B-parameter AI model for enterprise use, excelling in multilingual and code tasks; Apache 2.0 licensed.

View product

IBM Granite 3.0 2B Instruct

By IBM Data and AI

IBM's Granite 3.0 2B Instruct is a 2B-parameter AI model for enterprise use, excelling in multilingual and code tasks; Apache 2.0 licensed.

View product

Customer reviews

Leave a review

Ratings and reviews

Info

4.4

164 ratings

5 star

4 star

3 star

2 star

1 star

59%

37%

0 AWS reviews

164 external reviews

External reviews are from G2 .

Nikita S.

Open Lakehouse Architecture with Seamless Integration and High-Performance Querying

Reviewed on Jul 26, 2026

Review provided by G2

What do you like best about the product?

I like its open lakehouse architecture, seamless integration with multiple data sources, high-performance querying, and scalability. Together, these strengths make data management and AI analytics more efficient.

What do you dislike about the product?

The setup can feel complex, and some of the more advanced features come with a steep learning curve. The interface and documentation could also be made more beginner-friendly, as they aren’t always easy to navigate when you’re just getting started.

What problems is the product solving and how is that benefiting you?

It helps break down data silos and makes it easier to access large datasets. As a result, I can analyze data more efficiently, with better performance and less time spent when working on AI and analytics projects.

Information Technology and Services

Robust Data Storage and Maintenance for Managing Complex Data Flows

Reviewed on Jul 24, 2026

Review provided by G2

What do you like best about the product?

IBM watsonx.data has robust data storage and maintenance capabilities. It’s a powerful tool that has helped me manage data flow for semantic platforms and for the tools built for business intelligence and reporting.

What do you dislike about the product?

The ecosystem and setup process feel somewhat complex. There’s a slow learning curve to get fully engaged, and the UI is less intuitive compared to other available tools that offer similar functionality.

What problems is the product solving and how is that benefiting you?

It helps organize and process large-scale TPA data by unifying it in a single platform, where later stages of ETL processes can run smoothly. It also serves as a single, governed data layer that is retrieved from many different sources.

Chirag S.

Flexible Open Lakehouse with Iceberg Support and Multi-Engine Choice

Reviewed on Jul 23, 2026

Review provided by G2

What do you like best about the product?

Its focus is on giving organizations flexibility without forcing them into a single storage format or query engine. A few aspects stand out as particularly compelling. The open data lakehouse architecture is designed to work with open table formats such as Apache Iceberg, which helps reduce vendor lock-in and makes data more portable across different tools and platforms. The separation of storage and compute also matters: you can scale compute resources independently of storage, which can improve cost efficiency for workloads that fluctuate over time. Finally, instead of relying on one query engine, it supports multiple engines optimized for different workloads, letting users choose the best fit for analytics, SQL, or AI use cases.

What do you dislike about the product?

IBM watsonx.data has several strengths, but it also comes with trade-offs that some users and organizations may find limiting. One is complexity: compared with fully managed cloud data warehouses, watsonx.data can require more upfront planning and ongoing operational expertise, particularly when you’re configuring multiple query engines, storage layers, and governance components. Another is the learning curve: teams that aren’t already familiar with lakehouse concepts, Apache Iceberg, or IBM’s data ecosystem may need additional time before they can become fully productive.

What problems is the product solving and how is that benefiting you?

IBM watsonx.data helps solve the problem of fragmented data and inefficient analytics by offering a unified, open lakehouse platform. For me, the main benefits are that it makes data easier to access, improves performance for AI and analytics workloads, helps lower infrastructure costs, and provides flexibility by supporting open data formats.

Arkajit D.

Powerful Query Performance and Governance, But a Steep Onboarding Learning Curve

Reviewed on May 19, 2026

Review provided by G2

What do you like best about the product?

One feature that stood out for us was the query performance optimization, especially for large reporting and analytics workloads. We process high-volume financial and customer behavior data, and the platform handled complex queries much more efficiently than our previous setup.

I also appreciate the interoperability with existing tools and open formats. Our engineering team didn’t have to completely rebuild pipelines or retrain users from scratch, which made adoption smoother internally.

Another big advantage has been governance and data visibility. In a regulated fintech environment, having stronger control over data access and lineage tracking became extremely important, especially for audit and compliance requirements.

From a business perspective, watsonx.data helped reduce infrastructure inefficiencies while improving access to analytics across teams. Analysts, data engineers, and operations teams were able to work from a more unified environment instead of constantly moving data between disconnected systems.

What do you dislike about the product?

One challenge with IBM watsonx.data is that the platform can feel quite complex during the initial onboarding phase, especially for teams that are newer to lakehouse architectures or hybrid data environments. There are a lot of capabilities available, but understanding how to configure and optimize everything properly takes time.

We also experienced a steeper learning curve around setup, integration, and governance policies compared to some lighter-weight analytics platforms we evaluated. Certain workflows required more technical involvement from our data engineering team than we originally expected.

Another area that could improve is the user experience within parts of the interface. While the platform is powerful, some administrative and configuration tasks don’t always feel as intuitive or streamlined as newer cloud-native tools in the market.

Performance has generally been strong for large workloads, but during early implementation we had to spend time tuning queries and optimizing storage configurations to get consistent results across different environments.

Pricing and infrastructure planning can also become a consideration for organizations scaling large enterprise deployments. Smaller teams without dedicated data engineering resources may find adoption more challenging initially.

What problems is the product solving and how is that benefiting you?

IBM watsonx.data helped us solve a major issue around fragmented data management and slow analytics processing across multiple business systems. Before implementation, our teams were pulling data from separate cloud platforms, transactional databases, and reporting tools, which created delays, duplication, and inconsistent reporting.

One of the biggest problems was handling growing volumes of financial and operational data efficiently without constantly increasing infrastructure costs. Traditional warehouse scaling was becoming expensive, especially as our analytics workloads expanded across departments.

With watsonx.data, we were able to centralize access to structured and semi-structured data while still keeping flexibility in how the data was stored and queried. That significantly improved reporting speed and reduced the amount of manual data movement our engineering team had to manage.

A major benefit for us has been faster analytics and better visibility across teams. Earlier, generating large operational or customer-risk reports could take hours because data pipelines were fragmented. After implementation, analysts were able to query datasets more efficiently and collaborate from a more unified environment.

Anchal P.

Unified Data Management with Learning Curve

Reviewed on May 15, 2026

Review provided by G2

What do you like best about the product?

What I like most about IBM watsonx.data is its ability to unify data from multiple sources without complex migrations or duplication, which saves time and reduces storage costs. Its open lakehouse architecture delivers strong performance for analytics, reporting, and AI workloads while remaining cost-efficient and scalable. I also appreciate the clean and organized UI/UX, which makes navigating datasets, managing workloads, and monitoring data operations more efficient for enterprise teams. The built-in governance, hybrid cloud flexibility, and smooth integrations further simplify data management and support scalable AI and analytics initiatives across environments.

What do you dislike about the product?

One area IBM watsonx.data could improve is the initial setup and configuration, which can feel complex for new users or smaller teams. Some integrations and advanced features also come with a learning curve and would benefit from clearer, more detailed documentation. In certain situations, query performance and troubleshooting can take extra effort, especially when working with very large or highly diverse data environments.

What problems is the product solving and how is that benefiting you?

I use IBM watsonx.data to manage and analyze large data sets across hybrid cloud environments. It streamlines integration, boosts query performance, and provides trusted data access for AI. It simplifies complexity, enhances team collaboration, and controls costs across multiple sources.

View all reviews