
Overview
For more information or customized pricing, please email us: cpd_on_aws@wwpdl.vnet.ibm.com
IBM Cloud Pak for Data is a unified data and AI platform that connects the right data, at the right time, to the right people anywhere. Available on AWS and running on Red Hat OpenShift, the platform simplifies data access, automates data discovery and curation, and safeguards sensitive information by automating policy enforcement for all users in your organization. Make better data driven decisions and lay the foundation for AI with a data fabric that connects siloed data on premises or across multiple clouds without data movement. Discover actionable insights and apply trusted data to build, run, automate and manage AI models.
Outcomes:
- Data access and availability: Eliminate data silos and simplify your data landscape to enable faster, cost-effective extraction of value from your data.
- Data quality and governance: Apply governance solutions and methodologies to deliver trusted, business data.
- Data privacy and security: Fully understand and manage sensitive data with a pervasive privacy framework.
- Batch data integration: Design, develop and run jobs that move and transform data with powerful automated integration capabilities.
- 360 entity data: Enable agility and accelerated ROI for consolidated and governed views of critical enterprise data.
Product Version 4.7.x
Standard Min: 48 VPCs Enterprise Min: 72 VPCs
Already have a CP4D License? Deploy from the BYOL Listing today!
Highlights
- Deliver data responsibly with a data fabric. Unify and access disparate data with AutoSQL, a universal query engine. Discover and classify data in real time with Watson Knowledge Catalog. Protect sensitive data with automated policy enforcement.
- Scale trustworthy AI: Synchronize application and model pipelines while reducing drift, bias, and risk with ModelOps on Watson Studio. Monitor and govern AI models to meet regulations, manage risk and enhance transparency.
- Recognized by analysts as a Leader in core data and AI segments: The Forrester Wave™: Machine Learning Data Catalogs, Q4 2020; 2021 Gartner Magic Quadrant for Data Science and Machine Learning; The Forrester Wave™: Multi modal Predictive Analytics and Machine Learning, Q3 2020.
Details
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Buyer guide

Financing for AWS Marketplace purchases
Pricing
Dimension | Description | Cost/month |
|---|---|---|
Standard Option | Cloud Pak for Data Standard Option: 48 VPCs | $19,824.00 |
Enterprise Option | Cloud Pak for Data Enterprise Option: 72 VPCs | $59,400.00 |
Vendor refund policy
Please contact your rep for any questions.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Software as a Service (SaaS)
SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.
Resources
Support
Vendor support
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.


Standard contract
Customer reviews
Unified data workspace has enabled secure collaboration and improved automated AI lifecycle
What is our primary use case?
I use IBM Cloud Pak for Data to connect it with our product for integration with the cloud, and it helps to enable all of our data users to collaborate from a single, unified interface that supports many services that are designed to work together.
I do use the data virtualization features in IBM Cloud Pak for Data , and this has positively impacted my data analysis operations.
What is most valuable?
What I appreciate most about IBM Cloud Pak for Data is the all-in-one cloud-native data and AI platform in a single platform. It enables us to collect, organize, and analyze data and provides unprecedented simplicity and agility within a pre-configured and governed environment.
What needs improvement?
Areas within IBM Cloud Pak for Data that have room for improvement include user interface design and integration capabilities.
For how long have I used the solution?
I have been using IBM Cloud Pak for Data since 2020, so that is approximately five years.
What do I think about the scalability of the solution?
For scalability, I rate it a nine out of ten because it is a very scalable solution that has been able to handle my organization's growth efficiently.
How are customer service and support?
I rate the technical support from IBM a nine out of ten because the support has been very top-notch, unparalleled, and also very professional.
It is much more cost-effective compared to other solutions such as SAP, and the customer and technical support have always been very proactive and helpful.
Which other solutions did I evaluate?
I compare IBM Cloud Pak for Data with other solutions and find it stands out in several key aspects.
What other advice do I have?
My relationship with IBM Cloud Pak for Data is that I am a customer.
I use IBM Data Stage for ETL processes, and this has helped my data integration significantly.
The Watson Knowledge Catalog has helped improve my data governance by providing several benefits.
I assess the impact of the automated AI lifecycle management on my project development times as quite significant.
IBM Cloud Pak for Data is a great tool because if you need to connect to or get data from multiple data sources, it is the best solution, enabling data-driven decisions that are very accurate. I would rate this solution a nine out of ten overall.
Integrated data tools have unified governance and AI workflows for complex enterprise projects
What is our primary use case?
I believe IBM Cloud Pak for Data is suitable for mid-size to bigger companies. It is not tailored for smaller customers.
My customers use IBM DataStage for ETL processes.
One client has implemented automated AI lifecycle management, and their journey with IBM Cloud Pak for Data was very successful. They are one of the first banks in Jordan to implement AI in achievable and beneficial use cases that benefit their internal and external clients. The implementation was very successful.
What is most valuable?
The features I find most valuable in IBM Cloud Pak for Data are the data warehouse, data repository, and the data governance tools that exist there, including data masking, data quality, and ETL. The best thing about IBM Cloud Pak for Data is that you are getting all the products or all the needs in one license, and as you utilize it, you can add more licenses.
IBM DataStage has helped their data integration efforts by providing enhanced ETL tools. It is not just extracting the data and loading it, but it has some advanced tools to classify the data and transform the right format of the data, helping my customers have more cleansed data. It enables the customer to cleanse their data and have the right data to utilize. This was a value-add for the customers and it is very good. It connects quickly, so the experience with the implementation integration is stable. Since 2019, my customers have been working with no issues, and it is serving them well.
Watson Knowledge Catalog has helped improve data governance in my customers' organizations. I have some customers who needed data masking, and the WKC supported them in that need. I have a customer who only purchased IBM Cloud Pak for Data for this requirement of data masking, and it helped them a lot.
What needs improvement?
I see room for improvement in IBM Cloud Pak for Data, as it lacked the lake house. However, IBM issued the new product which is Watsonx.data. This is a new product for IBM and it provides all the missing capabilities that were lacking because this technology was released before the concept of lake house was established. The new product covers all those requirements.
I believe IBM Cloud Pak for Data could learn from its competitors in terms of data governance tools. Previously, Informatica had a more robust data governance offering, but with their sale to Salesforce , I believe they are not a threat to IBM. IBM has an even better story. Oracle has better go-to-market options, but it is not an issue with the product itself, rather the way they do their sales. I believe we have a strong product. Nothing is missing except the strategy from IBM on how to sell, not the product itself.
For how long have I used the solution?
Since its release, I have been dealing with IBM Cloud Pak for Data. It was previously a different product earlier, known as Netezza and different products. Since we named it IBM Cloud Pak for Data, we have sold it. I believe the first one was in 2019.
What do I think about the stability of the solution?
The overall performance of IBM Cloud Pak for Data, particularly with IBM DataStage for ETL processes, is very good. The customers I have are all very happy. They are looking for expansion and the experience is very good with it. It is providing them with the results needed and more. This was one of the successful products that I have worked on with IBM.
How are customer service and support?
I would rate the technical support by IBM as an eight to nine.
The response time for IBM's technical support is excellent. I gave them an eight to nine to cover the market here, but they really are excellent. I had a case a couple of days ago where the IBM team was with us 24/7 until the issue was closed. They need more enhancement in level one support, but overall it is good and effective.
How was the initial setup?
I believe the initial setup and configuration of IBM Cloud Pak for Data is straightforward. The implementation was needed first because it was from the first project I implemented with IBM. However, the support and implementation afterward, and all the enhancements, were done by the customer themselves. It was straightforward.
Which other solutions did I evaluate?
I think IBM Cloud Pak for Data is the best option on the market at the moment with the addition of Watsonx.data. For the current needs, it depends on what you need. It is not a magic ingredient. You need to understand your business requirements and accordingly select the right product for your need. It is a tool, but it has all the ingredients to succeed. You need to select the right thing in order to implement it well.
What other advice do I have?
I believe the licensing model and pricing of IBM Cloud Pak for Data is fair. If you get the right discount, it is fair and competitive.
Not all of my customers utilize data virtualization features. Mostly, my customers had their own data visualization tools earlier, so they kept using what they have in order not to buy more. They have seamlessly integrated the product they have with IBM Cloud Pak for Data.
I would not expect additional functionalities regarding AI from them in the future. It is already there, and the customers who purchased IBM Cloud Pak for Data do not need to change or replace the product. They can keep what they have and utilize Watsonx.data on top of it to enable them for the new features, so they do not lose anything.
My customers usually do not purchase IBM Cloud Pak for Data from the AWS Marketplace . They purchase it from partners and implement it on-premises, not on cloud.
I would rate this review a nine overall.
Data integration has accelerated financial workflows and now supports reliable AI-driven projects
What is our primary use case?
I usually recommend IBM Cloud Pak for Data for companies in the financial sector, as we are mostly working with local insurance companies and banks within Hungary where we are located.
For IBM Cloud Pak for Data setup and configuration, I think it is outstanding. The documentation is comprehensive, and we did not have any issues with that.
What is most valuable?
The features I find most valuable in IBM Cloud Pak for Data are the integration feature, specifically Message Queue and App Connect Enterprise.
I assess the impact of automated AI lifecycle management on project development times as positive since it accelerates our processes.
What needs improvement?
I think we are happy with IBM Cloud Pak for Data, and there is no specific idea that comes to my mind regarding room for improvement. We are following the progress and the new features, so overall we are quite content and satisfied with it.
I don't have any specific idea regarding additional features they could incorporate in the future to make it even better.
For how long have I used the solution?
I have been dealing with IBM Cloud Pak for Data for more than ten years now since the company is working with the IBM Integration portfolio. IBM Cloud Pak for Data itself is younger, but we started to work with it from the very beginning. I have been working with it for at least five years.
What do I think about the stability of the solution?
I did not have any problems while integrating it with any particular solutions that I can recall.
How are customer service and support?
I would rate the technical support by IBM as adequate. We have submitted some trouble tickets, and there was always an answer provided, so overall it is satisfactory.
They do not provide local support in a local language, as it is provided in English, but that is acceptable to us. I think that the local language market is not substantial enough, as there are not enough customers in Hungary to justify localization, but it is not an issue. Usually, our enterprise customers are comfortable with English.
What's my experience with pricing, setup cost, and licensing?
Regarding the price, I know IBM is traditionally relatively expensive in the Hungarian market, but we work together with the local IBM sales team, and on a project basis they manage to negotiate the prices. We rarely can sell it at the list price of course. Overall, the challenge is to let the customer see the value, so I do not have too many price concerns. The list price is high, but the flexibility in pricing is adequate.
I think the licensing model is acceptable and there is no need for change. Custom project-based pricing is usually possible with some customer discounts if the project is substantial enough, so overall we could sell many IBM licenses.
Which other solutions did I evaluate?
We usually go with IBM Cloud Pak for Data first when recommending products for smaller businesses, but in other cases, the customer may have an existing install base from some competitor, and that affects the recommendation.
What other advice do I have?
I do not have a specific opinion about its influence on decision-making accuracy.
In terms of data virtualization features, we do not use that, but we use some virtualization features.
At the moment we do not use Watson Knowledge Catalog, so it has not helped improve data governance for us.
Regarding Data Stage for ETL processes, we do not use that.
I think AI capabilities are coming regardless, and the product is progressing. IBM can be slightly slow with introducing new features, but I do not feel it is lacking in this respect. The new agents and assistants within the product are beneficial.
For us, IBM Cloud Pak for Data is the best option on the market at the moment. In its own category, I think it is the best, and we are satisfied with it.
In the financial segment where we are working, I think IBM Cloud Pak for Data is the market leader in our local territory, but there could be more marketing and promotion.
I do not have significant insight into other industries because our company really focuses on the financial sector. As far as I know, IBM is also strong in manufacturing, but SAP itself is very strong in Hungary in manufacturing, providing end-to-end solutions which means there is less room for platforms like IBM. In financial institutions, SAP is not strong at all, so I think IBM is the strongest in this respect for these platforms.
I would recommend IBM Cloud Pak for Data for different types of companies because the solution itself is not industry-specific. I mention finance only because my company focuses on that type of customer. Different IBM partners focus on different customers. There is a need for a minimum customer size, but I would not recommend IBM Cloud Pak for Data for smaller companies, as they might not need the higher reliability that IBM provides. Conversely, they might want a simpler, cheaper solution because their needs are not as comprehensive. For really large to medium-sized enterprises with very mission-critical applications and systems, that is what I would recommend.
I would give this product a rating of nine out of ten.
Data fabric has streamlined predictive analytics and has transformed how we manage hybrid data
What is our primary use case?
My main use case for IBM Cloud Pak for Data includes storing utility data to build a smart utility solution for the prediction of future trends. All data is stored, and with the help of AI and machine learning algorithms, analytics dashboards are built on the same platform. It helps manage and store high volumes of both structured and unstructured data and deliver desired results in optimum time.
A specific example of how I used IBM Cloud Pak for Data in one of my projects is that it serves as a very fully scalable platform for data and analytics. We use it to provide data solutions for our customers and provide various industry solutions for clients that need a cloud data platform that can be used for data analytics, data science, and data visualization.
What is most valuable?
The best features that IBM Cloud Pak for Data offers include data analytics, data science, data management, data catalog, inbuilt AI and machine learning capability, and integration with other applications, along with very safe cloud storage.
I find all of these features to be the most valuable for my daily work because they increase our impact by combining AI skills with advanced analytics and machine learning in an easy-to-use visual interface. Visualization and reporting are great in IBM Cloud Pak for Data, allowing me to manage data spread across distributed stores and clouds.
IBM Cloud Pak for Data has positively impacted my organization as it saves a lot of time by predicting outcomes faster using a platform built with data fabric architecture. It is easy to collect, organize, and analyze data no matter where it is located. Manual cataloging is eliminated, thereby saving a lot of cost, approximately 50 to 60 percent. Additionally, it significantly reduces data footprint, and AI and ML analysis for predictive analytics are excellent.
What needs improvement?
To improve IBM Cloud Pak for Data, I suggest more out-of-the-box integration, enhancement of analytics to be sharper, and the deployment options should be very flexible.
For how long have I used the solution?
I have been using IBM Cloud Pak for Data for the past eight years, even in my previous organization.
What other advice do I have?
I would add that our clients who want to address requirements alongside data catalog, data governance, and data visualization will benefit from investing in IBM Cloud Pak for Data.
I rate IBM Cloud Pak for Data a nine out of ten. I choose nine out of ten because this tool enables us to connect multiple data sources, ingest data, and run AI and ML algorithms. IBM Cloud Pak for Data is a very good solution for that work.
My advice to others looking into using IBM Cloud Pak for Data is that it has played a huge role in automating our end-to-end life cycles in the network dimension, giving us an extensive view of data across diverse data sources for required features at any moment without needing to migrate to a central repository. The platform has made our data management across multiple clouds much easier, thereby saving a lot of time. I find it to be a very commendable tool for that work.
IBM Cloud Pak for Data has enabled us to access diffuse data quickly across hybrid networks and given our teams an edge in data management through automation while adhering to compliance regulations. Additionally, it is a speedy solution, increasing our client satisfaction. I would like to note that my overall rating for IBM Cloud Pak for Data is nine out of ten.
Data workflows have become more transparent and now support faster, trusted decisions
What is our primary use case?
IBM Cloud Pak for Data is a powerful cloud-native all-in-one easy-to-use solution that enables us to put data to work quickly and effectively. These tools enable us to approach analytics our way with code, low-code, and no-code options that allow us to collaborate on one platform. It is easy to transform structured and unstructured data into analytics insights where we are able to make data-driven decisions easily. We can build and test models with best-in-class AI and analytics. The support team is generally the most proactive and supportive 24/7.
What is most valuable?
A quick specific example of how I use IBM Cloud Pak for Data in my day-to-day work is that, unlike other analytics tools, it provides out-of-the-box privacy, model interpretability, and fairness monitoring, along with automatic explanation of data and models written in business language. It is a great tool that all businesses should emulate while it provides a great user experience because every feature is functional and improved constantly.
IBM Cloud Pak for Data is driving our business productivity by reducing time spent reading and analyzing data. We always use this tool in all departments that need to gather relevant information in the cloud from a single centralized point for better reporting of data and data-driven decisions.
IBM Cloud Pak for Data has positively impacted my organization by rapidly providing business-ready data to all users equally, hence being able to make data-driven decisions easily. We can manage and analyze data no matter where it is. The manual catalog has been eliminated to save cost. It drives responsible, transparent, and explainable AI workflows to operationalize AI and mitigate risk and regulatory compliance.
A specific example that shows how IBM Cloud Pak for Data has helped my organization is that it has improved our decision-making, saved us money, and saves a lot of time by predicting outcomes faster using a platform built with data fabric architecture.
The best features IBM Cloud Pak for Data offers include robust data visualization.
The data visualization feature stands out for me because it increases our impact by combining BI skills with advanced analytics and machine learning in an easy-to-use visual interface.
What needs improvement?
IBM Cloud Pak for Data can be improved because processing speeds are sometimes slow.
The slowness in IBM Cloud Pak for Data is mainly during data processing, and pricing is something that affects most small businesses, so it should be affordable. It takes some time, something that affects most small businesses and enterprises.
Occasionally, IBM Cloud Pak for Data is very slow.
For how long have I used the solution?
I have been using IBM Cloud Pak for Data for the past three years.
What do I think about the stability of the solution?
IBM Cloud Pak for Data is very stable.
What do I think about the scalability of the solution?
IBM Cloud Pak for Data's scalability is very good; it can be used by any size of organization.
How are customer service and support?
The customer support for IBM Cloud Pak for Data is great and responsive.
I would rate the customer support for IBM Cloud Pak for Data a nine out of ten.
Which solution did I use previously and why did I switch?
We switched from Azure Databricks to IBM Cloud Pak for Data because IBM is very easy to use. It has a flexible way to deploy and enables us to collect, connect, catalog, transform, and analyze data regardless of the area.
What was our ROI?
I have seen a return on investment because the manual catalog is eliminated, hence saving a lot of cost. It is easy to collect, organize, and analyze data no matter where it is, hence being able to make data-driven decisions.
What's my experience with pricing, setup cost, and licensing?
My experience with pricing, setup cost, and licensing has been around on-premises.
Which other solutions did I evaluate?
Before choosing IBM Cloud Pak for Data, I evaluated other options such as Cloudera Data Platform .
What other advice do I have?
I advise others looking into using IBM Cloud Pak for Data to know that, unlike other analytics tools, it provides out-of-the-box privacy, model interpretability, and fairness monitoring, along with automatic explanation of data and models written in business language and a great user experience because every feature is functional and improved consistently. I would rate this product a nine out of ten overall.