Listing Thumbnail

    IBM Cloud Pak for Data

     Info
    Deployed on AWS
    IBM Cloud Pak® for Data is a unified data and AI platform that runs on any cloud. Utilize a data fabric to automatically break down data siloes, improve data quality and enhance data privacy and security. Build and infuse trustworthy AI across your business to drive digital transformation.
    4.2

    Overview

    For more information or customized pricing, please email us: cpd_on_aws@wwpdl.vnet.ibm.com 

    IBM Cloud Pak for Data is a unified data and AI platform that connects the right data, at the right time, to the right people anywhere. Available on AWS and running on Red Hat OpenShift, the platform simplifies data access, automates data discovery and curation, and safeguards sensitive information by automating policy enforcement for all users in your organization. Make better data driven decisions and lay the foundation for AI with a data fabric that connects siloed data on premises or across multiple clouds without data movement. Discover actionable insights and apply trusted data to build, run, automate and manage AI models.

    Outcomes:

    • Data access and availability: Eliminate data silos and simplify your data landscape to enable faster, cost-effective extraction of value from your data.
    • Data quality and governance: Apply governance solutions and methodologies to deliver trusted, business data.
    • Data privacy and security: Fully understand and manage sensitive data with a pervasive privacy framework.
    • Batch data integration: Design, develop and run jobs that move and transform data with powerful automated integration capabilities.
    • 360 entity data: Enable agility and accelerated ROI for consolidated and governed views of critical enterprise data.

    Product Version 4.7.x

    Standard Min: 48 VPCs Enterprise Min: 72 VPCs

    Already have a CP4D License? Deploy from the BYOL Listing today!

    Highlights

    • Deliver data responsibly with a data fabric. Unify and access disparate data with AutoSQL, a universal query engine. Discover and classify data in real time with Watson Knowledge Catalog. Protect sensitive data with automated policy enforcement.
    • Scale trustworthy AI: Synchronize application and model pipelines while reducing drift, bias, and risk with ModelOps on Watson Studio. Monitor and govern AI models to meet regulations, manage risk and enhance transparency.
    • Recognized by analysts as a Leader in core data and AI segments: The Forrester Wave™: Machine Learning Data Catalogs, Q4 2020; 2021 Gartner Magic Quadrant for Data Science and Machine Learning; The Forrester Wave™: Multi modal Predictive Analytics and Machine Learning, Q3 2020.

    Details

    Delivery method

    Deployed on AWS
    New

    Introducing multi-product solutions

    You can now purchase comprehensive solutions tailored to use cases and industries.

    Multi-product solutions

    Features and programs

    Buyer guide

    Gain valuable insights from real users who purchased this product, powered by PeerSpot.
    Buyer guide

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    IBM Cloud Pak for Data

     Info
    Pricing is based on the duration and terms of your contract with the vendor. This entitles you to a specified quantity of use for the contract duration. If you choose not to renew or replace your contract before it ends, access to these entitlements will expire.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    1-month contract (2)

     Info
    Dimension
    Description
    Cost/month
    Standard Option
    Cloud Pak for Data Standard Option: 48 VPCs
    $19,824.00
    Enterprise Option
    Cloud Pak for Data Enterprise Option: 72 VPCs
    $59,400.00

    Vendor refund policy

    Please contact your rep for any questions.

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    Software as a Service (SaaS)

    SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

    Support

    Vendor support

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Product comparison

     Info
    Updated weekly

    Accolades

     Info
    Top
    50
    In Data Preparation
    Top
    10
    In Data Catalogs, Data Governance, Master Data Management
    Top
    10
    In Data Catalogs, Data Governance

    Customer reviews

     Info
    Sentiment is AI generated from actual customer reviews on AWS and G2
    Reviews
    Functionality
    Ease of use
    Customer service
    Cost effectiveness
    Positive reviews
    Mixed reviews
    Negative reviews

    Overview

     Info
    AI generated from product descriptions
    Universal Query Engine
    AutoSQL provides a universal query engine for unified data access across disparate data sources.
    Data Discovery and Classification
    Watson Knowledge Catalog enables real-time discovery and classification of data with automated cataloging capabilities.
    Automated Policy Enforcement
    Pervasive privacy framework with automated policy enforcement for sensitive data protection across all users in the organization.
    Model Operations and Governance
    ModelOps on Watson Studio synchronizes application and model pipelines while monitoring and governing AI models to manage risk, reduce drift and bias, and enhance transparency.
    Data Fabric Architecture
    Data fabric technology connects siloed data on premises or across multiple clouds without requiring data movement, enabling consolidated and governed views of enterprise data.
    Metadata Centralization
    Centralizes metadata from disparate sources into a unified platform for discovering, describing, governing, and managing data assets including data, BI reports, and AI models.
    Behavioral Analysis Engine
    Incorporates a Behavioral Analysis Engine to provide advanced analytics and insights across data assets.
    Data Lineage and Tracking
    Enables documentation of insights and tracking of data lineage across teams for transparency and compliance purposes.
    Self-Service Analytics
    Supports self-service analytics capabilities allowing users to independently discover and analyze data assets.
    AI Governance Framework
    Provides an AI governance framework that ensures data quality, transparency, and compliance for AI initiatives.
    AI Governance Framework
    Active metadata-based governance with rules, processes and responsibilities to ensure ethical AI practices, mitigate risk, adhere to legal requirements, and protect privacy
    Automated Data Lineage
    End-to-end lineage tracking providing transparency into data transformation and flow across systems, including both summary-level business lineage and detailed technical lineage
    Unified Data Catalog
    Multi-cloud and hybrid environment data discovery with business context including data origin, ownership, usage patterns, and access to reports, AI models and data products
    Data Quality Automation
    Automated monitoring and rule management system for enterprise-wide data quality management replacing manual processes
    Privacy and Compliance Workflow
    Centralized automation of privacy workflows to operationalize privacy requirements and address global regulatory compliance

    Contract

     Info
    Standard contract
    No
    No
    No

    Customer reviews

    Ratings and reviews

     Info
    4.2
    96 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    52%
    44%
    4%
    0%
    0%
    1 AWS reviews
    |
    95 external reviews
    External reviews are from G2  and PeerSpot .
    Bálint Tóth

    Data integration has accelerated financial workflows and now supports reliable AI-driven projects

    Reviewed on Mar 02, 2026
    Review provided by PeerSpot

    What is our primary use case?

    I usually recommend IBM Cloud Pak for Data  for companies in the financial sector, as we are mostly working with local insurance companies and banks within Hungary where we are located.

    For IBM Cloud Pak for Data  setup and configuration, I think it is outstanding. The documentation is comprehensive, and we did not have any issues with that.

    What is most valuable?

    The features I find most valuable in IBM Cloud Pak for Data are the integration feature, specifically Message Queue and App Connect Enterprise.

    I assess the impact of automated AI lifecycle management on project development times as positive since it accelerates our processes.

    What needs improvement?

    I think we are happy with IBM Cloud Pak for Data, and there is no specific idea that comes to my mind regarding room for improvement. We are following the progress and the new features, so overall we are quite content and satisfied with it.

    I don't have any specific idea regarding additional features they could incorporate in the future to make it even better.

    For how long have I used the solution?

    I have been dealing with IBM Cloud Pak for Data for more than ten years now since the company is working with the IBM Integration portfolio. IBM Cloud Pak for Data itself is younger, but we started to work with it from the very beginning. I have been working with it for at least five years.

    What do I think about the stability of the solution?

    I did not have any problems while integrating it with any particular solutions that I can recall.

    How are customer service and support?

    I would rate the technical support by IBM as adequate. We have submitted some trouble tickets, and there was always an answer provided, so overall it is satisfactory.

    They do not provide local support in a local language, as it is provided in English, but that is acceptable to us. I think that the local language market is not substantial enough, as there are not enough customers in Hungary to justify localization, but it is not an issue. Usually, our enterprise customers are comfortable with English.

    How would you rate customer service and support?

    Positive

    What's my experience with pricing, setup cost, and licensing?

    Regarding the price, I know IBM is traditionally relatively expensive in the Hungarian market, but we work together with the local IBM sales team, and on a project basis they manage to negotiate the prices. We rarely can sell it at the list price of course. Overall, the challenge is to let the customer see the value, so I do not have too many price concerns. The list price is high, but the flexibility in pricing is adequate.

    I think the licensing model is acceptable and there is no need for change. Custom project-based pricing is usually possible with some customer discounts if the project is substantial enough, so overall we could sell many IBM licenses.

    Which other solutions did I evaluate?

    We usually go with IBM Cloud Pak for Data first when recommending products for smaller businesses, but in other cases, the customer may have an existing install base from some competitor, and that affects the recommendation.

    What other advice do I have?

    I do not have a specific opinion about its influence on decision-making accuracy.

    In terms of data virtualization features, we do not use that, but we use some virtualization features.

    At the moment we do not use Watson Knowledge Catalog, so it has not helped improve data governance for us.

    Regarding Data Stage for ETL processes, we do not use that.

    I think AI capabilities are coming regardless, and the product is progressing. IBM can be slightly slow with introducing new features, but I do not feel it is lacking in this respect. The new agents and assistants within the product are beneficial.

    For us, IBM Cloud Pak for Data is the best option on the market at the moment. In its own category, I think it is the best, and we are satisfied with it.

    In the financial segment where we are working, I think IBM Cloud Pak for Data is the market leader in our local territory, but there could be more marketing and promotion.

    I do not have significant insight into other industries because our company really focuses on the financial sector. As far as I know, IBM is also strong in manufacturing, but SAP itself is very strong in Hungary in manufacturing, providing end-to-end solutions which means there is less room for platforms like IBM. In financial institutions, SAP is not strong at all, so I think IBM is the strongest in this respect for these platforms.

    I would recommend IBM Cloud Pak for Data for different types of companies because the solution itself is not industry-specific. I mention finance only because my company focuses on that type of customer. Different IBM partners focus on different customers. There is a need for a minimum customer size, but I would not recommend IBM Cloud Pak for Data for smaller companies, as they might not need the higher reliability that IBM provides. Conversely, they might want a simpler, cheaper solution because their needs are not as comprehensive. For really large to medium-sized enterprises with very mission-critical applications and systems, that is what I would recommend.

    I would give this product a rating of nine out of ten.

    Eunice Romper

    Data fabric has streamlined predictive analytics and has transformed how we manage hybrid data

    Reviewed on Feb 26, 2026
    Review provided by PeerSpot

    What is our primary use case?

    My main use case for IBM Cloud Pak for Data  includes storing utility data to build a smart utility solution for the prediction of future trends. All data is stored, and with the help of AI and machine learning algorithms, analytics dashboards are built on the same platform. It helps manage and store high volumes of both structured and unstructured data and deliver desired results in optimum time.

    A specific example of how I used IBM Cloud Pak for Data  in one of my projects is that it serves as a very fully scalable platform for data and analytics. We use it to provide data solutions for our customers and provide various industry solutions for clients that need a cloud data platform that can be used for data analytics, data science, and data visualization.

    What is most valuable?

    The best features that IBM Cloud Pak for Data offers include data analytics, data science, data management, data catalog, inbuilt AI and machine learning capability, and integration with other applications, along with very safe cloud storage.

    I find all of these features to be the most valuable for my daily work because they increase our impact by combining AI skills with advanced analytics and machine learning in an easy-to-use visual interface. Visualization and reporting are great in IBM Cloud Pak for Data, allowing me to manage data spread across distributed stores and clouds.

    IBM Cloud Pak for Data has positively impacted my organization as it saves a lot of time by predicting outcomes faster using a platform built with data fabric architecture. It is easy to collect, organize, and analyze data no matter where it is located. Manual cataloging is eliminated, thereby saving a lot of cost, approximately 50 to 60 percent. Additionally, it significantly reduces data footprint, and AI and ML analysis for predictive analytics are excellent.

    What needs improvement?

    To improve IBM Cloud Pak for Data, I suggest more out-of-the-box integration, enhancement of analytics to be sharper, and the deployment options should be very flexible.

    For how long have I used the solution?

    I have been using IBM Cloud Pak for Data for the past eight years, even in my previous organization.

    What other advice do I have?

    I would add that our clients who want to address requirements alongside data catalog, data governance, and data visualization will benefit from investing in IBM Cloud Pak for Data.

    I rate IBM Cloud Pak for Data a nine out of ten. I choose nine out of ten because this tool enables us to connect multiple data sources, ingest data, and run AI and ML algorithms. IBM Cloud Pak for Data is a very good solution for that work.

    My advice to others looking into using IBM Cloud Pak for Data is that it has played a huge role in automating our end-to-end life cycles in the network dimension, giving us an extensive view of data across diverse data sources for required features at any moment without needing to migrate to a central repository. The platform has made our data management across multiple clouds much easier, thereby saving a lot of time. I find it to be a very commendable tool for that work.

    IBM Cloud Pak for Data has enabled us to access diffuse data quickly across hybrid networks and given our teams an edge in data management through automation while adhering to compliance regulations. Additionally, it is a speedy solution, increasing our client satisfaction. I would like to note that my overall rating for IBM Cloud Pak for Data is nine out of ten.

    Nikolas Vulai

    Data workflows have become more transparent and now support faster, trusted decisions

    Reviewed on Feb 25, 2026
    Review provided by PeerSpot

    What is our primary use case?

    IBM Cloud Pak for Data  is a powerful cloud-native all-in-one easy-to-use solution that enables us to put data to work quickly and effectively. These tools enable us to approach analytics our way with code, low-code, and no-code options that allow us to collaborate on one platform. It is easy to transform structured and unstructured data into analytics insights where we are able to make data-driven decisions easily. We can build and test models with best-in-class AI and analytics. The support team is generally the most proactive and supportive 24/7.

    What is most valuable?

    A quick specific example of how I use IBM Cloud Pak for Data  in my day-to-day work is that, unlike other analytics tools, it provides out-of-the-box privacy, model interpretability, and fairness monitoring, along with automatic explanation of data and models written in business language. It is a great tool that all businesses should emulate while it provides a great user experience because every feature is functional and improved constantly.

    IBM Cloud Pak for Data is driving our business productivity by reducing time spent reading and analyzing data. We always use this tool in all departments that need to gather relevant information in the cloud from a single centralized point for better reporting of data and data-driven decisions.

    IBM Cloud Pak for Data has positively impacted my organization by rapidly providing business-ready data to all users equally, hence being able to make data-driven decisions easily. We can manage and analyze data no matter where it is. The manual catalog has been eliminated to save cost. It drives responsible, transparent, and explainable AI workflows to operationalize AI and mitigate risk and regulatory compliance.

    A specific example that shows how IBM Cloud Pak for Data has helped my organization is that it has improved our decision-making, saved us money, and saves a lot of time by predicting outcomes faster using a platform built with data fabric architecture.

    The best features IBM Cloud Pak for Data offers include robust data visualization.

    The data visualization feature stands out for me because it increases our impact by combining BI skills with advanced analytics and machine learning in an easy-to-use visual interface.

    What needs improvement?

    IBM Cloud Pak for Data can be improved because processing speeds are sometimes slow.

    The slowness in IBM Cloud Pak for Data is mainly during data processing, and pricing is something that affects most small businesses, so it should be affordable. It takes some time, something that affects most small businesses and enterprises.

    Occasionally, IBM Cloud Pak for Data is very slow.

    For how long have I used the solution?

    I have been using IBM Cloud Pak for Data for the past three years.

    What do I think about the stability of the solution?

    IBM Cloud Pak for Data is very stable.

    What do I think about the scalability of the solution?

    IBM Cloud Pak for Data's scalability is very good; it can be used by any size of organization.

    How are customer service and support?

    The customer support for IBM Cloud Pak for Data is great and responsive.

    I would rate the customer support for IBM Cloud Pak for Data a nine out of ten.

    How would you rate customer service and support?

    Positive

    Which solution did I use previously and why did I switch?

    We switched from Azure Databricks  to IBM Cloud Pak for Data because IBM is very easy to use. It has a flexible way to deploy and enables us to collect, connect, catalog, transform, and analyze data regardless of the area.

    What was our ROI?

    I have seen a return on investment because the manual catalog is eliminated, hence saving a lot of cost. It is easy to collect, organize, and analyze data no matter where it is, hence being able to make data-driven decisions.

    What's my experience with pricing, setup cost, and licensing?

    My experience with pricing, setup cost, and licensing has been around on-premises.

    Which other solutions did I evaluate?

    Before choosing IBM Cloud Pak for Data, I evaluated other options such as Cloudera Data Platform .

    What other advice do I have?

    I advise others looking into using IBM Cloud Pak for Data to know that, unlike other analytics tools, it provides out-of-the-box privacy, model interpretability, and fairness monitoring, along with automatic explanation of data and models written in business language and a great user experience because every feature is functional and improved consistently. I would rate this product a nine out of ten overall.

    ArchanaSingh

    Collaborative data platform has transformed analytics and now drives faster decisions

    Reviewed on Jan 26, 2026
    Review from a verified AWS customer

    What is our primary use case?

    IBM Cloud Pak for Data  is a powerful cloud-native, all-in-one, easy-to-use solution that enables us to put data to work quickly and effectively. This tool enables us to approach analytics our way with code, low-code, and no-code options that allow us to collaborate on one platform easily.

    It is easy to transform structured and unstructured data into analytics insights, where we use those insights to make data-driven decisions easily. We build and test models with best-in-class AI analytics.

    As a team, we connect IBM Cloud Pak for Data  with our product for integration with cloud, and it enables all of our data users to collaborate from a single, unified interface that supports many services that are designed to work together. This helps to give our users recommendations as they want to store the data in the cloud.

    Regarding my main use case, it is a very great tool that enables our users to collaborate by using single data that is stored in a centralized, unified place where they can access it at any time. It also helps to drive business productivity by reducing the time spent reading and analyzing data. We use this tool in all departments that need to gather relevant information in the cloud from a single centralized platform for better reporting of data. It is possible to analyze data from many sources in a short period of time.

    What is most valuable?

    The best features IBM Cloud Pak for Data offers include robust data visualization, centralized data analytics, data reliability, and compatibility with hybrid and multi-cloud environments.

    The compatibility with hybrid and multi-cloud environments has helped our organization as data visualization is very simple. Migrations, reading, analysis, and data management from other sources are performed without problems of requirements. We have a team of experts in IBM Cloud Pak for Data to maintain security and correct data management easily.

    I find this cloud excellent for visualizing and managing data across networks and also fulfilling fastest data storage, making it less complex and completely improving productivity in my organization. Everything is managed in multiple environments without any problem.

    IBM Cloud Pak for Data has positively impacted my organization, and I have noticed some improvement since we started using this tool. Configuration with hybrid and multi-cloud environments has been very seamless and easy. It is a robust platform capable of working with multiple data sources where we gain insights to make data-driven decisions easily. It automates data analysis for quick and better performance. We have seen improvements in analysis and data correction from multiple sources. Our productivity in the company is growing, thanks to the data analysis team. We have also seen a robust hybrid and multi-cloud access system working seamlessly.

    I can share specific outcomes that show how productivity has grown and how performance has improved since the data is automated, and the analysis is done much faster, saving us a lot of time. We have been able to save approximately 80 percent of our time. We are not doing data analysis manually, so this relieves our data department of dealing with data. We have been relieved of a lot of duties, and now we are able to focus on other strategic tasks. Our productivity has greatly increased since we are able to make concrete and data-driven decisions easily.

    What needs improvement?

    Setting up the hybrid and multi-cloud environments is a long job and it takes time.

    Additionally, the customer support should be more responsive and reach and respond on time.

    The two main challenges that I face are setup complexity and customer support responsiveness. Customer support needs some improvement, as they are not always unresponsive, but sometimes they are not quick to respond to our queries. They should improve on that.

    For how long have I used the solution?

    I have been using IBM Cloud Pak for Data for the past five years.

    What do I think about the stability of the solution?

    IBM Cloud Pak for Data is stable.

    What do I think about the scalability of the solution?

    IBM Cloud Pak for Data's scalability has been great since we started using this platform. I have not noticed any downtime or lagging, especially when dealing with large data, so it is relatively very scalable.

    How are customer service and support?

    Customer support should be more responsive and reach and respond on time.

    Customer support needs some improvement, as they are not always unresponsive, but sometimes they are not quick to respond to our queries. They should improve on that.

    How would you rate customer service and support?

    Positive

    Which solution did I use previously and why did I switch?

    We switched from using Azure Databricks , and the reason why we switched is that IBM Cloud Pak for Data has been very helpful and innovative because it increased our workflow and collaboration using an integrated multi-cloud platform. It also enables us to deploy in any flexible way, on-premises or cloud, which saves time and hard disk space.

    How was the initial setup?

    Setting up the hybrid and multi-cloud environments is a long job and it takes time.

    The two main challenges that I face are setup complexity and customer support responsiveness.

    Regarding my experience with pricing, setup cost, and licensing, for a small organization, the price might be relatively high, but for huge enterprises such as ours, the price is relatively affordable. The setup is easy, with no complexity, and no time wasting.

    What was our ROI?

    I have seen a return on investment, and I can share that we save a lot of time by predicting outcomes faster using the platform built with data fabric architecture. We save at least 70 percent of our time. It has been easy to collect, organize, and analyze data no matter where it is from multiple sources. The manual catalog is eliminated to save costs by 50 to 60 percent. We have been able to drive responsible, transparent, and explainable AI workflow to operationalize AI and mitigate risk and regulatory compliance easily.

    What's my experience with pricing, setup cost, and licensing?

    Regarding my experience with pricing, setup cost, and licensing, for a small organization, the price might be relatively high, but for huge enterprises such as ours, the price is relatively affordable.

    Which other solutions did I evaluate?

    Before choosing IBM Cloud Pak for Data, I evaluated other options, including Cloudera Data Platform .

    What other advice do I have?

    I would advise others looking into using IBM Cloud Pak for Data that it is a very great tool that is an all-in-one, real-time data analytics solution that provides a phenomenal user experience. It increases data transparency, saves a lot of time, and it saves cost as well. It is a great tool that transforms and analyzes data regardless of the area, making it a sure-bet tool. I would rate this product a nine out of ten.

    reviewer2648136

    Longstanding reporting platform has supported reliable dashboards and regulatory compliance

    Reviewed on Dec 19, 2025
    Review provided by PeerSpot

    What is our primary use case?

    The main use case for IBM Cognos  is for business intelligence and reporting.

    What is most valuable?

    IBM Cognos  has been available for many years, and we use regular dashboarding and for producing scheduled reports and some mandatory regulatory reporting. All our departments use Cognos, and the actual Cognos reports are developed by those different teams.

    IBM Cognos is very stable and has been around for many years, with many users familiar with it, making it a reliable solution for our institution. Because of our long association with Cognos, we have good pricing.

    The benefits of choosing IBM Cognos, in addition to saving on cost, include having institutional knowledge about maintaining this infrastructure and enough people who have developed on Cognos in the past, which creates comfort in its use. Cognos is a reliable solution, and developer productivity is high because of the long history of development on it.

    What needs improvement?

    I do not know if Cognos has all the features that users are looking for since we provide it as our standard and do not maintain infrastructure for other tools.

    For how long have I used the solution?

    I am actually very new to the organization and have been here for less than a year.

    How are customer service and support?

    I would rate IBM's support at about a seven or eight out of ten because we have good support coverage owing to our long association with IBM. We are good on the support front. IBM support is very supportive, and I would rate them an eight out of ten based on our long relationship with them.

    How would you rate customer service and support?

    Positive

    How was the initial setup?

    DataStage is not difficult to set up, but we had a lot of challenges in setting up IBM Cloud Pak for Data  cluster on-premises. Our infrastructure team faced many challenges when they were doing it because we had to first stand up an OpenShift cluster on-premises before deploying IBM Cloud Pak for Data  solution.

    The setup for IBM Cloud Pak for Data is very complex, and our teams responsible for standing up the environment struggled a lot. This might also be due to the learning curve since we had not used containerized solutions in the past.

    What's my experience with pricing, setup cost, and licensing?

    The pricing and setup cost are handled by a different procurement team. Our IT procurement team is centralized, so licensing and the actual cost of the software are taken care of by a different team altogether.

    Which other solutions did I evaluate?

    I am not sure about the main differences between IBM Cognos and some other business intelligence tools such as Tableau or Microsoft because many members of the user community have previously experienced those reporting tools before joining our college. However, due to the variety of cloud offerings, users are often able to subscribe directly without having to approach IT for reporting tools, given they have the budget.

    What other advice do I have?

    I do not utilize Dell PowerStore  or Dremio  because I work for a university setting with a very simple infrastructure, where we just use Cognos and IBM DataStage.

    I do not know if my organization uses AWS  as a main cloud provider. We are not on the cloud in a major way and are still on-premises for most of our solutions. In fact, even IBM DataStage, we are using IBM Cloud Pak for Data version, but it is installed on-premises, and we haven't progressed much on how to migrate to the cloud yet.

    I am not sure if we use AWS  as a cloud provider since we do have some SaaS applications that we subscribe to, but I do not know where they are hosted. I just know we have access to the application for the user interface, and the data is pulled out using an API, but we do not know where it is hosted.

    I do not utilize Cognos ad hoc reporting because I do not develop reports. We only host the Cognos infrastructure for our different user groups, and the report development is completed by them. Our infrastructure team provides the hardware, and our system engineering team provides the installation and application maintenance for Cognos.

    I think some users are using the interactive dashboards feature, and there are also other tools such as Power BI and Tableau that some users automatically use. However, our IT organization only provides Cognos as an enterprise business intelligence and reporting tool. Other tools are subscribed to separately by different people.

    I am not the right person to speak on the machine learning capabilities, as my responsibility is to work with different IT teams who maintain systems across the university. I connect to them using IBM DataStage to fetch their data, perform ETL activities, and load the data into an Oracle database. My team maintains the infrastructure for DataStage and Cognos, but actual development is done by other people.

    I use IBM DataStage, which we call IBM Cloud Pak for Data, as we migrated from InfoSphere DataStage to IBM Cloud Pak for Data, and it is installed on-premises in our data center. IBM Cloud Pak for Data version is more or less a modern OpenShift cluster-based platform.

    The best features of IBM Cloud Pak for Data include a very modern approach to providing data capabilities under one umbrella, with various services such as artificial intelligence and machine learning capabilities, real-time integration, and data virtualization, though each has separate licenses associated with them. We are currently only using the DataStage license.

    We have not evaluated data virtualization, but I recognize it as a good capability for exploring and experimenting with data, especially for those unfamiliar with data modeling. However, we are not using it due to cost considerations.

    The developer productivity for DataStage on IBM Cloud Pak for Data is the same as on the old tool, InfoSphere. It does not change anything because the core capabilities remain consistent.

    Overall, I would rate Cognos a nine out of ten from a pure infrastructure stability and support perspective because we are comfortable and know what to do, considering the long-term use of Cognos.

    Overall, I would rate IBM Cloud Pak for Data a nine out of ten in terms of capabilities. It mirrors the traditional InfoSphere version of DataStage with a good ETL tool that covers all features expected from such tools.

    We did not purchase through a marketplace such as AWS. This is all from a long association with IBM directly through negotiations with our procurement team, as we have been a large IBM customer for many years. I would rate this review a nine out of ten overall.

    View all reviews