Listing Thumbnail

    DataHub

     Info
    Sold by: Datahub 
    Deployed on AWS
    DataHub vision is to bring clarity to your data through its next-generation multi-cloud metadata management platform. The technology is based on LinkedIn DataHub and Apache Gobblin - two successful open-source projects incubated at LinkedIn and battle-hardened in production at scale at major enterprises.
    4.2

    Overview

    DataHub is an AI & Data Context Platform adopted by over 3,000 enterprises including Apple, CVS Health, Netflix, and Visa. Innovated jointly with a thriving open-source community of 13,000+ members, DataHub's metadata graph provides in-depth context of AI and data assets with best-in-class scalability and extensibility. The company's enterprise SaaS offering, DataHub Cloud, delivers a fully-managed solution with AI-powered discovery, observability, and governance capabilities. Organizations rely on DataHub solutions to accelerate time-to-value from their data investments, ensure AI system reliability, and implement unified governance - enabling AI & data to work together and bring order to data chaos.

    For Data Analysts, developers, data scientists, and automated workflows:
    Easily find trusted datasets with the most current data

    • Access data where you work with a chrome extension for BI tools
    • Discover data your way - personalization for multiple business and technical user profiles
    • Support AI models and automations with a metadata graph that keeps up with today's data volume and velocity
    • Understand data provenance with table, column, and job level lineage graphs
    • Auto-enrich metadata with no-code automation
    • Use AI-generated documentation and propagation to better understand context
    • Always stay up-to-date with subscriptions to assets, activity and notifications

    For Data Engineers:
    Deliver reliable data quality

    • Provide end-to-end observability with user-created data quality checks and reports
    • Surface data quality results and impact analysis across all points in lineage
    • Monitor freshness SLAs, data volume, table schemas, column quality, and custom SQL
    • Use AI Anomaly Detection for freshness, volume, and column stats
    • Easily keep an eye on data quality with assertions and AI-based smart assertions
    • Evaluate data contracts and quality checks on-demand with API
    • Get notified where you work (slack, email, and more)
    • Easily manage data quality with a data health dashboard

    For Data Governance:
    Ensure continuous AI & data governance in production versus episodic compliance checks

    • Ensure every AI & data asset is accounted for by defining and enforcing documentation standards
    • Integrate governance practices early with automated shift-left governance
    • Automatically classify your data as it moves and transforms with lineage-driven compliance
    • Keep tags harmonized with seamless metadata flow between DataHub and source systems
    • Deliver continuous compliance monitoring with forms, impact analysis, and reporting
    • Create and implement bespoke compliance approval workflows

    Highlights

    • Search All Corners of Your Data Stack- DataHub's unified search experience surfaces results across databases, data lakes, BI platforms, ML feature stores, orchestration tools, and more.
    • Trace End-to-End Lineage- Quickly understand the end-to-end journey of data by tracing lineage across platforms, datasets, ETL/ELT pipelines, charts, dashboards, and beyond.
    • View Metadata 360 at a Glance- Combine technical, operational and business metadata to provide a 360 degree view of your data entities.Generate Dataset Stats to understand the shape & distribution of the data.

    Details

    Sold by

    Delivery method

    Deployed on AWS
    New

    Introducing multi-product solutions

    You can now purchase comprehensive solutions tailored to use cases and industries.

    Multi-product solutions

    Features and programs

    Buyer guide

    Gain valuable insights from real users who purchased this product, powered by PeerSpot.
    Buyer guide

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    Pricing is based on the duration and terms of your contract with the vendor. This entitles you to a specified quantity of use for the contract duration. If you choose not to renew or replace your contract before it ends, access to these entitlements will expire.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    12-month contract (1)

     Info
    Dimension
    Description
    Cost/12 months
    Discover & Govern
    Up to 20 Monthly Active Users
    $75,000.00

    Vendor refund policy

    All fees are non-cancellable and non-refundable except as required by law.

    How can we make this page better?

    Tell us how we can improve this page, or report an issue with this product.
    Tell us how we can improve this page, or report an issue with this product.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    Software as a Service (SaaS)

    SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

    Resources

    Support

    Vendor support

    Email support is offered Monday - Friday during regular business hours.
    marketplace@datahub.com 

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Product comparison

     Info
    Updated weekly

    Accolades

     Info
    Top
    10
    In Data Catalogs
    Top
    10
    In Data Catalogs, Data Governance, Master Data Management
    Top
    10
    In Data Catalogs, Data Governance

    Customer reviews

     Info
    Sentiment is AI generated from actual customer reviews on AWS and G2
    Reviews
    Functionality
    Ease of use
    Customer service
    Cost effectiveness
    Positive reviews
    Mixed reviews
    Negative reviews

    Overview

     Info
    AI generated from product descriptions
    Unified Search Across Data Stack
    Search functionality that surfaces results across databases, data lakes, BI platforms, ML feature stores, and orchestration tools within a multi-cloud environment.
    End-to-End Lineage Tracing
    Lineage tracking capability that traces data journey across platforms, datasets, ETL/ELT pipelines, charts, and dashboards at table, column, and job levels.
    AI-Powered Metadata Management
    Metadata graph with AI-generated documentation, AI anomaly detection for freshness and volume metrics, and smart assertions for data quality monitoring.
    Data Quality Monitoring and Observability
    End-to-end observability with user-created data quality checks, freshness SLA monitoring, schema tracking, column quality assessment, and custom SQL evaluation through API.
    Automated Governance and Compliance
    Lineage-driven compliance classification, automated shift-left governance integration, continuous compliance monitoring with forms and impact analysis, and metadata harmonization across source systems.
    Metadata Centralization
    Centralizes metadata from disparate sources into a unified platform for discovering, describing, governing, and managing data assets including data, BI reports, and AI models.
    Behavioral Analysis Engine
    Incorporates a Behavioral Analysis Engine to provide advanced analytics and insights across data assets.
    Data Lineage and Tracking
    Enables documentation of insights and tracking of data lineage across teams for transparency and compliance purposes.
    Self-Service Analytics
    Supports self-service analytics capabilities allowing users to independently discover and analyze data assets.
    AI Governance Framework
    Provides an AI governance framework that ensures data quality, transparency, and compliance for AI initiatives.
    AI Governance Framework
    Active metadata-based governance with rules, processes and responsibilities to ensure ethical AI practices, mitigate risk, adhere to legal requirements, and protect privacy
    Automated Data Lineage
    End-to-end lineage tracking providing transparency into data transformation and flow across systems, including both summary-level business lineage and detailed technical lineage
    Unified Data Catalog
    Multi-cloud and hybrid environment data discovery with business context including data origin, ownership, usage patterns, and access to reports, AI models and data products
    Data Quality Automation
    Automated monitoring and rule management system for enterprise-wide data quality management replacing manual processes
    Privacy and Compliance Workflow
    Centralized automation of privacy workflows to operationalize privacy requirements and address global regulatory compliance

    Contract

     Info
    Standard contract
    No
    No

    Customer reviews

    Ratings and reviews

     Info
    4.2
    14 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    43%
    57%
    0%
    0%
    0%
    10 AWS reviews
    |
    4 external reviews
    External reviews are from PeerSpot .
    RohitJoshi1

    Metadata lineage tracking has improved governance and currently supports clear data observability

    Reviewed on Jun 22, 2026
    Review from a verified AWS customer

    What is our primary use case?

    My main use case for Data Hub is data lineage tracking. With Data Hub, we track multiple sources, ingestion sources, and different sources where the data resides in S3 . We bring all that metadata into Data Hub to track lineage on the data ingestion patterns that we perform or transformations that we do, and how they move from different tables or assets or the data pipelines. Whatever transformations we do with Spark and S3 , Snowflake , all those are being tracked via Data Hub. We have S3 buckets and Snowflake  tables, and all those lineage tracking is managed through the platform.

    My main use case is mostly covered as we used Data Hub for metadata tracking and lineage for whatever transformations that we do so that we can track each transformation down the line.

    What is most valuable?

    In my experience, the best features Data Hub offers include lineage tracking, which is mostly on the asset level, a good glossary, and good connector support.

    Regarding asset level and the good glossary, we need the glossary of our products so that it is easy to track which product, what went at what time on that particular product, how many assets are related, and so on. For asset integrations, Data Hub makes it easy to ingest all that metadata of those particular assets from S3 via connectors, which is quite easy. It has good connector support, although limited in some cases.

    Overall, Data Hub is a good tool. If we talk about lineage, metadata, and observability on some high level, including domain descriptions, PII classification, datasets, and keeping datasets in one place along with policies, it is good in that particular sense. We do have a plan based on project-to-project usage, but in some of the projects, we do use Data Hub as well.

    What needs improvement?

    I would like to add that for the connectors, there is sometimes limited support for using wildcards to get the items or assets ingested from sources like S3; it does not support very good wildcard filters. Additionally, Data Hub has a problem with column-level lineage support, especially regarding non-pro users or those without any plans. If I talk about the free features of Data Hub open source, those two I found could be improved during my use case.

    Regarding improvements needed for Data Hub, I have already mentioned the limitations on the usage of wildcards in the ingestion or connectors; that can be worked upon, especially regarding the open-source part of Data Hub. The rest is that I hope the UI is quite good.

    For how long have I used the solution?

    I used Data Hub for one and a half years.

    What other advice do I have?

    My advice for others looking into using Data Hub is that it is a good tool if you want to capture all that metadata, lineage, keep track of governance, security, and observability. It just depends on how you want to use it; you can choose the open-source version or the paid version and subscription-based model. The paid versions have more features, but open-source Data Hub, which most people will try to go for, has some limitations, such as the missing column-level lineage with Spark. You need to consider those points, but overall, it is good. I would rate this product an 8 out of 10.

    Which deployment model are you using for this solution?

    Private Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    Gbytyqi Gbytyqi

    Data mesh has connected 2,000 colleagues and has made cross‑team collaboration transparent

    Reviewed on Jun 15, 2026
    Review from a verified AWS customer

    What is our primary use case?

    My main use case for Data Hub involves integrating our HR system or Active Directory, which automatically pulls in all 2,000 workers and groups them into their respective project squads and R&D teams. Each team gets its own team profile page in Data Hub, which helps solve the classic corporate headache of determining who to ask for specific information.

    When a team builds a data pipeline, a Kafka topic for telecom signals, or a dashboard, it is tagged explicitly with their team profile as the owner in Data Hub. This means that if a developer in Split , working in the same company, needs to find a specific network dataset, they do not waste days spamming Slack channels; they can simply look it up in Data Hub and find the team profile that owns it along with the direct contact info or Slack channel.

    Additionally, it enables us to run a data mesh model with 2,000 people, allowing one central IT team to manage everything while Data Hub facilitates splitting the company into logical domains such as electronic health, telecom networks, IoT, or smart cities.

    What is most valuable?

    The best features that Data Hub offers include the ability to centralize everything in one platform, such as creating profiles and organizing them into separate domains like engineering, health teams, supporting teams, and HR teams. This allows information to be shared across different domains.

    Utilizing the data mesh model enables the company to maximize functionality using a single solution. Data Hub supports collaboration between different teams and departments significantly, as evidenced when we created various data mesh modules and established different domains such as E-Health, telecom networks, and IoT. This allowed us to share datasets effectively, and with authenticated users, the communication and responses were much quicker.

    Among those features, I find the collaborative aspects the most valuable in my work because it has greatly improved our operations over the past year. We evaluated various licenses and methods to integrate data catalog platforms, ultimately deciding to move forward with Data Hub since it was more compatible with our company's security requirements. Compared to other tools, it received better support from the community, which is updated daily, allowing us to collaborate effectively through contact sharing.

    Data Hub has positively impacted my organization by functioning as an all-in-one solution. It uses data mesh and separates domains to manage privileged access based on user validation, allowing us to share data sets across the company, which informs everyone about internal regulations. Furthermore, it significantly aids new joiners in understanding the operations and knowing who works on specific projects, while also providing updates on changes occurring within various sectors and domains.

    The frequency and quality of updates or new features released for Data Hub have been impressive. This extensive community support was a key factor for us at Ericsson Nikola Tesla to choose Data Hub as our data catalog.

    What needs improvement?

    Regarding how Data Hub can be improved, I believe they should focus on enhancing their marketing efforts. Within our company, we were unaware of the Data Hub platform while searching for data catalog options that offered strong security and collaboration. Better marketing would help other companies learn about this effective solution.

    My rating of eight rather than a nine or ten pertains to the connections with different systems. Specifically, the integration with Slack and Azure , as well as how we link our HR system to Data Hub, could be improved for better compatibility.

    Integrating Data Hub with our existing tools and systems was not very easy, which is why my rating is an eight. We attempted to incorporate our HR system with Data Hub, aiming to set governance status for the 2,000 employees in our organization, but I did not complete this aspect before leaving the organization.

    For how long have I used the solution?

    I have been using Data Hub for at least six months at the company called Ericsson Nikola Tesla in Zagreb, which has a massive operation with an entire ICT and R&D division of around 2,000 workers.

    What do I think about the scalability of the solution?

    In terms of scalability, I believe Data Hub performs exceptionally well as more teams come on board, making it efficient for large organizations with approximately 2,000 employees. It adequately supports the scalability of data sets and the implementation of data mesh models.

    How was the initial setup?

    During implementation, the documentation and support resources from Data Hub were very helpful. I followed the guidelines, accessed each section, and understood the platform effectively, which made the initial setup easy.

    What other advice do I have?

    Data Hub is flexible, optimistic, and user-friendly in terms of its interface and experience. I rate Data Hub an eight on a scale of one to ten.

    The learning curve for new users adopting Data Hub is addressed through their learning section that guides users on how to navigate the platform. I found it quite simple and effective to follow.

    We purchased Data Hub through the AWS Marketplace .

    As for specific outcomes or metrics, I currently do not possess numbers since we are still in the early stages of implementing Data Hub within our company. However, the HR department reported significant time savings in completing tasks before and after adopting Data Hub, which has resulted in faster completion and better collaboration without interrupting others.

    Data Hub has worked for me personally, as I noticed that after we began ingesting Data Hub into our Ericsson Nikola Tesla company network, it proved to be incredibly helpful for easier access to information. By positioning team profiles at the center of Data Hub, it prevents the duplication of data sets, accelerates onboarding for new engineers, and fosters more connected and collaborative teams within our large employee base. Personally, it has helped me specify tasks and has contributed to the company's progress with the data catalog we chose.

    My advice for others considering using Data Hub is to understand how it works and explore its integration potential within their organization. Engaging with community support can also be beneficial, as the team's collaborative approach is impressive.

    Which deployment model are you using for this solution?

    Public Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    Alireza Khorami

    Centralizes data lineage and ownership and has improved our organization-wide data governance

    Reviewed on Jun 15, 2026
    Review provided by PeerSpot

    What is our primary use case?

    I use Data Hub for our data lineage, data management, data heritage, and our data dictionary. Our organization is quite large, with about 2,000 people working on different initiatives, and everyone wants to connect to a database somehow. As the data engineering team, we are responsible for connecting every single data source that we have, defining each one, and providing an accurate single source of truth for the data so everyone can have the same understanding of the data they are discussing. Since we are ingesting every database that the company has into our own data infrastructure through different tools, we needed to have a clear understanding of data quality, data lineage, and the data discovery part of the process.

    What is most valuable?

    The Helm chart of Data Hub is designed really well, which makes our deployment strategies lean and operational. The UI is excellent, and I really appreciated the ability to treat data gathering and data ingestion as GitHub  workflows. Data Hub is one of the services that was truly scalable, at least in the open-source version, which is one of the things we valued since you could scale every part of the system it used, including its internal MySQL  or metadata database, Elasticsearch, and the search capabilities. Everything about Data Hub was quite scalable due to its excellent Helm chart, as they really focused on the Kubernetes  aspect.

    What needs improvement?

    We encountered some issues when we wanted to connect our streaming infrastructure to Data Hub, which was somewhat problematic.

    In our data streaming infrastructure, we had a database CDC'd through Kafka Connect to a Kafka topic, and at the end of the pipeline, it would go to either an OLAP or a data lakehouse. However, the problem with visualizing this data lineage was that while the connection between MySQL  and Kafka worked, when we wanted to track data from Kafka to other services, we couldn't track everything back because the IDs were generated randomly and couldn't be connected. We had to fix this manually by stating where the data had gone, which was tedious.

    Data Hub's GMS service, or General Metadata Service, is a good service that I used regularly, but the CLI version had considerable changes across different versions. When I installed a different version, there wasn't enough consistency to ensure that commands I used would work in future versions of Data Hub's GMS CLI, which was frustrating. I also recall that setting up Kafka without Zookeeper was not possible, which was inconvenient, though I should verify this as I don't remember if they fixed it. At least from my recollection, when I wanted to set it up one and a half years ago, they did not have direct support for KRAFT in their Helm chart.

    For how long have I used the solution?

    I have been using Data Hub for approximately one and a half years.

    Which solution did I use previously and why did I switch?

    Before adopting Data Hub, we considered moving forward with OpenMetadata but decided against it since it couldn't support MySQL version 5.

    How was the initial setup?

    The setup of Data Hub was quite straightforward. One aspect of the architecture I appreciated is that Data Hub relies heavily on Cron jobs and jobs in Kubernetes . Whenever it needs to fix something, it initiates a job to repair its MySQL or its Elasticsearch. Operationally, I find it to be an excellent service, as they worked well on that aspect with the open-source version. However, the lack of support for KRAFT out of the box was somewhat problematic.

    Which other solutions did I evaluate?

    I previously evaluated OpenMetadata as a tool we considered before choosing Data Hub. In comparison to OpenMetadata, the lack of support for more databases and data sources was the deciding factor, whereas for Data Hub, we didn't encounter any problems; it worked really well.

    What other advice do I have?

    Data Hub helped us by making it clear who owned which data and who needed to make changes to clean the deprecated data models and infrastructures we had, which was the most significant benefit. Using a tool that Data Hub provided made us visible to the faults and bugs in our different data sources.

    I would recommend that organizations considering Data Hub adopt GitOps practices, as we implemented it where every single ingestion or transformation was triggered by GitLab  CI/CD, making it straightforward for everyone to use. That was the most innovative approach we took by running every single ingestion job as a Cron job in Kubernetes through our GitOps.

    I would rate this product a nine out of ten.

    Henrique dos Anjos

    Data catalog has unified business terms and democratized access to our data lake

    Reviewed on Jun 12, 2026
    Review provided by PeerSpot

    What is our primary use case?

    My main use case for Data Hub is to implement a data catalog for one of the clients that the consultancy I work at is serving.

    A specific example of how the data catalog was used for that client is that it was used to define business terms and to explore the terms from the data glossary by adding definitions. It was also used to capture all the tables and fields that were connected to a data lake, allowing me to explore the entire production data lake and tag the tables and fields, segmenting these tables by domains such as sales tables and marketing tables.

    What is most valuable?

    Data Hub offers several best features including the tagging capability, domain segmentation, data exploration, and creation of a data glossary, which was very interesting to me. Additionally, the ease of plugging in new data sources is exceptional. Data Hub can be easily integrated with a data lake, and the environment can be explored through the metadata via Data Hub. I found the connection part straightforward.

    Data Hub had a positive impact on my organization by disclosing to the organization and to business users what existed in the data lake. The interface that the technical team has with the tables and fields is designed for professionals in the technical area. Having a data catalog helps provide a better interface for data discovery and data democratization within the organization since everyone should have access to what types of data the organization has, and that was the biggest impact.

    What needs improvement?

    I started using the quality part for consistency, but I had limited contact with it and we did not progress much.

    I believe the data quality module can always be improved by examining what is available in the market and making appropriate improvements to the tool. The data quality part is very important and it is not always fully leveraged as it should be. I also think that providing consulting or support with professionals who are qualified to use Data Hub would be interesting, along with providing training and certifications for the tool so that those who are implementing it can specialize increasingly in its features.

    For how long have I used the solution?

    I have been using Data Hub for around one year.

    What do I think about the stability of the solution?

    Data Hub is stable, and I did not have any stability problems when I was working with the tool.

    What do I think about the scalability of the solution?

    Data Hub's scalability is very easy, as we were able to add users and new datasets very quickly and smoothly.

    Which solution did I use previously and why did I switch?

    I was not previously using a different solution. The implementation was already directly part of a data governance initiative and it was done directly with Data Hub, meaning there was no previous solution.

    What about the implementation team?

    I believe the consultancy has some kind of commercial relationship with Data Hub to promote and offer Data Hub as a data catalog solution.

    Which other solutions did I evaluate?

    Before choosing Data Hub, the consultancy worked with some tools such as Google's DataPlex and Purview .

    What other advice do I have?

    My advice for others thinking about using Data Hub is to have the governance initiative well-structured and to have all the documentation for data owners and data stewardship so you know who will be the points of contact when the tool starts being configured, ensuring that you have people responsible for doing reviews and approvals in the tool. I would rate this product an eight out of ten.

    Matheus Costa

    Data governance has unified domains and now supports conversational discovery for all teams

    Reviewed on Jun 07, 2026
    Review from a verified AWS customer

    What is our primary use case?

    My main use case for Data Hub is to build data products inside Natura; primarily, I built data products for CRM , which is customer relationship management, and also for some data products for the product field, such as analytic fields. I used it by dividing the company into domains, and each domain has its own functionality and its own structure. With that approach, I used it extensively for building domains. I also used it to build data lineage across the entire data journey, from the ingestion of the data to the use of the data in the final part, such as in a data product or in a dashboard.

    A specific example of how I used Data Hub for building domains and data lineage is the domain called GenAI, which is primarily built for products based on AI, mainly generative AI. To accomplish this, I used Data Hub to track the data from the ingestion field. I used some CDP tools such as Segment  and I also have data in an S3  bucket that was ingested to Databricks  using Airflow . With that setup, I track this lineage from the origin system. After that, I performed a lot of transformation of the data inside Databricks  to clean the data and conduct some data augmentation. After that, this data is used to train some models using Databricks LLM. With that, I ingest all this metadata into Data Hub and I can see from where the data is coming from and to where the data is going. This is primarily for LLMs to help consultants at the end of the product.

    What is most valuable?

    The best features that Data Hub offers include the capability to make conversational questions inside the platform, which I believe is the best thing that they built in the past year. It is also easy to connect different data sources. Since data lakes, I have connectors to some databases and also to some business analyst tools and other tools. I can connect many types of data inside Data Hub and see what is going on and how we govern the data. Data Hub is a pretty good tool for that. I also value very much the open-source version because it is free and everyone can use it.

    I do not have much experience using the conversational questions feature, but I do not need to go to the asset to see from where the data is coming from and where it is going. I can simply ask, 'How can we calculate the sales in this month?' and Data Hub will identify which table will be used and from where this data is coming from and where this data is going. This is very effective.

    Data Hub has impacted my organization positively by helping us build a data governance environment and share the knowledge about the data for the entire company. As we used the open-source version, we have no limitation in how many people can use the tool, which is excellent. I can conduct many tests and test as quickly as possible. It is very good for building POCs, for example. Data Hub also helps to give this understanding of the data for the entire company. Everyone in the company can see the data and know where the data is coming from and where it is going. I believe it is very effective and all the people in the organization, not only the data field personnel, can understand more about the data and also help to build better products.

    What needs improvement?

    I believe Data Hub could provide more functionalities in the free version. I understand that we have to pay the persons who build the platform, but the free version has some limitations. Some capabilities of the paid version being included in the free version would be beneficial. Another improvement that is needed in Data Hub is how I can get data from Data Hub to build some metrics. I know that I have the API and the GraphQL API, but I believe it could be better. If this is improved, it would be very helpful.

    For how long have I used the solution?

    I used Data Hub for over two years, mostly in the open-source version.

    What other advice do I have?

    I do not have actual metrics to provide currently; I only have some metrics. I certainly improved the data discovery part of building a data product because it is really fast to know if the data product already exists or does not exist. In the past, I had many products that were the same, and with that, I had a lot of work doing it twice or three times in different parts of the process. This is very good. I do not actually know the exact number of time saved, but I certainly saved time. I have a metric that before Data Hub, I believe 20 to 30 persons used and had knowledge about the data. Currently, I have almost 250 persons using Data Hub.

    I did not use many AI features in Data Hub, as I stopped using Data Hub before it started offering these functionalities.

    My advice to others looking into using Data Hub is to start by trying the tool using the free version. If it is sufficient and you already understand how to use it, you can transition to the paid version. However, you can accomplish everything in the free version. I would rate this review an eight out of ten.

    Which deployment model are you using for this solution?

    Public Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    Amazon Web Services (AWS)
    View all reviews