Listing Thumbnail

    Cloudera Data Lineage

     Info
    Sold by: Cloudera 
    Metadata Management and Automated Data Lineage Platform
    4.2

    Overview

    Data Lineage Made Easy

    Cloudera Data Lineage simplifies data management with an AI-powered metadata platform that provides clear insights into data flow, making it easier to understand, trust, and manage your data.

    With quick setup in under an hour, Cloudera Data Lineage automates metadata harvesting across complex hybrid ecosystems, saving time on tasks like debugging, data analysis, and troubleshooting. It helps reduce full-time equivalent hours spent on ad-hoc analysis, vendor management, and system issues.

    Beyond immediate efficiency gains, Cloudera Data Lineage accelerates strategic initiatives like cloud migrations, data literacy, and risk management. It offers real-time insights into data lineage, helping track changes and flow across systems; from databases and pipelines to BI tools and ML models.

    Key features include: Cross-platform metadata support with more than 50 technologies Upstream and downstream impact analysis to evaluate the ripple effects of data changes Entity-relationship exploration and data transformation lifecycle insights for in-depth understanding End-to-end asset lineage tracking for compliance, optimization, and root cause analysis Automated data discovery and technical documentation to streamline data management and decision-making

    Cloudera Data Lineage supports self-service, data democratization, cloud transformations, and the reduction of technical debt, all while empowering teams with reliable, real-time data insights.

    Highlights

    • Our Automated Metadata Harvesting Agent provides a centralized metadata knowledge hub that empowers effective data governance, ensures regulatory compliance, and enhances data quality and control. It also supports digital transformation efforts and helps organizations prepare for AI readiness.
    • Cloudera Data Lineage brings the world of Generative AI to data management, with Octomize, the AI automation agent built for data teams. Octomize empowers data teams with a real-time, unified workspace that automates, optimizes, and interprets scripts while providing immediate insights into data lineage and empowers data users to easily create business and technical documentation and debug and fix data errors in the Cloudera Data Lineage platform.

    Details

    Sold by

    Delivery method

    Deployed on AWS
    New

    Introducing multi-product solutions

    You can now purchase comprehensive solutions tailored to use cases and industries.

    Multi-product solutions

    Features and programs

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    Cloudera Data Lineage

     Info
    Pricing is based on the duration and terms of your contract with the vendor, and additional usage. You pay upfront or in installments according to your contract terms with the vendor. This entitles you to a specified quantity of use for the contract duration. Usage-based pricing is in effect for overages or additional usage not covered in the contract. These charges are applied on top of the contract price. If you choose not to renew or replace your contract before the contract end date, access to your entitlements will expire.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    12-month contract (1)

     Info
    Dimension
    Description
    Cost/12 months
    Octopai
    Tiered-based, 5 connectors, 10 Lineage users, 25 Catalog viewers
    $100,000.00

    Additional usage costs (1)

     Info

    The following dimensions are not included in the contract terms, which will be charged based on your usage.

    Dimension
    Cost/unit
    Additional connectors, users or viewers
    $0.01

    Vendor refund policy

    No refunds available

    How can we make this page better?

    Tell us how we can improve this page, or report an issue with this product.
    Tell us how we can improve this page, or report an issue with this product.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    Software as a Service (SaaS)

    SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

    Resources

    Vendor resources

    Support

    Vendor support

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Similar products

    Customer reviews

    Ratings and reviews

     Info
    4.2
    130 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    50%
    40%
    6%
    2%
    2%
    0 AWS reviews
    |
    130 external reviews
    External reviews are from G2 .
    Computer Software

    Easy to Use, Reliable, and Great for Team Collaboration

    Reviewed on Apr 27, 2026
    Review provided by G2
    What do you like best about the product?
    Easy to use, collaboration among the team and reliability.
    What do you dislike about the product?
    Not that I know out of my mind. A better UI could be helpful.
    What problems is the product solving and how is that benefiting you?
    A cloud platform for arrnging clsses, running virtual machines easily.
    Paritosh C.

    Reliable Platform for Managing Large-Scale Data Pipelines

    Reviewed on Jul 30, 2025
    Review provided by G2
    What do you like best about the product?
    Cloudera Data Engineering provides a solid environment for building and managing data pipelines at scale.

    I like the way it integrates with Apache Spark and Airflow, making batch processing and scheduling efficient
    What do you dislike about the product?
    Initial setup and configuration can be complex, especially in hybrid cloud environments.
    What problems is the product solving and how is that benefiting you?
    We had challenges with slow and unreliable data processing in our ETL pipelines. With Cloudera Data Engineering, we were able to automate our workflows, schedule tasks reliably, and scale up when needed. This significantly improved our data delivery times and overall team productivity.
    Asif A.

    Storage product by Cloudera

    Reviewed on Mar 11, 2024
    Review provided by G2
    What do you like best about the product?
    Usability and security is one of the core feature that I like the most with cloudera DB. Highly scalable and implementation is not a big deal.
    What do you dislike about the product?
    expensive and not supporting intranet to work.
    What problems is the product solving and how is that benefiting you?
    It is solving the very crucial problem of storing the data easily with high security . Being a research guy I have gone through many companies DB system and with no doubt this is one of the best and easiest to use.
    Information Technology and Services

    Review CloudEra data

    Reviewed on Dec 26, 2023
    Review provided by G2
    What do you like best about the product?
    Cloudera Data Platform is like a super-smart organizer for data, helping companies handle lots of information easily and securely. It works well with big data and lets businesses analyze and use their data smartly, making decisions based on facts.
    What do you dislike about the product?
    Some folks find Cloudera Data Platform a bit tricky to set up and costly to run, like a high-maintenance gadget. It might feel a bit complicated for beginners, and the buttons aren't as easy to figure out as some other tools.
    What problems is the product solving and how is that benefiting you?
    Cloudera Data Platform helps tp organize and make sense of their big data, making it easier to find valuable insights and make smart decisions.
    Azam A.

    Big data technology leader

    Reviewed on Dec 26, 2023
    Review provided by G2
    What do you like best about the product?
    The way it bundled useful big data technologies in one product and easy to install and use.
    What do you dislike about the product?
    CDP is too costly compared to open source software present.
    What problems is the product solving and how is that benefiting you?
    Solving problems related to big data.
    View all reviews