Listing Thumbnail

    Databricks Data Intelligence Platform

     Info
    Deployed on AWS
    Free Trial
    The Databricks Data Intelligence Platform unlocks the power of data and AI for your entire organization. Enjoy up to $400 in usage credits during your 14-day free trial. Cancel anytime. After your trial ends, you will automatically be enrolled into a Databricks pay-as-you-go plan.
    4.6

    Overview

    Play video

    Get started today with up to $400 in usage credits during your 14-day free trial. Trial ends the earlier of when credits are consumed or the 14-day period expires. After your trial ends, you will be automatically enrolled into a Databricks pay-as-you-go plan using the payment method associated with your AWS Marketplace account, paying only for what you use and you can cancel anytime. You can view the full per-product rates for Databricks Units (DBUs) at https://www.databricks.com/product/pricing 

    The Databricks Data Intelligence Platform allows your entire organization to use data and AI. Its built on a lakehouse to provide an open, unified foundation for all your data and governance. And its powered by a Data Intelligence Engine that speaks the language of your organization so anyone can access the data and insights they need.

    The Data Intelligence Platform simplifies your modern data stack by eliminating the data silos that traditionally separate and complicate data engineering, analytics, BI, data science and machine learning. Databricks is built on open source and open standards to maximize flexibility. And the platforms common approach to data management, security and governance helps you operate more efficiently and innovate faster across all analytics use cases.

    Reach out to sales@databricks.com  to get specialized configurations and pricing for Databricks on AWS Marketplace on a contract basis.

    ** Technical Support: For help setting up your account, connecting to data, or exploring the platform please reach out to awsmp-onboarding-help@databricks.com **

    Highlights

    • Simple: Databricks provides a simplified data architecture by unifying data, analytics and AI workloads on one common platform running on Amazon S3.
    • Open: Built on top of the world's most successful open source data projects, the Lakehouse Platform unifies your data ecosystem with open standards and formats.
    • Collaborative: With native collaboration capabilities, the Databricks Lakehouse Platform unifies data teams to collaborate across the entire data and AI workflow.

    Details

    Delivery method

    Deployed on AWS
    New

    Introducing multi-product solutions

    You can now purchase comprehensive solutions tailored to use cases and industries.

    Multi-product solutions

    Features and programs

    Buyer guide

    Gain valuable insights from real users who purchased this product, powered by PeerSpot.
    Buyer guide

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    Free trial

    Try this product free according to the free trial terms set by the vendor.

    Databricks Data Intelligence Platform

     Info
    Pricing is based on actual usage, with charges varying according to how much you consume. Subscriptions have no end date and may be canceled any time.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    Usage costs (1)

     Info
    Dimension
    Cost/unit
    Databricks Consumption Units
    $1.00

    Vendor refund policy

    No refunds

    Custom pricing options

    Request a private offer to receive a custom quote.

    How can we make this page better?

    Tell us how we can improve this page, or report an issue with this product.
    Tell us how we can improve this page, or report an issue with this product.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    Software as a Service (SaaS)

    SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

    Support

    Vendor support

    Please reach out to sales@databricks.com  with any questions or for options on contract or pricing terms.

    Technical Support: For help setting up your account, connecting to data, or exploring the platform please reach out to awsmp-onboarding-help@databricks.com 

    For additional training:

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Product comparison

     Info
    Updated weekly

    Accolades

     Info
    Top
    10
    In Databases & Analytics Platforms, ML Solutions, Data Analytics
    Top
    10
    In ML Solutions
    Top
    10
    In Data Analysis

    Customer reviews

     Info
    Sentiment is AI generated from actual customer reviews on AWS and G2
    Reviews
    Functionality
    Ease of use
    Customer service
    Cost effectiveness
    Positive reviews
    Mixed reviews
    Negative reviews

    Overview

     Info
    AI generated from product descriptions
    Lakehouse Architecture
    Built on a lakehouse foundation providing unified data storage and governance across data engineering, analytics, BI, data science, and machine learning workloads
    Open Source Integration
    Constructed on open source data projects and open standards to maximize flexibility and interoperability across the data ecosystem
    Data Intelligence Engine
    Powered by a Data Intelligence Engine that enables organizational access to data and insights across diverse user roles and technical skill levels
    Unified Data Platform
    Consolidates data, analytics, and AI workloads on a single common platform running on Amazon S3, eliminating traditional data silos
    Collaborative Capabilities
    Provides native collaboration features enabling data teams to work together across the entire data and AI workflow
    AWS Data Source Integration
    Secure connectivity to Amazon S3, Amazon Redshift, and Amazon RDS with push-down computation capabilities.
    Elastic Compute Scaling
    Distributed data and machine learning processing powered by Amazon EKS supporting Python, R, Spark, and additional frameworks.
    AWS AI Service Integration
    Pre-built workflows integrating AWS AI services including Amazon SageMaker and Amazon Comprehend for accelerated AI development.
    Large Language Model Connectivity
    LLM Mesh capability enabling connections to Amazon Bedrock for Chat, Retrieval-Augmented Generation (RAG), and Agentic workflows.
    Visual Analytics and ML Interface
    Low-code visual platform for data preparation, pipeline creation, and machine learning model development accessible to both technical and non-technical users.
    Workload Auto-scaling
    Intelligently autoscales workloads up and down across hybrid and public cloud environments for optimized cloud infrastructure utilization.
    Multi-function Analytics Platform
    Provides integrated data warehouse, machine learning, and custom analytics capabilities with unified analytic functions to eliminate data silos.
    Shared Data Experience (SDX)
    Implements security and governance policies that are set once and applied consistently across all data and workloads, with portability across supported infrastructures.
    Data Lifecycle Management
    Manages complete data lifecycle functions including ingestion, transformation, querying, optimization, and predictive analytics across multiple cloud environments.
    Unified Security and Governance
    Ensures all workloads share common security, governance, and metadata with capabilities for data discovery, curation, and self-service access controls.

    Contract

     Info
    Standard contract
    No
    No
    No

    Customer reviews

    Ratings and reviews

     Info
    4.6
    824 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    76%
    23%
    1%
    0%
    0%
    10 AWS reviews
    |
    814 external reviews
    External reviews are from G2  and PeerSpot .
    Gunther C.

    Databricks Makes Large-Scale Data Transformations Easy to Run

    Reviewed on Jun 05, 2026
    Review provided by G2
    What do you like best about the product?
    Databricks simplifies the process of running data transformation operations on massive datasets. Although it can be a bit of a paradigm shift from classic asynchronous processing architectures, it is extremely easy to get started with. Simply put, the thing I like best about it is it's ability to do work at scale.
    What do you dislike about the product?
    The inability to run a copy of Databricks locally to test changes before deploying them to production is a significant hindrance. Creating per-developer staging environments might be a close solution l, but might be a lot of work to manage.
    What problems is the product solving and how is that benefiting you?
    Databricks is making it possible to process tremendous amounts of data efficiently, while simultaneously not requiring a large amount of engineering effort to be applied towards designing the system itself (engineers can focus on solving data problems rather than scaling problems)
    Aruna P.

    This is very powerful for big data and machine learning but watch the cluster costs!

    Reviewed on Jun 04, 2026
    Review provided by G2
    What do you like best about the product?
    The best thing is that we don't have to do any infrastructure to manage now. My team was spending too much time on setting up Apache Spark cluster, managing yarn, and memory crashes on-premise before. With Databricks, we could – within 2-3 clicks – spin up a cluster; collaborative notebooks are very nice! Data engineers and data scientists share the same notebook, so they can collaborate on the same notebook, at the same time. We can have Python, Scala and SQL together in one place without changing any environments. Another super solid feature is delta lake; those provide us with transactions over raw data, this saved us from a lot of data corruption issues since in the past.
    What do you dislike about the product?
    Frankly, its cost is quite high. They charge for DBUs (Databricks Units) and then the cloud provider charge (we're using AWS). Unless you are monitoring, the bill will shoot like a rocket. At times, my developers forget to shutdown the clusters and, if auto-termination is not configured correctly, it's running all night and we get an interest big wake up call in the billing portal the next morning.
    What problems is the product solving and how is that benefiting you?
    We had our data in a lot of different places prior to Databricks. The marketing data was somewhere else, transactional database was somewhere else. We may have had a lot of problems with silo thinking. We are porting Databricks to implement our Lakehouse architecture.We are deploying Databricks to create our Lakehouse Architecture. Now, all raw data will be transferred to S3 and will be cleaned, processed and BI Reporting will be done on Databricks. Went a long way to help resolve our speed issue. Our daily ETLs used to run from 6 - 8 hours. Today, the same pipelines are completing within 45 minutes or less, thanks to Spark optimization in Databricks. The reporting of my business team is providing on time in the morning; thus, the decision making is very fast.
    Sachin G.

    Eliminates the fragmentation tax for ML teams, but Unity Catalog migration takes patience

    Reviewed on Jun 03, 2026
    Review provided by G2
    What do you like best about the product?
    Managing end-to-end machine learning pipelines, specifically training and deploying multi-agent models and recommendation engines.What I appreciate most about Databricks is how it completely eliminates the coordination overhead—the fragmentation tax—between our data engineering and data science teams. Before Databricks, we were losing hours every day moving data between unmanaged data lakes, proprietary data warehouses, and our isolated machine learning compute clusters. Having MLflow natively managed inside the Databricks workspace is a massive advantage for my day-to-day workflow. I no longer have to worry about setting up tracking servers or maintaining infrastructure just to log my training metrics, because Databricks handles the automatic updates and maintenance seamlessly. Every experiment is automatically tracked, and the model registry seamlessly handles version control, making the handoff from experimentation to production deployment incredibly smooth. Additionally, the recent updates to MLflow for evaluating GenAI agents, specifically the ability to use trace-derived baselines to generate runnable evaluation scripts, have saved me countless hours of manual assembly.
    What do you dislike about the product?
    The transition to Unity Catalog has been a significant hurdle for our team. Upgrading our legacy workspace to support Unity Catalog's centralized access control and lineage tracking involved a steep learning curve, especially when dealing with privilege inheritance and ensuring the correct schema privileges were granted across the board. Furthermore, while the platform beautifully abstracts away a lot of DevOps work, it can obscure underlying infrastructure costs. It is far too easy for an engineer to spin up an oversized compute cluster for a simple exploratory data analysis task, leading to sudden and severe spikes in our monthly cloud bill. You have to be extremely disciplined with setting strict auto-termination policies and cluster management rules to keep costs in check. The user interface can also feel a bit tedious at times, requiring you to click through multiple layers in the Catalog Explorer just to view the model details page and trace table-to-model lineage.
    What problems is the product solving and how is that benefiting you?
    The primary problem Databricks solved for us was the massive bottleneck in deploying machine learning models to production. We used to struggle with the classic issue where a model worked perfectly in a local notebook but failed in production due to environment mismatches and a lack of proper version control. By standardizing on Databricks and the managed MLflow environment, we established a strict, documented approval chain that satisfies both our engineering standards and our strict compliance requirements. A real-life example of this was when we recently deployed a multi-agent system for customer churn prevention. We were able to run the inference, monitor the agent's safety and relevance metrics using MLflow's built-in judges, and continuously track the outputs all in one unified platform. This consolidated architecture cut our deployment timelines drastically and significantly reduced the time we spent debugging production errors.
    Anita P.

    Unified Scalable Data Processing and Machine Learning Platform

    Reviewed on Jun 03, 2026
    Review provided by G2
    What do you like best about the product?
    As a Data Scientist working for a mid-size company, my main use case for Databricks is as the central engine for all of our data processing and predictive modeling pipeline. I use it every day to pull raw dirty data from our cloud storage, explore it with complicated SQL queries and then create and train machine learning models with PySpark and Python. Basically it gives our data engineering and data science teams a common place to play on the same huge data sets at the same time without having to endlessly exchange files or credentials.From a day-to-day workflow perspective, I love the fluidity of the collaborative notebook environment. The ability to work with different languages in the same workplace is a great advantage. I can perform an optimized SQL query to pull in a hefty data set in one cell, then process it in the next using PySpark, and visualize it with Python libraries straight after. This fully removes the need to constantly bounce between different tools or IDEs. Another big victory for my daily work is the out-of-the-box connection with MLflow. It makes it very easy to roll back to a previous version, automatically tracks hyperparameter tuning, compares several model runs, and manages the full lifespan of a model. I really enjoy how Databricks takes away the effort of managing Spark clusters, you can spin up a distributed cluster with a few clicks, and focus on writing algorithms vs playing DevOps.
    What do you dislike about the product?
    And despite all its potential, working with Databricks does come with certain daily difficulties. What is most important for a mid-sized company like us is the aggressive pricing model for compute costs. The monthly payment can get out of control very rapidly, if you’re not compulsively watching your cluster configurations and auto-termination settings especially if a high-memory cluster is unintentionally left operating over the weekend. Another major pain point is the built-in Git integration. Databricks Repos has been helpful however managing complicated merge conflicts or branch management still feels unexpectedly clumsy compared to a regular local IDE like VS Code. Lastly, the learning curve is rather severe for new employees. The user interface might be complicated and debugging distributed computing failures can be a major bottleneck for young data scientists getting up to speed.
    What problems is the product solving and how is that benefiting you?
    The largest basic problem that Databricks tackled for our business was breaking down the separate silos between our data engineers and data science team. We saw this effect in the real world recently when we were working on a project to build a fraud detection algorithm. In the prior approach, I would have to submit a ticket to data engineering, wait days for them to extract and clean the data, and then try to train the model locally. I would get out of date data by the time I got it, and my machine would crash all the time owing to memory constraints. I could immediately connect to our Delta Lake, utilize PySpark to process the huge data size without any memory issues and train the model on a scalable cluster, all in the same ecosystem using Databricks. This one-stop-shop decreased our model deployment duration from about a month to a couple of days, dramatically enhancing how fast we offer actionable business value.
    Keshav R.

    Excellent for big data team but very tricky to manage costs and access

    Reviewed on Jun 03, 2026
    Review provided by G2
    What do you like best about the product?
    The main benefit from IT side is that Databricks removes the infrastructure headache. Earlier our data engineers were always asking for setting up Spark clusters, managing libraries, and handling VM failures. Databricks does all this automatically. The auto-scaling is quite smooth; it adds nodes when workload is high and removes them later, so infrastructure utilization is very efficient.

    Also, the integration with AWS and Azure IAM roles is very solid. We can easily connect it with our active directory for single sign-on SSO, which makes user onboarding very fast. The notebook sharing feature is also liked by my teams because they can collaborate without sharing code files over email or Slack.
    What do you dislike about the product?
    The biggest pain point for IT Operations is cost control. Databricks billing uses DBUs Databricks Units, and it is very difficult to predict monthly budget. Another issue is the cluster startup time. It takes around 4 to 7 minutes to spin up a new cluster.
    What problems is the product solving and how is that benefiting you?
    We are using Databricks to centralize our entire data processing and machine learning pipelines. Before this, data was scattered in different silos, and maintaining different environments for data engineers and data scientists was an operational nightmare.

    Now, Databricks gives a single platform. From an operations perspective, it reduces my team's support ticket load by at least 40% because users can self-serve their clusters within the limits we set. It saves a lot of engineering hours that we used to spend on maintaining open-source Apache Spark infrastructure.
    View all reviews