Sign in Agent Mode
Categories
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

Reviews from AWS customer

7 AWS reviews

External reviews

17 reviews
from

External reviews are not included in the AWS star rating for the product.


    Satyam Wagh

Unified data workflows have cut ticket processing times and are driving faster business insights

  • February 03, 2026
  • Review provided by PeerSpot

What is our primary use case?

My main use case for Databricks involves the pipelines and ETL processes that we are implementing. Following the Medallion architecture with Gold, Silver, and Bronze layers, we filter the data, perform transformations, and integrate AI. Databricks has made this process significantly easier.

I worked for an airline company where they experienced substantial delays in data processing. When a passenger booked a ticket, it took 20 to 25 minutes for the transaction to reflect in the system. Using Databricks, we compressed that time from 10 to 6 minutes initially and eventually reduced it to just a few seconds. After setting up all the pipelines and leveraging Databricks features to enhance and accelerate the process, this project became truly impactful and time-based, resulting in reduced processing time and ultimately increased profit for the airline company.

What is most valuable?

The best features Databricks offers are Unity Catalog, Databricks Workflow, Databricks AI, Agentic AI, and the automated pipelines that utilize AI. The AI models are very easy to create and deploy in just a few seconds. These are helpful and user-friendly tools.

I find myself using Unity Catalog most frequently because it provides a unified governance solution for all data and AI needs on Databricks, offering centralized access control, auditing, lineage, and data discovery capabilities across the platform. The main features include access control, security compliance standard models, built-in auditing, and lineage tracking. Most of my projects have involved integrating Unity Catalog into systems and providing overall security, including a migration project to transition to Unity Catalog.

The platform's unified data intelligence capabilities allow teams to analyze, manage, and activate data at scale, leading to faster time to insights, cleaner data pipelines, and significant savings on infrastructure and engineering efforts. Databricks eliminates data silos, accelerates the time to insight, empowers all data personnel, and provides built-in governance and security. It also supports AI and ML, which is an added advantage in today's AI-driven field.

What needs improvement?

Databricks already provides monthly updates and continuously works on delivering new features while enhancing existing ones. However, the platform could become easier to use. While instruction-led workshops are available, offering more free instructional workshops would allow a wider audience to access and learn about Databricks. Additionally, providing use cases would help beginners gain more knowledge and hands-on experience.

Regarding my experience, I was initially unfamiliar with the platform and had to conduct research and learn through various videos. I did find some instruction-led classes, but several of those required payment. The platform should provide more free resources to enable a broader audience to access and learn about Databricks. The platform itself is user-friendly and easy to use without complex issues, so I believe it does not need improvement in its core functionality. Rather, supporting aspects can be enhanced.

For how long have I used the solution?

I have been working as a data engineer for four years. Initially, I was a software engineer, but my career has progressed as a data engineer over this four-year period.

What was our ROI?

Definitely. As I mentioned regarding my airline project, it was impactful because the cost was reduced by 60 to 70 percent. The company was initially using Azure Blob storage, and in Databricks, the cluster and associated infrastructure were cheaper than other platforms. This reduction in both time and money resulted in real-time impact and significant cost savings.

What other advice do I have?

For advice for others considering Databricks, it is important to start by understanding its place in the data ecosystem and how it fits into your specific needs. Key points to consider include familiarizing yourself with Databricks, learning the basics, starting with data engineering, and incorporating ETL processes. You can then dive deeper into Databricks features such as notebooks, clusters, and jobs. Achieving certification enhances your skills validation. For best practices, it is critical to optimize performance and minimize complexity while continuously learning to stay competitive in the data field. Following these steps will be very beneficial for anyone pursuing a career as a data engineer and Databricks engineer.

Databricks is a truly essential platform for data engineering needs, and I recommend it to anyone looking to advance in the data engineering field. It is a very important platform and tool for every data engineer. I encourage everyone to learn and explore this product and to maximize its potential. I rate this product a 9 out of 10.


    SimonRobinson

Improved data governance has enabled sensitive data tracking but cost management still needs work

  • January 12, 2026
  • Review provided by PeerSpot

What is our primary use case?

My usual use case for Databricks as an end-user mostly involves exporting data. This typically entails writing directly into a web interface to get the data out, so probably with Python.

What is most valuable?

The most significant benefit Databricks has brought to my company is the Unity Catalog. Previously, with our data warehouse, we weren't able to track where sensitive data was. The Unity Catalog has been a big improvement, even though we haven't gotten the rest right.

The user interface is very useful, especially in writing directly into a web interface.

From my perspective, the ability to export data effectively and use Python within Databricks are key valuable features.

What needs improvement?

I believe we could improve Databricks integration with cloud service providers. The impact of our current integration has not been particularly good, and it's becoming very expensive for us. The inefficiencies in our implementation, such as not shutting down warehouses when they're not in use or reserving the right number of credits, have led to increased costs.

We made several beginner mistakes, such as not taking advantage of incremental loading and running overly complicated queries all the time. We should be using ETL tools to help us instead of doing it directly in Databricks. We need more experienced professionals to manage Databricks effectively, as it's not as forgiving as other platforms such as Snowflake.

I think introducing customer repositories would facilitate easier implementation with Databricks.

For how long have I used the solution?

I have been working with Databricks for the last six months.

What do I think about the stability of the solution?

As a platform, Databricks is fine. However, our implementation isn't particularly reliable.

We've suffered from the lack of professionals with previous experience, which makes it difficult to dig ourselves out of the situation we've found ourselves in.

What do I think about the scalability of the solution?

The scalability level of Databricks at the moment exceeds our needs. It's not a problem for us.

The sky's the limit with Databricks.

How are customer service and support?

I have addressed technical support about our issue with Databricks. It was the team that engaged with them, and I believe our development teams also reach out for support, though I'm not sure what level of support they get.

Previously, when using Snowflake, we had customer reps who were really knowledgeable and helped us to avoid beginner mistakes. With Databricks, it seems we could have benefited from similar support. Our implementation team had no experience and made obvious mistakes. It may be that we opted not to have that support, but I believe we should have.

How would you rate customer service and support?

Positive

Which solution did I use previously and why did I switch?

Before Databricks, I used SQL Server.

The big decision to switch from SQL Server to Databricks was motivated by the lack of auditing, lineage, and tracking sensitive data in SQL Server, along with a need for more flexibility.

How was the initial setup?

I did not participate in the initial setup of Databricks.

What about the implementation team?

We use a consultancy, Avanade, for our Databricks implementation. They had previously done a Databricks implementation for another part of our organization. Our implementation team lacked experience which resulted in several beginner mistakes.

What was our ROI?

So far, we're not measuring any return on investment, such as saving time, money, or resources with Databricks. We're still in the phase where our old system and the new system are running simultaneously, so everything is twice as expensive and much effort is doubled. We haven't progressed far enough yet to realize any ROI.

What's my experience with pricing, setup cost, and licensing?

I believe that in terms of credits for Databricks, we're spending between £15,000 and £20,000 a month.

I think Databricks is priced correctly. If we managed our resources better, we wouldn't be paying anywhere near that amount. The issue is with our management of resources.

Which other solutions did I evaluate?

No other options were considered because we used the consultancy Avanade, who had done a previous Databricks implementation for another part of our organization. We used them to recreate our implementation.

What other advice do I have?

I'm probably not the best person to discuss certain aspects of Databricks since I haven't explored it deeply and am not part of the team developing it.

We haven't utilized Databricks' machine learning capabilities.

From my company, data ingestion and transformation are done with Databricks, though I don't do it directly.

I don't use Databricks' features for managing data, such as data lake and warehouse operations.

Most of our current work with Databricks isn't really live yet, so measuring savings in time and money or identifying any return on investment isn't applicable right now.

I would rate this review a 7 overall.


    AvivCohen

Experiencing smooth performance and cost advantages over previous tools

  • May 28, 2025
  • Review from a verified AWS customer

What is our primary use case?

The use case for Databricks is that we use the clustering for high big data processing within the cluster.

What is most valuable?

I think it is difficult to determine which feature of Databricks I enjoy the most since there are many valuable features.

What's valuable about Databricks to my organization is that it is more cost-effective and provides better performance than the current AWS tools and services they offer.

What needs improvement?

I am uncertain about specific improvements for Databricks.

It would be beneficial to make Databricks even more cost-effective.

For how long have I used the solution?

I have been using Databricks for two years.

What do I think about the stability of the solution?

My experience with Databricks has been smooth, and I haven't encountered any issues.

Databricks is definitely a very stable product and reliable.

How are customer service and support?

I have not used Databricks customer service or support.

How would you rate customer service and support?

Positive

Which solution did I use previously and why did I switch?

Before Databricks, I used Batch processing, Fargate, and possibly Kubernetes.

I switched from my previous solutions because they were either too expensive or too difficult to configure.

Which other solutions did I evaluate?

I have considered other solutions besides Databricks, such as Snowflake, but we haven't explored it extensively yet.

We are still early in our Snowflake experience, so we don't know the pros and cons compared to Databricks.

What other advice do I have?

My deployment model for Databricks is limited as I'm not a heavy user.

I am not the person who purchased Databricks, but it was possibly acquired through the AWS Marketplace.

I may not have utilized Databricks machine learning capabilities.

My experience with the pricing and licensing model is that it remains relatively expensive. Though it's less expensive than AWS, we still need a more cost-effective solution.

I would rate Databricks overall a nine out of ten.

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)


    Lax Kas

Unifying data for analytical insights with smooth AI and machine learning integration

  • May 15, 2025
  • Review provided by PeerSpot

What is our primary use case?

A typical use case for the solution is to build the data lakehouse for the client because they have a variety of source systems, and they want to unify that data into the lakehouse platform, where they want to use the data for analytical purposes and insights.

What is most valuable?

The most valuable features of Databricks are especially the Delta Lake and the Unity Catalog; those are the main features. The Unity Catalog is for data governance, and the Delta Lake is to build the lakehouse. Currently, they're coming up with workflow jobs, along with other supporting elements to create an end-to-end solution.

What needs improvement?

In my opinion, areas of Databricks that have room for improvement involve the dashboards. Until recently, everyone used third-party systems such as Power BI to connect to Databricks for dashboards and reports, but they're now coming up with their IBI dashboard, and I think they're on the right track to improve that even further.

For how long have I used the solution?

I have approximately four years of experience working with Databricks.

What do I think about the stability of the solution?

I would rate the stability of Databricks as highly stable, around nine out of ten.

What do I think about the scalability of the solution?

I would rate the scalability of this solution as very high, about nine out of ten.

How are customer service and support?

I rate the technical support as fine because they have levels of technical support available, especially partners who get really good support from Databricks on new features. For us, it's so far so good with no problems, and I would rate the support quality as eight out of ten.

How would you rate customer service and support?

Positive

How was the initial setup?

The initial setup of the Databricks solution is reasonably fair enough. It doesn't give any trouble to implement the solution, and I think it's fairly easy to set up and work on Databricks.

What was our ROI?

I can't say if there's seen an ROI from the solution because I do not have exposure in that area, although I think the people who decided to implement Databricks might have done all this analysis and POCs.

What other advice do I have?

My relationship with the vendor is that I'm not a partner of Databricks; I work for a client where we use the Databricks software for implementing the solutions.

My clients are usually enterprise-level organizations, but the area where they're implementing is medium level here, although it might go into enterprise level in the future.

Regarding the price of Databricks, I don't involve myself in those decisions.

I think Databricks is very good at facilitating AI and machine learning projects; they implement AI and machine learning models very well, and clients can run their models on Databricks. I believe they are in a better place compared to competitors such as Snowflake, and they are tying up with important companies such as SAP and Palantir.

Based on my experience, I would recommend Databricks to other people. Overall, I would rate this solution as one of the best, about eight out of ten, although I might not know some of the pitfalls; it's based on use case to use case, but for us, it's working well.


    Prabhakar Bonam

Cloud platform enables advanced collaboration but new SAP data feature could enhance its capabilities

  • April 28, 2025
  • Review from a verified AWS customer

What is our primary use case?

I am currently working as an IT architect. We have an AWS analytics platform, a cloud-based platform. We use Databricks for our AI/ML requirements and also the Databricks platform. For the past year, we have been using Databricks for our data scientist community to build their apps.

What is most valuable?

Databricks has a Unified Catalog that assists with secured access and governance. Additionally, serverless computing is crucial for our computing needs. Its collaboration features, such as data sharing capabilities, are also outstanding. Overall, the platform allows us to leverage cloud advantages effectively, enhancing our AI and ML projects.

What needs improvement?

I heard that a new feature is being developed for SAP that can bring SAP data directly into the platform for generating reports. This feature, if made publicly available, may act as a game-changer, considering many global organizations use SAP data for their ERP requirements.

For how long have I used the solution?

We have been using Databricks for one year.

What was my experience with deployment of the solution?

We did not experience deployment issues as we have an internal database AWS cloud admin team. We worked directly with Databricks personnel to configure our requirements.

What do I think about the stability of the solution?

Although it is too early to definitively state the platform's stability, we have not encountered any issues so far. However, since we are still in the process of building much on the platform, we are still observing its stability.

What do I think about the scalability of the solution?

Databricks is an easily scalable platform. It leverages the cloud advantage directly, and scalability is one of the great features of the cloud platform.

How are customer service and support?

We have a customer portal where we can raise issues. As of now, we are raising issues and they are providing solutions without any problems.

How would you rate customer service and support?

Neutral

Which solution did I use previously and why did I switch?

Before Databricks, we were using the R Studio platform for our advanced analytics requirements. We switched to Databricks because it is an advanced platform compared to R Studio.

How was the initial setup?

The initial setup of Databricks could be complex. We set it up with AWS as the backend. It's crucial to have both cloud-specific knowledge and how to configure its features in the platform.

What about the implementation team?

We have our internal database AWS cloud admin team. We did not use external integrators or consultants.

What was our ROI?

We haven't yet seen an ROI from this solution, as we are still using and observing the platform.

What's my experience with pricing, setup cost, and licensing?

There are some issues with pricing specific to our accounts, and Databricks is investigating. It is not a cheap solution.

Which other solutions did I evaluate?

We previously used R Studio before switching to Databricks.

What other advice do I have?

I would recommend Databricks as an advanced analytics platform. It offers features to quickly pull data from various sources required for report generation. Overall, I rate this solution as seven or eight out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)


    IshwarSukheja

Unified platform simplifies end-to-end processes with intuitive data access solutions

  • January 10, 2025
  • Review provided by PeerSpot

What is our primary use case?

I use Databricks for various purposes, including data engineering, MLOps, machine learning training and deployment, the entire ML cycle, and dashboards. It serves different purposes for different projects.

What is most valuable?

Unity Catalog is a feature I am currently using extensively. I am migrating many projects to Unity Catalog. MLflow, which I use for model registering and creating the lineage of models, is also valuable.

Additionally, Databricks serves as a single platform for conducting the entire end-to-end lifecycle of machine learning models or AI ops. I don't need to switch between various tools, making it an all-encompassing solution for development and research. I use the lake house and utilize features effectively.

What needs improvement?

There has been a significant evolution in databases. One area of improvement is the Databricks File System (DBFS), where command-line challenges arise when accessing files. Standardization of file paths on the system could help, as engineers sometimes struggle.

It would be beneficial to have utilities where code snippets are readily available. This would allow engineers to easily click a snippet and import it into the notebook, enabling quick modifications to variables or paths for fetching files, such as reading data from DBFS files. If I could right-click to copy absolute paths or to read files directly into a data frame, it would standardize and simplify the process.

For how long have I used the solution?

I have used the solution for five years plus.

What do I think about the stability of the solution?

I would rate stability seven to eight out of ten.

What do I think about the scalability of the solution?

I would rate scalability seven to eight out of ten.

How are customer service and support?

I do not have any issues that require support. Many resources are available online.

How would you rate customer service and support?

Neutral

How was the initial setup?

I use infrastructure as code on the cloud to deploy the infrastructure. I have all the Git repositories and code repositories for deploying the code and models in the workspace. My setup includes a shared workspace, shared clusters, and integration with Unity Catalog.

What about the implementation team?

I have a team of 100 engineers working with me, and I head the Center of Excellence (COE).

What was our ROI?

I believe it is competitive across clouds. When it comes to big data processing, I prefer Databricks over other solutions. Cost-wise, it is very competitive. The setup process is straightforward, thanks to the use of Spark clusters. This allows for faster turnaround times with Databricks.

What other advice do I have?

The product rating is nine out of ten.

Databricks serves as a single platform that can handle numerous end-to-end machine learning tasks. The configuration is simple, scalability is excellent, and monitoring cluster utilization facilitates informed business decisions.

It's easy to schedule jobs, pipelines, and handle multiple use cases in parallel, providing countless benefits.


    Rama Subba Reddy Thavva

Shared notebooks and scheduling enhance cost efficiency

  • January 08, 2025
  • Review from a verified AWS customer

What is our primary use case?

We work on three platforms. Databricks is hosted on Azure for us, so we work with ADFS, Azure Data Factory, and also the AWS Cloud. We work for some customers.

What is most valuable?

The notebooks and the ability to share them with collaborators are valuable, as multiple developers can use a single cluster. This reduces costs. The scheduling part is managed by Databricks itself, for example, when it is idle, it will automatically turn off. All these features are handled by Databricks, reducing costs. We do not need to schedule separately.

For example, on AWS EC2, we have to create a Lambda function or use System Manager templates to schedule EC2 and EMRs. Here, it is taken care of, saving significant resources.

Additionally, notebooks can be shared within the development team which saves effort. Developers can share their notebooks. Git and Azure DevOps integration on the Databricks side is also very helpful.

What needs improvement?

The API deployment and model deployment are not easy on the Databricks side. We use MLflow for managing MLOps, however, further improvement would be beneficial, especially for large language models and related tools. Moreover, the API deployment should be simplified for ease of deployment and consumption.

For how long have I used the solution?

I have been using Databricks for approximately two and a half to three years.

What do I think about the scalability of the solution?

We have not faced any shortages so far. The clusters are available on demand, thus we have not encountered any scalability issues.

How are customer service and support?

We mostly had limited data support required from Databricks. Whenever we did need support, within two or three days the problem was solved. I would rate them ten out of ten.

How would you rate customer service and support?

Positive

What about the implementation team?

We bought it as a service, which is why we never implemented it ourselves. We do not have any implementation team.

Which other solutions did I evaluate?

For companies focused solely on data transformation, transferring data between databases, and not tackling machine learning or deep learning problems, I recommend ADF. It would be sufficient and cost-saving compared to a full-fledged solution like Databricks. However, for data analytics and solving ETL problems, one should consider Databricks.

What other advice do I have?

I would rate it nine out of ten.


    Parag Bhosale

Integrating engineering and learning, but cost challenges arise with cluster management

  • January 08, 2025
  • Review provided by PeerSpot

What is our primary use case?

I usually handle data ingestion and create warehouses. I also assist other teams, such as analytics, to create reports or perform other tasks.

What is most valuable?

Having one solution for everything, from data engineering to machine learning, is beneficial since everything comes under one hood.

What needs improvement?

We often use a single cluster to ingest Databricks, which Databricks doesn't recommend. They suggest using a no-cluster solution like job clusters. This can be overwhelming for us because we started smaller.

We prefer using a small to mid-sized cluster for many jobs to keep costs low, but this sometimes doesn't support our operations properly. We need to stay in sync with the DVR versions, and migrations can pose challenges. For example, issues arose when we moved a cluster from a previous version to the latest one. We could use their job clusters, however, that increases costs, which is challenging for us as a startup. Maintaining this infrastructure can be a headache.

For how long have I used the solution?

I have worked at a couple of companies, not just the current one, and I have about 20 to 25 months of experience with Databricks.

What do I think about the stability of the solution?

They release patches that sometimes break our code. These patches are supposed to fix issues, but sometimes they cause disruptions.

What do I think about the scalability of the solution?

The patches have sometimes caused issues leading to our jobs being paused for about six hours. Fortunately, nothing important is currently running on Databricks, however, if there were, it would be a significant issue.

How are customer service and support?

They are good. My company has a contract with them that includes good support. Whenever we reach out, they respond promptly.

How would you rate customer service and support?

Neutral

What was our ROI?

With the benefits we receive, the price is reasonable. However, it's important to have good use cases. If it's just for data ingestion, it might not be the best solution price-wise. For a lot of different tasks, including machine learning, it is a nice solution.

What other advice do I have?

I would rate the solution seven out of ten. That rating also depends on how we have the contract with Databricks.

It's still a solid and good rating. I work as a data engineer and Databricks engineer.


    ShubhamSharma7

Capability to integrate diverse coding languages in a single notebook greatly enhances workflow

  • January 03, 2025
  • Review provided by PeerSpot

What is our primary use case?

I am working as a data engineer at Fractal. On a daily basis, I work on Azure Cloud, and I use Databricks frequently. We have EDF pipelines and utilize Synapse for our daily tasks.

What is most valuable?

Databricks offers various courses that I can use, whether it's PySpark, Scala, or R. I can leverage all these courses in a single notebook, which is beneficial for clients as they can access various tools in one place whenever needed. This is quite significant.

I usually work with PySpark based on client requirements. After coding, I feed the Databricks notebooks into the ADF pipeline for updates. Databricks' capability to process data in parallel enhances data processing speed. Furthermore, I can connect our Databricks notebook directly with Power BI and other visualization tools like Qlik. Once we develop code, it allows us to transform raw data into visualizations for clients using analysis diagrams, which is very helpful.

What needs improvement?

As a data engineer, I see cluster failure in our Databricks user databases as a major issue. I am unsure why, however, our flow, typically involving three to four notebooks, sometimes leads to cluster failure. Despite attempts to identify the problem, there are times when the reason remains unclear. Adjusting features like worker nodes and node utilization during cluster creation could mitigate these failures.

For how long have I used the solution?

I have been using the solution for three years now.

What do I think about the stability of the solution?

Cluster failure is one of the biggest weaknesses I notice in our Databricks.

Which solution did I use previously and why did I switch?

Databricks is beneficial for cost-saving since clients I work for transitioned from AWS Cloud to Azure Cloud for this reason.

How was the initial setup?

The initial setup is very straightforward for us.

What's my experience with pricing, setup cost, and licensing?

I am not very aware of the pricing. We use three to four clusters in our project. Increasing the number or size of clusters, such as adding more workers, would result in higher costs. That's why we limit ourselves to four clusters for our business.

Which other solutions did I evaluate?

In terms of cost efficiency, it's very useful because our clients switched from AWS Cloud to Azure Databricks to save costs.

What other advice do I have?

I would rate the overall product eight out of ten.

Everything is probably good as far as I have used it, but there's room for improvement in cluster integration. Enhancing cluster capabilities while keeping costs lower would be beneficial.


    Monalisha Nayak

Transformative data analytics with enhanced AI functionalities and good value for money

  • November 12, 2024
  • Review provided by PeerSpot

What is our primary use case?

Databricks is used for transformations and streaming data processing. We utilize it primarily for data analytics, including the use of Delta Lake and Delta Life tables for ETL processes, dashboards for analysis, and the Unity catalog for role management.

How has it helped my organization?

Databricks improves our data analysis tasks with its powerful functionality, offering real-time analytics and machine learning features that help improve model accuracy. It is easy to use, which helps in saving time and, ultimately, costs.

What is most valuable?

The most valuable features of Databricks include the Delta Lake, a user-friendly interface, Delta Life tables for ETL, dashboard features for analysis, and the Unity catalog for role management. It also offers AI functionalities that assist with code management and machine learning processes.

What needs improvement?

While Databricks is generally a robust solution, I have noticed a limitation with debugging in the Delta Live Table, which could be improved. The issue with Delta type tables not loading into multiple places in a single pipeline has been fixed recently.

For how long have I used the solution?

I have been working with Databricks for four years.

How are customer service and support?

We regularly contact Databricks support and are satisfied with their service. I would rate them eight out of ten.

How would you rate customer service and support?

Positive

How was the initial setup?

The initial setup was straightforward after the first week. Deployment processes became quick and efficient using Git.

What's my experience with pricing, setup cost, and licensing?

In terms of cost-effectiveness, Databricks is worth the money.

What other advice do I have?

I'd rate the solution nine out of ten.