Sign in
Categories
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

Reviews from AWS customer

6 AWS reviews

External reviews

640 reviews
from and

External reviews are not included in the AWS star rating for the product.


    Avadhut Sawant

Ahead of the competition in building data ecosystems, but needs to improve ease-of-use

  • August 16, 2023
  • Review from a verified AWS customer

What is our primary use case?

I worked with Databricks pretty recently. The particular design processes involved in Databricks were also a part of that specific design/architectural process.

We have used the solution for the overall data foundation ecosystem for processing and storage on a Delta format. We have also seen use cases where we were trying to establish advanced analytics models and data sharing where we leverage the Delta Sharing capabilities from Databricks.

What is most valuable?

A very valuable feature is the data processing, and the solution is specifically good at using the Spark ecosystem.

What needs improvement?

There are some aspects of Databricks, like generative AI, where they are positioning things like DALL-E. They're a little bit late to the game, but I think there are some things that they are working on. Generative AI is catching up in areas like data governance and enterprise flavor. Hence, these are places where Databricks has to be faster, and even though they are fast, I'm not sure how they'll catch up and get adopted because there are strong players in the market.

Databricks is coming up with a few good things in terms of integration. But I have to put one point forward that covers multiple aspects, which is the ease of use for the end user while operating this particular tool. For example, a tool like ADS gives you a GUI-based development, which is good for the end user who does development or maintenance. Looking at the complexities of data integration, a GUI might not be easy, but Databricks should embrace something on the graphical user development front because it is currently notebook-driven. Also, in terms of accessing the data for the end user, Databricks has an SQL interface, similar to earlier tools like SQL Management Studio. Since people are mostly comfortable with SSMS already or not, Databricks can build integration to known tools for data access, and that also helps, apart from what they're doing. I would like to see improvements with respect to user enablement, which is a good part of enterprise strategy. I would like to see their integration with a broader ecosystem of products. If you have to do data governance in tools like Microsoft Purview, it's manual and difficult. Now, I'm unsure if that momentum must be from Databricks or Microsoft. But it would be good if Databricks had some open interfaces to share metadata, which could be viewed in tools enabling data governance like Collibra, Purview, or Informatica. The improvement has to do with user and metadata integration for tools.

For how long have I used the solution?

I've worked with Databricks for over five or six years, but it's been on and off.

What do I think about the scalability of the solution?

The solution is scalable. In this particular ecosystem, there is no one else who can catch up with Databricks for now.

How are customer service and support?

Databricks' customer support is very good. They have a lot of ways in which they interact with vendors and service partners across the globe. They have periodic touch-up sessions with vendors, where their engineers answer your questions.

How was the initial setup?

The implementation is not challenging because the solution integrates well with the platforms on which they are established, whether it's Azure, AWS, or GCP. The solution is not difficult to set up, but you'd probably need a technical user to operate it.

It's the same story with maintenance, where you'd need a technically proficient person with programming knowledge to maintain it.

What other advice do I have?

Databricks integrates many enterprise processes because data processing and AIML are a small part of a larger ecosystem. Databricks has been a part of other platforms, and they are trying to establish their platform, which is a good direction.

Most of the capabilities of the underlying platform can be leveraged there. But the setup isn't difficult if the database lacks some capability, you can't find it in the database, or you're not comfortable with a certain feature in the database. It integrates well with the underlying platform. For example, with scheduling, let's say you are uncomfortable with workflow management. You can utilize integrations with EDA for any other tool and probably perform scheduling. Even if what you're trying to do is not easy, it is enabled with integration. Either they build a required feature in their tool later on, like a GUI, or you perform integrations to make the features possible.

We did evaluate licensing costs, but it had more to do with the Azure ecosystem pricing since whatever we are doing has more to do with Azure Databricks. Many optimizations are recommended, but we haven't exercised those for now. But considering that the processing is a bit more efficient, the overall price won't be much different from what it could be for any other similar component or technology. We haven't had specific discussions with Databricks' folks on pricing.

My advice to users who would like to start working with Databricks is that it is a good solution to work with for data integration and machine learning. Databricks is maturing for other use cases, so there are two points to be considered. One is that you need to evaluate how they will mature, which will be on a case-to-case basis. Second, how will it align with the overall platform story? There will be many overlapping aspects over there as Databricks expands its capabilities. In that case, it must be considered that if those capabilities overlap, how will the underlying platform vendors handle it? How would that interplay happen if many of Databricks' new capabilities align with Microsoft Fabric? That has to be very carefully considered. Otherwise, if you utilize those new capabilities, there might be a discontinuity where you cannot use Databricks because the platform does not support that.

If I specifically talk about Spark-based processing transformations, the data integration story, and advanced stability, I would rate Databricks around eight out of ten. However, with respect to new capabilities like cataloging, data governance, and security integration, I rate Databricks around five because it has to establish these features. And since Databricks integrates with platforms, we must see the interplay with the platforms' capabilities.

I overall rate Databricks a seven out of ten.


    Rupal Sharma

Processes large data for data science and data analytics purposes

  • August 15, 2023
  • Review provided by PeerSpot

What is our primary use case?

It's mainly used for data science, data analytics, visualization, and industrial analytics.

What is most valuable?

Specifically for data science and data analytics purposes, it can handle large amounts of data in less time. I can compare it with Teradata. If a job takes five hours with Teradata databases, Databricks can complete it in around three to three and a half hours.

So that's why it's quite convenient to use for data science, for training machine learning models. By using more computing power, you can make it even faster.

What needs improvement?

There is room for improvement in visualization.

For how long have I used the solution?

I used it for two years. I worked with the latest update. 

What do I think about the stability of the solution?

I would rate the stability a nine out of ten. I didn't face performance drops.

What do I think about the scalability of the solution?

I would rate the scalability an eight out of ten.

How are customer service and support?

Databrick's support is great. If we need any support, they are very quick with it. And they genuinely want you to use Databricks. So, whatever we ask them, they come up with multiple solutions to problem statements. That's really good.

Overall, the customer service and support are very good.

How would you rate customer service and support?

Positive

Which solution did I use previously and why did I switch?

I personally prefer using Databricks. However, we also considered using Snowflake, but the pricing was different. It's  price per query.

So, as per your storage, a data scientist or a data analytics team needs to query again and again, which does not suit a data-heavy organization.

What was our ROI?

It's a good return on investment for Databricks from a delivery perspective. Delivered multiple dashboards. So, it's quite a good return on investment. And being a small organization, everyone can use Databricks, and cost-wise, it's also good for small organizations.

Which other solutions did I evaluate?

If the company is a startup, Databricks might be suitable. If a big company needs a lot of storage, Teradata might be best for them. It depends on the situation.

What other advice do I have?

Overall, I would rate the solution a eight out of ten. I would definitely recommend this solution for small organizations. 

Which deployment model are you using for this solution?

Private Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?


    Qasim H.

Data + AI should be unified

  • August 13, 2023
  • Review provided by G2

What do you like best about the product?
Before exploring Databricks I had worked on many tools & their integration to make Data flow simpler but Databricks is doing wonders in this space by unifying Data + AI at one platform & also giving smooth UI to Intereatc & get your job done in a simplified manner
What do you dislike about the product?
Although my experience is really good but what I have observed is that data bricks is doing wonders with Azure most of the time, I think they introduce Databricks as PAAS with other platforms like AWS especially.
What problems is the product solving and how is that benefiting you?
I think overall the Platform is really good but I loved their Data Governance support using Catalog is really a solution to most of the Data Governance problems.


    Hubert D.

The best lakehouse on the market

  • August 13, 2023
  • Review provided by G2

What do you like best about the product?
- SQL warehouse,
- MLflow,
- Delta,
- Ecosystem
What do you dislike about the product?
- Dashboards could be improved to match BI product's capabilities
What problems is the product solving and how is that benefiting you?
Data engineering and machine learning


    Karan Sharma

An easy to setup tool that provides its users with an insight into the metadata of the data they process

  • August 11, 2023
  • Review provided by PeerSpot

What is our primary use case?

My company uses Databricks to process real-time and batch data with its streaming analytics part. We use Databricks' Unified Data Analytics Platform, for which we have Azure as a solution to bring the unified architecture on top of that to handle the streaming load for our platform.

What is most valuable?

The most valuable feature of the solution stems from the fact that it is quite fast, especially regarding features like its computation and atomicity parts of reading data on any solution. We have a storage account, and we can read the data on the go and use that since we now have the unity catalog in Databricks, which is quite good for giving you an insight into the metadata of the data you're going to process. There are a lot of things that are quite nice with Databricks.

What needs improvement?

Scalability is an area with certain shortcomings. The solution's scalability needs improvement.

For how long have I used the solution?

I have been using Databricks for a few years. I use the solution's latest version. Though currently my company is a user of the solution, we are planning to enter into a partnership with Databricks.

What do I think about the stability of the solution?

It is a stable solution. Stability-wise, I rate the solution an eight to nine out of ten.

What do I think about the scalability of the solution?

It is a scalable solution. Scalability-wise, I rate the solution an eight to nine out of ten.

My company has a team of 50 to 60 people who use the solution.

How are customer service and support?

Sometimes, my company does need support from the technical team of Databricks. The technical team of Databricks has been good and helpful. I rate the technical support an eight out of ten.

How would you rate customer service and support?

Positive

How was the initial setup?

The initial setup phase of Databricks was good. You can spin up clusters and integrate those with DevOps as well. Databricks it's quite nice owing to its user-friendly UI, DPP, and workspaces.

The solution is deployed on the cloud.

The time taken for the deployment depends on the workload.

What's my experience with pricing, setup cost, and licensing?

I cannot judge whether the product is expensive or cheap since I am unaware of the prices of the other products, which are competitors of Databricks. The licensing costs of Databricks depend on how many licenses we need, depending on which Databricks provides a lot of discounts.

What other advice do I have?

It is a state-of-the-art product revolutionizing data analytics and machine learning workspaces. Databricks are a complete solution when it comes to working with data.

I rate the overall product an eight out of ten.


    Samadhan K.

Embracing the Future : A Deep Dive into Databricks Lakehouse Platform

  • August 10, 2023
  • Review provided by G2

What do you like best about the product?
1. Advanced Analyrics and AI : Beyond traditional data management , Databricks Lakehouse platform offera a comprehesive suite of tools for advanced analytics and artificak Intelligence,

2. Data Quality and Goverance.
What do you dislike about the product?
Databrikcs platform offer numerous benefits and innovative features, but few aspects might find less favorable like
1. Complexity of New Users
2. Cost Considerations.
3.Limited Flexibility
What problems is the product solving and how is that benefiting you?
Databricks Lakehouse platform aims to address several challenges that organizations face in the realm of data management , analystics and decision making.
1. Unified Data Platform
2.Advanced Analytics and AI


    Ajay Kumar P.

Databricks - Unified solution for Data Engineering and AI

  • August 09, 2023
  • Review provided by G2

What do you like best about the product?
Databricks Intelliegence Platform provide a common platform for ETL,Reporting and AI. It's help user to monitor all the data lineage with the help of Unity Catalog and that can be used to create audit report at account level also databricks Genie help user to direct query the tables with simple english words.
What do you dislike about the product?
Overall it's good for Data Engineer ML Engineer and Analysts but need to work on workflow part that can be more robust and feature reach
What problems is the product solving and how is that benefiting you?
Databricks Lakehouse Platform provides multiple inbuilt fetaures that help developers and admin to manage the data efficiently. We had big issue related to data governance but with the help of Databricks Unity Catalog we can now govern the data and it's also help us to track the workspace access.
Databricks provide inbuilt ML models that help us to escalate the development process.


    Jacobus H.

Most advanced data platform

  • August 02, 2023
  • Review provided by G2

What do you like best about the product?
Databricks has been developing an excellent product with new features being added constantly. ETL/ELT, dbt integration, integration with external sources, and many more features make it a product that is versatile and highly effective. Managment of permissions is also extremely easy.
What do you dislike about the product?
Some automated recommendations on optimizing jobs are lacking, so you could end up spending quite a lot of money without the best performance. The platform is also not for small scale use-cases.
What problems is the product solving and how is that benefiting you?
Databricks is providing the fundamental building block of our data warehouse + data lake + data processing platform from which other systems receive analytics data.


    Ankur J.

Unified Still Federated Lakehouse Platform

  • August 02, 2023
  • Review provided by G2

What do you like best about the product?
A great platform to focus on industry challenges around data and ai. Good part is solutions for those challenges are quickly built, tested and released. Participation during private preview also make sure that these solutions are fit for purpose to industry challenges.
Best features floats around combining data lake and datawarehouse capability to help reduce cost and deliver faster with improved security
What do you dislike about the product?
Its integration with native cloud services is still weak, out of the box integration & use with org identify federation is still not mature. Along with capability to integrate with enterprise catalog and buillding a unified metric system for organization.
What problems is the product solving and how is that benefiting you?
Unified view for all our data sources, easy sharing of data with our product team, easy platform for data owners to democratise their data. and central place to apply security and governance.


    Marketing and Advertising

It is a one stop solution to collaborate for all Data Engineering and Data Science activities

  • August 02, 2023
  • Review provided by G2

What do you like best about the product?
Collaboration,Feature store,MLOps,scheduling
What do you dislike about the product?
Don't have a way to put dependencies between workflows
What problems is the product solving and how is that benefiting you?
It has helped me build the datalake for business intelligence usecases