I primarily use Databricks to process large-scale data sets with Apache Spark. My main use case is processing large data sets, such as 600 GB or 800 GB.

Databricks Data Intelligence Platform
Databricks, Inc.External reviews
External reviews are not included in the AWS star rating for the product.
it has made the integration with different sources hassel free and we can more focus on data
ai assistant also provide incorrect suggestion few times
The new era of Databricks Data Intelligence Platform for all the AI solutions
Complete platform but a bit confuse
Process large-scale data sets and integrates with Apache Spark with notebook environment
What is our primary use case?
What is most valuable?
Databricks integrates natively with Apache Spark, which I use as a processing engine for large-scale datasets. This native integration is one of its strengths. Another strength is that the platform makes it very easy to manage resources. For example, setting up a cluster of five or fifteen nodes is straightforward with Databricks. The notebook environment is also excellent, making it easy to perform various tasks.
What needs improvement?
While Databricks allows you to upload your packages, we encountered some limitations with its capabilities, particularly with Apache Spark, which also affected Databricks. We had issues working with spatial data. You had to go through many steps to find libraries that could process spatial data in a distributed fashion.
For how long have I used the solution?
I have been using Databricks since 2018.
What do I think about the scalability of the solution?
I might have a project that runs for one or two months, and perhaps I won't use it for six months. Self-service is one of its strengths. I can shut down everything and easily spin up resources when I need to use them again. We have a dedicated group of fifty people who consistently use Databricks for analytics.
How was the initial setup?
The initial setup was very easy and took around 10-15 people. We have a data science infrastructure team helping with this.
What was our ROI?
Databricks stands out among most data platforms mainly because of its ease of use. The learning curve is not as steep, making it accessible for anyone to handle large-scale data processing on Databricks. This ease of use contributes positively to our return on investment. However, in our line of work, converting this efficiency into direct monetary gains can be challenging, given our nonprofit nature.
What's my experience with pricing, setup cost, and licensing?
We purchased high-performance laptops to reduce our reliance on the cloud. The main issue was the cost. Internally, if I used Databricks, that cost would return to my team. There was a time when my monthly cost was around ten thousand dollars, which was quite high. Due to these costs, several teams, including ours, move away from using Databricks and other cloud providers. It became prohibitive, so we invested in our high-performance computers internally instead.
What other advice do I have?
Databricks provides ease of use for me, particularly due to its seamless integration with Apache Spark. This integration simplifies the process of conducting machine learning on large-scale datasets.
I recommend this solution 100%. Overall, I rate the solution an eight out of ten.
DataBricks Data Intelligence Platform Review
community to help as well
best for begineer as well.
Excelent platform
Databricks as a Product
Single Source for my Data Engineering work
with data Intelligence platform databricks made data pipelines and ETL process easier implementation than ever, now pipelines became more simpiler can build pipelines quick and easy.
with Data Intelligence platform delivery of project improved with frequency of pipeline builds increased.
Now its been easier to trobulshoot pipelines and spark jobs, which reduces heavy load team and customer support
Helps users with data processing and analytics
What is our primary use case?
I use Databricks to manage the setting up of data lakes for SaaS.
What needs improvement?
The biggest problem associated with the product is that it is quite pricey. We cannot find a better solution than Databricks in the market currently.
For how long have I used the solution?
I have been using Databricks for a year.
What's my experience with pricing, setup cost, and licensing?
It is an expensive tool. The licensing model is a pay-as-you-go one.
What other advice do I have?
The tool helps with data processing and analytics with large-scale data or big data since it is associated with managing data at a large scale.
For my general use cases, I would say that I am not a technical person, so I cannot explain to you how the tool helps with the area of data engineering tasks.
There is another team in my company that is involved in the use of machine learning and AI features in Databricks. My team is mostly into operations. The tool is used in a multi-country project.
For example, in my company, they make some shopping decisions related to solutions based on what is the product chosen by the whole company.
I rate the tool an eight out of ten.