I have recently gotten into Databricks and trained on one model. I started using Databricks because of its hardware support and all the other things that it provides, and it is easier to get into. Earlier, when I had to test some part of my code or test if it was working or not, it was not just a fair, not a full production run, but just a fair testing; I had to get a machine, raise a request, get into the whole process. With Databricks, I can just simply create one myself. I could get the resources, whatever they are required, test it out all there, and then go ahead with that, and that is why I have been using it primarily.

Databricks Data Intelligence Platform
Databricks, Inc.External reviews
External reviews are not included in the AWS star rating for the product.
services are easy to implement at scale
Provides resources to users quickly without much hassle
What is our primary use case?
What is most valuable?
The most valuable features of the solution are the hardware and the resources it quickly provides without much hassle.
What needs improvement?
I think setting up the whole account for one person and giving access are areas that can be difficult to manage and should be made a little easier.
For how long have I used the solution?
I have experience with Databricks.
What do I think about the stability of the solution?
I think there's a duration after which our training without any activity would expire, which I think is a fair point, and that is the only place where I think this will stop. I haven't come across a lot of problems with Databricks.
What do I think about the scalability of the solution?
The tool is not used as frequently as PyTorch. I don't know why I am comparing Databricks to PyTorch, but I think around five people use it.
How are customer service and support?
I have not contacted the solution's technical support team.
Which solution did I use previously and why did I switch?
Before Databricks, I used to use a cloud support platform.
How was the initial setup?
The solution is deployed on the cloud.
Which other solutions did I evaluate?
I chose Databricks over other products, considering the hardware support it offers.
What other advice do I have?
A little bit of time will be needed to get comfortable with Databricks.
I rate the tool an eight out of ten.
Leveraged the Databricks Data Intelligence Platform for streamline data processing and analytics.
The seamless integration of big data and machine learning workflows, enabling efficient data processing, collaborative development, and scalable analytics in a unified environment
There are numerous upsides but I will go with my top THREE:
1. Scalability: Automatically scales resources to handle large data volumes efficiently.
2. Collaborative Environment: Supports collaborative development with shared notebooks and real-time co-authoring.
3. Unified Analytics: Combines data engineering, data science, and machine learning in a single platform.
1. Complex Data Processing: It simplifies the handling of large and complex datasets by leveraging Apache Spark, enabling faster data processing and analysis.
2. Unified Environment: It provides a unified platform for data engineers, data scientists, and analysts to collaborate seamlessly on data projects, reducing silos and improving productivity.
3. Scalability: The platform automatically scales resources based on workload demands, ensuring efficient resource utilization and performance even with varying data volumes.
it has made the integration with different sources hassel free and we can more focus on data
ai assistant also provide incorrect suggestion few times
Complete platform but a bit confuse
Process large-scale data sets and integrates with Apache Spark with notebook environment
What is our primary use case?
I primarily use Databricks to process large-scale data sets with Apache Spark. My main use case is processing large data sets, such as 600 GB or 800 GB.
What is most valuable?
Databricks integrates natively with Apache Spark, which I use as a processing engine for large-scale datasets. This native integration is one of its strengths. Another strength is that the platform makes it very easy to manage resources. For example, setting up a cluster of five or fifteen nodes is straightforward with Databricks. The notebook environment is also excellent, making it easy to perform various tasks.
What needs improvement?
While Databricks allows you to upload your packages, we encountered some limitations with its capabilities, particularly with Apache Spark, which also affected Databricks. We had issues working with spatial data. You had to go through many steps to find libraries that could process spatial data in a distributed fashion.
For how long have I used the solution?
I have been using Databricks since 2018.
What do I think about the scalability of the solution?
I might have a project that runs for one or two months, and perhaps I won't use it for six months. Self-service is one of its strengths. I can shut down everything and easily spin up resources when I need to use them again. We have a dedicated group of fifty people who consistently use Databricks for analytics.
How was the initial setup?
The initial setup was very easy and took around 10-15 people. We have a data science infrastructure team helping with this.
What was our ROI?
Databricks stands out among most data platforms mainly because of its ease of use. The learning curve is not as steep, making it accessible for anyone to handle large-scale data processing on Databricks. This ease of use contributes positively to our return on investment. However, in our line of work, converting this efficiency into direct monetary gains can be challenging, given our nonprofit nature.
What's my experience with pricing, setup cost, and licensing?
We purchased high-performance laptops to reduce our reliance on the cloud. The main issue was the cost. Internally, if I used Databricks, that cost would return to my team. There was a time when my monthly cost was around ten thousand dollars, which was quite high. Due to these costs, several teams, including ours, move away from using Databricks and other cloud providers. It became prohibitive, so we invested in our high-performance computers internally instead.
What other advice do I have?
Databricks provides ease of use for me, particularly due to its seamless integration with Apache Spark. This integration simplifies the process of conducting machine learning on large-scale datasets.
I recommend this solution 100%. Overall, I rate the solution an eight out of ten.
DataBricks Data Intelligence Platform Review
community to help as well
best for begineer as well.
Excelent platform
Helps users with data processing and analytics
What is our primary use case?
I use Databricks to manage the setting up of data lakes for SaaS.
What needs improvement?
The biggest problem associated with the product is that it is quite pricey. We cannot find a better solution than Databricks in the market currently.
For how long have I used the solution?
I have been using Databricks for a year.
What's my experience with pricing, setup cost, and licensing?
It is an expensive tool. The licensing model is a pay-as-you-go one.
What other advice do I have?
The tool helps with data processing and analytics with large-scale data or big data since it is associated with managing data at a large scale.
For my general use cases, I would say that I am not a technical person, so I cannot explain to you how the tool helps with the area of data engineering tasks.
There is another team in my company that is involved in the use of machine learning and AI features in Databricks. My team is mostly into operations. The tool is used in a multi-country project.
For example, in my company, they make some shopping decisions related to solutions based on what is the product chosen by the whole company.
I rate the tool an eight out of ten.