Sign in
Categories
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

Reviews from AWS Marketplace

2 AWS reviews

External reviews

299 reviews
from G2

External reviews are not included in the AWS star rating for the product.


    Somu S.

Excellent infrastructure, can scale clusters in no time

  • August 16, 2020
  • Review provided by G2

What do you like best about the product?
Interactive clusters, user friendly, excellent cluster management
What do you dislike about the product?
Cluster takes some time to heat up on start, should support upsert without delta as business need pure upserts too
What problems is the product solving and how is that benefiting you?
Can seemlessly use pyspark, Python to build a robust pipeline
Recommendations to others considering the product:
It's the best infrastructure to build pipelines if you are planning to use spark in production


    Vivek P.

Databricks- Big Data processing tool

  • July 16, 2020
  • Review provided by G2

What do you like best about the product?
Very easy to use. No need to install and setup spark manually.
provides a notebook environment to write code.
support various languages like Python, Spark-SQL, R, Scala, etc.
easy to set up and use.
you can choose the cluster according to your need.
Support Machine Learning flows and Streaming Data.
Automatic suspend cluster if inactive for more than a given time( Cost-cutting)
Auto scalable Cluster.
Optimize uses of clusters (resources)
What do you dislike about the product?
No CI/ CD features given by default.
Costly for small level Enterprise.
Certification cost is high.
What problems is the product solving and how is that benefiting you?
We have to develop pipelines. We are getting data from different sources like AWS S3, redshift and we had to process that large amount of data on Databricks and put it back to our Dataware house.
Recommendations to others considering the product:
Splunk is a best tool when it comes to Big data processing. it is easy to use and setup


    Ramavtar M.

MLFlow: One stop solution for data science model tracking, versioning and deployemet

  • June 23, 2020
  • Review verified by G2

What do you like best about the product?
1) A single format to support all measure ML libraries such as Sklearn, Tensorflow, MXnet, Spark MLlib, Pyspark etc.
2) Capabilities to deploy on Amazon Sagemaker with just one API call
3) Flexibility to log all model params such as Accuracy, Recall, etc. along with Hyperparameter tuning support.
4) A good GUI to compare and select the best models.
5) Model registry to track Staging, Production, and Archived models.
6) Python best API
7) REST APIs supported.
8) Available out of the box in Microsoft Azure.
What do you dislike about the product?
1) CI/CD pipeline is not supported in the open-source version
2) Recent framework so not a very large community
3) Dependent on many python libraries. It can be a problem while resolving dependencies in your existing setup.
What problems is the product solving and how is that benefiting you?
I have used it for managing the ML lifecycle, including experimentation, reproducibility, deployment, and a central model registry.
The same thing can be done in Amazon sagemaker, GCP AI Platform, Microsoft Azure etc. but it would require monthly expenses. It can be good for initial startup data science team.
Recommendations to others considering the product:
It cant be a complete solution for the data science/ML engineering flow. But is essential in the pipeline. It may be used with Apache Airflow to have an end to end ML ops solution. Also, it works best with Amazon sagemaker and Microsoft Azure. However, GCP AI platform support is still in the development phase.
You would also need to take care of CI/CD pipeline for ML models on your own.


    Vikrant B.

Lightening Speed Analytics

  • April 29, 2020
  • Review provided by G2

What do you like best about the product?
DataBricks is a great analytics tool which provides lightening speed analytics and has given new abilities to Data Scientists. Additionally, our advanced analytics at scale has gone up 100 times.
What do you dislike about the product?
The learning curve is steep and people would need coding knowledge to work with Databricks. It can also be costly at times.
What problems is the product solving and how is that benefiting you?
Problems - Analytics problems

Benefits - Scale and Speed


    Alvaro R.

Great tool for distributed programming

  • October 31, 2019
  • Review verified by G2

What do you like best about the product?
The different languages used for implementation.
Great user experience.
Easy to understand and use.
Creation of different tools inside such as clusters or database.
Ease of integration with other software such as azure services.
Great addition to your expertise if you manage to master it completely.
Integration of spark with the different languages.(Python, R, Scala)
What do you dislike about the product?
The documentation inside the portal isn't the best, find better support outside with search engines.
What problems is the product solving and how is that benefiting you?
Currently data transformation as it provides easy access to databases or blobs and the ability to use a language such as python to build up the solution you need is great.
Recommendations to others considering the product:
Great tool for developing when looking for a fast result as it uses distributed programming by the usage of different clusters.


    Internet

Databricks review

  • October 24, 2019
  • Review provided by G2

What do you like best about the product?
1. Good UI
2. Good integrations with other applications/services.
3. Faster and efficient.
4. Updates are good.
What do you dislike about the product?
1. Sometimes it take much time to load the Spark notebook.
2. Sometimes having issues with interpreter settings while running the notebook.
What problems is the product solving and how is that benefiting you?
1. Big data - Analyzing large datasets.


    Douglas D.

Makes building Spark applications a lot easier

  • September 20, 2019
  • Review provided by G2

What do you like best about the product?
It's like a Jupyter notebook but a lot more powerful and flexible. You can easily switch from Python to SQL to Scala from one cell to the next. With the Spark framework, you can preview your data processing tasks without having to build large intermediate tables.
What do you dislike about the product?
Need better support when it comes to troubleshooting spark applications. It shows a lot of information, but gives you little sense of how to apply it
What problems is the product solving and how is that benefiting you?
We do a lot of large scale data processing applications. Previously we used databases, but this is more flexible and powerful (and cheap).
Recommendations to others considering the product:
It's great if you already understand Spark. Otherwise, Spark has quite a learning curve.


    Computer Software

Its the Databricks show!

  • April 06, 2019
  • Review provided by G2

What do you like best about the product?
It has significantly improves its performance with the Databricks Inout and Ouput Module. WIth better support for spark, it combines well with Microsoft Azure and Amazon AWS. It has faster execution and faster read write processes in its version 5.
What do you dislike about the product?
A few schema related queries are still on the slower side considering huge data clusters and the processing involved for those clusters.
What problems is the product solving and how is that benefiting you?
It runs on the clusters of machines managed by Databricks which gives us the assurance to manage data in a distributed manner. It includes Spark and adds a number of components and updates to performa big data analytics and data processing. It's parallel processing in RDD's is amazing.


    Computer Software

Great product for uncovering data insights but not made for team projects

  • November 05, 2018
  • Review provided by G2

What do you like best about the product?
You can sync data from different systems all onto this one platform and everything can be analyzed without switching programs since you can also use many different programming languages and reap the benefits of each such as SQL and Python. This makes it so much easier to work with large datasets. Very nice user interface too!
What do you dislike about the product?
Very difficult to collaborate on projects using Databricks, it is its biggest downfall and in fact just almost outweighs the benefits. I also don't think their customer support is the best, have had some challenges with that. Otherwise a very good product.
What problems is the product solving and how is that benefiting you?
Great way to uncover data insights easily from large datasets.
Recommendations to others considering the product:
Keep in mind that you cannot collaborate on products. Technical support is also not the best!


    Srishti K.

Awesome Experience

  • September 25, 2018
  • Review provided by G2

What do you like best about the product?
This is really a nice user friendly platform.
What do you dislike about the product?
I have not found any glitches. It is really good.
What problems is the product solving and how is that benefiting you?
Its really simple to manage data.
Recommendations to others considering the product:
NA