Dataiku is an AI platform that we use for oil and gas exploration. Even though I can't provide specific details, this is the primary use case for us.
Dataiku Trial
DataikuExternal reviews
External reviews are not included in the AWS star rating for the product.
The platform organizes workflows visually and efficiently
What is our primary use case?
What is most valuable?
One of the valuable features of Dataiku is the workflow capability. It allows us to organize a workflow efficiently. The platform has a visual interface, making it much easier for educated professionals to organize their work. This feature is useful because it simplifies tasks and eliminates the need for a data scientist. If you are knowledgeable about AI, you can directly write using primitive tools like Pantera flow, PyTorch, and Scikit-learn. However, Dataiku makes this process much easier.
What needs improvement?
One area for improvement is the need for more capabilities similar to those provided by NVIDIA for parallel machine learning training. We still encounter some integration issues.
For how long have I used the solution?
We have been using Dataiku for three years.
How are customer service and support?
Customer service is somewhat different because Dataiku partners with local industry experts who understand the business better and provide support. It can be challenging to determine the provider of better support, however, overall, the support is good.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
We are only using Dataiku.
What was our ROI?
I believe the return on investment looks positive.
Which other solutions did I evaluate?
I considered another option that excels in parallel processing. However, it falls short in other areas. No product is perfect. If these two solutions worked together, it would be advantageous. Unfortunately, one has strengths in certain areas while the other excels in another.
What other advice do I have?
Why not? BHP sold the energy part to a company called Woodside. It has changed because they are now part of Woodside.
Overall, I rate the product eight out of ten.
Dataiku for Grad Student
Advanced features might have a steep learning curve for beginners
Pricing can be high for students and small projects
ML
very good
Data okie for Project Management
Streamlining Data Science Workflows
Helpful for project collaboration
Gives different aspects of modeling approaches and good for multiple teams' collaboration
What is our primary use case?
My current client has Dataiku. We do sentiment analysis and some small large language models right now. We use Dataiku as a Jupyter Notebook.
We use it a lot for marketing and analytics. The marketing and sales team uses Dataiku.
What is most valuable?
It's got good feature selection and creation of feature stores, and it also gives different aspects of modeling approaches. There are a lot of similarities with DataRobot.
So feature selection, different modeling, and financial metrics are good aspects.
What needs improvement?
The no-code/low-code aspect, where DataRobot doesn't need much coding at all.
Dataiku still needs some coding, and that could be a difference where business data scientists would go for DataRobot more than Dataiku because you still have to code and use either Python or R, or Scala. However, with DataRobot, you don't have to do that at all.
For how long have I used the solution?
I've used Dataiku for about four years.
How are customer service and support?
The company is based in France. But they're more and more in America as well.
So, the support was okay.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
I used DataRobot. Dataiku has a different kind of structure to it. It's not financially heavy like DataRobot, which caters more to financial companies, like banks. Dataiku doesn't have that yet. I think they are also working on that area. But yeah, there are some key differences between the two products.
DataRobot has an additional feature with financial firms that it creates all these financial metrics when you run a time series analysis. Those things I have not seen in Dataiku.
If any financial company is choosing between DataRobot and Dataiku, they will definitely go for DataRobot because it creates all these financial metrics. It creates deltas, time series, time difference fields, and things like that. So, that is an added feature that DataRobot has.
What's my experience with pricing, setup cost, and licensing?
Pricing is pretty steep. Dataiku is also not that cheap. It depends on the client and how much they want to spend towards a tool.
What other advice do I have?
Overall, I would rate it an eight out of ten, except for some coding things that are there, which some people may not want to do, like certain business data scientists.
Dataiku is good for multiple teams' collaboration. If many teams are collaborating and sharing Jupyter notebooks, it's very useful. It has a good data processing structure and includes most of the models. I haven't checked the large language models in it yet, but it's a pretty good tool. It does well with analytics and has a sound structure on the back end.
Some coding aspects are necessary, but it generates SQL code, which is an added feature. A lot of data engineers like Dataiku because it generates SQL code on the right side.
Best tool to Analysis and Presenting data in different Dimention
Dataiku is the best tool to present data different views and representation data.
It can made available with other tool where they can connect the date to analyze it.
All-in-one data pipeline management tool
Its recently added features like ML Notebooks are very useful for running Machine Learning classification or regression tasks on a dataset. Dataiku has complete support for training and evaluating Machine Learning models.