Overview

Product video
This trial version of Dataiku allows you to deploy into your AWS environment for prototyping, testing and evaluating the full extent of Dataiku capabilities.
Dataiku is The Universal AI Platform™, empowering teams to deliver AI and analytics projects faster - all within a secure, collaborative, and governed environment.
- Data Scientists use familiar tools to focus on high-impact work, with automation and streamlined collaboration.
- Business Analysts get faster insights with intuitive data prep and accessible machine learning.
- Data Teams scale projects with built-in governance and transparency.
Built for AWS:
- Connect securely to all data sources, including Amazon S3, Amazon Redshift, and Amazon RDS.
- Scale data and ML processing with Dataiku elastic compute powered by Amazon EKS for Python, R, Spark, and more.
- Accelerate AI development with pre-built workflows integrating AWS AI services, such as Amazon SageMaker and Amazon Comprehend.
- Distributed creation of advanced analytics through its visual platform, fostering greater collaboration between technical and non-technical teams.
- Leverage the Dataiku LLM Mesh to connect to Amazon Bedrock for Chat, RAG, and Agentic workflows.
AI at Scale, Supported Every Step
With expert services and a robust learning platform, Dataiku helps organizations of any size adopt AI at scale - quickly and confidently.
With Dataiku visual, end-to-end collaborative AI platform: - Data Scientists spend more time on high-impact AI projects, leveraging the languages and tools they already know, automating repetitive tasks and efficiently collaborating with stakeholders. - Business Analysts generate deeper intelligence, faster, thanks to comprehensive data access, smart data preparation and accessible machine learning. - Data Teams can deliver more projects and more value from analytics and AI all with built in transparency and governance. Dataiku and AWS innovate together to enable organizations of any size to deliver enterprise AI in a highly scalable environment. - Dataiku natively integrates with AWS Services and products to enable organizations of any size to deliver enterprise AI at scale. - Dataiku enables users to ingest and manipulate a wide variety of data including Athena, Redshift and more, from the AWS ecosystem and beyond. - Dataiku empowers analytic teams to extend data science collaboration through integrations with Amazon Sagemaker Get started today with Dataiku on AWS!
Highlights
- Take full advantage of your investment in the AWS platform with Dataiku's unique push down to Amazon's storage and compute.
- Empower more users to clean and enrich data, build advanced data pipelines, and create machine learning models in a visual interface.
- Accelerate deployment on AWS, leveraging Sagemaker and Bedrock, with a fully managed service (SaaS) hosted and managed by Dataiku.
Details
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Financing for AWS Marketplace purchases
Pricing
Vendor refund policy
Refunds are not provided, but one can cancel at any time.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
64-bit (x86) Amazon Machine Image (AMI)
Amazon Machine Image (AMI)
An AMI is a virtual image that provides the information required to launch an instance. Amazon EC2 (Elastic Compute Cloud) instances are virtual servers on which you can run your applications and workloads, offering varying combinations of CPU, memory, storage, and networking resources. You can launch as many instances from as many different AMIs as you need.
Version release notes
Please read at https://doc.dataiku.com/dss/latest/release_notes/index.html
Additional details
Usage instructions
Browse to http(s)://INSTANCE_PUBLIC_ADDRESS/
You might need to wait few minutes that the instance starts and initializes.
You will have a first authentication to prove that you're the owner of the instance (with a basic access authentication):
- login = instance id
- password = empty
Then, you will have access to Dataiku DSS visual interface. Note that only Chrome and Firefox are supported.
Administrative (command-line) access can be obtained through ssh centos@INSTANCE_PUBLIC_ADDRESS. A standard installation of Dataiku DSS runs under linux user account "dataiku".
For additional information, or any issue, please see our resources and Q & A pages.
Resources
Support
Vendor support
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.
Similar products
Customer reviews
Dataiku Streamlines End-to-End AI at Scale with Intuitive, Collaborative Workflows
Centralized, Organized Data Platform with Powerful AutoML and Integrations
The program supports both coders and non coders, allowing them to use data in their different levels
Dataiku has a successful data lifecycle, something that collects, ingest, prepare and even analyze data
The program consists of an inbuilt Auto ML tools that speed u most of the operations
Dataiku has extensible APIs and plugins, all supporting success integrations
The app demands extensive computer resources, something that amplifies the infrastructure costs
Most of complex data workflows are significantly supported by this app, ensuring that no manual code needed to conduct a specific task
The presence of machine learning and AI support s the effectiveness of data processing and analysis
The app accommodates both technical and non technical users due to it’s effectiveness and simplicity
End-to-End Data Science Platform That Makes Collaboration Easy
Low-code projects have empowered non-technical teams and now need better integration and visuals
What is our primary use case?
We used Dataiku for a demand forecasting project where the objective is to forecast the demand for each product for the next three months.
What is most valuable?
Dataiku has positively impacted my organization by allowing non-technical users to adapt a data science project and to maintain a part of a data science project.
What needs improvement?
To improve Dataiku, the company could enhance the capabilities related to integration and visualization.
For how long have I used the solution?
What do I think about the stability of the solution?
What do I think about the scalability of the solution?
How are customer service and support?
How would you rate customer service and support?
Which solution did I use previously and why did I switch?
Before choosing Dataiku, I evaluated KNIME.
What was our ROI?
I have not seen a return on investment with Dataiku in terms of time saved, money saved, or fewer employees needed.
What's my experience with pricing, setup cost, and licensing?
What other advice do I have?
Dataiku is deployed in my organization on a public cloud on Amazon Web Services.
Amazon Web Services is our cloud provider.
I am not the person involved in the process of determining whether we purchased Dataiku through the AWS Marketplace.
My review rating for Dataiku is 7.
Which deployment model are you using for this solution?
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Flow-based demand forecasting has improved collaboration but still needs better visualization options
What is our primary use case?
My main use case for Dataiku is for data science and AI projects. I use Dataiku for a demand forecasting use case where the objective is to predict the demand for each product for the next four months. Demand forecasting is the primary focus where I use Dataiku.
What is most valuable?
The best features Dataiku offers that help me with my demand forecasting and data science projects include having a complete overview of the flow directly from the flowchart, allowing me to observe all the steps in a single overview, and the ability to use a no-code, low-code node.
Having that flow overview and the no-code, low-code nodes makes my work easier by allowing me to use a simple function without coding directly, meaning I can avoid using Python. In 80% of the project, we are using Python, but for very simple steps, we also use a low-code, no-code node, which can be simpler for users that are not technical and may want to do some preprocessing steps.
Dataiku has positively impacted my organization, but it is a tool that is very similar to others and it helps for what I mentioned before and not for other areas. The ability to use low-code or no-code nodes is more a convenience in that case, mainly for a non-technical user. We deliver this kind of solution for a client where the user is not so technical, and for this reason, it is better to have this kind of flow and tool.
What needs improvement?
To improve Dataiku, it could enhance its visualization features, as it is not possible in Dataiku to create direct visualizations or to integrate a web app directly or in a simpler way as it is possible for a preprocessing step. Visualization and integration are the main areas I would like to see enhanced.
In my experience, Dataiku can be more stable.
For how long have I used the solution?
I have been using Dataiku for two years.
What do I think about the stability of the solution?
In my experience, Dataiku can be more stable.
What do I think about the scalability of the solution?
Dataiku's scalability is not one of the best solutions to scale.
Which solution did I use previously and why did I switch?
We used a lot of other solutions before Dataiku and we switched only so that non-technical users can improve and maintain this kind of flow.
What other advice do I have?
My advice to others looking into using Dataiku is to use it for a simple flow in data science and to teach how to make a data science project or flow for non-technical users. I would rate this product a 7 out of 10.