Overview

Product video
This trial version of Dataiku allows you to deploy into your AWS environment for prototyping, testing and evaluating the full extent of Dataiku capabilities.
Dataiku is the Platform for AI Success, the enterprise orchestration layer for building, deploying, and governing AI.
- Data Scientists use familiar tools to focus on high-impact work, with automation and streamlined collaboration.
- Business Analysts get faster insights with intuitive data prep and accessible machine learning.
- Data Teams scale projects with built-in governance and transparency.
Built for AWS:
- Connect securely to all data sources, including Amazon S3, Amazon Redshift, and Amazon RDS.
- Scale data and ML processing with Dataiku elastic compute powered by Amazon EKS for Python, R, Spark, and more.
- Accelerate AI development with pre-built workflows integrating AWS AI services, such as Amazon SageMaker and Amazon Comprehend.
- Distributed creation of advanced analytics through its visual platform, fostering greater collaboration between technical and non-technical teams.
- Leverage the Dataiku LLM Mesh to connect to Amazon Bedrock for Chat, RAG, and Agentic workflows.
AI at Scale, Supported Every Step
With expert services and a robust learning platform, Dataiku helps organizations of any size adopt AI at scale - quickly and confidently.
With Dataiku visual, end-to-end collaborative AI platform: - Data Scientists spend more time on high-impact AI projects, leveraging the languages and tools they already know, automating repetitive tasks and efficiently collaborating with stakeholders. - Business Analysts generate deeper intelligence, faster, thanks to comprehensive data access, smart data preparation and accessible machine learning. - Data Teams can deliver more projects and more value from analytics and AI all with built in transparency and governance. Dataiku and AWS innovate together to enable organizations of any size to deliver enterprise AI in a highly scalable environment. - Dataiku natively integrates with AWS Services and products to enable organizations of any size to deliver enterprise AI at scale. - Dataiku enables users to ingest and manipulate a wide variety of data including Athena, Redshift and more, from the AWS ecosystem and beyond. - Dataiku empowers analytic teams to extend data science collaboration through integrations with Amazon Sagemaker Get started today with Dataiku on AWS!
Highlights
- Take full advantage of your investment in the AWS platform with Dataiku's unique push down to Amazon's storage and compute.
- Empower more users to clean and enrich data, build advanced data pipelines, and create machine learning models in a visual interface.
- Accelerate deployment on AWS, leveraging Sagemaker and Bedrock, with a fully managed service (SaaS) hosted and managed by Dataiku.
Details
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Buyer guide

Financing for AWS Marketplace purchases
Pricing
Vendor refund policy
Refunds are not provided, but one can cancel at any time.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
64-bit (x86) Amazon Machine Image (AMI)
Amazon Machine Image (AMI)
An AMI is a virtual image that provides the information required to launch an instance. Amazon EC2 (Elastic Compute Cloud) instances are virtual servers on which you can run your applications and workloads, offering varying combinations of CPU, memory, storage, and networking resources. You can launch as many instances from as many different AMIs as you need.
Version release notes
Please read at https://doc.dataiku.com/dss/latest/release_notes/index.html
Additional details
Usage instructions
Browse to http(s)://INSTANCE_PUBLIC_ADDRESS/
You might need to wait few minutes that the instance starts and initializes.
You will have a first authentication to prove that you're the owner of the instance (with a basic access authentication):
- login = instance id
- password = empty
Then, you will have access to Dataiku DSS visual interface. Note that only Chrome and Firefox are supported.
Administrative (command-line) access can be obtained through ssh centos@INSTANCE_PUBLIC_ADDRESS. A standard installation of Dataiku DSS runs under linux user account "dataiku".
For additional information, or any issue, please see our resources and Q & A pages.
Resources
Support
Vendor support
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.
Similar products
Customer reviews
Dataiku Speeds Up Repeatable Marketing Data Workflows
I use the visual workflows regularly because they make the end-to-end process far more transparent. Rather than working only in spreadsheets or in isolated scripts, I can see every step of the data preparation flow and explain it clearly to clients or internal teams. This is especially helpful when I need to show exactly how a lead scoring model, a customer segmentation analysis, or a campaign performance dataset was created.
Another thing I really value is the balance between no-code and code options. For everyday consulting work, it’s practical: I can move quickly with visual recipes for common tasks, and then go deeper with SQL or Python when the analysis needs more flexibility. That saves time and makes it easier to adapt the workflow to the complexity of each project.
Dataiku also improves collaboration with non-technical stakeholders. When I’m working with marketing managers or sales teams, they don’t always need the technical details, but they do need to trust the output. Having a clear, documented workflow makes conversations smoother and helps translate analysis into concrete marketing decisions.
Overall, the biggest benefit for me is that Dataiku turns complex data preparation and analysis into a repeatable consulting workflow. It helps me spend less time on manual data cleaning and more time interpreting results, spotting opportunities, and recommending actions to improve campaign performance, customer targeting, and ROI.
The pricing can also be a limitation, especially for smaller clients or companies that are still at an early stage in their data maturity. Dataiku can deliver strong value when it is used regularly across multiple projects, teams and data sources, but for a smaller marketing team that only needs occasional analysis, it may feel like a significant investment. The ROI is much clearer when the company is ready to operationalize data workflows, not just run one-off reports.
In terms of onboarding, I think the platform requires a structured introduction to get the most out of it. There are many features, which is a strength, but it can also be overwhelming at first. For some clients, I need to spend extra time explaining not only how the tool works, but also how to think in terms of reusable data pipelines instead of simple spreadsheet-based analysis.
Regarding AI and machine learning, the capabilities are powerful, but they still require good data quality and a clear business objective. Dataiku can help a lot with automation and predictive models, but it does not replace the strategic work of defining the right question, selecting the right variables and interpreting the results correctly. In my daily work, I still need to guide clients carefully so they do not treat AI outputs as automatic answers without proper validation.
So overall, my main dislike is not about a single missing feature, but about the complexity that comes with such a complete platform. It is very useful, but it needs the right level of adoption, training and business commitment to fully justify the investment.
With Dataiku, I can create more repeatable workflows for tasks like campaign performance analysis, customer segmentation, lead scoring, churn analysis and ROI reporting. This benefits me because I do not have to start from zero every time a client sends updated data. Once the workflow is built, I can refresh the inputs, review the outputs and focus more on insights and recommendations.
It also helps me reduce manual errors. When working only with spreadsheets, it is easy to lose track of formulas, versions or manual changes. Dataiku makes the process more structured and transparent, so I can better control how the data is transformed and explain the logic behind the final results to clients.
Another important benefit is that it helps me move from simple reporting to more advanced decision support. Instead of only showing what happened in a campaign, I can help clients understand patterns, identify high-value segments, predict possible outcomes and prioritize marketing actions more effectively.
Overall, Dataiku saves time, improves the reliability of my analysis and helps me deliver more strategic value. It allows me to spend less energy on repetitive data preparation and more time advising clients on what to do next.
Powerful Collaboration, but Can Feel Bulky and Resource-Heavy
It tends to save/cache data in many places, which can make workflows harder to manage and slower to navigate.
For hardcore technical teams that prefer lightweight, code-first tooling, it may feel slower and more restrictive than working directly with native engineering stacks.
It benefits me by making collaboration between business and technical teams much smoother and reducing the time needed to move projects from experimentation to production.