Listing Thumbnail

    Dremio Enterprise

     Info
    Sold by: Dremio 
    Deployed on AWS
    Dremio Data Lake Engine

    Overview

    Dremio delivers lightning-fast queries and a self-service semantic layer directly on S3, so that you don't have to move the data into data warehouses, cubes or extracts.

    Highlights

    • Fast queries on S3 (4-100x faster & 10x more efficient than other SQL engines)
    • Join between S3 and other AWS/on-premise databases
    • Semantic layer to empower BI (Tableau, Power BI, etc.) users and govern data access

    Details

    Sold by

    Delivery method

    Delivery option
    Dremio Deployment

    Latest version

    Operating system
    AmazonLinux 2.0.20250915.0

    Deployed on AWS

    Unlock automation with AI agent solutions

    Fast-track AI initiatives with agents, tools, and solutions from AWS Partners.
    AI Agents

    Features and programs

    Buyer guide

    Gain valuable insights from real users who purchased this product, powered by PeerSpot.
    Buyer guide

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    Dremio Enterprise

     Info
    Pricing is based on actual usage, with charges varying according to how much you consume. Subscriptions have no end date and may be canceled any time. Alternatively, you can pay upfront for a contract, which typically covers your anticipated usage for the contract duration. Any usage beyond contract will incur additional usage-based costs.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    Usage costs (7)

     Info
    Dimension
    Cost/hour
    m5d.2xlarge
    $0.78
    m5d.xlarge
    $0.39
    m5d.8xlarge
    $3.12
    m5d.4xlarge
    $1.56
    i3.4xlarge
    $2.15
    r5d.4xlarge
    $1.99
    c5d.18xlarge
    $5.96

    Vendor refund policy

    No refunds

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    Dremio Deployment

    Launches a coordinator node of the product, with the ability to dynamically provision additional engines to execute queries

    CloudFormation Template (CFT)

    AWS CloudFormation templates are JSON or YAML-formatted text files that simplify provisioning and management on AWS. The templates describe the service or application architecture you want to deploy, and AWS CloudFormation uses those templates to provision and configure the required services (such as Amazon EC2 instances or Amazon RDS DB instances). The deployed application and associated resources are called a "stack."

    Additional details

    Usage instructions

    Quickstart Instructions:

    Support

    Vendor support

    Community Support

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Product comparison

     Info
    Updated weekly

    Accolades

     Info
    Top
    10
    In Big Data, Business Intelligence
    Top
    10
    In Databases & Analytics Platforms, ML Solutions, Data Analytics
    Top
    10
    In Streaming solutions, ELT/ETL

    Customer reviews

     Info
    Sentiment is AI generated from actual customer reviews on AWS and G2
    Reviews
    Functionality
    Ease of use
    Customer service
    Cost effectiveness
    13 reviews
    Insufficient data
    Positive reviews
    Mixed reviews
    Negative reviews

    Overview

     Info
    AI generated from product descriptions
    Data Lake Query Performance
    Enables high-speed SQL queries directly on S3 with performance improvements of 4-100x faster compared to traditional SQL engines
    Multi-Source Data Integration
    Supports cross-database joins between S3 and other AWS or on-premise database systems
    Semantic Layer Management
    Provides a self-service semantic layer for data governance and access control across business intelligence platforms
    Direct Query Optimization
    Eliminates data movement requirements by executing queries directly on source data storage without requiring data extraction or warehouse migration
    Database Virtualization
    Enables virtual data access and querying across heterogeneous data sources without physical data consolidation
    Data Platform Architecture
    Unified platform integrating data engineering, analytics, business intelligence, data science, and machine learning on a single architecture
    Open Source Foundation
    Built on open source data projects with support for open standards and data formats
    Lakehouse Infrastructure
    Provides a common data management approach using a lakehouse architecture running on Amazon S3
    Data Intelligence Engine
    Advanced engine capable of interpreting organizational data context and enabling broad data access across teams
    Collaborative Workflow
    Native collaboration capabilities enabling cross-functional data and AI workflow integration
    Data Ingestion
    Supports continuous ingestion of streaming and batch data from multiple sources including PostgreSQL, SQLServer, and Kinesis
    Data Transformation
    Enables data transformations using SQL with automated task orchestration, scaling, and data quality validation
    Table Management
    Provides Iceberg Live Tables with native schema evolution and file system optimization capabilities
    Performance Optimization
    Implements adaptive Iceberg data file optimizer that profiles data files and write patterns to improve query performance and reduce storage costs
    Operational Monitoring
    Offers built-in task monitoring and data observability features to track volume changes, data value drift, and schema evolution

    Contract

     Info
    Standard contract
    No
    No
    No

    Customer reviews

    Ratings and reviews

     Info
    0 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    0%
    0%
    0%
    0%
    0%
    0 AWS reviews
    |
    76 external reviews
    Star ratings include only reviews from verified AWS customers. External reviews can also include a star rating, but star ratings from external reviews are not averaged in with the AWS customer star ratings.
    Corrr Moray

    Has simplified complex data integration workflows and supported consistent reporting across multiple sources

    Reviewed on Oct 29, 2025
    Review provided by PeerSpot

    What is our primary use case?

    As a BI developer, my main use case for Dremio  involves registering data zones and also doing inquiries, views, and making it persistent to use on BI tools, like Tableau reports. In a project, we create data zones, typically one that we have in our company to receive files. After doing the ingestion inside Dremio , what we do is set up permissions, and after setting up the permissions, the refresh session allows us to go over and start building the views and all those things to be possible to connect via JDBC driver and consume on Dremio dashboard.

    What is most valuable?

    The first feature that stands out for me in Dremio is the federated type of query, which allows the possibility to use multiple endpoints without worrying about writing custom SQL that runs only for SQL Server  or for Postgres and Redshift. We build a SQL on Dremio standard, and then Dremio goes over and rewrites the whole lineage to make it possible to use multiple sources of data.

    The feature impacts our daily work by reducing complexity because we don't need to worry about where the data comes from. We are slightly migrating to Snowflake , so Dremio is currently being replaced because Snowflake  is a more robust platform, but we are kind of happy with the work Dremio does.

    I think the reduction of complexity is a positive impact that Dremio has had on my organization. The main thing is that it is a place where you can write down simple SQLs and see the lineage, the way it integrates together; this is the best part of it.

    What needs improvement?

    We also have a close relationship with the team that does the Dremio maintenance for the database, like upgrading the versions and they know about some specific problems we had in the past, such as a memory leak. We had a memory leak on some versions, which sometimes stopped the service. Since we are using Dremio installed like a server, not a SaaS solution, many times we need to stop and restart the service to clear all the cache and all that, and this is the thing I should add.

    I see that many times the new versions of Dremio have not fixed old bugs, and in some new versions, old problems that were previously fixed come back again, so I think the upgrade part could use improvement.

    I remember using some features in the past, like pivot tables, which proved to be really difficult, but I know this is a fault also for other vendors. Pivoting, transposing, and unpivoting are often not so good. CTEs also many times prove to be not so good, so I think these two main items could be improved significantly if they standardize them.

    For how long have I used the solution?

    I have been using Dremio for approximately three years, since January of 2022.

    What do I think about the stability of the solution?

    We had a memory leak on some versions, which sometimes stopped the service. Many times, we need to stop and restart the service to clear all the cache and all that. I rate Dremio a nine in terms of stability. I think it is stable, but we need to restart it many times, and we need to monitor it regularly.

    What do I think about the scalability of the solution?

    Dremio's scalability can handle growing data and user demands easily.

    How are customer service and support?

    We have had to reach out for customer support many times, and they respond, so they are pretty supportive about some long-term issues. Many times it's with vendors where there is a missing feature or ongoing problem that they say will be fixed in the next release, but this happens not only with Dremio but with almost every vendor we have.

    How would you rate customer service and support?

    Neutral

    Which solution did I use previously and why did I switch?

    We haven't previously used a different solution before Dremio. Snowflake is the new solution we are currently using to replace Dremio.

    What was our ROI?

    I cannot share the exact metric itself, but Dremio surely saves time, reduces costs, and all those things because we don't have to worry so much about the infrastructure to make the different tools communicate. Dremio was a solution that made it possible for us to have different data sources with only one registration on our data lake.

    What's my experience with pricing, setup cost, and licensing?

    I don't have information about pricing, setup cost, and licensing for Dremio, so I am not entitled to discuss it.

    Which other solutions did I evaluate?

    I probably don't remember all the options we evaluated, but we talked a lot about Databricks  and Redshift, and Dremio was the best choice. Also, we don't use Tableau Prep because we use Dremio, which is an important thing.

    What other advice do I have?

    My advice to others looking into using Dremio is that it is a great tool because it keeps all your efforts together. It's a good thing to have if you want to have a unified catalog or metadata or something like this, so this is certainly a good tool to consider. I would rate this review an eight out of ten overall.

    Which deployment model are you using for this solution?

    On-premises
    reviewer2769915

    Has created a unified workspace for data teams and reduced storage costs through centralized access

    Reviewed on Oct 21, 2025
    Review provided by PeerSpot

    What is our primary use case?

    I have been using Dremio  on and off as a data warehouse for the past three years.

    My main use case for Dremio  is that we use it as a logical data warehouse where we use Dremio with VDSs as an alternative to AWS Glue  or Apache Hive . As we are working with our ETL at the end of all of it, after the data types and everything have been cast, we make that available on Dremio as VDS and then we move on to our further data warehousing schemes within Dremio.

    We use Dremio enterprise-wide now, and the key use case has been reducing our costs when it comes to data storage.

    Our main use case for Dremio is as a data warehouse, and the challenge that it helped us solve is that physical data warehouses such as Redshift have storage and hardware upscaling conflicts. Dremio helps us decouple those and lets us catalog more. We can manage everything under one system.

    What is most valuable?

    The best features Dremio offers include having a single system where we can manage all of our data cataloging and visualization or virtualization.

    The interface is a plus over the traditional warehousing solutions, which makes it easier to work with Dremio compared to other solutions I've used.

    Having everything under one system and an easier-to-work-with interface, along with having API integrations, adds significant value to working with Dremio.

    Dremio has positively impacted my organization by helping us create a single source of truth, a singular data warehouse where we can have access to all of the data sets. The fact that Dremio has a clear role-based access management system helps us significantly, as we can have roles segregating all of the data, while users with the appropriate roles can access everything.

    What needs improvement?

    Dremio could be improved by making it easier for data cataloging, especially when working with open table formats, as you have to choose a data format and then go into it. With the Dremio software version that we're using, all that requires a learning curve, and only when you go to the premium cloud version do you get Dremio Arctic. It should be easier to get Arctic or an open-source version of Arctic onto the software version so that development teams can experiment with it.

    For how long have I used the solution?

    I have been working in my current field for five years.

    What do I think about the stability of the solution?

    In my experience, Dremio is reasonably stable.

    What do I think about the scalability of the solution?

    I haven't pushed Dremio's scalability to its limit, so I cannot provide detailed information about that.

    How are customer service and support?

    I haven't had the need to interact with Dremio's support team yet.

    How would you rate customer service and support?

    Positive

    Which solution did I use previously and why did I switch?

    Before Dremio, we used Hive  and AWS Glue , and we wanted to shift to an open-source version that is newer than Hive and provides flexibility in moving between cloud and on-premises.

    How was the initial setup?

    We evaluated other options before choosing Dremio, including Presto  and Trino, but we did not find reasonable advantages to using them.

    What about the implementation team?

    I don't have the appropriate information regarding pricing, setup cost, and licensing as that is managed by a different team.

    What was our ROI?

    I cannot share return on investment information from using Dremio.

    Which other solutions did I evaluate?

    We evaluated other options before choosing Dremio, including Presto  and Trino, but we did not find reasonable advantages to using them.

    What other advice do I have?

    I would advise others looking into using Dremio to study the tool beforehand. Dremio has several different offerings, and the best way to get into it is to use the open-source on-premises version to experiment with it. To extract the complete power of the platform, organizations should educate themselves with the complete information and compare it with other solutions since a single solution cannot fit everywhere. Educating the team before adopting a technology is better than just adopting a suggested package. I rate Dremio 8 out of 10.

    Which deployment model are you using for this solution?

    On-premises

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    Other
    Abhishek C.

    Dremio make daily work easy, but needs little polish

    Reviewed on Sep 10, 2025
    Review provided by G2
    What do you like best about the product?
    Its just how easy it is to use. When we first onboarded, I was surprised at how fast we could connect to, like, multiple data sources. Didn't have a huge setup headache, which was awesome.The implementation wasn't that bad, especially comparing to some other BI tools we used. I mean, it wasn't 100% smooth, had a few little hiccups, but overall we got it running way easier than I expected.
    It's got pretty rich feature set—the reflections and acceleration stuff is cool for performance, even if it feels a bit overwhelming at the start. Integrating it with our existing stuff, like our AWS S3 buckets and Snowflake, was pretty straightforward. No major drama there,Oh, and the SQL editor is way better than I thought it'd be..Overall, it just feels like a tool built for speed and flexibility. we use sometimes multiple times a day when I have to do ad-hoc analysis or explore big datasets Yeah, there's definitely a learning curve, no lie. But once you get past that, you realize how powerful it is.
    What do you dislike about the product?
    Their customer support is decent. Sometimes they take a bit to get back to you, but most of the time I've gotten a proper solution that actually fixes the problem. The performance is weird sometimes, like one day a query runs blazing fast, and then the next day the exact same query is just... slower. For no obvious reason, The UI also feels a little clunky at times, not gonna lie. Especially when you're trying to handle a really large dataset, it'll just freeze up for a second , laggy . Makes the whole experience feel less smooth than it should.And the documentation... yeah, it could definitely be better. A lot of times I've had to just google around on forums or actually reach out to support just to find some small configuration detail that should really be in the main docs. Wastes a bunch of time.
    Also it's not exactly cheap. When you start to really scale it up, especially running on our own cloud infra, the bills start to add up. I feel like for smaller teams, the admin side of things can feel too complex for what you need. Just setting up user permissions and everything is a whole thing.
    What problems is the product solving and how is that benefiting you?
    So Dremio's basically solved our whole issue with data being scattered everywhere. Before this, we were always having to copy and move data into some central system just to be able to run a query on it. Super time-consuming .We can now just query right on top of where the data lives. Like, directly on S3, or Snowflake, even some of our old legacy databases. We don't need to build these massive ETL pipelines just for a simple question, which is a game changer.It's also helped a ton with speed. Those reflections they have? They make a huge difference on heavy queries. Our reporting team used to have to wait like, hours for their results to come back, and now it's way faster. Saves a ton of times for our day-to-day analysis and helps us make decisions way quicker.
    Luca P.

    Unified lakehouse platform for Analytics and Al

    Reviewed on Jul 06, 2025
    Review provided by G2
    What do you like best about the product?
    I love the platform’s ability to connect to a wide array of data sources, including relational databases (PostgreSQL, MySQL, Oracle, MS SQL), NoSQL systems (MongoDB, Elasticsearch), and cloud or file-based storage like S3 and HDFS, without requiring complex ETL pipelines.

    This approach simplifies data integration and reduces engineering overhead.

    The SQL query engine is highly performant, delivering sub-second response times even on large datasets, and supports live data visualization and dynamic previews during query preparation.

    Data reflections feature acts as an intelligent caching layer, optimizing query performance and enabling low-latency dashboard refreshes for BI workloads.

    The platform’s virtual datasets allow for complex query logic to be encapsulated and reused, supporting data-as-code principles such as Git-like version control and experimentation.


    Cloud-native architecture offers elastic compute scaling and is available as a managed service on AWS and Azure, making it suitable for both on-premises and cloud deployments. It supports role-based access control and multitenancy, which is essential for enterprise environments with strong data governance requirements.
    What do you dislike about the product?
    The learning curve can be significant, especially when configuring advanced features like data reflections, multitenancy, and integrating with complex enterprise authentication systems.

    While the UI is functional, some administrative and monitoring functions feel less intuitive compared to other modern analytics platforms.

    I have also found that fine-grained access controls and tenant isolation require careful configuration to avoid inadvertent data exposure in multi-tenant scenarios.
    What problems is the product solving and how is that benefiting you?
    Dremio has eliminated the need for traditional ETL pipelines in my analytics workflows, allowing direct querying and data exploration across disparate sources without data movement.

    This has resulted in faster dataset creation cycles and reduced bottlenecks between data engineering and analytics teams.

    The platform’s autonomous performance optimization and use of data reflections have significantly improved query speeds, enabling real-time analytics and interactive BI dashboarding even on large, complex datasets.

    By adopting Dremio, I achieved unified access to both structured and semi-structured data in a single platform, which streamlined data governance and cataloging.

    The self-service model empowered business analysts to experiment and iterate on data products without constant engineering intervention, accelerating time-to-insight for AI and analytics projects.

    The platform’s open, standards-based approach has also made it easier to integrate with existing tools and future-proof my data infrastructure against vendor lock-in concerns.


    âś… My overall insight: Dremio has enabled a more agile, scalable, and cost-effective analytics environment, supporting both operational BI and advanced data science initiatives in a unified, governed, and performant manner.
    Information Technology and Services

    Easy Direct Access

    Reviewed on Jul 03, 2025
    Review provided by G2
    What do you like best about the product?
    I like the fact that you can query directly s3 and hdfs and it also support power bi as integration
    What do you dislike about the product?
    Its not etl friendly so I have to link it with apache ariflow
    What problems is the product solving and how is that benefiting you?
    The easy integration so i save time
    View all reviews