Listing Thumbnail

    Starburst Galaxy

     Info
    Sold by: Starburst 
    Deployed on AWS
    Starburst Galaxy offers a full-featured data lake analytics platform that allows you to discover, manage, and consume the data in and around your data lake.

    Overview

    Starburst Galaxy is a fully managed data lake analytics platform designed for large and complex data sets in and around your cloud data lake. It is the easiest and fastest way for you to start running queries at interactive speeds across data sources using the business intelligence and analytics tools you already know.

    Starburst Galaxy takes just minutes to set up and takes care of the heavy lifting of designing, provisioning, maintaining, and securing your Trino infrastructure. In addition, Galaxy offers proprietary features such as fully managed connectors, global search, schema discovery, monitoring and metrics, and data sharing with data products that allow your data teams to focus on generating unique insights from your data - not managing and building analytics infrastructure.

    Highlights

    • Simplicity - Starburst Galaxy lets you discover, govern, and prepare your data from a single, fully-managed platform. Future-proof your architecture with a single point of access and governance to all your data, including RBAC and ABAC capabilities.
    • Scalability - Built on top of a query engine designed to run at internet-scale, Starburst Galaxy automatically scales your infrastructure to the needs of your workload in just a few clicks.
    • Optionality - Starburst Galaxy works with any data storage and table format, so you never have to worry about locking yourself into a proprietary data ecosystem.

    Details

    Delivery method

    Deployed on AWS

    Unlock automation with AI agent solutions

    Fast-track AI initiatives with agents, tools, and solutions from AWS Partners.
    AI Agents

    Features and programs

    Buyer guide

    Gain valuable insights from real users who purchased this product, powered by PeerSpot.
    Buyer guide

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    Starburst Galaxy

     Info
    Pricing is based on the duration and terms of your contract with the vendor, and additional usage. You pay upfront or in installments according to your contract terms with the vendor. This entitles you to a specified quantity of use for the contract duration. Usage-based pricing is in effect for overages or additional usage not covered in the contract. These charges are applied on top of the contract price. If you choose not to renew or replace your contract before the contract end date, access to your entitlements will expire.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    1-month contract (1)

     Info
    Dimension
    Description
    Cost/month
    Standard Tier
    Pay as you go
    $0.00

    Additional usage costs (1)

     Info

    The following dimensions are not included in the contract terms, which will be charged based on your usage.

    Dimension
    Cost/unit
    Usage fee
    $0.01

    Vendor refund policy

    No refunds.

    Custom pricing options

    Request a private offer to receive a custom quote.

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    Software as a Service (SaaS)

    SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

    Support

    Vendor support

    Get help directly from Starburst in the Starburst Galaxy UI by using our chat app. You can use the app to get answers to frequently asked questions, chat with a support agent, and search our knowledge base. For free, on-demand training, visit Starburst Academy. Docs: https://docs.starburst.io/starburst-galaxy/index.html  Support Packages:

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Product comparison

     Info
    Updated weekly

    Accolades

     Info
    Top
    25
    In Databases & Analytics Platforms, Business Intelligence & Advanced Analytics, Data Analytics
    Top
    100
    In Log Analysis, Analytic Platforms
    Top
    10
    In Data Warehouses

    Customer reviews

     Info
    Sentiment is AI generated from actual customer reviews on AWS and G2
    Reviews
    Functionality
    Ease of use
    Customer service
    Cost effectiveness
    2 reviews
    Insufficient data
    Insufficient data
    Insufficient data
    Insufficient data
    Positive reviews
    Mixed reviews
    Negative reviews

    Overview

     Info
    AI generated from product descriptions
    Query Engine Performance
    Fully managed data lake analytics platform built on a query engine designed for internet-scale performance
    Data Source Connectivity
    Supports multiple data storage systems and table formats with flexible, universal data access capabilities
    Infrastructure Management
    Automated infrastructure design, provisioning, maintenance, and security for complex data environments
    Access Control
    Role-based and attribute-based access control (RBAC and ABAC) for comprehensive data governance
    Data Discovery
    Advanced schema discovery, global search, and monitoring capabilities for complex data ecosystems
    Data Indexing
    Indexes Amazon S3 data without transformation, optimizing for data size and performance
    Analytics Integration
    Supports search, SQL, and machine learning workloads through open APIs with tools like Kibana, Elastic, Looker, and Tableau
    Cloud Storage Transformation
    Converts Amazon S3 into a hot analytical data lake with native indexing capabilities
    Data Access Architecture
    Enables direct data access without complex data pipelines, parsing, or schema changes
    Scalability Mechanism
    Provides infinite scale data analysis with no administrative overhead for re-indexing, sharding, or load balancing
    Data Lake Query Performance
    Provides sub-second query response times using SQL query service on data lake platforms
    Open Standards Support
    Utilizes community-driven standards like Apache Iceberg and Apache Arrow for processing engines
    Multi-Source Data Integration
    Enables joining data from data lakes and external databases without data movement
    Compute Engine Management
    Automatically handles compute engine lifecycle including provisioning, scaling, pausing, and decommissioning
    VPC-Based Data Processing
    Deploys compute engines within customer's Amazon Virtual Private Cloud for secure data processing

    Security credentials

     Info
    Validated by AWS Marketplace
    FedRAMP
    GDPR
    HIPAA
    ISO/IEC 27001
    PCI DSS
    SOC 2 Type 2
    No security profile
    No security profile
    -
    -
    -
    -

    Contract

     Info
    Standard contract
    No
    No

    Customer reviews

    Ratings and reviews

     Info
    4.7
    7 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    71%
    29%
    0%
    0%
    0%
    7 AWS reviews
    |
    93 external reviews
    Star ratings include only reviews from verified AWS customers. External reviews can also include a star rating, but star ratings from external reviews are not averaged in with the AWS customer star ratings.
    Financial Services

    Effortless Data Federation and Granular Governance Made Easy

    Reviewed on Nov 15, 2025
    Review provided by G2
    What do you like best about the product?
    Effortlessly federating data is a standout feature, and the ability to apply data governance with a high level of granularity is impressive. I also appreciate how easy it is to use overall.
    What do you dislike about the product?
    One drawback is the absence of a built-in data processing and orchestration mechanism.
    What problems is the product solving and how is that benefiting you?
    The platform enables rapid data integration from a variety of sources, spanning different regions and cloud environments. It simplifies feature engineering for AI datasets, making the process more efficient. With a single platform, it delivers governed data to multiple stakeholders, ensuring consistency and control. Automated user access is streamlined through OKTA SAML setup, enhancing security and ease of use. Additionally, curated data can be provided directly to client storage, eliminating the need to store or transfer data locally.
    Dev Saran S.

    Effortless AI Agent Creation with Robust Features

    Reviewed on Nov 06, 2025
    Review provided by G2
    What do you like best about the product?
    While creating AI agents its very easy to use starburst it already has most support and nice number of features. If you want to let AI agents use your data there are built in plugins controlling governance and ease.
    What do you dislike about the product?
    Lack of community support and reliability. Most of the companies prefer to go to Azure and have their own AI solution because main thing is about reliabilty.
    What problems is the product solving and how is that benefiting you?
    Creating data pipelines and ease of connecting AI agents with organisational data.
    reviewer2750097

    Unified data access improves analytics and simplifies complex processes

    Reviewed on Aug 14, 2025
    Review from a verified AWS customer

    What is our primary use case?

    I use Starburst Galaxy  on AWS  as a federated query engine to access our S3-based Iceberg data lake, Snowflake , and Redshift without duplicating data. This enables secure, high-performance analytics and machine learning workloads with consistent governance across all data sources.

    How has it helped my organization?

    Starburst Galaxy  has improved our organization by unifying access to all major data sources, reducing the need for complex ETL processes. In addition to our original use case, it has proven fast and reliable for Iceberg table maintenance, and it has enabled ingestion of Kafka feeds into our AWS  S3  data lake, further increasing its value to our data platform.

    What is most valuable?

    The features I value most are federated querying across S3  Iceberg, Snowflake , and Redshift; native Iceberg table management tools that make maintenance operations simple and performant; and the ability to connect directly to Kafka for streaming ingestion. The federated query capability has also enabled me to build a Sigma  Computing dashboard that pulls data from Postgres, BigQuery , and Snowflake through a single Starburst Galaxy connection, greatly simplifying data access and integration.

    What needs improvement?

    I would like to see better alerting integrations for failures and errors in scheduled tasks and maintenance jobs. I also want support for more connectors such as Kinesis  and Firehose, support for more file types such as Avro and JSON, and object storage message queue integration for object storage integrations. A single view of query execution and optimization details, rather than needing to toggle between the Galaxy  and Trino UI, would be helpful. Additionally, enhanced control over account and environment variables that would be available in the Enterprise edition would be beneficial.

    For how long have I used the solution?

    I have used the solution for 1.5 years.

    Which solution did I use previously and why did I switch?

    I previously used several query engines, including Athena , EMR, Redshift, Snowflake, and BigQuery . Starburst Galaxy’s federated query capabilities allowed me to join data across clouds and platforms, reducing complexity.

    What's my experience with pricing, setup cost, and licensing?

    I recommend tracking usage metrics from the start, focusing on data scanned and query concurrency, so you can right-size spend. If workloads are steady, you should explore commitment-based pricing for better rates and factor in the operational savings from not having to manage and scale your own Trino or query infrastructure.

    Which other solutions did I evaluate?

    I reviewed several options including Databricks  and Dremio . I was an early adopter of Snowflake and still use it as well. Starburst Galaxy was a better fit for my technology stack and developers.

    What other advice do I have?

    I have found that Starburst Galaxy’s flexibility makes it worth experimenting beyond the initial deployment plan. Features I originally viewed as secondary, such as Iceberg maintenance and Kafka ingestion, have become everyday tools. Building a strong relationship with the Starburst team has also helped me optimize configurations and discover new capabilities faster.

    Which deployment model are you using for this solution?

    Public Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    Amazon Web Services (AWS)
    reviewer2750082

    Platform reduces management overhead by deploying multiple clusters and tracking costs efficiently while enhancing performance with low-latency responses

    Reviewed on Aug 14, 2025
    Review from a verified AWS customer

    What is our primary use case?

    Starburst Galaxy  serves as our primary SQL-based data processing engine, a strategic decision driven by its seamless integration with our AWS  cloud infrastructure and its ability to deliver high performance with low-latency responses.

    The platform provides a comprehensive suite of functionalities that significantly enhance the daily operations of our data engineers and data analysts.

    How has it helped my organization?

    Starburst Galaxy  has been instrumental in reducing the maintenance effort and management overhead of our Trino cluster, which is particularly valuable given our lean platform team responsible for Kovi's data infrastructure.

    The platform has enabled us to deploy multiple clusters for different purposes while providing clear cost tracking and utilization monitoring capabilities.

    What is most valuable?

    The most relevant functionalities today are cluster autoscaling for intensive load periods and automated metadata management through cleaning, compression, and orphaned file deletion in Iceberg.

    These capabilities significantly reduce reading costs, storage expenses, and query processing overhead.

    What needs improvement?

    I maintain weekly conversations with Starburst's development and support teams, which provides me with visibility into the product roadmap and evolution.

    Currently, my primary need is the impersonation functionality for BI solutions within Starburst clusters, which would enable enhanced access control and data governance capabilities.

    For how long have I used the solution?

    I have used the solution for almost 2 years.

    Which solution did I use previously and why did I switch?

    Previously, I utilized the AWS  stack with Redshift and Athena .

    I chose to migrate to Starburst Galaxy due to their expertise with Trino, superior aggregate cost structure compared to my previous solutions, and the rapid product evolution with new functionalities, problem corrections, and performance improvements.

    What's my experience with pricing, setup cost, and licensing?

    Since Starburst Galaxy's pricing model is simple to understand and easy to predict, there are no major secrets.

    Everything is transparent and accessible through the product console.

    The only point of attention is the S3  and transfer costs that should also be included when calculating the total cost.

    Which deployment model are you using for this solution?

    Public Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    Amazon Web Services (AWS)
    reviewer2750055

    Guaranteed performance transforms complex queries and empowers focus on feature delivery

    Reviewed on Aug 14, 2025
    Review from a verified AWS customer

    What is our primary use case?

    I use the solution for processing large simulation datasets into aggregated datasets that can either be used for real-time data analysis or stored for later analysis.

    How has it helped my organization?

    Starburst has provided us with virtually guaranteed performance on complex queries across datasets that are in the tens of gigabytes which complete in seconds. This allows me to concentrate on the features I want to deliver to our end users rather than diagnosing performance issues.

    What is most valuable?

    The most valuable features include taking care of the minutiae of Trino management so that it is well-optimized for our use case out of the box. Additionally, the ability to write to Apache Iceberg tables enables complex queries to be written to S3 , avoiding the need for them to be re-run repeatedly.

    I also find attribute-based access control valuable, as it allows end users to access only their data in a multi-tenant environment.

    What needs improvement?

    Multi-tenancy could be improved. In order to have multiple environments for SSO , we maintain multiple tenants that are connected to different AWS  accounts via the Marketplace. On the AWS  side this setup works because all accounts belong to the same organization. However, on the Starburst side these tenants are disconnected from each other, and it would be great if they could be connected and managed centrally.

    Which solution did I use previously and why did I switch?

    I previously used Amazon Athena . I switched because the performance offered by Starburst was significantly better than that provided by Athena . Additionally, Starburst allowed for integrations with BI tools, which was difficult to achieve with the necessary level of security in Athena .

    What's my experience with pricing, setup cost, and licensing?

    I recommend experimenting with different cluster sizes to determine what works best for your particular use case.

    Which other solutions did I evaluate?

    I considered Amazon Athena  and Firebolt  as alternative solutions.

    Which deployment model are you using for this solution?

    Public Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    Amazon Web Services (AWS)
    View all reviews