Overview
Starburst Galaxy is a fully managed data lake analytics platform designed for large and complex data sets in and around your cloud data lake. It is the easiest and fastest way for you to start running queries at interactive speeds across data sources using the business intelligence and analytics tools you already know.
Starburst Galaxy takes just minutes to set up and takes care of the heavy lifting of designing, provisioning, maintaining, and securing your Trino infrastructure. In addition, Galaxy offers proprietary features such as fully managed connectors, global search, schema discovery, monitoring and metrics, and data sharing with data products that allow your data teams to focus on generating unique insights from your data - not managing and building analytics infrastructure.
Highlights
- Simplicity - Starburst Galaxy lets you discover, govern, and prepare your data from a single, fully-managed platform. Future-proof your architecture with a single point of access and governance to all your data, including RBAC and ABAC capabilities.
- Scalability - Built on top of a query engine designed to run at internet-scale, Starburst Galaxy automatically scales your infrastructure to the needs of your workload in just a few clicks.
- Optionality - Starburst Galaxy works with any data storage and table format, so you never have to worry about locking yourself into a proprietary data ecosystem.
Details
Unlock automation with AI agent solutions

Features and programs
Buyer guide

Financing for AWS Marketplace purchases
Pricing
Vendor refund policy
No refunds.
Custom pricing options
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Software as a Service (SaaS)
SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.
Resources
Vendor resources
Support
Vendor support
Get help directly from Starburst in the Starburst Galaxy UI by using our chat app. You can use the app to get answers to frequently asked questions, chat with a support agent, and search our knowledge base. For free, on-demand training, visit Starburst Academy. Docs: https://docs.starburst.io/starburst-galaxy/index.html Support Packages:
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.


Standard contract
Customer reviews
Unified data access improves analytics and simplifies complex processes
What is our primary use case?
I use Starburst Galaxy on AWS as a federated query engine to access our S3-based Iceberg data lake, Snowflake , and Redshift without duplicating data. This enables secure, high-performance analytics and machine learning workloads with consistent governance across all data sources.
How has it helped my organization?
Starburst Galaxy has improved our organization by unifying access to all major data sources, reducing the need for complex ETL processes. In addition to our original use case, it has proven fast and reliable for Iceberg table maintenance, and it has enabled ingestion of Kafka feeds into our AWS S3 data lake, further increasing its value to our data platform.
What is most valuable?
The features I value most are federated querying across S3 Iceberg, Snowflake , and Redshift; native Iceberg table management tools that make maintenance operations simple and performant; and the ability to connect directly to Kafka for streaming ingestion. The federated query capability has also enabled me to build a Sigma Computing dashboard that pulls data from Postgres, BigQuery , and Snowflake through a single Starburst Galaxy connection, greatly simplifying data access and integration.
What needs improvement?
I would like to see better alerting integrations for failures and errors in scheduled tasks and maintenance jobs. I also want support for more connectors such as Kinesis and Firehose, support for more file types such as Avro and JSON, and object storage message queue integration for object storage integrations. A single view of query execution and optimization details, rather than needing to toggle between the Galaxy and Trino UI, would be helpful. Additionally, enhanced control over account and environment variables that would be available in the Enterprise edition would be beneficial.
For how long have I used the solution?
Which solution did I use previously and why did I switch?
I previously used several query engines, including Athena , EMR, Redshift, Snowflake, and BigQuery . Starburst Galaxy’s federated query capabilities allowed me to join data across clouds and platforms, reducing complexity.
What's my experience with pricing, setup cost, and licensing?
I recommend tracking usage metrics from the start, focusing on data scanned and query concurrency, so you can right-size spend. If workloads are steady, you should explore commitment-based pricing for better rates and factor in the operational savings from not having to manage and scale your own Trino or query infrastructure.
Which other solutions did I evaluate?
I reviewed several options including Databricks and Dremio . I was an early adopter of Snowflake and still use it as well. Starburst Galaxy was a better fit for my technology stack and developers.
What other advice do I have?
I have found that Starburst Galaxy’s flexibility makes it worth experimenting beyond the initial deployment plan. Features I originally viewed as secondary, such as Iceberg maintenance and Kafka ingestion, have become everyday tools. Building a strong relationship with the Starburst team has also helped me optimize configurations and discover new capabilities faster.
Which deployment model are you using for this solution?
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Platform reduces management overhead by deploying multiple clusters and tracking costs efficiently while enhancing performance with low-latency responses
What is our primary use case?
Starburst Galaxy serves as our primary SQL-based data processing engine, a strategic decision driven by its seamless integration with our AWS cloud infrastructure and its ability to deliver high performance with low-latency responses.
The platform provides a comprehensive suite of functionalities that significantly enhance the daily operations of our data engineers and data analysts.
How has it helped my organization?
Starburst Galaxy has been instrumental in reducing the maintenance effort and management overhead of our Trino cluster, which is particularly valuable given our lean platform team responsible for Kovi's data infrastructure.
The platform has enabled us to deploy multiple clusters for different purposes while providing clear cost tracking and utilization monitoring capabilities.
What is most valuable?
The most relevant functionalities today are cluster autoscaling for intensive load periods and automated metadata management through cleaning, compression, and orphaned file deletion in Iceberg.
These capabilities significantly reduce reading costs, storage expenses, and query processing overhead.
What needs improvement?
I maintain weekly conversations with Starburst's development and support teams, which provides me with visibility into the product roadmap and evolution.
Currently, my primary need is the impersonation functionality for BI solutions within Starburst clusters, which would enable enhanced access control and data governance capabilities.
For how long have I used the solution?
I have used the solution for almost 2 years.
Which solution did I use previously and why did I switch?
Previously, I utilized the AWS stack with Redshift and Athena .
I chose to migrate to Starburst Galaxy due to their expertise with Trino, superior aggregate cost structure compared to my previous solutions, and the rapid product evolution with new functionalities, problem corrections, and performance improvements.
What's my experience with pricing, setup cost, and licensing?
Since Starburst Galaxy's pricing model is simple to understand and easy to predict, there are no major secrets.
Everything is transparent and accessible through the product console.
The only point of attention is the S3Â and transfer costs that should also be included when calculating the total cost.
Which deployment model are you using for this solution?
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Guaranteed performance transforms complex queries and empowers focus on feature delivery
What is our primary use case?
I use the solution for processing large simulation datasets into aggregated datasets that can either be used for real-time data analysis or stored for later analysis.
How has it helped my organization?
Starburst has provided us with virtually guaranteed performance on complex queries across datasets that are in the tens of gigabytes which complete in seconds. This allows me to concentrate on the features I want to deliver to our end users rather than diagnosing performance issues.
What is most valuable?
The most valuable features include taking care of the minutiae of Trino management so that it is well-optimized for our use case out of the box. Additionally, the ability to write to Apache Iceberg tables enables complex queries to be written to S3Â , avoiding the need for them to be re-run repeatedly.
I also find attribute-based access control valuable, as it allows end users to access only their data in a multi-tenant environment.
What needs improvement?
Multi-tenancy could be improved. In order to have multiple environments for SSOÂ , we maintain multiple tenants that are connected to different AWSÂ accounts via the Marketplace. On the AWSÂ side this setup works because all accounts belong to the same organization. However, on the Starburst side these tenants are disconnected from each other, and it would be great if they could be connected and managed centrally.
Which solution did I use previously and why did I switch?
I previously used Amazon Athena . I switched because the performance offered by Starburst was significantly better than that provided by Athena . Additionally, Starburst allowed for integrations with BI tools, which was difficult to achieve with the necessary level of security in Athena .
What's my experience with pricing, setup cost, and licensing?
I recommend experimenting with different cluster sizes to determine what works best for your particular use case.
Which other solutions did I evaluate?
I considered Amazon Athena and Firebolt as alternative solutions.
Which deployment model are you using for this solution?
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Significantly improved our data architecture flexibility and performance management
What is our primary use case?
My team uses Starburst Galaxy for cross-database querying, iceberg table management, and workload separation across multiple data sources. We implemented Starburst Galaxy to replace our self-hosted Trino setup, bridging gaps in our data warehousing situation where we need flexibility to read from various warehouses and write to different formats while maintaining clean compute separation.
How has it helped my organization?
Starburst Galaxy has significantly improved our data architecture flexibility and performance management. We have successfully solved cross-database query challenges by utilizing Starburst Galaxy's ability to read and write in iceberg format on Trino, making our iceberg tables usable externally across our entire data ecosystem.
The compute separation capabilities have been transformative. We can easily split workloads and prevent sporadic usage spikes from slowing down critical processes. This has resulted in much more predictable performance and better resource utilization across our data operations.
The clean entry point provided by the built-in query engine has streamlined our SQL development workflow, while the data products functionality gives us an excellent way to present our end-state warehouse-level tables to stakeholders.
What is most valuable?
The flexibility to connect to numerous different warehouses and write to various formats is Starburst Galaxy's standout feature. This adaptability allows it to mold itself perfectly to our specific needs rather than forcing us to conform to rigid constraints.
The compute-focused architecture makes workload management incredibly straightforward. Since Trino focuses primarily on compute, it is really easy to work with and optimize. The user interface for navigating, managing permissions, and viewing queries and clusters is excellent and makes administration tasks much more manageable.
Cross-database functionality combined with iceberg format support has been game-changing for our data integration workflows.
What needs improvement?
For teams heavily invested in cutting-edge dbt features, it is worth noting that Starburst Galaxy is not a tier 1 dbt partner, so it is typically slower to adopt the newest dbt capabilities such as the Fusion Engine and Semantic Layer. While these features would be nice to have, it was not significant enough to deter us from choosing Starburst Galaxy. The core functionality works well and the benefits far outweigh this limitation.
Cluster startup time is another pain point, typically 3 to 5 minutes, which is not the worst with proper planning but can be annoying for ad-hoc work. The lack of a Terraform provider is also a notable gap for infrastructure-as-code workflows. Additionally, integration between data products and the dbt Semantic Layer would significantly enhance the platform's value proposition.
For how long have I used the solution?
We have used Starburst Galaxy for a few months.
Which solution did I use previously and why did I switch?
We migrated our self-hosted Trino instance to Starburst Galaxy.
What's my experience with pricing, setup cost, and licensing?
Pricing is competitive and the value proposition depends on your specific use case and requirements. When evaluating against alternatives such as Snowflake , it is worth considering the unique flexibility and cross-database capabilities that Starburst Galaxy provides rather than focusing solely on compute costs.
Which other solutions did I evaluate?
We briefly explored other options, but given the one-to-one nature of Trino and Starburst Galaxy, it made for a more seamless transition.
What other advice do I have?
Starburst Galaxy excels as a flexible, adaptable solution for teams dealing with complex, multi-source data architectures. It may not be the absolute best at any single function, but its strength lies in being very good at many things while remaining highly malleable.
I would particularly recommend it for teams that need cross-database functionality and iceberg format support, though dbt-focused teams should be prepared to work around the slower adoption of cutting-edge dbt features. It is important to plan for cluster startup times in your workflows, and if infrastructure-as-code is important, factor in the current lack of Terraform support.
Overall, if you are looking for a solution that can bridge gaps in your data architecture rather than replace everything, Starburst Galaxy is an excellent choice that provides the flexibility to adapt to your specific needs.
Which deployment model are you using for this solution?
Supports interactive queries efficiently with fast query completion for better access to data
What is our primary use case?
I use Starburst Galaxy to support interactive queries and dashboards.
When comparing it to Databricks , which is also deployed to serve ETL pipelines, Starburst is much faster and much more friendly to non-technical employees.
How has it helped my organization?
Starburst is the most important portal for both technical and non-technical employees to access the data lake.
Starburst also provides a user-permission system which protects sensitive data.
What is most valuable?
The most fundamental feature is the query engine, which is much faster than any of the competitors.
Starburst is able to finish most queries within 10 seconds, which is especially important for many non-technical employees.
What needs improvement?
I would like Starburst to leverage AI to improve usability.
Data lakes are complicated and difficult for users to explore. AI would help a lot in this respect.
For how long have I used the solution?
I have used the solution for over three years.
Which solution did I use previously and why did I switch?
I used other tools before, and I switched because Starburst is faster.
What's my experience with pricing, setup cost, and licensing?
The price is reasonable and controllable.
For example, you have a fixed-size cluster and the cost is predictable. Queries may slow during rush hours, but there is no spike in billing.
Which other solutions did I evaluate?
I evaluated other solutions such as Databricks .
What other advice do I have?
I wish there were more products available in the ecosystem.