I have different data sources, including Oracle, DB2, and a MongoDB cluster, so I join all of these data sources using Starburst Galaxy with the federated querying feature. I transform that into Iceberg using Starburst Galaxy, land it in S3 storage, convert it into Iceberg tables, and then use them for dashboarding in Power BI or Tableau.
Starburst Galaxy
StarburstExternal reviews
External reviews are not included in the AWS star rating for the product.
Unified data querying has accelerated petabyte-scale analytics and simplified dashboard delivery
What is our primary use case?
What is most valuable?
I find myself relying most on querying from different databases as well as automatic indexing in my day-to-day work, as I am a data science architect who needs to get the queries in a very short period of time. Starburst Galaxy serves the best purpose for me because if my SLAs are not met with my customers, they will raise a case, and I have tried many other tools, but Starburst Galaxy fits the best.
Starburst Galaxy has positively impacted my organization since we were struggling with Denodo and Dremio, which had their own features but were not helpful in querying large amounts of data, especially semi-structured or unstructured data. Starburst Galaxy addresses this with many YAML files and manifest files for automated maintenance, and it helps reduce the small file problem in different HDFS systems. Additionally, Starburst Galaxy has an MCP server that connects to various agentic pipelines, reducing the time to market for data consumption.
What needs improvement?
For how long have I used the solution?
What do I think about the stability of the solution?
What do I think about the scalability of the solution?
How are customer service and support?
Which solution did I use previously and why did I switch?
How was the initial setup?
What was our ROI?
Which other solutions did I evaluate?
What other advice do I have?
Which deployment model are you using for this solution?
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Unified data from diverse sources has created consistent client views and reshaped data strategy
What is our primary use case?
My main use case for Starburst Galaxy is to use it as a data federation tool, collect data from various data sources, and have a unified view of the data.
A quick specific example of how I use Starburst Galaxy for data federation in my daily work is that I assume I need data from five different data sources, and each data source is on a different database platform, and I have information that I need for my client profile. I can pull data from all those five different data sources and have a consolidated view of the client.
Those are the main use cases for Starburst Galaxy; basically, we are trying to build data products.
What is most valuable?
Starburst Galaxy is very SQL friendly, which stands out for me because I have used SQL in other platforms such as SQL Server, Teradata, and Oracle, so it is very portable with minor changes.
Another feature I appreciate in Starburst Galaxy is that it has object storage with Iceberg storage, which helps optimize data storage and also enables columnar search, which speeds up queries.
Starburst Galaxy has positively impacted my organization by allowing us to rethink the strategy for data and architect data differently; instead of having multiple data marts and siloed data marts, we have a unified vision, and that is how it is changing.
What needs improvement?
One way Starburst Galaxy can be improved is through AI enablement. I have not seen how the user interface is going to function or how users can interact with the data products on Starburst Galaxy using AI, so I am curious to know about that.
I chose a rating of eight because it has many good features, including data federation and the ability to write queries easily. I think there are areas of improvement with respect to AI adaptability, and also in general, the amount of connectors working with other tools are areas where it can be expanded.
For how long have I used the solution?
I have been using Starburst Galaxy for 18 months.
What do I think about the stability of the solution?
Starburst Galaxy is stable in my experience so far.
What do I think about the scalability of the solution?
I do not have enough visibility into the scalability of Starburst Galaxy, but I think we are adding more and more data sources into it, so I believe it is going to be scalable, though results are still pending.
How are customer service and support?
Starburst Galaxy customer support is good.
Which solution did I use previously and why did I switch?
Earlier, we were using traditional databases.
What was our ROI?
I am yet to see the hard numbers regarding return on investment, but I believe it will probably result in money saved and time saved.
What other advice do I have?
My advice to others looking into using Starburst Galaxy would be to first understand your current data environment and make sure that you have the right connectors that Starburst Galaxy can connect to those environments. Have a dedicated team from Starburst who can help you through all the installation and onboarding, and ensure all your personnel who are going to be working on that environment receive good training with proper use cases. I would also recommend using a sandbox in your environment and putting Starburst Galaxy in it so that you can get a taste of how it works with your data. I gave this product a rating of eight.
Outstanding Performance and Savings with Robust Governance
Streamlined Data Analytics with Excellent Support
Effortless Data Federation and Granular Governance Made Easy
Effortless AI Agent Creation with Robust Features
Significantly improved our data architecture flexibility and performance management
What is our primary use case?
My team uses Starburst Galaxy for cross-database querying, iceberg table management, and workload separation across multiple data sources. We implemented Starburst Galaxy to replace our self-hosted Trino setup, bridging gaps in our data warehousing situation where we need flexibility to read from various warehouses and write to different formats while maintaining clean compute separation.
How has it helped my organization?
Starburst Galaxy has significantly improved our data architecture flexibility and performance management. We have successfully solved cross-database query challenges by utilizing Starburst Galaxy's ability to read and write in iceberg format on Trino, making our iceberg tables usable externally across our entire data ecosystem.
The compute separation capabilities have been transformative. We can easily split workloads and prevent sporadic usage spikes from slowing down critical processes. This has resulted in much more predictable performance and better resource utilization across our data operations.
The clean entry point provided by the built-in query engine has streamlined our SQL development workflow, while the data products functionality gives us an excellent way to present our end-state warehouse-level tables to stakeholders.
What is most valuable?
The flexibility to connect to numerous different warehouses and write to various formats is Starburst Galaxy's standout feature. This adaptability allows it to mold itself perfectly to our specific needs rather than forcing us to conform to rigid constraints.
The compute-focused architecture makes workload management incredibly straightforward. Since Trino focuses primarily on compute, it is really easy to work with and optimize. The user interface for navigating, managing permissions, and viewing queries and clusters is excellent and makes administration tasks much more manageable.
Cross-database functionality combined with iceberg format support has been game-changing for our data integration workflows.
What needs improvement?
For teams heavily invested in cutting-edge dbt features, it is worth noting that Starburst Galaxy is not a tier 1 dbt partner, so it is typically slower to adopt the newest dbt capabilities such as the Fusion Engine and Semantic Layer. While these features would be nice to have, it was not significant enough to deter us from choosing Starburst Galaxy. The core functionality works well and the benefits far outweigh this limitation.
Cluster startup time is another pain point, typically 3 to 5 minutes, which is not the worst with proper planning but can be annoying for ad-hoc work. The lack of a Terraform provider is also a notable gap for infrastructure-as-code workflows. Additionally, integration between data products and the dbt Semantic Layer would significantly enhance the platform's value proposition.
For how long have I used the solution?
We have used Starburst Galaxy for a few months.
Which solution did I use previously and why did I switch?
We migrated our self-hosted Trino instance to Starburst Galaxy.
What's my experience with pricing, setup cost, and licensing?
Pricing is competitive and the value proposition depends on your specific use case and requirements. When evaluating against alternatives such as Snowflake, it is worth considering the unique flexibility and cross-database capabilities that Starburst Galaxy provides rather than focusing solely on compute costs.
Which other solutions did I evaluate?
We briefly explored other options, but given the one-to-one nature of Trino and Starburst Galaxy, it made for a more seamless transition.
What other advice do I have?
Starburst Galaxy excels as a flexible, adaptable solution for teams dealing with complex, multi-source data architectures. It may not be the absolute best at any single function, but its strength lies in being very good at many things while remaining highly malleable.
I would particularly recommend it for teams that need cross-database functionality and iceberg format support, though dbt-focused teams should be prepared to work around the slower adoption of cutting-edge dbt features. It is important to plan for cluster startup times in your workflows, and if infrastructure-as-code is important, factor in the current lack of Terraform support.
Overall, if you are looking for a solution that can bridge gaps in your data architecture rather than replace everything, Starburst Galaxy is an excellent choice that provides the flexibility to adapt to your specific needs.
Now we can work much more efficiently with data
More than meets the requirement
Fantastic product from the creators of Trino!
Starburst SSO integration and SCIM capabilities along with the IAM integration with AWS was very well implemented. RBAC and ABAC capabilities are also robust.
Starburst support team has also been very responsive so far.