Sign in Agent Mode
Categories
Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

Reviews from AWS customer

27 AWS reviews

External reviews

91 reviews
from and

External reviews are not included in the AWS star rating for the product.


    Ashutosh Parmar

AI-driven observability has reduced resolution times and improves real-time monitoring

  • April 17, 2026
  • Review provided by PeerSpot

What is our primary use case?

I mostly work with the performance metrics of the CPU, or host metrics, as well as application metrics and traces. Overall, I use these mostly for real-time monitoring based on the application to track application performance.

For the monitoring of infrastructure, it is quite insightful because in-depth, I can see what is going on in the infrastructure. If something goes down or some crons fail inside the infrastructure, the alerts are quite helpful for more visibility on the cloud-native side.

This is quite helpful for improving the application observability and the infrastructure side as well. I would rate observability above an eight.

I am not that much involved in the business side because I work as a DevOps engineer, so I do not know how much it helps on that front. However, it helps in tracking traces and metrics quite generously well and helps us improve the application side for more reliability on the business side.

What is most valuable?

It is very helpful and really enhances the AI-powered analytics, which helps us for troubleshooting the application and to get more insightful information while troubleshooting application error rates.

AI-powered guidance is really helpful because it provides more actionable insights and highlights anomalies automatically. I do not need to go through it manually, and it also helps us with smart alerting and recommendations.

It helped operationally because due to the insights of the applications, I get more insight for our application to enhance it further. It detects anomalies and correlates data while guiding us to the root causes, so we can enhance our application accordingly.

I have seen that mean time to resolution was reduced around 30 to 50 percent. The main reason for this combination is because of real-time monitoring and AI-powered anomaly detection and distributed tracing. Instead of manually checking the logs and metrics across multiple tools, the platform quickly highlights the issues, correlates data, and points us towards the root cause.

After implementing Splunk Observability Cloud, there was a deep learning curve for the new tool. It took one or two months to get proper insights from it. After configuring, I have seen that it is very useful for tracking traces and metrics of our application, servers, and clusters. Adoption time is usually after two months, or after a few weeks of getting Splunk Observability Cloud.

Splunk Observability Cloud is highly effective in improving digital resilience. Real-time visibility and proactive alerting and fast root cause analysis, distributed tracing, and AI-driven insights enable anomaly detection, which allows us to quickly understand failures and recover faster. This is critical for maintaining system availability and helps us handle failures in complex distributed environments since we can see how services interact and where breakdowns occur.

What needs improvement?

Regarding features, it helps us for better understanding of how the application works and in-depth tracking of application monitoring.

It can be more enhanced using additional AI power. I can get more reliability using AI because AI-driven guidance is more useful nowadays. It can really improve more on the AI side because it will help us to reduce manual intervention with the system and root cause analysis will be much better with AI over human analysis.

I would say that it is quite helpful, but for different kinds of applications, it could be improved because sometimes it might provide a cloud judgment of the root cause analysis. I need to do manual intervention using a dedicated human for root cause analysis for better understanding of the root cause. This is how the agentic side can be improved.

For how long have I used the solution?

I have been working with Splunk Observability Cloud for around a year.

What do I think about the scalability of the solution?

It is quite scalable. Right now, it is providing much better insights and can be more enhanced over several aspects. I would rate scalability an eight to eight point five.

Which solution did I use previously and why did I switch?

I have tried other solutions, but they were not that great in terms of functionalities and overall performance. Splunk Observability Cloud is much better than the others because it provides AI alongside the solution. This is very helpful due to the AI-driven solutions and guidance for root cause analysis. Splunk Observability Cloud goes through the details of application traces and metrics in depth, so I get better observability over the application. This is why I have preferred Splunk Observability Cloud over other monitoring tools.

I have tried SignalFx, but it was not quite insightful. I have tried Splunk Observability Cloud over SignalFx.

What other advice do I have?

Splunk Observability Cloud is quite insightful and helpful for improving the observability side. I provide this solution an overall rating of eight.


    Aman Dhanesha

Monitoring has reduced API latency and now predicts issues across our cloud infrastructures

  • April 16, 2026
  • Review provided by PeerSpot

What is our primary use case?

I mainly use Splunk Observability Cloud to monitor the performance of our cloud-native infrastructure. Because we have created multiple infrastructures, we use it to handle and monitor everything.

Splunk Observability Cloud helps us manage latency across any of our projects and APIs. It is particularly valuable for detecting issues before they occur. We can predict features and errors in advance. Recently, we discovered problems in seven of our APIs that we were able to solve because of this predictive capability.

What is most valuable?

The best feature of Splunk Observability Cloud is that I can identify the root cause of any problem, including API latency. The real-time alerts and smart alerting system are exceptional, allowing me to know what is happening in real-time.

Detectors in Splunk Observability Cloud are very useful, and I have recently used them with great results.

Regarding the no-sample tracing feature, we collect multiple data from various sources. This feature is very useful since we recently shifted to it, and it is working very well.

The AI-powered analytics that Splunk provides allows me to get a smart analyzed version of any report.

Splunk Observability Cloud has greatly impacted our operations by reducing timing requirements. We get smarter solutions and overall use cases in a smart way. I have reduced our manpower requirements and time commitment significantly. Splunk Observability Cloud reduces our mean time to detect by approximately one to two hours.

The LLM in Splunk Observability Cloud is very powerful, and the vector database infrastructure is excellent. This is why we switched from our previous tools, and I believe it was a very good decision that has resulted in better outcomes.

What needs improvement?

The AI-powered analytics that Splunk provides delivers a smart analyzed version of reports, and it is quite good, but it is very generic. The issues identified could be better addressed through deeper AI thinking to provide a more effective solution.

For how long have I used the solution?

I have been using Splunk Observability Cloud for more than eight or nine months.

What do I think about the stability of the solution?

Splunk Observability Cloud experienced a significant outage recently when it went down for approximately five to six hours. This impacted us considerably because we were actively working during that time.

How are customer service and support?

I would rate the technical support for Splunk Observability Cloud as 9.5 out of 10 because we received their support during our deployment. They were very helpful in assisting us to create a good infrastructure.

Which solution did I use previously and why did I switch?

I find Splunk Observability Cloud to be very good. I previously used DataDog for observing everything, but Splunk Observability Cloud is more accurate and a better solution.

What was our ROI?

Previously with other applications, analyzing and controlling our API latency required almost five to six hours a day of resources. With Splunk Observability Cloud, I only need to allocate one to two hours maximum per day to accomplish the same tasks.

Which other solutions did I evaluate?

I highly recommend Splunk Observability Cloud. If you are using any other third-party tool, Splunk Observability Cloud is significantly better than the alternatives.

What other advice do I have?

I highly recommend creating better documentation for Splunk Observability Cloud. This documentation could be integrated with AI to provide specific use case solutions so that users do not have to search through Splunk documentation every time. Instead, users could directly ask about the issues they are facing and receive targeted solutions. My overall review rating for Splunk Observability Cloud is 9 out of 10.


    Udit Parekh

End-to-end tracing has transformed how we detect failures and optimize critical transactions

  • April 08, 2026
  • Review from a verified AWS customer

What is our primary use case?

Our primary use case for Splunk Observability Cloud is to monitor our infrastructure and applications, and it helps us troubleshoot issues related to any failures.

What is most valuable?

The feature we appreciate most about Splunk Observability Cloud is their distributed tracing. We also value the ability to create real-time dashboards and their alerting system is exceptional. The main best feature of that observability is their distributed tracing.

We are very satisfied with the out-of-the-box dashboards and detectors in Splunk Observability Cloud. In distributed tracing, we have banks as our clients, so if anything goes wrong with transactions, we directly go to the trace and troubleshoot those issues faster.

The AI-powered analytics and guidance in Splunk Observability Cloud is very useful. You can observe your LLM models and monitor the usage of your APIs in that cloud.

Splunk helps improve our operational performance and resilience significantly. Before we used Splunk Observability Cloud, if any failures occurred, we had to go to servers and check all the log files to find the failure. Now in Splunk, we go to that single dashboard and filter with the timestamp of failure to directly find the log, allowing us to troubleshoot issues faster. In terms of optimization, before using Splunk, we could not measure why our API was taking 100 ms, but now through distributed tracing, we can see where the bottleneck of that API is. If that bottleneck is the database, we optimize our database queries, and our application is now optimized.

Splunk Observability Cloud has reduced our mean time to detect by approximately 25 to 30 percent because it offers real-time monitoring and intelligent alerting, allowing us to troubleshoot issues faster and enhancing detection by approximately 30 to 40 percent.

What needs improvement?

In terms of pricing, I have one issue with Splunk Observability Cloud. In a large-scale organization, it does not have features such as cost optimization or budgeting for observability spend. I think they need to improve that so that I can optimize our observability. For instance, if our thousands of server applications are running, I should be able to set a budget, such as only spending $100 per month for a specific environment. They need to introduce that feature because it is very important for budgeting.

In terms of areas for improvement in Splunk Observability Cloud, the first is cost budgeting. The second is that they have many integrations, but if you are new to Splunk or new to observability, you must dive deep into more concepts. They can improve user-friendly features so that new users can set up their observability in their environment more smoothly. I think they need to improve in that integration part so that end users can onboard their infrastructure or applications very effectively.

I would appreciate more simplicity in the platform.

For how long have I used the solution?

I have been using Splunk Observability Cloud for the past eight or nine months.

What do I think about the stability of the solution?

I rate the stability of Splunk Observability Cloud as ten out of ten because it is very stable, especially since we are using their cloud environment, and Splunk Observability Cloud is built for cloud-native systems.

What do I think about the scalability of the solution?

We have not explored enriching data with custom metrics in Splunk Observability Cloud because their ready-to-use dashboards are well designed, and every organization can benefit from them. However, if you have a very large organization with over ten thousand servers running applications, you may need to build a team to create custom metrics for your specific use case.

How are customer service and support?

I would rate their technical support in Splunk Observability Cloud a nine.

Which solution did I use previously and why did I switch?

I have used other vendors such as Elastic Stack and Grafana Stack, but in Splunk Observability Cloud, there are so many integrations and useful features that no other vendor can offer. In Grafana, the logs and tracing features are almost nonexistent. You can use Grafana only for monitoring your infrastructure, but Splunk provides end-to-end visibility with infrastructure monitoring, tracing, and overall observability of our application.

How was the initial setup?

Deploying Splunk Observability Cloud is an intermediate task for new users, but if you have been in this space for one or two years or longer, then it is easy to deploy their products.

It can take up to one week to deploy Splunk Observability Cloud.

What other advice do I have?

We are not using the NoSample tracing feature in Splunk Observability Cloud.

In our organization, we have approximately 25 to 30 users using the solution daily.

We do not require any maintenance for Splunk Observability Cloud since we are using their cloud solution, which means that all patching and updates are done by them.

I recommend Splunk Observability Cloud to other organizations because we are currently saving our engineers time by 20 to 30 percent, and for infrastructure alerting, we can use it to ensure that servers will not go down. Every organization should use this because it will reduce your engineering team's effort and the downtime of your application, and in terms of any failure or APIs, you can troubleshoot your issues faster.

End-to-end visibility into our cloud-native environment is very important. If an organization is building a SaaS or B2B software, then end-to-end visibility is crucial in terms of security, failures, and compliance. The end-to-end visibility of our infrastructure and applications is extremely important.

I recommend Splunk Observability Cloud to every user because they offer trials. If you do not just read the reviews, you should try it out. Understanding the biggest features and why others are using it can be beneficial, and I always recommend Splunk Observability Cloud for end-to-end visibility in your application.

I gave this review an overall rating of ten out of ten.


    Nishith Joshi

Real-time monitoring has improved performance tracking and has simplified analyzing complex metrics

  • March 30, 2026
  • Review from a verified AWS customer

What is our primary use case?

I work in data analytics with experience in monitoring systems and working with large-scale data. I have used Splunk Observability Cloud in the context of real-time monitoring and performance tracking.

Splunk Observability Cloud works well alongside Splunk Enterprise for logs and integrates with cloud platforms and monitoring tools. It is often used together with other observability solutions. The tracking metrics such as latency, error, and throughput are easily visible. I can also build dashboards for real-time visibility.

We use Splunk Observability Cloud to track latency metrics and identify where slowdowns are happening. We have visualized response time trends and quickly detected performance degradation. We have also used it for infrastructure monitoring. Over the past six months, we have been monitoring metrics such as CPU usage and memory. If there is unusual usage, we identify it quickly using this tool and take action before it impacts our performance.

What is most valuable?

Splunk Observability Cloud has optimized our solutions and helped us understand the metrics. The AI-powered guidance in Splunk Observability Cloud helps us identify patterns and anomalies in system performance data. Instead of manually going through a large volume of metrics, it highlights unusual behavior and potential issues automatically. This makes it easier to detect problems early and understand where to focus, especially in complex systems.

There is definitely log analysis and dashboards. Log monitoring and dashboards have been better using Splunk. Splunk Observability Cloud is the best tool for log monitoring and dashboards. Splunk Observability Cloud feels more focused on real-time metrics and performance tracking compared to some other traditional log-based tools.

What needs improvement?

The learning curve for understanding all features should be improved, and the cost can increase. Splunk Observability Cloud is very costly. Cost is one of the drawbacks.

Sometimes too many alerts, if not configured properly, is a major drawback that could be improved.

The prices are quite high. As I have mentioned earlier, we are Splunk partners, so this has been handled by my other team. However, for other companies and small startups, the prices are very high for them to use Splunk Observability Cloud. Price is a concern.

For how long have I used the solution?

I have been working with Splunk Observability Cloud for the past six to eight months.

What do I think about the scalability of the solution?

We have expanded our team and usage. We are scaling up right now from ten people to twenty-five or thirty. Over time, I expanded my usage by going through basic monitoring and exploring things like setting up custom dashboards. We have gradually expanded our usage from setting up dashboards and alerts.

How are customer service and support?

For customer service, I would rate them eight out of ten because whenever we raise a support case, they are always available for us.

For Splunk real user monitoring, implementation took time because our engineers tried very hard. In case of support, there should be more engineers specifically for this case.

Which solution did I use previously and why did I switch?

We have used different products like Palo Alto and Cribl before moving to Splunk Observability Cloud. As we got a partnership, we have shifted to Splunk Observability Cloud.

What was our ROI?

The information is confidential and I cannot share specific details. However, I can tell you in percentage that fifty to sixty percent of our work has been easy to identify in terms of performance metrics and performance using Splunk Observability Cloud.

It has saved us thirty to forty percent in cost because we used some other tools before that were more costly. As we are Splunk partners, we obtained Splunk Observability Cloud, and our costs have been reduced by thirty to forty percent using this solution.

What other advice do I have?

My overall impression of using Splunk Observability Cloud is that it is a strong tool for real-time monitoring. It does take some time to get fully comfortable with all the features. We have not explored everything right now, but in the future, we are looking forward to using more features.

A part of the implementation has been handled by my other team. I have explored using custom metrics to enrich observability data, mainly by adding application layer or business-related metrics alongside system metrics. I have used custom metrics in a limited way to add more context to monitoring, such as tracking application-specific metrics alongside system data.

Dashboard customization in Splunk Observability Cloud is quite flexible. We care about metrics in different types of visualization, and it helps us organize them in a way that makes sense for monitoring. It allows us to build dashboards tailored to specific use cases. This makes it easier to monitor system performance and quickly identify issues without going through unnecessary data.

The integration in real user monitoring from Splunk Observability Cloud is actually better than from some other tools. If you are looking for the best SIM tool, then Splunk Observability Cloud is for you. If you have funds and capability for the cost, then Splunk Observability Cloud is definitely the best tool you can use.

I have given this review an overall rating of nine out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?


    Jigar Hirani

End-to-end tracing has improved monitoring and now reduces downtime with proactive alerts

  • March 27, 2026
  • Review provided by PeerSpot

What is our primary use case?

My experience with Splunk Observability Cloud involves monitoring infrastructure, application performance monitoring, and real-time alerting. Although I am no longer working with Splunk Observability Cloud due to a recent position change that occurred approximately two months ago, I previously monitored servers, containers, Kubernetes, application performance, and Docker images. In terms of monitoring, I tracked response time, error rate, and latency. This capability helped in identifying performance issues or infrastructure issues before users were impacted. For instance, if Kafka failed, we knew about it before users experienced an impact and could resolve it before it caused maximum damage to our systems. I also used dashboards and alerts to monitor critical services and received notifications whenever issues arose.

The features of Splunk Observability Cloud that I found most valuable included application performance monitoring and distributed tracing, particularly when monitoring distributed systems or applications. Real-time alerting and Kubernetes monitoring were essential since Kubernetes is quite complex. I could effectively monitor Kubernetes using Splunk Observability Cloud. Additionally, the Smart Attack Detector, which I tried at the last moment, was a good feature, although I did not work extensively with it. The Log Observer was very fast and reliable, and the dashboards provided good visualization for troubleshooting and monitoring. If there was a network outage, I received notifications very quickly.

What is most valuable?

Splunk Observability Cloud helped me detect performance issues faster and reduce downtime in my organization. Earlier, I had limited visibility into my application performance. After implementing observability, I could see end-to-end transaction tracing and quickly identify where issues arose, which reduced troubleshooting time and improved overall application stability and availability for our customers and systems. This capability also helped in proactive detection.

What needs improvement?

I believe that areas of Splunk Observability Cloud that could be improved include the initial setup and instrumentation costs, which take more time for APM. Some dashboards and detectors require tuning, and I think the visualization needs enhancement. Additionally, alert noise remains an issue, and we need suppressions for when systems go down for short periods. Better integration with third-party tools and easier onboarding of data would also be beneficial.

What do I think about the stability of the solution?

When evaluating the stability and reliability of Splunk Observability Cloud, I can confirm it has been reliable. I would rate it eight out of ten for reliability.

What do I think about the scalability of the solution?

Splunk Observability Cloud scales very well with the growing needs of my organization. I can demonstrate the scalability of our system to our customers, which is advantageous for business. This capability helped us secure business as we provide real insights to customers who were happy to purchase our systems and applications. The ROI has been good for us.

How are customer service and support?

I communicated with the technical support of Splunk Observability Cloud regarding our issues, specifically when I was unable to monitor or set up Kubernetes to monitor our infrastructure. They were able to help us, and we purchased an on-demand call for assistance, which they provided.

How was the initial setup?

I did not participate significantly during the initial setup and deployment of Splunk Observability Cloud, but I was part of the team. I know the process is straightforward. We simply needed to ensure that all data was in the correct format, matched current dashboard setups, and included all necessary fields for insights.

What was our ROI?

My experience with lowering the cost of unplanned digital downtime using Splunk Observability Cloud has been positive, as it helped us significantly. Our system was bottlenecking and consuming excessive resources, but with the ability to detect and resolve that issue, overall system usage was reduced without further bottlenecking.

What's my experience with pricing, setup cost, and licensing?

Regarding metrics or data points confirming performance improvement and resilience, I found that during certain times, we experienced the most significant spike in our systems due to multiple users requesting the same service. We needed to change our overall architecture as we were not scaling adequately, and this was bottlenecking our systems. By observing this from the dashboards, I realized improvements could be made. After implementing the solution, our application's stability improved significantly. I can confidently say our availability improved by forty percent, and downtime was reduced by approximately seventy to eighty percent.

What other advice do I have?

My impression of the No-Sample Tracing feature in Splunk Observability Cloud is that it helped us detect key metrics and real use cases, particularly in tracking and monitoring. I primarily tracked server uptime, application response time, API latency, and similar metrics. Combining these parameters instead of relying on a single factor improved our system. Specifically, I used distributed tracing to understand how requests flowed through our network and how different systems responded, which helped determine if any particular system impacted all our systems.

Regarding the AI-powered analytics and guidance provided by Splunk Observability Cloud, I have not actually used the AI features, particularly with ITSI, as I did not utilize that aspect for observability.

My teams effectively utilized the ability to enrich data with custom metrics in Splunk Observability Cloud. They found valuable insights from our systems and created reports that the application and infrastructure teams used to decide their workarounds and solutions. They developed different solutions, experimenting and improving our systems by relying on observability to understand what happens when we adjust parameters or change configurations.

When evaluating the effectiveness of the out-of-the-box customizable dashboards provided by Splunk Observability Cloud, I note that we mostly used the default dashboards. While we created a custom dashboard to track our overall system flow, we relied on pre-built dashboards for monitoring and representing our business perspective. When we needed to showcase our environment to customers, we demonstrated our scalability and system performance, including response time and downtime, providing insightful details from the dashboards for business use cases.

I would rate Splunk Observability Cloud an eight out of ten, where ten is the best and one is the worst.


    RahulMhatre3

Observability has improved anomaly detection and dashboard flexibility but needs simpler licensing

  • March 11, 2026
  • Review provided by PeerSpot

What is our primary use case?

What is most valuable?

Splunk Observability Cloud is effective for detecting anomalies and preventing system outages.

There are pre-built dashboards where I can check service centers and monitor spikes in errors and traces. I can also check error logs, and everything is consolidated while providing anomaly alerts in case there is any deviation from the baseline.

The personalized dashboard helps my team. Splunk Observability Cloud has its own query language that can be used to build easy dashboards. Multiple teams can build their own, replicate them, and also have role-based access control, which is beneficial.

The application management feature helps with end-user experiences because front-end monitoring helps track user issues and any back-end issues that may be causing them. It shows how the user experience is overall and identifies any outages. Front-end monitoring is very useful.

What needs improvement?

As an integrator, I think the biggest advantage of Splunk Observability Cloud is because it is part of the Splunk ecosystem, it is good to correlate logs with application data through traces and metrics. Overall, it is an evolving product, not top class, but it is getting there.

I see many good things about the product and many advantages. Regarding the negative side, I think the licensing can be much better because it is based upon host units and there is additional licensing for the number of traces that I can bring in. A simplified licensing model would be much better, similar to what other tools offer. Pricing could be either based upon ingestion or directly based upon host units, rather than multiple different trackers. There are licenses for custom metrics, licenses for the number of traces that I can ingest, and host unit licensing. A better licensing plan would be beneficial.

For how long have I used the solution?

I have been using Splunk Observability Cloud for more than two years.

What do I think about the stability of the solution?

I have not seen any issues with stability. The solution is very stable.

What do I think about the scalability of the solution?

Regarding scalability, I do not think there is an issue with scaling. I have never encountered any issues with that.

How are customer service and support?

Support is good.

Which solution did I use previously and why did I switch?

I have worked on Coralogix, which is also an observability tool. I worked in the product company itself. I have also worked on Dynatrace, and now I am working on Cribl.

How was the initial setup?

The installation and deployment process is somewhat challenging, but there are multiple ways of deployment that give me a lot of options. I would say it is acceptable and not that complicated. I can deploy agents with Splunk deployment server, which is beneficial. However, there is some dependency on the deployment server.

What about the implementation team?

As an integrator, I deployed it and made it workable with OpenTelemetry.

What was our ROI?

I am able to observe significant ROI with Splunk Observability Cloud. When I worked with a previous solution, it was one-third of the cost of Dynatrace, so there was definitely an exceptional return on investment. It helped reduce costs by almost 50%.

What's my experience with pricing, setup cost, and licensing?

Splunk Observability Cloud is affordable. I have visited the PeerSpot website and downloaded reports on Azure, Grafana, and Splunk Observability Cloud.

Which other solutions did I evaluate?

When I compare Splunk Observability Cloud to other vendors, the good part is the branding because the support is good. There is a large community where I can look for known issues. However, experience-wise, DataDog is far more superior and easier to use. DataDog has its own agent for tracing, so I just deploy one trace. With Splunk Observability Cloud, they are dependent upon OpenTelemetry, and there is a learning curve because it is open source. The onboarding is not as smooth as DataDog or Dynatrace.

What other advice do I have?

I deploy both on-cloud and on-premise options for clients. I have deployed Splunk Observability Cloud on Splunk Cloud. I have not used threat detection because there is a separate tool for it. I have not deployed a solution on AWS Cloud or purchased it from AWS Marketplace in my career. I would rate this review 7.5 out of 10.


    Isaac Ogbonnaya

Monitoring has transformed incident response and cost management while making data fully visible

  • March 11, 2026
  • Review provided by PeerSpot

What is our primary use case?

My main use case for Splunk Observability Cloud involves performing visualized performance metrics and tracing capability, making sure that all troubleshooting is faster during incident response. We also integrate it to ensure that every data point and operational data is monitored.

A specific example of how I have used Splunk Observability Cloud in a real situation is that we make use of it to ensure that every operational data point is being monitored, traceable, and visible.

Regarding my main use case for Splunk Observability Cloud, I would add that we really utilize it in the area of cost management, along with the smarter alerting system and the log search performance.

What is most valuable?

The dashboard and lead time metrics from Splunk Observability Cloud really improve our workflow, making every workflow more visible and understandable for our stakeholders as well.

Splunk Observability Cloud has positively impacted my organization. Although we have not noticed any specific outcomes, we really recommend it for handling higher data volumes effectively, especially its scalability, which is suitable for us during enterprise environments, monitoring, and alerting.

The best features that Splunk Observability Cloud offers include APM monitoring, the fast alerting system during incident response, and the dashboard that provides real-time metrics.

What needs improvement?

To improve Splunk Observability Cloud, I wish they could develop more in the area of pricing and cost transparency, provide a smoother learning curve, and enhance the log management experience, ensuring that log navigation is not solely focused on metrics and tracing but also has good search performance to understand larger data sets.

I would also like to see a very good user interface and onboarding experience that is smoother for new users.

Before we wrap up, I want to emphasize the need for improvements in the log search performance and the smarter alerting system.

For how long have I used the solution?

I have been using Splunk Observability Cloud for over a year.

What do I think about the stability of the solution?

In my experience, Splunk Observability Cloud is very stable in the area of real-time monitoring and analytics.

What do I think about the scalability of the solution?

When handling higher data volume and scalability, I can say that we have over 70% efficiency now.

For scalability, I would rate it an eight, as it is very good in responding faster and monitoring larger data sets.

How are customer service and support?

We have great feedback from the customer support of Splunk Observability Cloud, as they help solve and make bug alert management easier, respond quickly to incidents, and monitor data sets effectively.

Which solution did I use previously and why did I switch?

I have not used any different solution before Splunk Observability Cloud.

What was our ROI?

I have seen a return on investment with Splunk Observability Cloud, with current metrics showing over 75% efficiency. It has really helped our workflow, saved time, reduced costs, and also saved employees' time.

What's my experience with pricing, setup cost, and licensing?

My experience with the pricing, setup cost, and licensing for Splunk Observability Cloud was acceptable at first, though I believe they need to improve more in this area. I would rate it a seven, but overall, the costing and licensing were fine for me.

Which other solutions did I evaluate?

Before choosing Splunk Observability Cloud, I was recommended to it specifically.

What other advice do I have?

Splunk Observability Cloud deserves an eight out of ten rating. I choose an eight because of their fast response and the monitoring of strong infrastructures.

I would advise others looking into using Splunk Observability Cloud because I am a witness to its effectiveness. It is very beneficial for workflow, making tasks easier and flexible while being able to track and monitor all data sets.


    Airlines/Aviation

Powerful Real-Time Insights, But Pricing Can Spiral Without Log Filtering

  • February 13, 2026
  • Review provided by G2

What do you like best about the product?
Real-time visibility and powerful SPL queries for rapid root cause analysis.
What do you dislike about the product?
High and Unpredictable Costs: The pricing (whether based on data ingestion volume or "Workload" compute units) scales rapidly. If you don't aggressively filter logs before they hit the cloud, your bill can spiral quickly
What problems is the product solving and how is that benefiting you?
Splunk IT Cloud (comprising Splunk Cloud Platform and the Observability suite) is designed to solve the problem of "Data Sprawl"—the overwhelming amount of fragmented information generated by modern, multi-cloud environments.


    HrishikeshNavkar

Metric-based monitoring has simplified alerting and currently supports our cloud migration

  • February 04, 2026
  • Review provided by PeerSpot

What is our primary use case?

Currently, we are in the process of migrating from on-premises to Splunk Cloud as well as Observability. For metric-based monitoring, we can monitor via Observability and are migrating it there. We are setting up private locations to monitor synthetic tests, such as ping checks, port checks, and URL monitoring. The rest is metric-based monitoring, which is being done by Splunk using Splunk OTeL, which is an OpenTelemetry agent for Observability. This agent brings metrics from end devices to Observability. Based on these metrics, we set detectors and rules to trigger alerts.

Our observability is not yet live in production with Splunk Observability Cloud. It is currently being built, and we are adding new components, but it is not yet fully ready.

What is most valuable?

Comparing to Cloud, Splunk Cloud, or any other solution, the most valuable feature of Splunk Observability Cloud is that it is entirely based on metrics. The agent is also very lightweight compared to Splunk UF and does not consume much compute resources on the end server or host from which we are pulling data. However, it can only monitor metrics and cannot monitor logs.

Regarding how Splunk Observability Cloud has benefited our organization, we are yet to go live, but most of the configuration that requires conditions and triggers on Splunk Cloud involves writing queries. With Splunk Observability Cloud, the process is quite simple. We can directly get metrics flowing, set thresholds, and everything is UI-based. This requires less time to set up and use. I do not have that much visibility with Splunk Observability Cloud at this time as I am working as an administrator. It has helped us create dashboards for visualization purposes.

What needs improvement?

There is one thing that could be improved in Splunk Observability Cloud. We have the capability in Splunk to connect to Splunk agents such as Splunk forwarders from a deployment server and update the end agents and forwarders using server classes. We can push and update configurations from our own hosted servers without needing to access the end device. In Splunk Observability, the OTeL agent cannot be updated from our end. Every time we need to update, we have to reach out to users or gain access to the host to update the configurations. There should be a solution to update OTeL agents from Splunk Observability Cloud itself.

For how long have I used the solution?

I have been working with Splunk Observability Cloud for approximately five to six months.

What do I think about the stability of the solution?

Splunk Observability Cloud is reliable based on my experience with stability and reliability so far.

We were facing some challenges with the stability of Splunk Observability Cloud regarding the login page. It was not working several times and was not accepting SSO authentication. The observability team found a solution for this issue, though I am not fully aware of the details. There were several times when opening the page did not directly log in and showed some errors.

What do I think about the scalability of the solution?

I have not encountered any scenarios regarding the scalability of Splunk Observability Cloud. It should be good because it is cloud-based. I am not aware of the licensing model and how it scales or what the rules are for scaling.

How are customer service and support?

I was not directly involved with technical support for Splunk Observability Cloud, but I am aware that my teammates reached out to support. They were finding issues regarding configuration, installation, and deployment of Observability for specific components. Since Observability is cloud-based and hosted by Splunk, the components we own on-premises are the OTeL gateways, agents, and private locations. They reached out to the vendor regarding these components, and the support was quite smooth. They have raised some bugs as well for the vendor to fix. I would rate the technical support from Splunk an eight out of ten.

How was the initial setup?

Since it is cloud-based, Splunk Observability Cloud was ready to use upon deployment. The OTeL gateways were built by our team and required configuration. I was not part of that process but am aware that we needed to configure the OTeL gateways to route data to them as an endpoint and from there it would be ingested to Observability or forwarded to Observability. There were no significant issues with this process and it was quite smooth. However, configuring private locations on a few gateways was quite difficult to set up and maintain because Docker was going down at times. There were some issues that were discussed with Splunk vendor, and they provided guidance on how to fix them.


    reviewer2800332

Observability has exposed tracing gaps and inconsistent metrics while still mapping complex services

  • January 30, 2026
  • Review from a verified AWS customer

What is our primary use case?

In my organization, we have 150 to 160 applications yearly with different frameworks including .NET, Java, and Python based applications. All of them are hosted on different types of servers such as Windows, Linux, ECS, and EKS. With respect to deployments, we integrated Splunk Observability Cloud. Previously, we used Prometheus and Grafana. My organization considered Splunk Observability Cloud to be a premium side of observability, so they switched from our previous solution.

We use the tracing feature in Splunk Observability Cloud.

What is most valuable?

I appreciate the service map and APM in Splunk Observability Cloud the most. This is the main feature I value. The interface is completely UI based, so I can see the complete service map, observe the latency present, and view complete metadata for a particular service or any database-related service. The service map enables a 3D view of the complete application architecture.

With respect to the effectiveness of Splunk Observability Cloud in improving digital resilience within the organization, it was quite similar to other third-party tools. The main distinction is that it has some improved security. We use SignalFlow queries, and with respect to those queries, we work with alerts and the dashboarding part. I can say it provides efficiency with improved security compared to other third-party tools, but in terms of usage, it is quite similar to Prometheus and Grafana.

What needs improvement?

I want to address a disadvantage regarding the service map showing misinformation with respect to latency, which relates to data reliability pulled from AWS cloud or on-premise servers. We saw issues with latency because Splunk APM app shows different data than Prometheus and Grafana. We tried to get premium support and on-call support with Splunk, and they were helpful in troubleshooting, but they ended up with no solution.

Performance with Splunk Observability Cloud is acceptable to me, but the modifications required by users are problematic. I had to build the complete alerting system and monitoring system, which had to be changed. The way they designed this is not optimal. If I compare with Prometheus, we can import and export dashboards, but here we face errors with dialogue boxes. We tried with technical support calls about this, but they were unable to solve it, so I do not understand why export and imports are not functioning.

The overall impression of the no-sample tracing feature in Splunk Observability Cloud, specifically in terms of eliminating blind spots in data collection, is that it needs improvement because the data is not adequate compared to other third parties. We get disturbance in the dashboards and charts while trying to correlate data. The mechanism functions differently manually than it does with a SignalFlow query, and both should be equal. We are unable to replicate from manual processes to the automation method, which is the issue.

The SignalFlow query feature in Splunk Observability Cloud needs improvement because it should function the same as manual processes. When we configure manual queries and then configure them via SignalFlow, they give different outputs. We tried with on-call support about this, but they were unable to address it, indicating there is a bug with the queries that needs improvement.

For enhancements, I would like to see improvements in the OTEL agents, OTEL collectors, and other features in Splunk Observability Cloud. The guidelines in the official documentation are not working at all. We have to deploy processes in our own way, and the documentation works only in 60 percent of the conditions, leaving the remaining 40 percent as problematic and needing improvement.

For how long have I used the solution?

I have used Splunk Observability Cloud for nearly one to one and a half years.

What do I think about the stability of the solution?

I experienced a downtime with Splunk Observability Cloud one time. We were unable to access it for nearly one day, which took a lot of time to resolve. Normally, other tools do not take as much time, and I do not understand why Splunk took so long. From the vendor's end, they should address such issues in a much shorter timeframe. When downtime occurs, it raises concerns about how we measure and receive alerts, as everything needs to be in place.

What do I think about the scalability of the solution?

In terms of lowering the cost of unplanned digital downtime using Splunk Observability Cloud, I found that many users report it is expensive, especially at a large scale, which can be a concern for organizations with tight budgets. At a large scale it is good, but for start-ups and some medium-range companies, it is expensive and they cannot afford it, especially as the cost increases with respect to data volume and retention needs.

How are customer service and support?

Support wise, there are two kinds of support for Splunk Observability Cloud: bi-weekly support and on-call support, with one more being premium support. They need to decrease the price of premium on-call support because as an employee, we require credits to get premium support, and our organization does not have many credits. That is a point where it lagged, but with respect to the bi-weekly calls and on-call support, it was acceptable. Out of five, I can give three for normal support, and four for premium call support.

How would you rate customer service and support?

Neutral

Which solution did I use previously and why did I switch?

Previously, we used Prometheus and Grafana.

Which other solutions did I evaluate?

In comparing Splunk Observability Cloud to other observability platforms I have worked with, I find no key differences in both pros and cons. The integration process is the same across the board, and I feel there is not a real differentiator, as everything is similar in terms of custom dashboards and APM features.

What other advice do I have?

We miss the synthetic monitoring and AI-related features in Splunk Observability Cloud, which I think means front-end monitoring. We touch only the main AWS monitoring and service map, APM, and that is what we are using.

Regarding the ability to enrich data with custom metrics in Splunk Observability Cloud, we configured our breaches based on application performance only. Every application has different SLAs and SLOs, and according to each application, we have configured alerts using baselines that got triggered. We correlate this with multiple factors, such as Java-based memory leaks or garbage collections, and we generate custom metrics with alerts for notification purposes, employing the Webhook URL of Microsoft Teams and Outlook.

The out-of-the-box customizable dashboards provided by Splunk Observability Cloud are effective in showcasing IT performance to business leaders. It offers a nice point, as when we correlate different charts, I get so many x-axis and y-axis options, and we can correlate with other metrics. We have formulas there to find ratios and averages, which was a nice experience offering so many options. We are using the f(x) functions with respect to maximum, minimum, and averages, which are quite good.

On a scale of one to ten where ten is the best, I would rate Splunk Observability Cloud differently. For the UI part, I would rate it an eight, but for the configuration part, I would rate it three to four, as the configuration and integration aspects are not good at all. Overall, I would rate Splunk Observability Cloud a three out of ten.

Which deployment model are you using for this solution?

On-premises

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)