Listing Thumbnail

    Splunk Observability Cloud

     Info
    Sold by: Splunk 
    Deployed on AWS
    Splunk Observability Cloud is the only fully integrated, turn-key solution for DevOps teams to conquer the complexity caused by modern applications and infrastructure. It powers high performing applications to deliver world-class customer experiences by eliminating operational blindspots. You can quickly find, analyze and resolve incidents anywhere in your stack with all the answers in one place. Unlike other vendors, with Splunk Observability Cloud you only need to instrument once with OpenTelemetry to get unified metrics, traces and logs collected in real-time, without sampling for full-stack, end-to-end visibility. AI-driven pattern detection proactively identifies and alerts on issues in seconds, drastically lowering MTTR. One tightly integrated modern UI powered by the most advanced capabilities means reduced tool sprawl, centralized management, cost control, and one seamless and streamlined workflow for monitoring, troubleshooting, investigation and resolution.
    4.2

    Overview

    Splunk Observability Cloud is the only fully integrated, turn-key solution of all the tools DevOps teams need to monitor any stack at any scale. One seamless UI provides end-to-end visibility, context rich workflows and lets you drill down to root cause in seconds.

    Splunk Observability Cloud includes:

    Infrastructure Monitoring - Splunk Cloud Infrastructure Monitoring provides DevOps, CloudOps, and SRE teams with real-time, full-stack visibility across all layers of their environment. With hundreds of out-of-the-box integrations, streaming analytics, pre-built dashboards, intelligent problem detection, programmability, and Service Bureau capabilities, Splunk Infrastructure Monitoring provides the fastest, most flexible visualization and accurate alerting for enterprise DevOps teams to meet or exceed Service Level Objectives (SLOs) by quickly detecting, triaging and resolving performance issues.

    Synthetic Monitoring - Splunk Synthetic Monitoring helps teams proactively eliminate customer-facing issues and optimize web and API performance to deliver better digital experiences. Our solution goes beyond basic uptime monitoring and incorporates filmstrips and screen recordings of user experience, OOTB benchmarks and customizable performance metrics, and seamless connectivity to a suite of observability solutions to help teams quickly understand and prioritize performance defects wherever they originate, and collaborate to quickly resolve these issues to deliver digital experiences that delight customers.

    APM (incl. Always On Profiling) - Splunk APM is the industry's most advanced Observability solution to troubleshoot issues and optimize performance for modern applications. It includes:

    100% data capture: Never miss an issue or anomaly across in your code or traces with Full-Fidelity, NoSample data capture to ingest and contextualize all your telemetry data, and code profiling to identify performance bottlenecks Directed troubleshooting: easily identify, scope, and resolve issues with guided troubleshooting that automatically correlates and contextualizes system performance to find root cause faster OpenTelemetry standardization: engineering teams receive flexible instrumentation to build and measure data from new code in services, with no proprietary vendor lock in

    Real User Monitoring - Splunk RUM connects ALL front-end traces with their backend tracing, providing unmatched visibility that enables DevOps teams to understand exactly how their backend services impact user experience, thereby simplifying troubleshooting and resource allocation. With streaming analytics, SREs and developers are alerted within seconds of any user issue, and powerful AI capabilities coupled with high cardinality analysis enable them to bring the issue to speedy resolution. OpenTelemtry-based instrumentation provides maximum flexibility and ensures customers are never locked in.

    Log Observer Connect - Consolidate your tools by unifying the logs from Splunk Enterprise and Splunk Cloud, with our best-in-class metrics and traces. Log Observer Connect lets observability users explore the data you're already sending to your existing Splunk instances with Splunk Log Observer's intuitive no-code interface for faster troubleshooting, root-cause analysis and better cross-team collaboration.

    Splunk Observability Cloud suite starts at $15 per host, per month, billed annually. Minimum host quantities pricing apply.

    Highlights

    • Full-stack, end-to-end visibility: with a tightly integrated modern UI and seamless, context- rich workflows for full stack monitoring, troubleshooting and investigation of the unknown unknowns. Splunk Splunk Observability Cloud lets you drill down to root cause in seconds. You can easily integrate your existing monitoring tools to bring full context to alerts behind every incident.
    • NoSample™ full fidelity tracing: no more dead end investigations using a NoSample™ full fidelity approach to capture and visualize all data, in context, making sure no anomalies get missed. Find the backend root cause of any front-end issue. When troubleshooting backend issues, full fidelity tracing helps finding any issue, even those that do not result in system errors, and issues that no one anticipated.
    • Monitor any stack at any scale: Great for on-prem, hybrid and multicloud environments. Splunk Observability Cloud is a future-proof observability investment with a solution that will scale with customers and can meet the needs of any cloud-native environment, no matter how large (up to petabytes of ingest per day) or how complex (multiple cloud environments all integrated into one system of record), without compromising performance.

    Details

    Sold by

    Delivery method

    Deployed on AWS
    New

    Introducing multi-product solutions

    You can now purchase comprehensive solutions tailored to use cases and industries.

    Multi-product solutions

    Features and programs

    Buyer guide

    Gain valuable insights from real users who purchased this product, powered by PeerSpot.
    Buyer guide

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    Splunk Observability Cloud

     Info
    Pricing is based on the duration and terms of your contract with the vendor. This entitles you to a specified quantity of use for the contract duration. If you choose not to renew or replace your contract before it ends, access to these entitlements will expire.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    12-month contract (3)

     Info
    Dimension
    Description
    Cost/12 months
    OBSCloud: Infrastructure
    Real-time visibility for infrastructure health. 200 hosts included
    $36,000.00
    OBSCloud: App & Infra
    All the data you need to adopt microservices. 100 hosts included
    $72,000.00
    OBSCloud: End-to-End
    Troubleshoot O11y with ease to create the best UX. 100 hosts included
    $90,000.00

    Vendor refund policy

    All purchases are final, no returns or refunds.

    Custom pricing options

    Request a private offer to receive a custom quote.

    How can we make this page better?

    Tell us how we can improve this page, or report an issue with this product.
    Tell us how we can improve this page, or report an issue with this product.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    Software as a Service (SaaS)

    SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

    Support

    Vendor support

    Splunk offers a variety of support options to help ensure your success. support@splunk.com 

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Product comparison

     Info
    Updated weekly

    Accolades

     Info
    Top
    10
    In Data Anonymization, Data Security and Governance

    Customer reviews

     Info
    Sentiment is AI generated from actual customer reviews on AWS and G2
    Reviews
    Functionality
    Ease of use
    Customer service
    Cost effectiveness
    0 reviews
    Insufficient data
    Insufficient data
    Insufficient data
    Insufficient data
    7 reviews
    Insufficient data
    Positive reviews
    Mixed reviews
    Negative reviews

    Overview

     Info
    AI generated from product descriptions
    Full-Fidelity Data Capture
    NoSample full fidelity tracing approach that captures and visualizes all telemetry data without sampling, ensuring no anomalies are missed across metrics, traces, and logs.
    OpenTelemetry-Based Instrumentation
    Standardized instrumentation using OpenTelemetry framework for flexible data collection from services and applications without vendor lock-in.
    AI-Driven Anomaly Detection
    Machine learning-powered pattern detection that proactively identifies and alerts on performance issues and anomalies within seconds to reduce mean time to resolution.
    Integrated Monitoring Capabilities
    Unified platform combining infrastructure monitoring, synthetic monitoring, application performance monitoring with always-on profiling, real user monitoring, and log analysis in a single interface.
    Multi-Environment Scalability
    Support for on-premises, hybrid, and multi-cloud environments with capability to handle petabytes of data ingestion per day across complex distributed architectures.
    Real-time Data Collection and Indexing
    Collects and indexes machine-generated data from virtually any source or location in real time with automatic indexing upon data ingestion.
    Complex Event Correlation
    Correlates complex events spanning multiple diverse data sources using time-based correlations, transaction-based correlations, sub-searches, lookups, and joins.
    Scalable Data Processing
    Scales to collect and index tens of terabytes of data per day with distributed computing architecture.
    High Availability Clustering
    Provides clustering technology for availability and fault tolerance across distributed computing environments.
    Machine Data Search and Analysis
    Enables searching, analyzing, and visualization of machine data generated by IT systems and technology infrastructure across physical, virtual, and cloud environments.
    Data Routing and Destination Management
    Routes data to multiple destinations with capability to deliver specific data to targeted tools while archiving full fidelity data to cost-effective storage
    Data Optimization and Reduction
    Reduces data streams by up to 50% through removal of unused log and metric data
    Event Processing and Transformation
    Processes event data through centralized parsing with capabilities to route, optimize, reformat, and enrich data in flight
    Role-Based Access Control
    Implements role-based access control with support for external authentication via LDAP, Splunk, and OpenID Connect identity providers
    Real-Time Monitoring and Configuration
    Provides GUI-based configuration and testing interface with live data capture and real-time observability pipeline monitoring

    Contract

     Info
    Standard contract
    No
    No

    Customer reviews

    Ratings and reviews

     Info
    4.2
    119 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    53%
    41%
    5%
    1%
    0%
    28 AWS reviews
    |
    91 external reviews
    External reviews are from G2  and PeerSpot .
    Dhaval Bhalgamadiya

    Real-time dashboards and AI-driven insights have reduced incident resolution time significantly

    Reviewed on Apr 30, 2026
    Review from a verified AWS customer

    What is our primary use case?

    In our organization, we are using Splunk Observability Cloud  for real-time monitoring and troubleshooting of our applications and the infrastructure performance, tracking metrics such as CPU usage, memory, latency, and the services of different microservices which we run for our applications and products.

    What is most valuable?

    The best features from Splunk Observability Cloud  include the high-level dashboard for clear visibility of our infrastructure and the product, as well as the detailed traces for the request flow of our APIs and the in-between application communication. From the detailed traces, we can know where our application fails, allowing us to solve incidents very easily, which has drastically reduced the MTTR of our application.

    I find the out-of-the-box dashboards very helpful. Although we have not done much customization yet, the out-of-the-box dashboards and detection capabilities include pre-built dashboards for common services and infrastructure components. We have not used them extensively, but we customize them for our organization's needs, and we also adapt the detectors for alerting purposes.

    I find the AI-powered analytics very helpful because we have also used other observability platforms such as SignalFX, where the AI-powered analytics is not built into the application. Here , the AI provides intelligent insights and very early anomaly detection and pattern recognition, automatically informing us of highly unusual behavior in the application before any incident or outage occurs during production.

    What needs improvement?

    One area that has room for improvement is the pricing; as I mentioned, it can be expensive due to large data volumes. Also, the pricing can be unpredictable, and if it were more predictable, the organization would be more comfortable with it. Additionally, I found the learning curve quite steep when I started using Splunk Observability  Cloud; it took me some time to learn it. I also think that while our team is large enough to utilize it, smaller teams might not prefer this solution.

    We have not started customizing Splunk Observability  Cloud yet according to our needs, but we plan to in the next weeks. We have used the basic customization features, and I believe it is customizable.

    For how long have I used the solution?

    I have been using Splunk Observability Cloud for the last one year; I have joined my recent organization from the last three to four months, where I have been using it from the last three to four months.

    What do I think about the stability of the solution?

    The stability and reliability of Splunk Observability Cloud is top-notch, as we have not faced much downtime, so I would rate it nine.

    What do I think about the scalability of the solution?

    The scalability of Splunk Observability Cloud is also very good; we can ingest any data we desire, so I would rate that nine as well.

    How are customer service and support?

    I rate the technical support as very proactive, and our doubts and queries are resolved properly, so I would give it a rating of five.

    Which solution did I use previously and why did I switch?

    Before using Splunk Observability Cloud, we had used SignalFX and considered vendors such as Datadog  and New Relic . We chose Splunk Observability Cloud because of its vast features, the visibility we gain from the dashboard, the AI integrated into the platform, detailed traces, and logging capabilities. While Datadog  and New Relic  are also good, Splunk Observability Cloud is better in certain areas.

    How was the initial setup?

    The deployment part was handled by the other developers and ops engineers in my organization, but I know the initial setup for Splunk Observability Cloud is simple and very easy.

    What about the implementation team?

    The deployment part was handled by the other developers and ops engineers in my organization.

    What was our ROI?

    From an ROI perspective, Splunk Observability Cloud offers much higher value because, as I mentioned earlier, our MTTR has reduced by more than 50%, which decreases the overall downtime for our application. When there is an outage, the time to resolve is shorter, and application uptime has also increased because of it. This improvement is the main reason for using Splunk Observability Cloud; we wanted to decrease our application downtime. Additionally, the visibility provided by the dashboard helps us understand where our application has failed.

    Which other solutions did I evaluate?

    Before using Splunk Observability Cloud, we had used SignalFX and considered vendors such as Datadog and New Relic. We chose Splunk Observability Cloud because of its vast features, the visibility we gain from the dashboard, the AI integrated into the platform, detailed traces, and logging capabilities. While Datadog and New Relic are also good, Splunk Observability Cloud is better in certain areas.

    What other advice do I have?

    I have not used the no-sample tracing feature yet, so I am not sure about that.

    I would say it takes around one month to learn Splunk Observability Cloud; it varies from person to person, but that was my experience in learning all the features and use cases our organization employs.

    Our company is not deeply involved in LLMs and GPUs for AI applications; our applications mainly run on normal Java processes on standard servers, not on GPUs or LLMs yet. We are in the process of developing our capabilities in AI later on.

    We are using normal servers as a cloud-based solution, but we still have some drawbacks, mainly the pricing part, as smaller teams may not find it suitable, and the pricing model is complex while the learning curve is steep, particularly for the SignalFlow query language.

    My advice for anyone considering this solution is to opt for Splunk Observability Cloud without any hesitation, as it can drastically decrease the mean time to resolution and mean time to detect any issues in their applications. The overall visibility of the organization, including application usage and memory metrics, is clearly presented on the dashboard, allowing insights into what went wrong and when. Although the learning curve can be challenging initially, users will adapt and find it very beneficial for their organization.

    I would describe the pricing as neither too high nor too low; however, if it could be cheaper, it would be beneficial for us since sometimes due to large data volumes, it can be expensive for the organization to track large datasets, as it charges for large volumes of data. Sometimes it can be costly if the data we are receiving is irrelevant.

    Our organization has between 200 to 500 people, and I believe that more than 100 people are using Splunk Observability Cloud, including developers, ops engineers, security engineers, and others. I am not certain of the exact number, but it is definitely more than 50.

    I would rate this product overall at a nine.

    Which deployment model are you using for this solution?

    Public Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    Ashutosh Parmar

    AI-driven observability has reduced resolution times and improves real-time monitoring

    Reviewed on Apr 17, 2026
    Review provided by PeerSpot

    What is our primary use case?

    I mostly work with the performance metrics of the CPU, or host metrics, as well as application metrics and traces. Overall, I use these mostly for real-time monitoring based on the application to track application performance.

    For the monitoring of infrastructure, it is quite insightful because in-depth, I can see what is going on in the infrastructure. If something goes down or some crons fail inside the infrastructure, the alerts are quite helpful for more visibility on the cloud-native side.

    This is quite helpful for improving the application observability and the infrastructure side as well. I would rate observability above an eight.

    I am not that much involved in the business side because I work as a DevOps engineer, so I do not know how much it helps on that front. However, it helps in tracking traces and metrics quite generously well and helps us improve the application side for more reliability on the business side.

    What is most valuable?

    It is very helpful and really enhances the AI-powered analytics, which helps us for troubleshooting the application and to get more insightful information while troubleshooting application error rates.

    AI-powered guidance is really helpful because it provides more actionable insights and highlights anomalies automatically. I do not need to go through it manually, and it also helps us with smart alerting and recommendations.

    It helped operationally because due to the insights of the applications, I get more insight for our application to enhance it further. It detects anomalies and correlates data while guiding us to the root causes, so we can enhance our application accordingly.

    I have seen that mean time to resolution was reduced around 30 to 50 percent. The main reason for this combination is because of real-time monitoring and AI-powered anomaly detection and distributed tracing. Instead of manually checking the logs and metrics across multiple tools, the platform quickly highlights the issues, correlates data, and points us towards the root cause.

    After implementing Splunk Observability Cloud , there was a deep learning curve for the new tool. It took one or two months to get proper insights from it. After configuring, I have seen that it is very useful for tracking traces and metrics of our application, servers, and clusters. Adoption time is usually after two months, or after a few weeks of getting Splunk Observability Cloud .

    Splunk Observability  Cloud is highly effective in improving digital resilience. Real-time visibility and proactive alerting and fast root cause analysis, distributed tracing, and AI-driven insights enable anomaly detection, which allows us to quickly understand failures and recover faster. This is critical for maintaining system availability and helps us handle failures in complex distributed environments since we can see how services interact and where breakdowns occur.

    What needs improvement?

    Regarding features, it helps us for better understanding of how the application works and in-depth tracking of application monitoring.

    It can be more enhanced using additional AI power. I can get more reliability using AI because AI-driven guidance is more useful nowadays. It can really improve more on the AI side because it will help us to reduce manual intervention with the system and root cause analysis will be much better with AI over human analysis.

    I would say that it is quite helpful, but for different kinds of applications, it could be improved because sometimes it might provide a cloud judgment of the root cause analysis. I need to do manual intervention using a dedicated human for root cause analysis for better understanding of the root cause. This is how the agentic side can be improved.

    For how long have I used the solution?

    I have been working with Splunk Observability  Cloud for around a year.

    What do I think about the scalability of the solution?

    It is quite scalable. Right now, it is providing much better insights and can be more enhanced over several aspects. I would rate scalability an eight to eight point five.

    Which solution did I use previously and why did I switch?

    I have tried other solutions, but they were not that great in terms of functionalities and overall performance. Splunk Observability Cloud is much better than the others because it provides AI alongside the solution. This is very helpful due to the AI-driven solutions and guidance for root cause analysis. Splunk Observability Cloud goes through the details of application traces and metrics in depth, so I get better observability over the application. This is why I have preferred Splunk Observability Cloud over other monitoring tools.

    I have tried SignalFx, but it was not quite insightful. I have tried Splunk Observability Cloud over SignalFx.

    What other advice do I have?

    Splunk Observability Cloud is quite insightful and helpful for improving the observability side. I provide this solution an overall rating of eight.

    Aman Dhanesha

    Monitoring has reduced API latency and now predicts issues across our cloud infrastructures

    Reviewed on Apr 16, 2026
    Review provided by PeerSpot

    What is our primary use case?

    I mainly use Splunk Observability Cloud  to monitor the performance of our cloud-native infrastructure. Because we have created multiple infrastructures, we use it to handle and monitor everything.

    Splunk Observability Cloud  helps us manage latency across any of our projects and APIs. It is particularly valuable for detecting issues before they occur. We can predict features and errors in advance. Recently, we discovered problems in seven of our APIs that we were able to solve because of this predictive capability.

    What is most valuable?

    The best feature of Splunk Observability  Cloud is that I can identify the root cause of any problem, including API latency. The real-time alerts and smart alerting system are exceptional, allowing me to know what is happening in real-time.

    Detectors in Splunk Observability  Cloud are very useful, and I have recently used them with great results.

    Regarding the no-sample tracing feature, we collect multiple data from various sources. This feature is very useful since we recently shifted to it, and it is working very well.

    The AI-powered analytics that Splunk provides allows me to get a smart analyzed version of any report.

    Splunk Observability Cloud has greatly impacted our operations by reducing timing requirements. We get smarter solutions and overall use cases in a smart way. I have reduced our manpower requirements and time commitment significantly. Splunk Observability Cloud reduces our mean time to detect by approximately one to two hours.

    The LLM in Splunk Observability Cloud is very powerful, and the vector database infrastructure is excellent. This is why we switched from our previous tools, and I believe it was a very good decision that has resulted in better outcomes.

    What needs improvement?

    The AI-powered analytics that Splunk provides delivers a smart analyzed version of reports, and it is quite good, but it is very generic. The issues identified could be better addressed through deeper AI thinking to provide a more effective solution.

    For how long have I used the solution?

    I have been using Splunk Observability Cloud for more than eight or nine months.

    What do I think about the stability of the solution?

    Splunk Observability Cloud experienced a significant outage recently when it went down for approximately five to six hours. This impacted us considerably because we were actively working during that time.

    How are customer service and support?

    I would rate the technical support for Splunk Observability Cloud as 9.5 out of 10 because we received their support during our deployment. They were very helpful in assisting us to create a good infrastructure.

    Which solution did I use previously and why did I switch?

    I find Splunk Observability Cloud to be very good. I previously used DataDog for observing everything, but Splunk Observability Cloud is more accurate and a better solution.

    What was our ROI?

    Previously with other applications, analyzing and controlling our API latency required almost five to six hours a day of resources. With Splunk Observability Cloud, I only need to allocate one to two hours maximum per day to accomplish the same tasks.

    Which other solutions did I evaluate?

    I highly recommend Splunk Observability Cloud. If you are using any other third-party tool, Splunk Observability Cloud is significantly better than the alternatives.

    What other advice do I have?

    I highly recommend creating better documentation for Splunk Observability Cloud. This documentation could be integrated with AI to provide specific use case solutions so that users do not have to search through Splunk documentation every time. Instead, users could directly ask about the issues they are facing and receive targeted solutions. My overall review rating for Splunk Observability Cloud is 9 out of 10.

    Udit Parekh

    End-to-end tracing has transformed how we detect failures and optimize critical transactions

    Reviewed on Apr 08, 2026
    Review from a verified AWS customer

    What is our primary use case?

    Our primary use case for Splunk Observability Cloud  is to monitor our infrastructure and applications, and it helps us troubleshoot issues related to any failures.

    What is most valuable?

    The feature we appreciate most about Splunk Observability Cloud  is their distributed tracing. We also value the ability to create real-time dashboards and their alerting system is exceptional. The main best feature of that observability is their distributed tracing.

    We are very satisfied with the out-of-the-box dashboards and detectors in Splunk Observability  Cloud. In distributed tracing, we have banks as our clients, so if anything goes wrong with transactions, we directly go to the trace and troubleshoot those issues faster.

    The AI-powered analytics and guidance in Splunk Observability  Cloud is very useful. You can observe your LLM models and monitor the usage of your APIs in that cloud.

    Splunk helps improve our operational performance and resilience significantly. Before we used Splunk Observability Cloud, if any failures occurred, we had to go to servers and check all the log files to find the failure. Now in Splunk, we go to that single dashboard and filter with the timestamp of failure to directly find the log, allowing us to troubleshoot issues faster. In terms of optimization, before using Splunk, we could not measure why our API was taking 100 ms, but now through distributed tracing, we can see where the bottleneck of that API is. If that bottleneck is the database, we optimize our database queries, and our application is now optimized.

    Splunk Observability Cloud has reduced our mean time to detect by approximately 25 to 30 percent because it offers real-time monitoring and intelligent alerting, allowing us to troubleshoot issues faster and enhancing detection by approximately 30 to 40 percent.

    What needs improvement?

    In terms of pricing, I have one issue with Splunk Observability Cloud. In a large-scale organization, it does not have features such as cost optimization or budgeting for observability spend. I think they need to improve that so that I can optimize our observability. For instance, if our thousands of server applications are running, I should be able to set a budget, such as only spending $100 per month for a specific environment. They need to introduce that feature because it is very important for budgeting.

    In terms of areas for improvement in Splunk Observability Cloud, the first is cost budgeting. The second is that they have many integrations, but if you are new to Splunk or new to observability, you must dive deep into more concepts. They can improve user-friendly features so that new users can set up their observability in their environment more smoothly. I think they need to improve in that integration part so that end users can onboard their infrastructure or applications very effectively.

    I would appreciate more simplicity in the platform.

    For how long have I used the solution?

    I have been using Splunk Observability Cloud for the past eight or nine months.

    What do I think about the stability of the solution?

    I rate the stability of Splunk Observability Cloud as ten out of ten because it is very stable, especially since we are using their cloud environment, and Splunk Observability Cloud is built for cloud-native systems.

    What do I think about the scalability of the solution?

    We have not explored enriching data with custom metrics in Splunk Observability Cloud because their ready-to-use dashboards are well designed, and every organization can benefit from them. However, if you have a very large organization with over ten thousand servers running applications, you may need to build a team to create custom metrics for your specific use case.

    How are customer service and support?

    I would rate their technical support in Splunk Observability Cloud a nine.

    Which solution did I use previously and why did I switch?

    I have used other vendors such as Elastic Stack  and Grafana  Stack, but in Splunk Observability Cloud, there are so many integrations and useful features that no other vendor can offer. In Grafana , the logs and tracing features are almost nonexistent. You can use Grafana only for monitoring your infrastructure, but Splunk provides end-to-end visibility with infrastructure monitoring, tracing, and overall observability of our application.

    How was the initial setup?

    Deploying Splunk Observability Cloud is an intermediate task for new users, but if you have been in this space for one or two years or longer, then it is easy to deploy their products.

    It can take up to one week to deploy Splunk Observability Cloud.

    What other advice do I have?

    We are not using the NoSample tracing feature in Splunk Observability Cloud.

    In our organization, we have approximately 25 to 30 users using the solution daily.

    We do not require any maintenance for Splunk Observability Cloud since we are using their cloud solution, which means that all patching and updates are done by them.

    I recommend Splunk Observability Cloud to other organizations because we are currently saving our engineers time by 20 to 30 percent, and for infrastructure alerting, we can use it to ensure that servers will not go down. Every organization should use this because it will reduce your engineering team's effort and the downtime of your application, and in terms of any failure or APIs, you can troubleshoot your issues faster.

    End-to-end visibility into our cloud-native environment is very important. If an organization is building a SaaS or B2B software, then end-to-end visibility is crucial in terms of security, failures, and compliance. The end-to-end visibility of our infrastructure and applications is extremely important.

    I recommend Splunk Observability Cloud to every user because they offer trials. If you do not just read the reviews, you should try it out. Understanding the biggest features and why others are using it can be beneficial, and I always recommend Splunk Observability Cloud for end-to-end visibility in your application.

    I gave this review an overall rating of ten out of ten.

    Nishith Joshi

    Real-time monitoring has improved performance tracking and has simplified analyzing complex metrics

    Reviewed on Mar 30, 2026
    Review from a verified AWS customer

    What is our primary use case?

    I work in data analytics with experience in monitoring systems and working with large-scale data. I have used Splunk Observability Cloud  in the context of real-time monitoring and performance tracking.

    Splunk Observability Cloud  works well alongside Splunk Enterprise for logs and integrates with cloud platforms and monitoring tools. It is often used together with other observability solutions. The tracking metrics such as latency, error, and throughput are easily visible. I can also build dashboards for real-time visibility.

    We use Splunk Observability  Cloud to track latency metrics and identify where slowdowns are happening. We have visualized response time trends and quickly detected performance degradation. We have also used it for infrastructure monitoring. Over the past six months, we have been monitoring metrics such as CPU usage and memory. If there is unusual usage, we identify it quickly using this tool and take action before it impacts our performance.

    What is most valuable?

    Splunk Observability  Cloud has optimized our solutions and helped us understand the metrics. The AI-powered guidance in Splunk Observability Cloud helps us identify patterns and anomalies in system performance data. Instead of manually going through a large volume of metrics, it highlights unusual behavior and potential issues automatically. This makes it easier to detect problems early and understand where to focus, especially in complex systems.

    There is definitely log analysis and dashboards. Log monitoring and dashboards have been better using Splunk. Splunk Observability Cloud is the best tool for log monitoring and dashboards. Splunk Observability Cloud feels more focused on real-time metrics and performance tracking compared to some other traditional log-based tools.

    What needs improvement?

    The learning curve for understanding all features should be improved, and the cost can increase. Splunk Observability Cloud is very costly. Cost is one of the drawbacks.

    Sometimes too many alerts, if not configured properly, is a major drawback that could be improved.

    The prices are quite high. As I have mentioned earlier, we are Splunk partners, so this has been handled by my other team. However, for other companies and small startups, the prices are very high for them to use Splunk Observability Cloud. Price is a concern.

    For how long have I used the solution?

    I have been working with Splunk Observability Cloud for the past six to eight months.

    What do I think about the scalability of the solution?

    We have expanded our team and usage. We are scaling up right now from ten people to twenty-five or thirty. Over time, I expanded my usage by going through basic monitoring and exploring things like setting up custom dashboards. We have gradually expanded our usage from setting up dashboards and alerts.

    How are customer service and support?

    For customer service, I would rate them eight out of ten because whenever we raise a support case, they are always available for us.

    For Splunk real user monitoring, implementation took time because our engineers tried very hard. In case of support, there should be more engineers specifically for this case.

    Which solution did I use previously and why did I switch?

    We have used different products like Palo Alto and Cribl  before moving to Splunk Observability Cloud. As we got a partnership, we have shifted to Splunk Observability Cloud.

    What was our ROI?

    The information is confidential and I cannot share specific details. However, I can tell you in percentage that fifty to sixty percent of our work has been easy to identify in terms of performance metrics and performance using Splunk Observability Cloud.

    It has saved us thirty to forty percent in cost because we used some other tools before that were more costly. As we are Splunk partners, we obtained Splunk Observability Cloud, and our costs have been reduced by thirty to forty percent using this solution.

    What other advice do I have?

    My overall impression of using Splunk Observability Cloud is that it is a strong tool for real-time monitoring. It does take some time to get fully comfortable with all the features. We have not explored everything right now, but in the future, we are looking forward to using more features.

    A part of the implementation has been handled by my other team. I have explored using custom metrics to enrich observability data, mainly by adding application layer or business-related metrics alongside system metrics. I have used custom metrics in a limited way to add more context to monitoring, such as tracking application-specific metrics alongside system data.

    Dashboard customization in Splunk Observability Cloud is quite flexible. We care about metrics in different types of visualization, and it helps us organize them in a way that makes sense for monitoring. It allows us to build dashboards tailored to specific use cases. This makes it easier to monitor system performance and quickly identify issues without going through unnecessary data.

    The integration in real user monitoring from Splunk Observability Cloud is actually better than from some other tools. If you are looking for the best SIM tool, then Splunk Observability Cloud is for you. If you have funds and capability for the cost, then Splunk Observability Cloud is definitely the best tool you can use.

    I have given this review an overall rating of nine out of ten.

    Which deployment model are you using for this solution?

    Public Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    View all reviews