Listing Thumbnail

    Splunk Observability Cloud

     Info
    Sold by: Splunk 
    Deployed on AWS
    Splunk Observability Cloud is the only fully integrated, turn-key solution for DevOps teams to conquer the complexity caused by modern applications and infrastructure. It powers high performing applications to deliver world-class customer experiences by eliminating operational blindspots. You can quickly find, analyze and resolve incidents anywhere in your stack with all the answers in one place. Unlike other vendors, with Splunk Observability Cloud you only need to instrument once with OpenTelemetry to get unified metrics, traces and logs collected in real-time, without sampling for full-stack, end-to-end visibility. AI-driven pattern detection proactively identifies and alerts on issues in seconds, drastically lowering MTTR. One tightly integrated modern UI powered by the most advanced capabilities means reduced tool sprawl, centralized management, cost control, and one seamless and streamlined workflow for monitoring, troubleshooting, investigation and resolution.
    4.2

    Overview

    Splunk Observability Cloud is the only fully integrated, turn-key solution of all the tools DevOps teams need to monitor any stack at any scale. One seamless UI provides end-to-end visibility, context rich workflows and lets you drill down to root cause in seconds.

    Splunk Observability Cloud includes:

    Infrastructure Monitoring - Splunk Cloud Infrastructure Monitoring provides DevOps, CloudOps, and SRE teams with real-time, full-stack visibility across all layers of their environment. With hundreds of out-of-the-box integrations, streaming analytics, pre-built dashboards, intelligent problem detection, programmability, and Service Bureau capabilities, Splunk Infrastructure Monitoring provides the fastest, most flexible visualization and accurate alerting for enterprise DevOps teams to meet or exceed Service Level Objectives (SLOs) by quickly detecting, triaging and resolving performance issues.

    Synthetic Monitoring - Splunk Synthetic Monitoring helps teams proactively eliminate customer-facing issues and optimize web and API performance to deliver better digital experiences. Our solution goes beyond basic uptime monitoring and incorporates filmstrips and screen recordings of user experience, OOTB benchmarks and customizable performance metrics, and seamless connectivity to a suite of observability solutions to help teams quickly understand and prioritize performance defects wherever they originate, and collaborate to quickly resolve these issues to deliver digital experiences that delight customers.

    APM (incl. Always On Profiling) - Splunk APM is the industry's most advanced Observability solution to troubleshoot issues and optimize performance for modern applications. It includes:

    100% data capture: Never miss an issue or anomaly across in your code or traces with Full-Fidelity, NoSample data capture to ingest and contextualize all your telemetry data, and code profiling to identify performance bottlenecks Directed troubleshooting: easily identify, scope, and resolve issues with guided troubleshooting that automatically correlates and contextualizes system performance to find root cause faster OpenTelemetry standardization: engineering teams receive flexible instrumentation to build and measure data from new code in services, with no proprietary vendor lock in

    Real User Monitoring - Splunk RUM connects ALL front-end traces with their backend tracing, providing unmatched visibility that enables DevOps teams to understand exactly how their backend services impact user experience, thereby simplifying troubleshooting and resource allocation. With streaming analytics, SREs and developers are alerted within seconds of any user issue, and powerful AI capabilities coupled with high cardinality analysis enable them to bring the issue to speedy resolution. OpenTelemtry-based instrumentation provides maximum flexibility and ensures customers are never locked in.

    Log Observer Connect - Consolidate your tools by unifying the logs from Splunk Enterprise and Splunk Cloud, with our best-in-class metrics and traces. Log Observer Connect lets observability users explore the data you're already sending to your existing Splunk instances with Splunk Log Observer's intuitive no-code interface for faster troubleshooting, root-cause analysis and better cross-team collaboration.

    Splunk Observability Cloud suite starts at $15 per host, per month, billed annually. Minimum host quantities pricing apply.

    Highlights

    • Full-stack, end-to-end visibility: with a tightly integrated modern UI and seamless, context- rich workflows for full stack monitoring, troubleshooting and investigation of the unknown unknowns. Splunk Splunk Observability Cloud lets you drill down to root cause in seconds. You can easily integrate your existing monitoring tools to bring full context to alerts behind every incident.
    • NoSample™ full fidelity tracing: no more dead end investigations using a NoSample™ full fidelity approach to capture and visualize all data, in context, making sure no anomalies get missed. Find the backend root cause of any front-end issue. When troubleshooting backend issues, full fidelity tracing helps finding any issue, even those that do not result in system errors, and issues that no one anticipated.
    • Monitor any stack at any scale: Great for on-prem, hybrid and multicloud environments. Splunk Observability Cloud is a future-proof observability investment with a solution that will scale with customers and can meet the needs of any cloud-native environment, no matter how large (up to petabytes of ingest per day) or how complex (multiple cloud environments all integrated into one system of record), without compromising performance.

    Details

    Sold by

    Delivery method

    Deployed on AWS
    New

    Introducing multi-product solutions

    You can now purchase comprehensive solutions tailored to use cases and industries.

    Multi-product solutions

    Features and programs

    Buyer guide

    Gain valuable insights from real users who purchased this product, powered by PeerSpot.
    Buyer guide

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    Splunk Observability Cloud

     Info
    Pricing is based on the duration and terms of your contract with the vendor. This entitles you to a specified quantity of use for the contract duration. If you choose not to renew or replace your contract before it ends, access to these entitlements will expire.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    12-month contract (3)

     Info
    Dimension
    Description
    Cost/12 months
    OBSCloud: Infrastructure
    Real-time visibility for infrastructure health. 200 hosts included
    $36,000.00
    OBSCloud: App & Infra
    All the data you need to adopt microservices. 100 hosts included
    $72,000.00
    OBSCloud: End-to-End
    Troubleshoot O11y with ease to create the best UX. 100 hosts included
    $90,000.00

    Vendor refund policy

    All purchases are final, no returns or refunds.

    Custom pricing options

    Request a private offer to receive a custom quote.

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    Software as a Service (SaaS)

    SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

    Support

    Vendor support

    Splunk offers a variety of support options to help ensure your success. support@splunk.com 

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Product comparison

     Info
    Updated weekly

    Accolades

     Info
    Top
    10
    In Data Anonymization, Data Security and Governance

    Customer reviews

     Info
    Sentiment is AI generated from actual customer reviews on AWS and G2
    Reviews
    Functionality
    Ease of use
    Customer service
    Cost effectiveness
    0 reviews
    Insufficient data
    Insufficient data
    Insufficient data
    Insufficient data
    7 reviews
    Insufficient data
    Positive reviews
    Mixed reviews
    Negative reviews

    Overview

     Info
    AI generated from product descriptions
    Full-Fidelity Data Capture
    NoSample full fidelity tracing approach that captures and visualizes all telemetry data without sampling, ensuring no anomalies are missed across metrics, traces, and logs.
    OpenTelemetry-Based Instrumentation
    Standardized instrumentation using OpenTelemetry framework for flexible data collection from services and applications without vendor lock-in.
    AI-Driven Anomaly Detection
    Machine learning-powered pattern detection that proactively identifies and alerts on performance issues and anomalies within seconds to reduce mean time to resolution.
    Integrated Monitoring Capabilities
    Unified platform combining infrastructure monitoring, synthetic monitoring, application performance monitoring with always-on profiling, real user monitoring, and log analysis in a single interface.
    Multi-Environment Scalability
    Support for on-premises, hybrid, and multi-cloud environments with capability to handle petabytes of data ingestion per day across complex distributed architectures.
    Real-time Data Collection and Indexing
    Collects and indexes machine-generated data from virtually any source or location in real time with automatic indexing upon data ingestion.
    Complex Event Correlation
    Correlates complex events spanning multiple diverse data sources using time-based correlations, transaction-based correlations, sub-searches, lookups, and joins.
    Scalable Data Processing
    Scales to collect and index tens of terabytes of data per day with distributed computing architecture.
    High Availability Clustering
    Provides clustering technology for availability and fault tolerance across distributed computing environments.
    Machine Data Search and Analysis
    Enables searching, analyzing, and visualization of machine data generated by IT systems and technology infrastructure across physical, virtual, and cloud environments.
    Data Routing and Destination Management
    Routes data to multiple destinations with capability to deliver specific data to targeted tools while archiving full fidelity data to cost-effective storage
    Data Optimization and Reduction
    Reduces data streams by up to 50% through removal of unused log and metric data
    Event Processing and Transformation
    Processes event data through centralized parsing with capabilities to route, optimize, reformat, and enrich data in flight
    Role-Based Access Control
    Implements role-based access control with support for external authentication via LDAP, Splunk, and OpenID Connect identity providers
    Real-Time Monitoring and Configuration
    Provides GUI-based configuration and testing interface with live data capture and real-time observability pipeline monitoring

    Contract

     Info
    Standard contract
    No
    No

    Customer reviews

    Ratings and reviews

     Info
    4.2
    82 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    50%
    44%
    6%
    0%
    0%
    25 AWS reviews
    |
    57 external reviews
    External reviews are from G2  and PeerSpot .
    HrishikeshNavkar

    Metric-based monitoring has simplified alerting and currently supports our cloud migration

    Reviewed on Feb 04, 2026
    Review provided by PeerSpot

    What is our primary use case?

    Currently, we are in the process of migrating from on-premises to Splunk Cloud as well as Observability . For metric-based monitoring, we can monitor via Observability  and are migrating it there. We are setting up private locations to monitor synthetic tests, such as ping checks, port checks, and URL monitoring. The rest is metric-based monitoring, which is being done by Splunk using Splunk OTeL, which is an OpenTelemetry  agent for Observability. This agent brings metrics from end devices to Observability. Based on these metrics, we set detectors and rules to trigger alerts.

    Our observability is not yet live in production with Splunk Observability Cloud . It is currently being built, and we are adding new components, but it is not yet fully ready.

    What is most valuable?

    Comparing to Cloud, Splunk Cloud, or any other solution, the most valuable feature of Splunk Observability Cloud  is that it is entirely based on metrics. The agent is also very lightweight compared to Splunk UF and does not consume much compute resources on the end server or host from which we are pulling data. However, it can only monitor metrics and cannot monitor logs.

    Regarding how Splunk Observability Cloud has benefited our organization, we are yet to go live, but most of the configuration that requires conditions and triggers on Splunk Cloud involves writing queries. With Splunk Observability Cloud, the process is quite simple. We can directly get metrics flowing, set thresholds, and everything is UI-based. This requires less time to set up and use. I do not have that much visibility with Splunk Observability Cloud at this time as I am working as an administrator. It has helped us create dashboards for visualization purposes.

    What needs improvement?

    There is one thing that could be improved in Splunk Observability Cloud. We have the capability in Splunk to connect to Splunk agents such as Splunk forwarders from a deployment server and update the end agents and forwarders using server classes. We can push and update configurations from our own hosted servers without needing to access the end device. In Splunk Observability, the OTeL agent cannot be updated from our end. Every time we need to update, we have to reach out to users or gain access to the host to update the configurations. There should be a solution to update OTeL agents from Splunk Observability Cloud itself.

    For how long have I used the solution?

    I have been working with Splunk Observability Cloud for approximately five to six months.

    What do I think about the stability of the solution?

    Splunk Observability Cloud is reliable based on my experience with stability and reliability so far.

    We were facing some challenges with the stability of Splunk Observability Cloud regarding the login page. It was not working several times and was not accepting SSO  authentication. The observability team found a solution for this issue, though I am not fully aware of the details. There were several times when opening the page did not directly log in and showed some errors.

    What do I think about the scalability of the solution?

    I have not encountered any scenarios regarding the scalability of Splunk Observability Cloud. It should be good because it is cloud-based. I am not aware of the licensing model and how it scales or what the rules are for scaling.

    How are customer service and support?

    I was not directly involved with technical support for Splunk Observability Cloud, but I am aware that my teammates reached out to support. They were finding issues regarding configuration, installation, and deployment of Observability for specific components. Since Observability is cloud-based and hosted by Splunk, the components we own on-premises are the OTeL gateways, agents, and private locations. They reached out to the vendor regarding these components, and the support was quite smooth. They have raised some bugs as well for the vendor to fix. I would rate the technical support from Splunk an eight out of ten.

    How would you rate customer service and support?

    Positive

    How was the initial setup?

    Since it is cloud-based, Splunk Observability Cloud was ready to use upon deployment. The OTeL gateways were built by our team and required configuration. I was not part of that process but am aware that we needed to configure the OTeL gateways to route data to them as an endpoint and from there it would be ingested to Observability or forwarded to Observability. There were no significant issues with this process and it was quite smooth. However, configuring private locations on a few gateways was quite difficult to set up and maintain because Docker  was going down at times. There were some issues that were discussed with Splunk vendor, and they provided guidance on how to fix them.

    reviewer2800332

    Observability has exposed tracing gaps and inconsistent metrics while still mapping complex services

    Reviewed on Jan 30, 2026
    Review from a verified AWS customer

    What is our primary use case?

    In my organization, we have 150 to 160 applications yearly with different frameworks including .NET, Java, and Python based applications. All of them are hosted on different types of servers such as Windows, Linux, ECS, and EKS. With respect to deployments, we integrated Splunk Observability Cloud . Previously, we used Prometheus and Grafana . My organization considered Splunk Observability Cloud  to be a premium side of observability, so they switched from our previous solution.

    We use the tracing feature in Splunk Observability  Cloud.

    What is most valuable?

    I appreciate the service map and APM  in Splunk Observability Cloud the most. This is the main feature I value. The interface is completely UI based, so I can see the complete service map, observe the latency present, and view complete metadata for a particular service or any database-related service. The service map enables a 3D view of the complete application architecture.

    With respect to the effectiveness of Splunk Observability Cloud in improving digital resilience within the organization, it was quite similar to other third-party tools. The main distinction is that it has some improved security. We use SignalFlow queries, and with respect to those queries, we work with alerts and the dashboarding part. I can say it provides efficiency with improved security compared to other third-party tools, but in terms of usage, it is quite similar to Prometheus and Grafana .

    What needs improvement?

    I want to address a disadvantage regarding the service map showing misinformation with respect to latency, which relates to data reliability pulled from AWS  cloud or on-premise servers. We saw issues with latency because Splunk APM app shows different data than Prometheus and Grafana. We tried to get premium support and on-call support with Splunk, and they were helpful in troubleshooting, but they ended up with no solution.

    Performance with Splunk Observability Cloud is acceptable to me, but the modifications required by users are problematic. I had to build the complete alerting system and monitoring system, which had to be changed. The way they designed this is not optimal. If I compare with Prometheus, we can import and export dashboards, but here we face errors with dialogue boxes. We tried with technical support calls about this, but they were unable to solve it, so I do not understand why export and imports are not functioning.

    The overall impression of the no-sample tracing feature in Splunk Observability Cloud, specifically in terms of eliminating blind spots in data collection, is that it needs improvement because the data is not adequate compared to other third parties. We get disturbance in the dashboards and charts while trying to correlate data. The mechanism functions differently manually than it does with a SignalFlow query, and both should be equal. We are unable to replicate from manual processes to the automation method, which is the issue.

    The SignalFlow query feature in Splunk Observability Cloud needs improvement because it should function the same as manual processes. When we configure manual queries and then configure them via SignalFlow, they give different outputs. We tried with on-call support about this, but they were unable to address it, indicating there is a bug with the queries that needs improvement.

    For enhancements, I would like to see improvements in the OTEL agents, OTEL collectors, and other features in Splunk Observability Cloud. The guidelines in the official documentation are not working at all. We have to deploy processes in our own way, and the documentation works only in 60 percent of the conditions, leaving the remaining 40 percent as problematic and needing improvement.

    For how long have I used the solution?

    I have used Splunk Observability Cloud for nearly one to one and a half years.

    What do I think about the stability of the solution?

    I experienced a downtime with Splunk Observability Cloud one time. We were unable to access it for nearly one day, which took a lot of time to resolve. Normally, other tools do not take as much time, and I do not understand why Splunk took so long. From the vendor's end, they should address such issues in a much shorter timeframe. When downtime occurs, it raises concerns about how we measure and receive alerts, as everything needs to be in place.

    What do I think about the scalability of the solution?

    In terms of lowering the cost of unplanned digital downtime using Splunk Observability Cloud, I found that many users report it is expensive, especially at a large scale, which can be a concern for organizations with tight budgets. At a large scale it is good, but for start-ups and some medium-range companies, it is expensive and they cannot afford it, especially as the cost increases with respect to data volume and retention needs.

    How are customer service and support?

    Support wise, there are two kinds of support for Splunk Observability Cloud: bi-weekly support and on-call support, with one more being premium support. They need to decrease the price of premium on-call support because as an employee, we require credits to get premium support, and our organization does not have many credits. That is a point where it lagged, but with respect to the bi-weekly calls and on-call support, it was acceptable. Out of five, I can give three for normal support, and four for premium call support.

    How would you rate customer service and support?

    Neutral

    Which solution did I use previously and why did I switch?

    Previously, we used Prometheus and Grafana.

    Which other solutions did I evaluate?

    In comparing Splunk Observability Cloud to other observability platforms I have worked with, I find no key differences in both pros and cons. The integration process is the same across the board, and I feel there is not a real differentiator, as everything is similar in terms of custom dashboards and APM features.

    What other advice do I have?

    We miss the synthetic monitoring and AI-related features in Splunk Observability Cloud, which I think means front-end monitoring. We touch only the main AWS  monitoring and service map, APM, and that is what we are using.

    Regarding the ability to enrich data with custom metrics in Splunk Observability Cloud, we configured our breaches based on application performance only. Every application has different SLAs and SLOs, and according to each application, we have configured alerts using baselines that got triggered. We correlate this with multiple factors, such as Java-based memory leaks or garbage collections, and we generate custom metrics with alerts for notification purposes, employing the Webhook URL of Microsoft Teams  and Outlook.

    The out-of-the-box customizable dashboards provided by Splunk Observability Cloud are effective in showcasing IT performance to business leaders. It offers a nice point, as when we correlate different charts, I get so many x-axis and y-axis options, and we can correlate with other metrics. We have formulas there to find ratios and averages, which was a nice experience offering so many options. We are using the f(x) functions with respect to maximum, minimum, and averages, which are quite good.

    On a scale of one to ten where ten is the best, I would rate Splunk Observability Cloud differently. For the UI part, I would rate it an eight, but for the configuration part, I would rate it three to four, as the configuration and integration aspects are not good at all. Overall, I would rate Splunk Observability Cloud a three out of ten.

    Which deployment model are you using for this solution?

    On-premises

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    Amazon Web Services (AWS)
    Pradeep Hiremath

    Monitoring has become more proactive and cloud operations are managed with real-time insights

    Reviewed on Jan 28, 2026
    Review from a verified AWS customer

    What is our primary use case?

    My use case for Splunk Observability Cloud  is primarily for monitoring and cloud management, and it serves us well.

    What is most valuable?

    The best features in Splunk Observability Cloud  that I appreciate the most include its comprehensive monitoring capabilities and its user-friendly interface.

    The solution has significantly helped improve my operational performance and my company's resilience by providing real-time insights. The enhancements to my operational performance and resilience are noticeable.

    It has saved me a considerable amount of time and resources by streamlining our monitoring processes.

    My impression of the AI-powered analytics and guidance provided by Splunk Observability  Cloud is that they are very effective and enhance our decision-making.

    I do use the no-sample tracing feature to eliminate blind spots in data collection, and it is quite helpful.

    My team has effectively utilized the ability to enrich data with custom metrics to improve our analytical capabilities.

    The out-of-the-box customizable dashboards are effective, and they help showcase IT performance to business leaders quite effectively.

    What needs improvement?

    In Splunk Observability  Cloud, the areas that have room for improvement include usability enhancements to make it even better.

    For how long have I used the solution?

    I have been using Splunk Observability Cloud for a considerable time, and I can share my experience with it.

    What do I think about the stability of the solution?

    Regarding stability, I would rate the stability of Splunk Observability Cloud as a 9, indicating it is very reliable. Splunk Observability Cloud performs exceptionally in terms of stability under varying conditions.

    How are customer service and support?

    From 1 to 10, I would rate the technical support as an 8 since it is generally responsive and helpful.

    How would you rate customer service and support?

    Positive

    What about the implementation team?

    The solution was purchased through a partner, and my experience with the partner has been generally positive. My experience with the partner has been satisfactory as they provided the needed support throughout the process.

    What was our ROI?

    My experience with lowering the cost of unplanned digital downtime has been positive as it has indeed reduced downtime.

    What's my experience with pricing, setup cost, and licensing?

    Regarding the pricing of Splunk Observability Cloud, while I believe it can be improved, I would rate it around 7, leaning towards being expensive.

    Which other solutions did I evaluate?

    I would compare Splunk Observability Cloud with other solutions as more feature-rich and user-friendly based on my concerns.

    What other advice do I have?

    For others looking into this product, I would recommend trying it out with a proof of concept to see its benefits firsthand.

    Approximately 50 users in my company use Splunk Observability Cloud to leverage its capabilities effectively.

    The solution does require some maintenance, but it is quite straightforward in managing it.

    In terms of my company's relationship with Splunk, we are currently a customer making the most of their offerings.

    I would rate Splunk Observability Cloud a solid 8 from 1 to 10 based on my experience and satisfaction with its performance.

    Dhananjay Dileep

    Unified monitoring has improved end-to-end visibility and reduced detection time across apps

    Reviewed on Jan 12, 2026
    Review from a verified AWS customer

    What is our primary use case?

    I have been using Splunk Observability Cloud  for the past one year in my career. Splunk Observability Cloud  has been introduced to our project for end-to-end monitoring for applications, providing complete visibility of applications, services, tech stacks, and CIs, which constitutes the whole monitoring solution for an entire application.

    Previously, we were using different monitoring systems such as Dynatrace , the competitors of Splunk, and even Splunk Cloud Platform  or Enterprise platforms for logging alone. Now we have the entire solution under one name and one platform, which is Splunk Observability  Cloud, and that is why we mainly introduced Splunk Observability  Cloud to our project.

    What is most valuable?

    The UI is quite understandable, making it not as complex when compared to the other previous platforms I have worked on. Another thing I could specifically point out is that we can have entire visibility for the entire application performance when we look into Splunk Observability Cloud, and it is much easier to navigate across various aspects such as real user monitoring, application performance monitoring, or synthetic tests, making it stand above the other previous applications I have worked on previously.

    One thing I should point out is that there are some auto-detectors which are defaultly present in Splunk. For example, if you are configuring a detector for AWS  RDS  service, you have an auto-detector which detects what the technology is, and you will have a readily available detector, needing only to configure your specific metrics on that, which is one advantage. The dashboards especially stand out, being different compared to the other platforms. Even previously, Splunk Enterprise also had dashboards, but this is different as we can have live metrics through the dashboards, which is quite impressive with how Splunk Observability Cloud has been introduced and it is performing better than the previous Splunk versions.

    What needs improvement?

    When we have too many detectors in place for one particular app, such as when I have created 50+ detectors through my account, the entire page becomes a bit loaded when creating the 51st detector, feeling heavy and taking time to load. Additionally, it throws random errors; for example, when we try to save one detector, it might throw some random error which is not even related, with something else being wrong, not that particular error, but the underlying root cause might be different. Sometimes the error is just "some problem occurred," and we are not able to point out what the real cause is.

    This mainly happens when we have too many detectors or too many alerts in place rather than a standard number. One more thing is in the alert rules; if we have a main general alert, and instead of creating a new detector, we are adding a new rule under one detector, when the number of rules also increases, such as when we have 10 or 15 rules under one generic detector, that again creates the same kind of problem, taking some time to save that particular newly added rule, and it might not save at times, just keeps on spinning. Those are the two drawbacks which I spotted recently; other than that, everything looks perfect.

    What do I think about the stability of the solution?

    There was an outage which occurred about three or four months ago; that was the only outage I faced entirely in one year, and I believe that was a global outage from Splunk's side, which prevented us from logging in for a couple of hours. The Splunk team was working on it, and they resolved it within five or six hours, which was the only outage I faced in one year. Other than that, everything was smooth.

    What do I think about the scalability of the solution?

    Splunk Observability Cloud is quite scalable compared to the other platforms I have worked on, and I do not find any difficulty in scaling up or even scaling down.

    How are customer service and support?

    For particular kinds of issues which we were not able to resolve, we have raised Splunk tickets a couple of times before contacting the technical support or customer support. The support is an eight out of ten. The speed is actually quite good; they would respond within 48 hours, and the solutions they are giving are quite good, as we were able to solve most of the issues with their solutions.

    How would you rate customer service and support?

    Positive

    Which solution did I use previously and why did I switch?

    I have used Dynatrace  and DataDog as alternatives to Splunk Observability Cloud.

    How was the initial setup?

    The initial deployment of Splunk Observability Cloud is actually easy. With the clear documentation we have in place, it is quite straightforward. We even have examples of code snippets in the documentation, making it quite straightforward.

    What about the implementation team?

    One or two people can manage the deployment; you do not need a team of five. I have myself worked on an entire project, and with one of my colleagues, I have worked on a much bigger project. I believe one or two people can easily manage the deployment process.

    What was our ROI?

    Splunk Observability Cloud has helped me reduce my mean time to detect. We have worked on around 80 applications last year for one particular client, and since the MTTR has improved drastically, they have given us 245 applications, which is around 150 applications added to the previous number of applications. This is definitely a performance improvement.

    What's my experience with pricing, setup cost, and licensing?

    The pricing area I am not particularly aware of because that is centrally managed by the company which I work for. However, I feel that we are not spending too much on the licensing cost; it is manageable for how much we are working with currently for the number of applications we have at present, which is what I believe is not too much.

    Which other solutions did I evaluate?

    I would prefer Splunk Observability Cloud any day when comparing these solutions to Dynatrace and DataDog because the first thing is that the documentation Splunk has is perfect, and anybody who is new, even new to the platform, can gain knowledge reading through the documents, which are perfectly explained for configuring various kinds of technologies and integrating various kinds of technologies with Splunk Observability Cloud. The second thing is the UI, which is much more user-friendly compared to Dynatrace and DataDog.

    What other advice do I have?

    The No-Sample Tracing helps me eliminate any blind spots in my data collection because we have particularly many services, and for example, using Mule, it is an added advantage to use no-sampling traces provided by Splunk Observability Cloud, giving us the exact points where the service is emitted and the exact spans between the two endpoints. It helps us break down where the actual issue is rather than just getting sample trace data and looking into each point entirely, which takes much time, providing a particular breakdown of that span and how it navigates across endpoints and pointing out the particular error which occurs when we access a service or when a service travels across two endpoints.

    As of now, we are not using any AI tools in Splunk Observability Cloud, but we are planning to onboard them, considering that the number of applications we have is increasing day by day. We are planning to automate a few applications to generate the detectors and synthetics automatically as soon as we have the metrics in place. For that, we are developing code that can integrate with Splunk Observability Cloud platform and generate the results, which will be a time saver for us.

    The recent UI changes have been more streamlined. Initially, the UI was a bit different, but later, I believe a couple of months ago, maybe one month before, they re-modified the menu options, and that has also resulted well for us in navigating across the panes.

    I have utilized the ability to enrich data with custom metrics in Splunk Observability Cloud; I did it for custom metrics for AWS  services, and for a couple of MQ , IBM-based MQs, we have worked on custom metrics, integrating easily in both cases. My company may have partnerships with Splunk, but I am not sure of that. I would rate this review as a nine out of ten.

    reviewer2787105

    Log insights have boosted uptime and now drive automated remediation and pattern-based alerts

    Reviewed on Dec 17, 2025
    Review provided by PeerSpot

    What is our primary use case?

    I am using Splunk Observability Cloud  as a log-based monitoring tool for my databases. We have ingested our database logs and OS system logs into Splunk Observability Cloud  and are creating dashboards and alerting features over those alerts. One of my major use cases is that all kinds of databases I am currently working with have database logs that capture all information, warnings, and error messages. These database logs are moving to Splunk Observability  Cloud. The first use case is that I no longer need to maintain a long list of flat files on my server for all those logs. Those can be directly ingested into Splunk Observability  Cloud. The benefit I am seeing from here is that I can get pattern-based analysis of what kind of errors I am commonly getting and what the date patterns of those errors are. I can get dashboards over that and I can also create alerts. I can also incorporate those alerts with some back-end Git  workflow for automatic remediation. This is one of the solutions.

    Another use case for Splunk Observability Cloud that we are seeing is that there are multiple times when there is a requirement to publish some kind of data. So instead of publishing an alert if those data breaches occur or if some kind of dashboard needs to be created, instead of sending data directly to the users, if that data is not PII, we are also ingesting that into Splunk Observability Cloud in a JSON format and then again, dashboards and other alerting can be created. These two are the main major use cases for which I am using Splunk Observability Cloud.

    How has it helped my organization?

    With the help of the alerting and observability mechanism, resiliency, and automatic automation of issue remediation based on alerts and workflows, it actually reduces the cost and increases the uptime of my system and customer satisfaction. There are multiple indirect benefits I am getting when using Splunk Observability Cloud.

    Currently, with the growth of the organization, I am seeing an increasing use of Splunk Observability Cloud in a more dynamic way. We are continuously creating new dashboards, ingesting logs in JSON, and trying to bring the best value out of it. I am seeing a dynamic and drastic increase in the use of Splunk logs and the Splunk data we are ingesting.

    There are two aspects to expanding the usage. Organic growth of the environment actually puts new systems into Splunk Observability Cloud, and exploring new opportunities for what all can also be ingested into Splunk Observability Cloud. Previously, I can see that memory dumps are there. We are also looking at whether we can ingest memory dumps so that if the system is about to crash, those memory dumps can be captured into Splunk Observability Cloud so that it can create alerts over that and I can also perform analysis. I can also see if any other system is facing the same kind of memory dump issues. So that maybe it is one alert for one system for me, but for the complete farm, there may be different servers with different teams or business units facing the same issues. When I have Splunk Observability Cloud on all systems, I can actually create a consolidated report and see that this is the pattern which particular farms are having this kind of issues, and maybe something is broken. This is the way the plan is to increase the availability or the usage of Splunk Observability Cloud.

    What is most valuable?

    The performance and speed are valuable. Previously when Splunk offered the enterprise solution, I needed to install Splunk and maintain my local server. There was a limitation that only a certain number of servers could be supported in one instance and I would need to have multiple instances if I was in an enterprise system setup. When I am in the cloud, a single instance can support N number of systems. It is pretty fast, no matter how much data is there. Dashboards are pretty good with multiple functions available. The alignment or integration that can trigger automatic solutions with the workflow for automatic remediation of the alerts is the best thing. These three or four things are the best Splunk Observability Cloud features that I am seeing.

    The point in time alerting, the point in time data capture, and automatic remediation with the integration of good workflows or Ansible  workflows is definitely the key to any resiliency and increasing the uptime of any system.

    After moving to Splunk Observability Cloud, it is almost zero downtime. We never face downtime because when I was in the enterprise setup, I needed to maintain my servers and maintain hygiene of vulnerabilities, patches, and all. Now when I am in the cloud, everything is automatic. Almost zero downtime plus the perfect alerting feature and log-based analysis are available. Metrics alerting is also there in Splunk Observability Cloud through queries. This is one of the features that keeps me updated with the current health of my system and helps me to keep my system up and running fine and available for my customers.

    Splunk Observability Cloud incorporated a new AI agent feature that is really good. Sometimes I need to create queries and Splunk queries for filtering the data and some pattern-based analysis. This agent is really good in helping me and suggesting the queries. This means I do not need to have a Splunk expert or Splunk query expert. I can just ask that agent that I need pattern-based analysis or I need to create this kind of filters for this kind of data and it can suggest to me. Once it suggests a sample query to me, I can do the tweaking and I can have my data ready. It actually reduces my time to perform my analysis and to reach the conclusion about what exactly is causing issues in my system and what are the repetitive issues in my system. This AI feature really helps for newcomers to Splunk Observability Cloud to perform deep diving analysis with the data captured by it.

    Custom metrics are valuable. In Splunk Observability Cloud, some infra-level metrics are not available, but through custom metrics, I can achieve it. This is an add-on feature that Splunk Observability Cloud is providing and without any additional monitoring tool. If that feature was not there, then I would need to plan some other monitoring tool for metrics-based alerting, but this custom one helps me to achieve it in the same monitoring tool. The consolidation and integration of metrics-based alerting and log-based alerting in a single tool is actually the lovable feature. I do not need to worry about or look for multiple tools. I can have my own data and own health available in a single tool, in a single view.

    What needs improvement?

    The dashboards are good, but the only limitation I see currently is that they need particular formats only to create a dashboard. They need to have a particular JSON format or time series format. This sometimes creates additional work for me so that when I am ingesting logs in Splunk Observability Cloud, it should be in a specific format. Either Splunk Observability Cloud should have multiple formats available or multiple dashboards available for different kinds of formats. At least Splunk Observability Cloud has everything available at a Splunk level. They can do some kind of analysis and see what are the major top ten or top twenty types of logs they are getting and they can have dashboards according to those logs. Instead of forcing customers to design their logs in the way of Splunk Observability Cloud, Splunk Observability Cloud can create dashboards based on the customer requirement. This will actually ease things up for the end users.

    The current dashboards are good. The feedback is that Splunk Observability Cloud is forcing me to modify my logs that I am ingesting in Splunk Observability Cloud in a specific format. If Splunk Observability Cloud can leverage it and make it open for any format, that would be great. If that is not feasible, at least the top ten or top twenty logs that Splunk Observability Cloud is getting should be readable by Splunk Observability Cloud without any changes. That actually is one of the major feedback items I can provide which can actually ease the life of the end users or any layman. As a newcomer to Splunk Observability Cloud, I may not know JSON. I now need to hire someone or I need to look for someone who knows JSON and who can convert my logs into JSON format and then I will ingest them into the logs if I want to create a dashboard. If I do not want to create a dashboard, that is okay. On the other hand, Splunk Observability Cloud is giving me a usability and easy to go interface, but for a dashboard, I need to have an understanding of JSON so that I can ingest the log in JSON format. That is a dilemma that they have and they should work on.

    Currently, Splunk Observability Cloud is not the only solution which any organization is using. There is also Grafana  and PagerDuty. If Splunk Observability Cloud can plan some kind of integration with PagerDuty and Grafana , then those things can be controlled from a single position and if something else is happening at one location, it can update things at all levels. That can also bring great value to the users. Currently, I have to maintain three systems separately, but if some kind of integrations can be developed with these three vendors, then that can be a great thing because all these three have now become the industry pillars or industry standards for observability and resiliency.

    For how long have I used the solution?

    I have been working with it for the last two years. Before that, it was an enterprise solution. Now it is cloud-based.

    What do I think about the stability of the solution?

    I cannot relate any stability issues to my experience with Splunk Observability Cloud.

    What do I think about the scalability of the solution?

    Scalability is pretty smooth. I just need to deploy the Splunk forwarder and the config file that specifies which servers it should connect to and it will get connected. My data will start populating. It is pretty straightforward. I do not see any challenges there, even when it was in enterprise and now when it is in the cloud. The deployment and onboarding of new servers and ingesting the logs is pretty straightforward. Anybody can learn it within a day without having any prior knowledge.

    How are customer service and support?

    We have raised multiple questions when we face any issues. Our support is prompt and usually within a day, I will get my answers.

    How would you rate customer service and support?

    Positive

    Which solution did I use previously and why did I switch?

    Previously I was on Splunk Enterprise. I have been using Splunk for seven to eight years before we moved to the cloud in the last eighteen months.

    How was the initial setup?

    The initial setup is pretty smooth. I just need to deploy the Splunk forwarder and the config file that specifies which servers it should connect to and it will get connected. My data will start populating. It is pretty straightforward. I do not see any challenges there, even when it was in enterprise and now when it is in the cloud. The deployment and onboarding of new servers and ingesting the logs is pretty straightforward. Anybody can learn it within a day without having any prior knowledge.

    What other advice do I have?

    I appreciate that your organization collects reviews about the product so that it can be shared with the vendor or the product owner as appreciation or as feedback for improvement. Everything has been smooth in my experience. I would rate this product a ten out of ten.

    View all reviews