My main use cases for Splunk Observability Cloud include Application Performance Monitoring, Real User Monitoring, and Synthetic Monitoring.
Splunk Observability Cloud
SplunkExternal reviews
External reviews are not included in the AWS star rating for the product.
Has reduced digital downtime and supported operational performance through effective monitoring
What is our primary use case?
What is most valuable?
Splunk Observability Cloud has proven to be beneficial for our organization. In evaluating its effectiveness in improving digital resilience within my organization, I have experienced lower costs of unplanned digital downtime.
The solution has helped improve operational performance and company resilience.
What needs improvement?
Splunk Observability Cloud can be optimized to its full potential.
For how long have I used the solution?
I have been using Splunk Observability Cloud since this year.
What do I think about the stability of the solution?
The stability and reliability of Splunk Observability Cloud has been satisfactory. Customer service and technical support have been evaluated positively.
What do I think about the scalability of the solution?
Splunk Observability Cloud scales effectively with the growing needs of my organization.
How are customer service and support?
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
I have always used Splunk products.
What was our ROI?
My experience with the pricing, setup costs, and licensing has shown a return on investment with Splunk Observability Cloud.
What other advice do I have?
I would recommend organizations to consider implementing Splunk Observability Cloud.
On a scale of one to 10, I would rate Splunk Observability Cloud overall as nine.
Alerting improves incident response across teams and enables faster awareness before customer impact
What is our primary use case?
My primary use cases for Splunk Observability Cloud include alerting the business and the integrations app team, which are the largest users of Splunk within our company. They take up most of our ingest and have many alerts set up, along with log analysis and events analysis.
Those are our biggest team users, and alerting in general plays a crucial role for incident creation across multiple teams, regardless of who the shareholders are, cutting across multiple teams.
How has it helped my organization?
Splunk Observability Cloud has significantly improved my operational performance and my company's resilience. I would evaluate the effectiveness of Splunk Observability Cloud in improving digital resilience within my organization as very positive. It's easier to access for both Splunk experts and users who may not be familiar with Splunk, allowing even non-technical people to navigate it.
What is most valuable?
The features I appreciate the most about Splunk Observability Cloud are the ServiceNow integration feature, which is very seamless, and that's probably my favorite since ServiceNow is big in the observability industry. Being able to seamlessly create those incidents is the biggest plus. Without that, we probably wouldn't be using Splunk.
Just alerting with Splunk Observability Cloud provides significant benefits for my organization, particularly since this is a food and manufacturing company. Alerting the business side before the customer reports an issue helps us to be one step ahead, which is a big feature.
Dashboards for the IT executives are also beneficial since they might not be as technical yet can easily read a dashboard. The ability to create something easily interpretable, using color coding and heat maps, allows directors to see value in it.
The ability to enrich data with custom metrics in Splunk Observability Cloud has significantly impacted our integrations and app team, the biggest users of Splunk in our environment. The visibility provided and the remediation they gain from reading events have led to increased log ingestion, which is a good problem to have. They can confidently rely on Splunk for monitoring app creations, APIs, and more.
What needs improvement?
Splunk Observability Cloud could be improved in terms of integrations with more technical add-ons, such as Zoom. Although they have one with Zoom, it's not available in the cloud, so having that feature would be beneficial. Essentially, Splunk should continue expanding to create easier ways to ingest logs from different products.
The out-of-the-box customizable dashboards in Splunk Observability Cloud are very effective in showcasing IT performance to business leaders. However, there are aspects that could be improved, such as linking dashboards to one another. While IT leaders may not drill down, it's crucial to create levels of dashboards for technical users to find root causes, making it effective for stakeholders.
For how long have I used the solution?
I have been using Splunk Observability Cloud for two years, as my first cloud was with ABC.
What do I think about the stability of the solution?
I have experienced slowness from Splunk Observability Cloud occasionally, yet we have not faced crashes or major performance issues. When slowdowns occur, we reach out to support for explanations and have received adequate responses.
What do I think about the scalability of the solution?
Splunk Observability Cloud scales efficiently with the growing needs of my organization, even requiring occasional license increases. Maintenance is done by Splunk, and we receive alerts about maintenance windows to ensure we stay current. So far, we've experienced seamless operations without any breakdowns.
How are customer service and support?
I find Splunk's customer service to be great compared to many other products. They do a good job of responding within 24 hours or less, even with P4 issues. They may reroute you a couple of times. Overall their support is commendable. I would rate technical support and customer service a solid eight.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
Prior to adopting Splunk Observability Cloud, I was aware that our company used another solution. I don't recall what it was since I was brought in specifically for Splunk.
How was the initial setup?
I wasn't a part of the deployment process.
What was our ROI?
I have definitely seen a return on investment with Splunk Observability Cloud, particularly through how fast it has grown and how comfortable other teams are in relying on its outputs for monitoring and observability.
I don't directly deal with costs, so I can't comment on the return on investment regarding lowering unplanned digital downtime using Splunk Observability Cloud.
What other advice do I have?
I haven't explored the no-sample tracing feature in Splunk to eliminate blind spots in data collection.
AI-powered analytics and guidance provided by Splunk Observability Cloud will be very beneficial. We just initiated a response to get those AI functionalities into our cloud environments, so we haven't fully explored it yet.
My advice to other organizations considering Splunk Observability Cloud is to get it, especially for log monitoring and alerting. There aren't too many observability tools that match its ease of use, whether for IT-oriented users or not. Its graphical user interface is brilliant and very seamless, making it easy for anyone to navigate. I'm confident big companies considering Splunk should choose it, as it delivers in usability and integration with other tools.
On a scale of one to ten, I rate Splunk Observability Cloud an eight.
Enables faster issue resolution by pinpointing problem areas through custom metrics and agent data
What is our primary use case?
My main use case for Splunk Observability Cloud is application monitoring.
What is most valuable?
The features of Splunk Observability Cloud that I appreciate the most are ops intel and the community support. These features have benefited my organization because they help us find the root cause of any issue quickly and pinpoint the exact location where the issue exists.
We have not yet completely gone into production, so I do not have any metrics or data points to share. To evaluate the effectiveness of Splunk Observability Cloud in improving digital resilience within my organization, we have various client applications, such as the teller application and our online banking applications.
Initially, before Splunk, we had a long time to resolve issues. Now, with Splunk Observability Cloud, we will be able to solve them quickly and know exactly where the issue is. Previously, we needed to go to the war room to find where the issue was. Now, with Splunk Observability Cloud and all its agents and data, we know exactly where the issue is located.
Regarding the no-sample tracing feature, all the data fed by the agents to Splunk Observability Cloud means we do not have to worry about missing any issues during sampling. We have not yet explored the AI-powered analytics feature, but we have partially explored MLTK.
My teams have utilized the ability to enrich data with custom metrics by writing custom agents in Java and Python to collect those custom metrics and feed them into Splunk Observability Cloud. This is particularly useful for applications without direct Splunk agents.
The out-of-the-box customizable dashboards are helpful in showcasing IT performance to business leaders. They provide guidance on requirements we may not have visualized and help us build custom dashboards to include our company-specific metrics. We have not yet expanded usage since we haven't started using it extensively.
What needs improvement?
To improve Splunk Observability Cloud, we need more applications to be included in the observability so that more applications can have agents to monitor them and bring that information to the cloud.
Splunk Observability Cloud has not yet completely improved our operational performance for our company's resilience as we are just starting out, however, it will help us ultimately to reduce incident time.
For how long have I used the solution?
I have been using Splunk Observability Cloud for one year now.
What do I think about the stability of the solution?
In my experience until now, I have not experienced any stability issues with Splunk Observability Cloud.
What do I think about the scalability of the solution?
Splunk Observability Cloud scales effectively with the growing needs of my organization. As we are a growing company transitioning all our applications to the cloud, and with the increasing number of cloud-native applications, Splunk Observability Cloud will help us achieve digital resiliency and reduce our mean time to resolution.
How are customer service and support?
I would evaluate customer service and technical support as excellent, as Splunk has been quite responsive to our service requests, with their team providing good support.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
Prior to adopting Splunk Observability Cloud, we were using Splunk Enterprise, and we had custom monitoring tools developed in-house.
How was the initial setup?
The installation of Splunk Observability Cloud worked smoothly once we figured out the initial issues. The agents do not consume many resources, and the type of metrics they collect is helpful.
What was our ROI?
Since we have not progressed far into the implementation of Splunk Observability Cloud, I cannot comment on the return on investment at this time.
What's my experience with pricing, setup cost, and licensing?
I am not involved in the experience with pricing, setup cost, and licensing.
What other advice do I have?
I rate Splunk Observability Cloud eight out of ten.
Which deployment model are you using for this solution?
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
RUM data has improved visibility into user paths and strengthened operational performance
What is our primary use case?
Our main use cases include synthetic monitoring, APM, RUM, alerting, detectors, dashboards, and all related functionality.
What is most valuable?
My favorite feature of Splunk Observability Cloud is the RUM because I like the RUM data. Splunk Observability Cloud has helped improve our operational performance and company's resilience, which was their initial offering. Without it, we wouldn't have any exposure and would be looking at raw data.
What needs improvement?
It can be improved through the integration of AI, which is either coming or already available.
For how long have I used the solution?
I've been using Splunk Observability Cloud for two and a half years.
What do I think about the stability of the solution?
I've only experienced downtime, crashes, or performance issues when it isn't configured correctly.
What do I think about the scalability of the solution?
Splunk Observability Cloud scales effectively with the growing needs of our organization; we simply need to pay more and ingest more data.
Which solution did I use previously and why did I switch?
Prior to this, I wasn't using another solution to address similar needs.
What was our ROI?
I've seen ROI with Splunk Observability Cloud, though I cannot specify the exact amount. We would be unable to track user access issues without it, which would result in significant losses.
What other advice do I have?
I use the cloud almost exclusively and am still learning some features. I handle synthetic monitoring, but don't manage all integrations or usage aspects. I need to explore the AI-powered analytics and guidance, as we haven't implemented it yet. The out-of-the-box customizable dashboards are effective because they contain all the necessary base components.
On a scale of one to 10, I would rate Splunk Observability Cloud as an eight. I appreciate the cloud because it provides more visibility into the user's path. It's quite good, though the observability aspect is somewhat complicated, primarily due to my limited experience with it.
Provides real-time visibility for improved operational performance
What is our primary use case?
We are using the Splunk Observability Cloud for monitoring purposes and troubleshooting, and we are using that infrastructure in real time, in which we have infrastructure monitoring, application monitoring, log observer, and RUM synthetic monitoring. For troubleshooting purposes, we are installing the open telemetry collector agent on some of the servers, including Intel, Windows, and UNIX servers.
I have also worked on the agent upgrade from version 0.103 to 0.1113, which is ongoing right now.
How has it helped my organization?
We are also using the dashboards and detectors in Splunk Observability Cloud. For client needs, we are creating dashboards, reports, and detectors as well. For the detectors, we mostly work on host-down situations. When a server is down, we troubleshoot using the detector infra host down and identify the root cause of the failure, such as why it was down or not reporting to Splunk Observability Cloud. We find out the root cause by using that detector when the alert gets triggered and cleared.
We use the tracing features in the Splunk Observability Cloud, primarily for application performance monitoring. It helps us figure out service maps for root cause analysis. It provides visibility and helps address blind spots in data collection.
Splunk Observability Cloud offers a transparent, customized tool with real-time visibility. We use AWS, ReactJS, Python, and Java for tracing. It helps create customized dashboards and service maps based on customer requirements. It has AI that automatically generates visualizations, allowing us to create more reports based on customer needs. My seniors are primarily working on creating dashboards, reports, and for monitoring purposes.
Their technical team is performing well. About a year ago, Splunk Observability Cloud was slow and lacked features compared to now. It didn't provide exact details for any searched server in the metrics, but the situation has improved significantly, and we can now retrieve complete data on when servers were down or up.
What is most valuable?
The best features in Splunk Observability Cloud are the metrics; I can see any logs or anything related to the server or services we want to monitor, and the metrics are a good function. It provides exact details. It offers unified visibility for logs, metrics, and traces.
What needs improvement?
In Splunk Observability Cloud, I notice room for improvement in synthetic monitoring. It does not provide output based on server names. It only gives a response when we input a URL. I'm not sure if this issue is specific to my organization, but it would be beneficial if server details could be retrieved directly in synthetic monitoring.
For how long have I used the solution?
I have been using this solution for two years and two months.
What do I think about the stability of the solution?
I would rate its stability an eight out of ten.
What do I think about the scalability of the solution?
I would rate its scalability an eight out of ten.
Around 100+ users access Splunk Observability Cloud in my organization, including the cloud SRE team, Windows Intel team, Linux team, and AD team.
My client base primarily consists of enterprise financial services.
How are customer service and support?
If any issues arise, we can raise a vendor case, and resolutions are provided in a timely and accurate manner.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
In my organization, we also work with Sentry, Datadog, PagerDuty, and Dynatrace. Splunk Observability Cloud offers more features than Datadog, which also provides APM monitoring, log observer, and metrics, but does not match the feature set of Splunk Observability Cloud.
How was the initial setup?
It is a bit complicated. For deploying Splunk Observability Cloud, we first need an access token, after which we connect to our AWS Cloud account and provide the access token. We must set up CloudWatch or AWS Lambda and forward the metrics or logs from all sources to AWS.
The implementation took about 45 days.
What was our ROI?
The return on investment varies based on requirements; for smaller tasks, we can leverage our team's capabilities effectively, so I can estimate around a 20% efficiency gain.
Currently, we are providing outputs to clients within the required time frames. If a client requests any dashboard, logs, APM monitoring, or synthetic monitoring, we have been able to deliver output on time, achieving approximately an 80% efficiency in response.
What's my experience with pricing, setup cost, and licensing?
Splunk Observability Cloud is expensive.
What other advice do I have?
For operational performance, we created monitoring within the Splunk Observability Cloud for most servers with agent installation. We upgraded the open telemetry collector from version 0.82 to 0.103, then again to a newer version, enhancing visibility and use cases, especially after the upgrade, which has improved operational purposes.
My impressions of Splunk Observability Cloud for focusing on business-critical initiatives are positive. I manage six tools, but Splunk Observability Cloud is one of my favorites, and I aspire to build my career specializing in it because it has great features, more attention in the market, and is a relatively new tool with promising growth.
I would recommend Splunk Observability Cloud to other users for its accurate data fetching, dashboard creation, report generation, and synthetic monitoring capabilities.
I would rate Splunk Observability Cloud a nine out of ten.
Which deployment model are you using for this solution?
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Adopted global standards enhances data collection and simplifies monitoring
What is our primary use case?
The solution involves observability in general, such as Application Performance Monitoring, and generally addresses digital applications, web applications, sites, and mobile applications. I worked with it in two companies: one in the energy sector and one in the hotel sector.
The Splunk teams helped us with data collection, instrumentation, and many other options.
How has it helped my organization?
The testing and monitoring of infrastructure is useful. We also use it for many metrics and can use it effectively for troubleshooting and for detection. It's very helpful.
What is most valuable?
With Splunk Observability Cloud, I appreciate working with open telemetry. The standards of open telemetry are especially useful for collecting data such as traces, matrices, and logs. Splunk respects the standards of open telemetry. This is beneficial. Many clients work with AWS and the cloud in general with multiple solutions such as Datadog, Dynatrace, and Splunk. Working with the standard open telemetry is very advantageous. Splunk Observability Cloud is very simple for users in general, including developers, DevOps, and data teams. It's more straightforward compared to Dynatrace.
There are many out-of-the-box solutions proposed by Splunk, such as dashboards for AWS instances, EC2, Fargate, and Lambda. It's very helpful for beginning, especially for monitoring, and the detectors for alerting help understand how the platforms work.
The no-sample feature is great. It eliminates blind spots.
After completing the instrumentations, we have many dashboards and tests for monitoring infrastructure, particularly CPU and memory. We also use applicative metrics such as JVM, Java Runtime, and many other applicative metrics and testing. For troubleshooting, we can detect problems in seconds, which is particularly helpful for digital teams.
AI analytics have the potential for a lot of functionality. The detectors for alerting may prove useful.
When we deploy the instrumentation in the application, we can start using the dashboards immediately. The dashboard building is very helpful for starting work.
It's beneficial for monitoring performance and infrastructure, especially when deploying applications with multiple versions with Git. It's important to detect performance issues, such as CPU consumption or memory consumption, particularly over time in Java and Python.
For other teams, they need help and guidance to use custom metrics. For observability engineers and specialists, it's straightforward, but for others, it can be challenging.
The solution overall is very valuable for me.
The time to value was immediate. Once we deployed, we started to use the dashboard directly and began detecting issues.
Saving time with automation can save us weeks. It's improving our resilience. It helps us detect issues and increase performance.
The solution has been very useful for helping us focus on business-critical initiatives.
What needs improvement?
Regarding dashboard customization, while Splunk has many dashboard building options, customers sometimes need to create specific dashboards, particularly for applicative metrics such as Java and process terms. These categories of dashboards would be very helpful for customers.
For how long have I used the solution?
I started working with Splunk Observability Cloud in 2023.
What do I think about the stability of the solution?
The system is relatively stable. We rarely have problems accessing the dashboard or the page. We encounter problems in the Splunk platform very rarely.
What do I think about the scalability of the solution?
It's very scalable. We haven't experienced any problems with the instrumentation or scalability. On a scale of one to ten, I'd rate it a ten.
We've used the solution across more than 250 people, including engineers.
How are customer service and support?
I would rate Splunk technical support at six out of ten.
When we have a problem and need to create a case, the response isn't quick. They often require multiple questions, with five or six emails to get a response. Problem resolution typically takes between two and five days, which isn't very helpful. However, sometimes we do receive quicker solutions.
How would you rate customer service and support?
Neutral
Which solution did I use previously and why did I switch?
We used legacy solutions such as Grafana and Prometheus. There are several differences between Splunk Observability Cloud and these solutions. We used Grafana as a monitoring solution, however, it's not truly observability. We used OpenSearch for logs, Prometheus for metrics, and Grafana to work with Prometheus. That said, it's not equivalent. Observability is different.
We're also familiar with Datadog and Dynatrace.
How was the initial setup?
The implementation took between two and three weeks.
For cloud deployment, it's straightforward. We can use GitLab and DevOps CI/CD. For on-premise deployment, such as Linux and deployment with satellite, it's easy yet requires some work to configure the configuration files.
Updates are generally needed, especially for the open telemetry version or SDK. However, regarding the platform itself, we don't need to do anything.
What was our ROI?
I worked with my company when they used the solution, so I'm not certain about the history of how long it took to detect problems. However, for mean time to detect, and mean time to respond, I'm sure it's very helpful, and we can estimate a minimum improvement of 20%.
What other advice do I have?
We're a customer and end-user.
Currently, in France, we cannot use the artificial intelligence option. While this option is enabled for the United States and many countries, it's not yet available in France. However, the solution with detectors, especially for alerting, is important for us.
I recommend it, especially for teams using legacy monitoring.
I would rate Splunk Observability Cloud nine to ten out of ten.
Which deployment model are you using for this solution?
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Seamless issue detection with user time tracking and application load analysis
What is our primary use case?
We primarily use Splunk Real User Monitoring to analyze performance bottlenecks and application transactions. It allows us to see how applications are experienced on the user side, making it easy to capture any bottlenecks or performance issues.
What is most valuable?
The most valuable features include user time tracking and the ability to analyze application load times. Splunk provides advanced notifications of roadblocks in the application, which helps us to improve and avoid impacts during high-volume days. It is very useful for identifying performance bottlenecks.
What needs improvement?
It would be beneficial to have more enhanced features with capabilities to adapt more integrated applications. Improvements in dashboard configuration, customization, and artificial intelligence functionalities are desired. There is room for improvement in customer support due to delays and standard feedback responses.
For how long have I used the solution?
I have been working with Splunk Real User Monitoring for almost two years.
What do I think about the stability of the solution?
In terms of stability, I would rate it a nine out of ten. It is a very stable solution.
What do I think about the scalability of the solution?
Splunk Real User Monitoring is definitely scalable. I would rate its scalability a nine out of ten.
How are customer service and support?
Technical support is rated an eight. There is some delay in their in-depth responses and standard answers to questions.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
I worked with Splunk alongside Dynatrace. Before Splunk, I did not use any other services.
How was the initial setup?
It takes about an hour to set up the client for real-time monitoring.
What about the implementation team?
We have a separate team for deployment, consisting of about three to four people.
What was our ROI?
We have achieved a return on investment between 10% to 20% as it helped in removing roadblocks, which could lead to more savings with wider usage.
What's my experience with pricing, setup cost, and licensing?
Splunk is a little expensive, however, it is in line with the current market pricing. I would rate the pricing an eight on a scale of one to ten, as it reflects the going rate in the market.
What other advice do I have?
I would recommend this product to other users because of its capabilities in monitoring and analytics.
I rate the overall solution eight out of ten, considering the comparison with other products like Dynatrace.
Customized dashboards streamline log monitoring needs
What is our primary use case?
Splunk is primarily used for log monitoring, where I collect all my security logs, system logs, and application logs into a centralized place. This helps me customize my monitoring models.
How has it helped my organization?
Splunk has provided me with a centralized platform to manage multiple features. Instead of using various products, Splunk offers everything in one solution, which adds value to my organization.
What is most valuable?
The most valuable feature is the ability to customize dashboards based on my queries or any other customization I may need.
What needs improvement?
I'm still experiencing some features of the product. However, in future updates, I would like to see more predefined monitoring query solutions, which could be more effective.
For how long have I used the solution?
I have been using Splunk Synthetic Monitoring for almost five years, primarily focusing on log monitoring.
What do I think about the stability of the solution?
Overall, the product is stable, and I would rate it an eight out of ten.
What do I think about the scalability of the solution?
For scalability, I would give it a nine out of ten.
How are customer service and support?
Technical support is good but could be improved, particularly concerning the time taken for ticket resolution.
How would you rate customer service and support?
Neutral
Which solution did I use previously and why did I switch?
The main reason for choosing Splunk over other products is its comprehensive capabilities and flexible customization options. It is widely used and provides cloud solutions.
How was the initial setup?
The initial setup was quite straightforward, and agent installation can be done quickly. However, the entire setup process might involve multiple people due to organizational policies.
What about the implementation team?
The implementation process involved around five to ten people due to our organization's processes and need for multiple approvals.
What was our ROI?
Using Splunk has saved my organization about 30% of our budget compared to using multiple different monitoring products.
What's my experience with pricing, setup cost, and licensing?
Splunk is a bit expensive since it charges based on the indexing rate of data. However, considering the features it provides, the pricing is quite affordable compared to other monitoring solutions.
What other advice do I have?
Overall, I would recommend Splunk to anyone seeking a monitoring solution, thanks to its extensive capabilities and features.
I'd rate the solution nine out of ten.
Optimizes application performance and has an effective service map
What is our primary use case?
The main purpose of using Splunk APM is to optimize our application. We use Splunk APM primarily to understand how the application works, how it uses resources, and its response time in connection with different infra services. It is mainly used for application optimization and reviewing third-party application dependency response times.
How has it helped my organization?
Splunk APM helps us to identify long-running queries and long-running functions or methods, as well as third-party dependencies that are not responding on time. We are easily able to see the error or trace it. A developer can easily find out the issue without having to dig into the application.
We normally do not use the Tag Spotlight functionality, but our developers use this functionality when we are trying to dig into the logs. It helps to search the data that we want to see. It helps to troubleshoot the actual problem and visualize the data. We can see how the error is coming and how many reports are coming.
Splunk APM has helped us to optimize the application performance, find out when third-party services go down, and monitor our application within our SLA. It allows us to minimize our downtime. We can send timely notifications to our users. It mainly helps us to optimize application performance, and secondly, we are able to generate alerts based on the data that we receive from Splunk.
Splunk APM helps us to find errors immediately and resolve them. We are able to find some of the errors within five minutes. It minimizes the time to identify errors. There are about 30% to 40% time savings.
What is most valuable?
The best feature is the service map that they have. I have used multiple APM solutions such as Datadog and Elastic. They have a service map, but it does not work like Splunk APM. Splunk APM provides a holistic view of the application. Unlike other APMs, Splunk's service map is quite effective.
We suggested they provide an alert based on insert services. We told them that they have all the data, so why not have an alert on the insert service? They took feedback from us and added that feature. That feature helps us identify if any third-party dependent is down.
What needs improvement?
There is room for improvement in the alerting system, which is complicated and has less documentation available. We sometimes encountered issues in setting up alerts. The custom detector could be more simplified to assist system engineers in setting up alerts with ease.
For how long have I used the solution?
We tested Splunk APM last year and officially started using it this year. It has been about a year.
What do I think about the stability of the solution?
Splunk APM is stable. I would rate its stability a nine out of ten, as it delivers on its promises.
What do I think about the scalability of the solution?
We have not had to scale it. Our clients are medium enterprises.
How are customer service and support?
The support is responsive, though it could use some improvement. In the past, we contacted their support about a feature. They did respond to us, but they did not explicitly inform us about the feature's absence. Instead, they directed us to try various resources or articles. They did not have a clear answer. I would rate them a five out of ten for customer service.
How would you rate customer service and support?
Neutral
Which solution did I use previously and why did I switch?
Before using Splunk APM, I used Elastic APM and Datadog. Splunk APM is better than them. Splunk's service map and support for our existing libraries were significant reasons for the switch. The previous vendor required library updates that we could not accommodate, but Splunk supported our existing setups.
How was the initial setup?
The initial setup of Splunk APM was easy and straightforward. It took around a week.
What's my experience with pricing, setup cost, and licensing?
It appears to be expensive compared to competitors.
What other advice do I have?
Splunk APM is suitable for enterprise solutions, particularly for those deeply involved in technical business. The service map and overall stability make it a robust choice for such needs.
I would rate Splunk APM a nine out of ten.
Collaborates performance metrics with log data to pinpoint the exact cause of issues and offers error detection
What is our primary use case?
We use Splunk in APM to monitor our applications. So, we integrated it into our systems to enhance our monitoring and observing capabilities, especially for our microservices.
So, we have used Splunk APM for this.
How has it helped my organization?
APM integrates well with Splunk’s other observability solutions. These logs with application performance monitoring can significantly impact our business in several positive ways, like troubleshooting and root cause analysis.
Using these logs with APM, we can collaborate performance metrics with log data. It allows us to pinpoint the exact cause of issues, such as identifying specific errors in the logs. Because of this, we have access to faster resolution and detailed logs alongside performance metrics, enabling quicker diagnosis and resolution of problems. It also helps us minimize downtime and improve system reliability.
Additionally, it improves our performance optimization with detailed insights and analyzing historical log data along with APM metrics. This allows us to understand long-term trends and make informed decisions about performance improvements and better user experience, like error reduction and proactive monitoring.
Splunk has reduced our mean time to resolution by 30%.
If there is any issue in Splunk; we’ll identify the issue first and look for the error messages, like alerting with the Splunk user interface or in the logs that might indicate what the issue is and then determine which part of Splunk is affected.
Then, we’ll refer to the Splunk official documentation and check the system's health. We’ll review the logs. By following these steps, I can resolve the issues with Splunk, ensuring that our monitoring and analytics capabilities remain effective.
What is most valuable?
Mainly, I like Splunk APM because it will show the errors compared to other tools. We use the dashboards to monitor our applications. It will tell us the errors, and we can solve them quickly.
I have used APM but haven’t used Trace Analyzer, though I have some knowledge of it. We are able to implement it. We have some Trace Log Points in Splunk APM to catch the errors. We have a special graph for it where we can see the red points.
We use OpenTelemetry. OpenTelemetry and Splunk APM are similar in terms of observability and monitoring. We use it for observability standardization, which allows us to collect traces and metrics, making it easier to work with different monitoring tools, including Splunk APM. It is more flexible because it allows us to instrument our applications without being locked into a specific monitoring vendor.
It supports collecting traces, metrics, and logs from our applications, providing a comprehensive view of our performance and health endpoints. This data can be fed into Splunk APM, giving us in-depth analysis and insights about our application.
What needs improvement?
Splunk APM is a robust tool with many capabilities. There are always areas for potential improvement to enhance its functionality and user experience.
For Splunk APM, there could be simplified navigation, like streamlining the user interface to make navigation more intuitive for our users, especially those new to APM, which can enhance usability. We can provide more customization options for dashboards and visualizations to help users tailor the platform to their specific needs.
There could be more integration capabilities with a wider range of third-party tools and platforms would also be beneficial. By focusing on these areas, Splunk APM can enhance its value proposition, improve user satisfaction, and better meet the evolving needs of organizations monitoring their application performance.
For how long have I used the solution?
I have been using it for a year.
What do I think about the stability of the solution?
I never had an issue with the stability. It worked fine.
Which solution did I use previously and why did I switch?
My team has used alternatives to Splunk APM, like Datadog and New Relic.
How was the initial setup?
The initial setup was easy. To fully deploy it, we had to add some signal effects into our applications and just deploy it. It took like 20 minutes. That’s it.
What about the implementation team?
We took some help from our teams and my senior manager and also from other teams across our company. We connected and did all this together.
For deployment, one person can do it, actually, but as we are junior developers, we took help from our senior manager, like three to four people.
Splunk is good like this now. I don’t think any updates would be required, but there are some regular updates and upgrades of Splunk APM, like software updates, version upgrades, and all.
These provide more powerful monitoring capabilities and help ensure the system remains reliable, secure, and aligned with organizational needs. Regular updates, performance tuning, and proactive management help in maximizing these benefits of the Splunk solution.
What was our ROI?
We’ll see the results after the deployment. It’s not that late, and that’s the reason we are using Splunk APM.
Splunk made our job easier in a way. It will give the points when we use any dashboards, and there are no delays in everything, like performance. It will give the error issues very clearly, and it will monitor 24/7. It will show the issues, and it is very effective. It will pinpoint the exact cause of the issues, and it will help us troubleshoot the issues very fast.
It benefits the IT staff in other teams, like operations, improves efficiency, and manages the IT environments more effectively. When it centralizes the logs and search analytics, the powerful capabilities allow IT teams to perform in-depth troubleshooting, identify root causes, and analyze complex issues with ease.
Splunk also provides real-time visibility into IT infrastructure, and we have connected with cross-functional teams around our team to work with Splunk APM. It supports proactive management, enhances security, and improves operational efficiency. It facilitates better collaboration across the team.
What's my experience with pricing, setup cost, and licensing?
The pricing is based on several factors, including the scale of deployment. The pricing model typically includes considerations like the number of hosts, features, and capabilities.
What other advice do I have?
Overall, I would rate the solution a nine out of ten.