In my organization, particularly in Ericsson's telecom BSS domain, the primary use case of Coralogix is centralized log management and real-time monitoring of telecom applications, such as the BSS software products and infrastructure. We have multiple SDP and AIR components in our telecom stack, including application servers, network elements, microservices, and APIs. Coralogix acts as a single platform where all logs are aggregated, which makes troubleshooting much faster compared to using different systems. The platform also helps with monitoring live traffic and system behavior. We have configured alerts for critical scenarios such as service downtime, API failures, error rates, and spikes.
Using Coralogix has significantly improved the efficiency and structure of my daily work, especially in monitoring and troubleshooting. Previously, we relied on manual checks and customer complaints to identify different issues. Now, with real-time alerts in Coralogix, we get notified immediately when something goes wrong, such as API failures or abnormal error spikes. The approach has shifted from reactive troubleshooting to proactive monitoring. Previously, we had to log into multiple servers and manually check logs, which was time-consuming. Now, everything is centralized in Coralogix, and with powerful search and filtering, we can quickly pinpoint issues within minutes instead of hours.
One recent incident involved a production issue where one of our SDP application services started showing intermittent API failures during peak traffic hours. Subscribers in Singapore were experiencing delayed responses, and some transactions were failing. Initially, it was difficult to identify the root cause because multiple microservices and backend systems were involved. Using Coralogix, we were able to quickly narrow down the issue through real-time log monitoring, centralized log correlation, advanced filtering and search, and alerting dashboards. We observed through Coralogix dashboards that error rates suddenly increased for a particular service after a deployment. By filtering logs based on transaction IDs and timestamps, we traced the issue to a backend service timeout caused by a misconfigured connection pool. The biggest advantage was that we did not need to manually log into multiple servers to investigate. Everything was visible from a single platform, which saved considerable troubleshooting time. Because of Coralogix alerts, the operations team was informed immediately, and we resolved the issue much faster than our general earlier processes, significantly reducing troubleshooting time and providing faster root cause identification.