Key Takeaways
|
|
DX Operational Observability (DX O2), a next-generation AIOps and Observability solution from Broadcom, offers two powerful capabilities that generate valuable insights from complex log data. Since DX O2 supports ingestion of logs from a wide variety of sources, the solution offers an enormous opportunity to improve observability and power AIOps.
DX O2 uses the same log data to support log-based alerts and log-based metrics. The insights DX O2 generates, the use cases, and, to some extent, the primary beneficiaries of these two capabilities are fundamentally different. Understanding these differences can help teams shorten the path to value when adopting these capabilities and it can help them take full advantage of each capability.
Log-based alerts |
Log-based metrics |
These are alarms that are triggered when specific patterns, keywords, or conditions are detected in log entries. They monitor raw log data for pre-defined events, such as error messages, security incidents, and performance issues. When these conditions occur, teams receive real-time alerts. Log-based alerts are useful for immediate issue detection and response. |
Derived from log data, such as error rates, request counts, or response times, these are metrics that show time-series data to illustrate trends and highlight anomalies. Log-based metrics transform raw log entries into structured, quantifiable insights that enable trend analysis, real-time monitoring, and threshold-based alerting. This enables better observability and decision-making. |
In this blog, I take a deeper look into how log-based metrics can provide a more actionable approach to observability and how to generate these metrics.
To define log-based metrics in DX O2, you must configure these three parameters:
In addition to the parameters above, a Group By field is also available, which, when selected, generates multiple time-series data. This additional criterion works on the Keyword field only. This is shown in the example below.
Multiple time series are created, one for each host, as HOST is the Group By field.
These metrics can also be accessed easily from the Metric Browser as illustrated below:
These metrics can now be used in DX Dashboards for visualization purposes as well as for analyzing trends over time. They can also be used for threshold-based alerting. Moreover, since these metrics can be grouped by a host, container, and more, they retain helpful context. Context enrichment is a significant benefit for practitioners working with complex log data, and for organizations where IT operations data is shared across teams during triage, remediation, performance analysis, or other work.
Log-based alarms help the team get started and add logs to the monitoring mix. Log-based metrics help make systems truly observable, enable additional analysis, and generate powerful insights, while keeping costs in check. Get started! Try these log-based metric capabilities in your environment.