首页> 外文会议>6th international conference on autonomic computing and communications 2009 >Ranking the Importance of Alerts for Problem Determination in Large Computer Systems
【24h】

Ranking the Importance of Alerts for Problem Determination in Large Computer Systems

机译:对大型计算机系统中确定问题的警报的重要性进行排名

获取原文
获取原文并翻译 | 示例

摘要

The complexity of large computer systems has raised unprecedented challenges for system management. In practice, operators often collect large volume of monitoring data from system components and set up many rules to check data and trigger alerts. However, the alerts from various rules usually have different problem reporting accuracy because their thresholds are often manually set based on operators' experience and intuition. Meantime, due to system dependencies, a single problem may trigger many alerts at the same time in large systems and the critical question is which alert should be analyzed first in the following problem determination process. In this paper, we propose a novel peer review mechanism to rank the importance of alerts and the top ranked alerts are more likely to be true positives. After comparing a metric value against its threshold to generate alerts, we also compare the value with the equivalent thresholds from many other rules to determine the importance of alerts. Our approach is evaluated with a real test bed system and experimental results are also included to demonstrate its effectiveness.
机译:大型计算机系统的复杂性对系统管理提出了前所未有的挑战。实际上,操作员经常从系统组件中收集大量监视数据,并设置许多规则来检查数据并触发警报。但是,来自各种规则的警报通常具有不同的问题报告准确性,因为它们的阈值通常是根据操作员的经验和直觉手动设置的。同时,由于系统依赖性,单个问题可能在大型系统中同时触发许多警报,而关键问题是在随后的问题确定过程中应首先分析哪个警报。在本文中,我们提出了一种新颖的同行评审机制来对警报的重要性进行排名,排名最高的警报更有可能是真正的肯定。在将指标值与其阈值进行比较以生成警报之后,我们还将值与其他许多规则中的等效阈值进行比较,以确定警报的重要性。我们的方法是使用真实的测试平台系统进行评估的,并且还包含实验结果以证明其有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号