Conference paper

Combining visualization and statistical analysis to improve operator confidence and efficiency for failure detection and localization

Web applications suffer from software and configuration faults that lower their availability. Recovering from failure is dominated by the time interval between when these faults appear and when they are detected by site operators. We introduce a set of tools that augment the ability of operators to perceive the presence of failure: an automatic anomaly detector scours HTTP access logs to find changes in user behavior that are indicative of site failures, and a visualizer helps operators rapidly detect and diagnose problems. Visualization addresses a key question of autonomic computing of how to win operators' confidence so that new tools will be embraced. Evaluation performed using HTTP logs from demonstrates that these tools can enhance the detection of failure as well as shorten detection time. Our approach is application-generic and can be applied to any Web application without the need for instrumentation

    Keywords: fault location;Internet;software fault tolerance;statistical analysis;system recovery;


    visualization;statistical analysis;operator confidence;failure detection;failure localization;Web applications;software fault;configuration fault;software availability;failure recovery;automatic anomaly detector;HTTP access logs;user behavior;site failures;failure diagnosis;autonomic computing;;


    • DSLAB-CONF-2005-001

    Record created on 2006-12-22, modified on 2017-05-12


  • There is no available fulltext. Please contact the lab or the authors.

Related material