Abstract
Fault resolution in communication networks and distributed systems is a challenge that demands the expertise of system administrators and the support of multiple systems, such as monitoring and event correlation systems. Trouble ticket systems are frequently used to organize the workflow of the fault resolution process. In this context, we introduce DisCaRia, a distributed case-based reasoning system that assists system administrators and network operators in resolving faults. DisCaRia integrates various fault knowledge resources that are already available in the Internet, and it exploits them by applying a distributed case-based reasoning methodology, which is based on scalable peer-to-peer technology. We present the architecture of DisCaRia, the key algorithms used by DisCaRia, and provide an evaluation of a prototype implementation of the system.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have