
In this paper we consider a root-cause analysis framework for NFV infrastructure. As monitoring machinery for NFV has matured the next step is to leverage on such data to automatically optimize failure detection, analysis, and overall resiliency. The complex architecture and dynamics of NFV poses significant challenges from the point of view of causality inference. In particular, the need for an approach that does not depend on domain knowledge or human intervention is of high importance. We propose in this context a step-wise data-driven root-case analysis approach based on correlation clustering, and time sensitivity analysis of alarms data. Our approach recovers templates of causality relationship between network resources alarms, which in turn allows to determine rules for performing root cause analysis. We demonstrate our approach on real data collected from NFV, where our algorithm computes causality templates. These templates were verified by system experts, while most of them were confirmed to be known and others were new.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 8 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
