
Modern particle physics experiments observing collisions of particle beams generate large amounts of data. Complex trigger and data acquisition systems are built to select on line the most interesting events and write them to persistent storage. The final stage of this selection process nowadays often happens on large computer clusters. The stable and reliable operation of such event filter clusters is critical for the success of these experiments. Operating the event filter cluster must ensure dead time free processing of large amount of data, requiring 24 hours continuous status monitoring of each processing node, and fast detection and problem solving. Ideally, problems should be recognized before they deteriorate the system performance. The process control of the event filter cluster is performed exclusively by a human operator, placing high demands difficult to accomplish. In this paper, a hybrid system based on expert system technology and statistical tools and methods is proposed to address this issue. The system is built upon a scalable modular architecture and a design overview is given. The proposed hybrid system is designed and tested in a real environment, with an event filter cluster prototype based on the architecture of the Compact Muon Solenoid experiment at CERN. The system test results with an analysis are provided. Finally, the future possibilities are discussed.
expert systems., decision support, fault detection and recovery, intelligent problem solving, process control, fault prediction, intelligent problem solving; decision support; process control; fault detection and recovery; fault prediction; expert systems.
expert systems., decision support, fault detection and recovery, intelligent problem solving, process control, fault prediction, intelligent problem solving; decision support; process control; fault detection and recovery; fault prediction; expert systems.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
