Frontiers of Data and Computing ›› 2022, Vol. 4 ›› Issue (1): 30-41.

doi: 10.11871/jfdc.issn.2096-742X.2022.01.003

• Special Issue: Union of National Scientific Data Center • Previous Articles     Next Articles

Intelligent Operation and Maintenance System for High Energy Physics Science Data Center

HU Qingbao1,2,*(),ZHENG Wei1,2(),WANG Jiarong1,2(),WANG Lu1,2(),YAN Tian1,2()   

  1. 1. High Energy Physics Institute, Chinese Academy of Sciences, Beijing 100049, China
    2. National High Energy Physics Science Data Center, Beijing 100049, China
  • Received:2021-09-28 Online:2022-02-20 Published:2022-03-04
  • Contact: HU Qingbao E-mail:huqb@ihep.ac.cn;zhengw@ihep.ac.cn;wangjr@ihep.ac.cn;lu.wang@ihep.ac.cn;yant@ihep.ac.cn

Abstract:

[Objective] The High-energy Physical Science Data Center has a complex operation and maintenance environment. Because the monitoring tools are various, the functions are relatively overlapped, and the monitoring data cannot be interoperable, the daily operation and maintenance are facing many challenges. To make full use of the monitoring data and improve the operation and maintenance capabilities of the data center, this paper implements an intelligent operation and maintenance system for the high-energy physical science data center. [Methods] This article combines industrial big data technology, machine learning technology, and data center operation and maintenance requirements to design a general data center operation and maintenance technology architecture. It introduces the core functions of the monitoring data collection, analysis, storage, sharing, visualization, etc., and their implementation methods. The application effects of this system in the direction of data center data storage, computing services, and network security operation and maintenance are also introduced. [Results] The operation and maintenance framework designed in this paper has been maturely applied and practiced in the daily operation and maintenance of the High-energy Physical Science Data Center and has improved the data center operation and maintenance management capabilities. [Conclusions] The application of intelligent operation and maintenance systems in the High-energy Physical Science Data Center has enhanced the value of operation and maintenance data and realized the data-driven intelligent operation and maintenance ecology of data centers.

Key words: big data, data center operation and maintenance, intelligent operation and maintenance system