Frontiers of Data and Computing ›› 2023, Vol. 5 ›› Issue (3): 13-28.

CSTR: 32002.14.jfdc.CN10-1649/TP.2023.03.002

doi: 10.11871/jfdc.issn.2096-742X.2023.03.002

• Special Issue: AI for Science • Previous Articles     Next Articles

An AI-for-Science Platform of Molecular Dynamics with Ab initio Accuracy

LIU Tao1(),ZHAO Tong1,TAN Guangming1,2,JIA Weile1,2,*()   

  1. 1. State Key Lab of Processors, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
    2. University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:2023-05-04 Online:2023-06-20 Published:2023-06-21

Abstract:

[Objective] AI for Science is changing the landscape of the traditional scientific computing by combining physical models, artificial intelligence, and high-performance computing to address challenging problems such as molecular dynamics with ab initio accuracy. This approach adapts neural networks to fitting high-dimensional functions, achieving orders of magnitude increases in the temporal and spatial scales, leading to a paradigm shift in scientific research. [Methods] This paper proposes an HPC+AI-driven computing platform for molecular dynamics with ab initio accuracy. Aiming at the changes and challenges brought by the workflow, the key technologies and processes for building an AI for Science computing platform are described from four aspects: generating scientific data and preparing datasets, exploring configuration space and labeling training samples, efficiently training AI for Science models, and performing large scale efficient inference (MD simulation). [Results] Based on the computational platform proposed in this paper and AI for Science computing workflows, this paper proposes an active learning strategy based on Kalman filtering for the typical application of HPC+AI-driven first-principles accuracy molecular dynamics. The training method for the quasi-second-order AI model is improved, achieving a training time acceleration from days to minutes. A fifth-order polynomial model compression technology increases the system scale by one order of magnitude for model inference and accelerates time-to-solution by 3-9 times. [Conclusions] All of the above work is combined to form an AI for Science computing platform for first-principles accuracy molecular dynamics calculations. [Limitations and Prospects] The AI for Science computing approach and workflows are still in a vigorous stage of development and facing significant challenges in high-precision data, more general AI models, and efficient computing methods. These challenges will also be important directions for future exploration in this work.

Key words: AI for Science, first-principles calculation, molecular dynamics, active learning, Kalman Filtering, Model compression