Frontiers of Data and Computing ›› 2025, Vol. 7 ›› Issue (4): 182-195.

CSTR: 32002.14.jfdc.CN10-1649/TP.2025.04.015

doi: 10.11871/jfdc.issn.2096-742X.2025.04.015

• Technology and Application • Previous Articles     Next Articles

Computation Resource Assessment Methodology for Large Meteorological AI Models

SHI Yiheng(),WANG Qiyi,SUN Jing,ZHAO Chunyan*(),DENG Shuai,WU Peng,YAO Wang   

  1. National Meteorological Information Center, Beijing 100081, China
  • Received:2025-03-20 Online:2025-08-20 Published:2025-08-21
  • Contact: ZHAO Chunyan E-mail:shiyiheng@cma.gov.cn;zhaocy@cma.gov.cn

Abstract:

[Objective] In recent years, large meteorological AI models have demonstrated the potential to surpass traditional numerical methods in weather forecasting. However, the model training and deployment require significant computational resources. The existing resource assessment methods, primarily designed for large-scale models in natural language processing (NLP), are struggling to accommodate the dynamic computational demands of meteorological tasks, such as spatiotemporal multidimensionality, and the unique architectures of meteorological models. This results in inefficient resource utilization and high computational costs. To address these challenges, this study proposes a computational resource assessment framework for large meteorological models. By quantifying parameters, computational load, memory usage, and communication overhead, the framework provides a theoretical foundation for hardware configuration and resource allocation, aiming to reduce computational costs and ensure efficient and stable development and operation of large meteorological models. [Methods] We introduce the Multi-Granularity Computing Resource Joint Evaluation Framework (MGCRJEF), which establishes modular models for parameter calculation, spatiotemporal-aware FLOPs assessment, memory usage prediction, and distributed communication analysis. By incorporating the spatiotemporal heterogeneity of meteorological data, it comprehensively evaluates the core hardware resource requirements of large meteorological models. [Results] Using the Pangu-Weather model, which is based on the Swin-Transformer architecture, as a case study, the framework uncovers the model’s resource demand characteristics. For instance, memory usage increases significantly with higher input resolutions, while communication overhead becomes a major performance bottleneck during multi-node training. These insights provide practical guidance for optimizing resource allocation. Furthermore, the framework’s estimated resource demands closely align with actual consumption, demonstrating its accuracy and effectiveness. [Conclusions] The MGCRJEF framework provides a standardized approach to assessing the resource demands of large meteorological models, facilitating resource planning in intelligent computing hardware environments. It offers both theoretical and practical references for model deployment and hardware optimization in the field of meteorology.

Key words: large meteorological AI models, multi-granularity computing resource joint evaluation framework, resource optimization, intelligent computing