数据与计算发展前沿 ›› 2024, Vol. 6 ›› Issue (3): 116-126.

CSTR: 32002.14.jfdc.CN10-1649/TP.2024.03.013

doi: 10.11871/jfdc.issn.2096-742X.2024.03.013

• 技术与应用 • 上一篇    下一篇

异构系统并行计算软件性能测评分析与实证研究

顾蓓蓓1,2(),邱霁岩1,2,迟学斌1,*()   

  1. 1.中国科学院计算机网络信息中心,北京 100083
    2.中国科学院大学,北京 101408
  • 收稿日期:2023-11-01 出版日期:2024-06-20 发布日期:2024-06-21
  • 通讯作者: *迟学斌(E-mail:chi@sccas.cn
  • 作者简介:顾蓓蓓,中国科学院计算机网络信息中心,博士研究生,高级工程师,主要研究方向为并行计算,高性能计算软件性能测评分析。
    本文中主要负责方法研究、实验设计、论文写作与格式校正等。
    GU Beibei, Computer Network Information Center, Chinese Academy of Sciences, doctoral student, her main research directions include parallel computing, performance evaluation and analysis of high performance computing software applications.
    In this paper, she is responsible for method research, paper writing, and format correction.
    E-mail: gbb@cnic.cn|迟学斌,中国科学院计算机网络信息中心,博士生导师,研究员,博士,主要研究方向为并行计算。
    本文中主要负责论文框架和实验指导。
    CHI Xuebin, Computer Network Information Center, Chinese Academy of Sciences, doctoral supervisor, researcher, PhD, his main research directions include high performance computing, parallel computing and software.
    In this paper, he is responsible for paper framework and experimental guidance.
    E-mail: chi@sccas.cn
  • 基金资助:
    国家自然科学基金(62372428)

Analysis and Empirical Research on Performance Evaluation of Parallel Computing Software in Heterogeneous Systems

GU Beibei1,2(),QIU Jiyan1,2,CHI Xuebin1,*()   

  1. 1. Computer Network Information Center, Chinese Academy of Science, Beijing, 100083, China
    2. University of Chinese Academy of Sciences, Beijing, 101408, China
  • Received:2023-11-01 Online:2024-06-20 Published:2024-06-21

摘要:

【目的】“并行计算软件性能测评”一直是超算领域重要的研究方向。在异构系统上对计算软件实际性能进行真实测评和分析,可以有效促进对异构系统计算软件生态的良性发展。【方法】本文首先通过调研文献对国内外并行计算软件性能测评方法进行研究分析,归纳总结出业界对并行计算软件性能测评的研究划分的3个重要阶段;通过并行计算矩阵乘积Cannon算法对软件的真实性能进行实证分析,并对运行时间和效率等重要指标进行多维度的实验分析。【结果】在同一节点下,不是使用越多的加速卡越能降低该并行程序的运行时间;不同规模矩阵在不使用加速卡和使用单个加速卡两种情况下,程序的并行效率均没有因为节点的增多而发生明显的变化。【结论】在异构系统计算软件中只关注并行效率不能如实反映该软件性能的真实水平。除节点间并行效率因素外,节点内部加速也成为反映并行计算软件真实水平的一项重要测评指标。

关键词: 异构系统, 并行软件, 性能测评, 并行效率

Abstract:

[Objective] The Performance evaluation of parallel computing software has always been an important research direction in the field of supercomputing. Real evaluation and analysis of the actual performance of computing software on heterogeneous systems can effectively promote the healthy development of the computing software ecosystem in heterogeneous systems. [Methods] This article first conducts research and analysis on parallel computing software performance evaluation methods, both domestically and internationally, through literature research, and summarizes three important stages of industry research on parallel computing software performance evaluation. Empirical analysis is conducted on the real performance of the software through the Cannon algorithm for parallel computation of matrix product, and multi-dimensional experimental analysis is conducted on important indicators such as execution time and efficiency. [Results] Under the same node, the execution time is not always reduced with more accelerator cards used. In the cases of using single or none accelerator card, the parallel efficiency of programs dealing with different matrix scales does not change significantly with the increase of nodes. [Conclusions] In heterogeneous system, the computing software focusing only on parallel efficiency cannot truly reflect the actual level of software performance. In addition to the efficiency factor of inter-node parallelism, intra-node acceleration has also become an important evaluation indicator reflecting the true level of parallel computing software.

Key words: heterogeneous systems, parallel software, performance evaluation, parallel efficiency