E+A Galaxy Search Based on a Domestic Heterogeneous Acceleration Platform: Parallelization Strategy and Implementation

doi:10.11871/jfdc.issn.2096-742X.2025.05.008

Abstract

Abstract:

[Objective] E+A galaxies are rare, short-lived post-starburst galaxies whose observational samples hold critical value for understanding galactic evolution and cosmic history. While modern sky surveys have amassed vast astronomical datasets, efficiently detecting these transient objects remains a key challenge in contemporary astrophysical research. [Methods] This study proposes PEAS (Parallel E+A Searcher), a novel pipeline for accelerated E+A galaxy detection implemented on a domestic heterogeneous computing platform. Our methodology involves three phases. First, we analyze dependencies in the serial search algorithm to decompose it into three parallelizable task operators. Second, leveraging the software stack of the target platform, we design a hierarchical distributed architecture for PEAS. Finally, we implement two parallelization schemes: a multi-core task-parallel approach using OpenMP and a multi-node data-parallel strategy using MPI. [Results] Validation on the LAMOST DR2 dataset confirms PEAS’s accuracy. Performance benchmarks conducted on 260,000 galaxies from LAMOST DR10 demonstrate significant speedups. The results indicate that, compared to a single-core CPU, PEAS achieves a speedup of 22.30 on a 32-core system and of up to 107.06 on a single accelerator card. In terms of performance scalability, 4 acceleration cards achieve a speedup 1.89 compared to 1 acceleration card, while 4 nodes achieve 1.83 speedup compared to 1 node. In terms of data scalability, the speedup is 6.93, approaching the data ratio of 8.6.

Key words: domestic heterogeneous acceleration platform, Parallel computing, E+A galaxy, rare astronomical target search

ZHENG Aiyu,MENG Xiangyu,ZHANG Boyu,ZHOU Lichan,YANG Haifeng. E+A Galaxy Search Based on a Domestic Heterogeneous Acceleration Platform: Parallelization Strategy and Implementation[J]. Frontiers of Data and Computing, 2025, 7(5): 102-112, https://cstr.cn/32002.14.jfdc.CN10-1649/TP.2025.05.008.

Figures/Tables 13

Fig.1

Fig.2

Fig.3

Table 1

Table 2

Table 3

Table 4

Table 5

Fig.4

Fig.5

Fig.6

Fig.7

Fig.8

References 20

[1]	YORK, DONALD G, et al. The sloan digital sky survey: Technical summary[J]. The Astronomical Journal, 2000, 120(3): 1579-1587.
[2]	YANG H F, LUO A L, CHEN X Y, et al. A sample of E+ A galaxy candidates in the Second Data Release of LAMOST Survey[J]. Research in Astronomy and Astrophysics, 2015, 15(8): 1414-1420.
[3]	LUO Y, NEMETH P, WANG K, et al. VizieR Online Data Catalog: Hot subdwarf stars with Gaia DR2 and LAMOST DR7 data (Luo+, 2021)[J]. VizieR Online Data Catalog, 2022, 225: 746-751.
[4]	ABDELLAH E, SAMIR R M, AWAD Z, et al. SDSS-IV MaNGA: the environmental effects on some fundamental properties of early-type galaxies[J]. Astrophysics and Space Science, 2025, 370(1): 7-11.
[5]	YANG H F, WANG R, CAI J H, et al. A Sample of Am and Ap Candidates from LAMOST DR10 (v1. 0) Based on the Ensemble Regression Model[J]. The Astrophysical Journal Supplement Series, 2024, 272(2): 43-50.
[6]	Large Sky Area Multi-Object Fiber Spectroscopic Telescope DATA RELEASE 13 v0 Q1[EB/OL]. [2023-04-02]. https://www.lamost.org/dr13/.
[7]	黄国如, 陈志威, 曾博威. 城市洪涝模型及CPU-GPU异构并行计算技术研究进展[J]. 水利学报, 2023, 54(6): 654-665.
[8]	杨晨, 翁祖建, 孟小峰, 等. 天文大数据挑战与实时处理技术[J]. 计算机研究与发展, 2017, 54(2): 10-15.
[9]	裴彤, 张彦霞, 彭南博, 等. Python多核并行计算在海量星表交叉证认中的应用[J]. 中国科学: 物理学、力学、天文学, 2011, 41(1): 6-12.
[10]	杜帅岐, 刘晓楠, 廉德萌, 等. Grover量子搜索算法在"嵩山"超级计算机系统中的模拟[J]. 计算机科学, 2024, 51(9): 96-102. doi: 10.11896/jsjkx.230600219
[11]	黄聪祎, 赵伟文, 万德成. 国产DCU加速卡与MPS方法结合高效模拟带障碍物溃坝流动问题[J]. 水动力学研究与进展A辑, 2024(5): 45-55.
[12]	WILKINSON C L, K A PIMBBLET, J P STOTT. The evolutionary sequence of post-starburst galaxies[J]. Monthly Notices ROYAL ASTRONOMICAL SOCIETY, 2017, 472: 1447-1457.
[13]	ZHAO Y H. Large-scale astronomical spectroscopic surveys[J]. Scientia Sinica Physica, Mechanica & Astronomica, 2014, 44(10): 1041-1048.
[14]	WILKINSON C L, PIMBBLET K A, STOTT J P. The evolutionary sequence of post-starburst galaxies[J]. Monthly Notices of the Royal Astronomical Society, 2017, 472(2): 1447-1457.
[15]	KONG L, HUANG T, ZHU Y, et al. Big Data in Astronomy: Scientific Data Processing for Advanced Radio Telescopes[M]. Elsevier, 2020:3 05-323.
[16]	GOTO T. Are E+ A galaxies dusty-starbursts?: VLA 20 cm radio continuum observation[J]. Astronomy & Astrophysics, 2004, 427(1): 125-130.
[17]	祝鹏. 异构并行计算下高维混合型数据聚类算法研究[J]. 现代电子技术, 2024, 47(9): 139-142.
[18]	贾瑞鹏, 林中朝, 左胜, 等. 面向国产异构DCU平台的大规模并行矩量法研究[J]. 西安电子科技大学学报, 2024, 51(2): 76-83.
[19]	刘晓楠, 廉德萌, 杜帅岐, 等. 基于矩阵乘积态的有限纠缠量子傅里叶变换模拟[J]. 计算机科学, 2024, 51(9): 80-86. doi: 10.11896/jsjkx.230300215
[20]	国产异构计算平台开发者社区[EB/OL]. [2023-03-24]. https://developer.sourcefind.cn/.

	Star	Galaxy	QSO
DR2	3779674	37665	8633
DR10	11473644	263444	80342

	CPU	CPU32	ACU
RS	11209.80	246.00	132.23
SI	20574.40	1063.77	151.32
Area	87.28	56.88	4.88
Total	31956.97	1432.77	298.45

	CPU/CPU32	CPU/APU	CPU32/ACU
RS	45.56	84.76	1.86
SI	19.34	135.96	7.02
Area	1.53	17.85	11.63
Total	22.30	107.07	4.80

	1-1ACU	1-2ACU(S)	1-4ACU(S)
RS	132.23	76.72(1.72)	64(2.06)
SI	151.32	113.24(1.33)	82.12(1.84)
Area	4.88	3.8(1.28)	2.99(1.63)
Total	298.45	201.33(1.48)	157.63(1.89)

	1-1ACU	4-4ACU	S_1-4
RS	132.23	111.67	1.18
SI	151.32	38.28	3.95
Area	4.88	4.56	1.07
Total	298.45	162.52	1.83