Frontiers of Data and Computing ›› 2025, Vol. 7 ›› Issue (3): 149-161.

CSTR: 32002.14.jfdc.CN10-1649/TP.2025.03.012

doi: 10.11871/jfdc.issn.2096-742X.2025.03.012

• Technology and Application • Previous Articles     Next Articles

AGPU-Accelerated Framework for Non-Small Cell Lung Cancer Subtype Identification

HAN Xinyin1,2(),HAN Zidong3,JI Detao4,LI Chen1,LU Zhonghua1,*()   

  1. 1. Computer Network Information Center, Chinese Academy of Sciences, Beijing 100083, China
    2. University of Chinese Academy of Sciences, Beijing 100190, China
    3. Guangdong Laboratory of Artificial Intelligence and Digital Economy (Shenzhen), Shenzhen,Guangdong 518107, China
    4. Beijing institute of technology, Zhuhai, Guangdong 519088, China
  • Received:2024-12-05 Online:2025-06-20 Published:2025-06-25

Abstract:

[Objective] This study aims to optimize the computational performance of the Morphgene framework to address inefficiencies in processing large-scale pathology images and multi-omics data for non-small cell lung cancer (NSCLC) subtyping. [Methods] Comprehensive optimizations are applied to the framework’s pathology image patch processing, feature extraction, and K-means clustering modules through CPU thread pool scheduling, PyTorch tensor programming, GPU acceleration, and deep learning inference optimization techniques. Experiments conducted on NSCLC samples from the TCGA database validate the effectiveness of these optimizations and the framework’s subtyping performance. [Results] The optimized framework achieved over a 10-fold speedup in large-scale data processing while maintaining high subtyping accuracy. It successfully identified prognostically relevant subtypes, providing strong support for personalized treatments and survival predictions. [Limitations] The current optimizations are designed for specific file formats and patch sizes, requiring further research to adapt to smaller files or larger patches. [Conclusions] The GPU acceleration strategy significantly improves the computational efficiency of the Morphgene framework, making it a robust tool for NSCLC subtyping in precision medicine. Future work will focus on enhancing its multi-modal data integration and adaptability to broaden its clinical applications.

Key words: GPU, non-small cell lung cancer (NSCLC), multi-omics data integration, digital pathology image analysis, precision oncology