Frontiers of Data and Computing ›› 2025, Vol. 7 ›› Issue (6): 23-34.

CSTR: 32002.14.jfdc.CN10-1649/TP.2025.06.003

doi: 10.11871/jfdc.issn.2096-742X.2025.06.003

• Special Issue: Call for Papers for the 40th National Conference on Computer Security • Previous Articles     Next Articles

A Blockchain Anomaly Transaction Detection Model Based on Threat Environment Perception and Large Language Model Feature Enhancement

LIANG Fei1(),ZHANG Shixing2,*(),CHENG Zirui3   

  1. 1. Economic Crime Investigation Brigade of Beijing Municipal Public Security Bureau, Beijing 100061, China
    2. School of Intelligent Policing, China People’s Police University, Langfang, Hebei 065000, China
    3. School of Information and CyberSecurity, People’s Public Security University of China, Beijing 100038, China
  • Received:2025-07-24 Online:2025-12-20 Published:2025-12-17
  • Contact: ZHANG Shixing E-mail:475662476@qq.com;382927221@qq.com

Abstract:

[Objective] Current mainstream methods for detecting malicious behavior on blockchains rely on graph neural networks and primarily focus on feature aggregation. This paper improves traditional approaches by exploring the threat environment perception in blockchain transactions and leveraging large language models for feature enhancement. [Methods] The proposed Graph Community-Aware Augmentation with Large Language Models model first divides transactions into subgraphs using a community detection algorithm, treating the resulting communities as the environmental context of node addresses. The node features and their associated communities are then converted into textual descriptions, which are processed by a large language model to generate enhanced features. Finally, the original node features and the LLM-enhanced features are fused to form the final node representation. [Results] The model demonstrates an improved ability to capture and learn behavioral characteristics of malicious node addresses, even in scenarios with limited labeled samples. [Conclusions] Experimental results show that the proposed model outperforms traditional algorithms across multiple metrics on two public benchmark datasets.

Key words: Community Detection, Large Language Models, Blockchain