Frontiers of Data and Computing ›› 2026, Vol. 8 ›› Issue (1): 35-44.

CSTR: 32002.14.jfdc.CN10-1649/TP.2026.01.004

doi: 10.11871/jfdc.issn.2096-742X.2026.01.004

• Special Issue: Computational Finance • Previous Articles     Next Articles

An Analysis of Stock Trading Algorithms Based on Reinforcement Learning Methods

LIAO Yuming1(),LU Yutong1,2,*()   

  1. 1. School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, Guangdong 510006, China
    2. National Supercomputer Center in Guangzhou, Guangzhou, Guangdong 510006, China
  • Received:2025-02-28 Online:2026-02-20 Published:2026-02-02
  • Contact: LU Yutong E-mail:liaoym6@mail2.sysu.edu.cn;yutong.lu@nscc-gz.cn

Abstract:

[Objective] Stock trading represents a critical topic within the field of finance and has garnered significant attention for the integration of reinforcement learning models into trading algorithms. [Coverage] This paper collects literature related to Reinforcement Learning algorithms designed for stock trading in recent years. [Methods] This paper systematically outlines the core challenges of intelligent computing in stock market modeling, discusses the limitations of traditional statistical methods and machine learning models in the stock domain, and analyzes state-of-the-art stock reinforcement learning algorithms. By comparing the performance of three representative algorithms, this study conducts an empirical evaluation from three dimensions: return stability, risk control, and computational efficiency. Based on the experimental results, the paper thoroughly examines the strengths and weaknesses of each algorithm. Building on these findings, it proposes future research directions for this field, providing theoretical foundations and practical references for further studies. [Conclusions] Although the integration of stock reinforcement learning algorithms with models such as LSTM and NLP has shown promising performance in backtesting, the increased model complexity has significantly raised computational time costs, reducing their efficiency in real-time analysis scenarios. Future research needs to advance both efficient algorithm design and hardware adaptation optimization to address current limitations.

Key words: reinforcement learning, stock trading, high-performance computing