Reinforcement learning-based portfolio optimization with deterministic state transition

Song, GL; Zhao, TL; Ma, X; Lin, PG; Cui, CR
2025
点赞
收藏

【Author】 Song, Guangle; Zhao, Tianlong; Ma, Xiang; Lin, Peiguang; Cui, Chaoran

【Source】INFORMATION SCIENCES

【影响因子】8.233

【Abstract】Portfolio optimization has attracted substantial interest within the artificial intelligence community due to its significant impact on financial decision-making, risk management, and market analysis. Reinforcement learning fits well with portfolio optimization because their goal is to maximize cumulative returns. In reinforcement learning, state transition probabilities are often unknown and must be estimated. However, in portfolio backtesting experiments, these probabilities are deterministic, making the conventional reinforcement learning approach to estimating state transitions suboptimal for portfolio optimization. Addressing this issue, this study decomposes the portfolio optimization into two core tasks: prediction and profit policy optimization and proposes a novel reinforcement learning framework that assumes deterministic state transition probabilities, comprised of three main modules: feature extraction, prediction, and profit strategy optimization. To model assets more effectively and comprehensively, we capture their temporal features, relational features, and market state. We introduce a patch-wise correlation method and attribute based gate to enhance feature extraction. In the profit policy module, we utilize a deterministic strategy, employing a recursive reinforcement learning method based on Monte Carlo sampling to train the policy network. This enables dynamic adjustments of asset investment weights, ensuring the maximization of cumulative returns. Extensive experiments conducted on cryptocurrency datasets demonstrate the superior performance of our approach, and achieving 36.6%-75.6% improvements in main measurements on cryptocurrency datasets.

【Keywords】Reinforcement learning; Deterministic state transition probability; Portfolio optimization; Asset modeling

【发表时间】2025 FEB

【收录时间】2024-10-26

【文献类型】实证数据

【主题类别】

区块链治理-市场治理-数字资产

【DOI】 10.1016/j.ins.2024.121538

Reinforcement learning-based portfolio optimization with deterministic state transition

评论

Vulnerabilities and attacks assessments in blockchain 1.0, 2.0 and 3.0: tools, analysis and countermeasures

Evaluation on the application mode of blockchain technology in water transportation engineering project management

Cashing out crypto: state of practice in ransom payments

Cryptocurrency and digital currency based on blockchain-enabled IoT: a bibliometric literature review