• Corpus ID: 236912965

Factor Representation and Decision Making in Stock Markets Using Deep Reinforcement Learning

  title={Factor Representation and Decision Making in Stock Markets Using Deep Reinforcement Learning},
  author={Zhaolu Dong and Shan Huang and Simiao Ma and Yining Qian},
Deep Reinforcement learning is a branch of unsupervised learning in which an agent learns to act based on environment state in order to maximize its total reward. Deep reinforcement learning provides good opportunity to model the complexity of portfolio choice in high-dimensional and data-driven environment by leveraging the powerful representation of deep neural networks. In this paper, we build a portfolio management system using direct deep reinforcement learning to make optimal portfolio… 

Figures from this paper



Algorithm Trading using Q-Learning and Recurrent Reinforcement Learning

This paper uses classic reinforcement algorithm, Q-learning, to evaluate the performance in terms of cumulative profits by maximizing different forms of value functions: interval profit, sharp ratio, and derivative sharp ratio and finds that this direct reinforcement learning framework enables a simpler problem representation than that in value function based search algorithm.

Deep Direct Reinforcement Learning for Financial Signal Representation and Trading

This work introduces a recurrent deep neural network for real-time financial signal representation and trading and proposes a task-aware backpropagation through time method to cope with the gradient vanishing issue in deep training.

Deep reinforcement learning for automated stock trading: an ensemble strategy

An ensemble strategy that employs deep reinforcement schemes to learn a stock trading strategy by maximizing investment return is proposed and shown to outperform the three individual algorithms and two baselines in terms of the risk-adjusted return measured by the Sharpe ratio.

Reinforcement Learning for Trading

This work extends previous work to compare Q-Learning to the authors' Recurrent Reinforcement Learning (RRL) algorithm and provides new simulation results that demonstrate the presence of predictability in the monthly S&P 500 Stock Index for the 25 year period 1970 through 1994, as well as a sensitivity analysis that provides economic insight into the trader's structure.

A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem

A financial-model-free Reinforcement Learning framework to provide a deep machine learning solution to the portfolio management problem, able to achieve at least 4-fold returns in 50 days.

Learning to trade via direct reinforcement

It is demonstrated how direct reinforcement can be used to optimize risk-adjusted investment returns (including the differential Sharpe ratio), while accounting for the effects of transaction costs.

Reinforcement learning in Portfolio Management and its interpretation

The goal of this paper is to make one of these machine learning methods transparent in the context of Portfolio management by interpreting the strategies implied by reinforcement learning (RL) and relating them to the strategies imply by academic portfolio advice with the help of their classical portfolio (CP) management models.

Adversarial Deep Reinforcement Learning in Portfolio Management

A so called Adversarial Training method is proposed and it can greatly improve the training efficiency and significantly promote average daily return and sharpe ratio in back test and results show that the agent based on Policy Gradient can outperform UCRP.

A Multiagent Approach to $Q$-Learning for Daily Stock Trading

A new stock trading framework is presented that incorporates multiple Q-learning agents, allowing them to effectively divide and conquer the stock trading problem by defining necessary roles for cooperatively carrying out stock pricing and selection decisions.


It is discovered that the deep-learning long short-term memory network has outperformed other trading strategies for the German blue-chip stock, BMW, during the 2010–2018 period.