• Taufik Faturohman School of Business and Management Institut Teknologi Bandung, Indonesia
  • Teguh Nugraha School of Business and Management Institut Teknologi Bandung, Indonesia
Keywords: Deep reinforcement learning, Actor-critic framework, Islamic stock.


The Islamic principles in identifying stocks as Shari’ah principles have inevitability restrict the number of stocks that Muslims can invest in and consequently may affect the return from investment. In this paper, we examine the potential of Deep Reinforcement Learning in optimizing the portfolio returns of Islamic stocks. We model stock trading as a Markov Decision Process problem because of its stochastic and interactive nature. Then, we define the trading objective as a problem of maximization, while the DRL agents used are actor-critic algorithms. The selected portfolio consists of 30 most liquid Islamic stocks in Indonesia that constitute JII index and compare with that of the benchmark portfolio, namely the 45 most liquid conventional stocks or LQ45. The performance is compared using several algorithms. The result show that trading on Islamic stocks from January 2019 to December 2020 using the DRL agents could outperform the benchmark index of conventional stocks. Using DRL agents, fund managers would be able to optimize the portfolio on daily basis, minimize risk during crisis or turbulence, and outperform the conventional stocks.

Author Biography

Teguh Nugraha, School of Business and Management Institut Teknologi Bandung, Indonesia

Teguh Nugraha, S.Si., is currently an MBA student at the School of Business and Management Institut Teknologi Bandung, Indonesia. He received a Bachelor’ degree in Mathematics from Faculty of Natural Sciences and Mathematics, Institut Teknologi Bandung.

Teguh is also working as Head of Data at PT Setiap Hari Dipakai, a social commerce startup in Indonesia. Teguh's research interests are Islamic stocks, machine learning, and economics.


Adler, T., & Krizman, M. (2008). The cost of socially responsible investing. Journal of Portfolio Management, 35(1), 52–56.

Barnett, M., & Salomon, R. (2006). Beyond dichotomy: The curvilinear relationship between social responsibility and financial performance. Strategic Management Journal, 27(11), 1101–1122.

Bekiros, S. D. (2010). Heterogeneous trading strategies with adaptive fuzzy actor-critic reinforcement learning: A behavioral approach. Journal of Economic Dynamics and Control, 34(6), 1153–1170.

Chen, L., & Gao, Q. (2019). Application of deep reinforcement learning on automated stock trading. In 2019 IEEE 10th International Conference on Software Engineering and Service Science (ICSESS). (pp. 29–33).

Deng Y., Bao F., Kong, Y., Ren, Z., & Dai Q. (2017). Deep direct reinforcement learning for financial signal representation and trading. IEEE Transactions on Neural Networks and Learning Systems, 28(3), 653–664.

Faturohman, T., Widjaya, K. A., & Afgani, K. F. (2021). Sin stock proportion and investment manager education background in indonesian equity funds. In Barnett, W.A. & Sergi, B.S. (Eds.), Environmental, social, and governance perspectives on economic development in Asia (International Symposia in Economic Theory and Econometrics, Vol. 29A), Emerald Publishing Limited, Bingley, pp. 83-99.

Fischer, T. G. (2018). Reinforcement learning in financial markets - a survey. FAU Discussion Papers in Economics 12/2018, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.

Jiang, Z., & Liang, J. (2017). Cryptocurrency portfolio management with deep reinforcement learning. In 2017 Intelligent Systems Conference. London, UK: IEEE.

Liu, X. Y., Yang, H., Chen, Q., Zhang, R., Yang, L., Xiao, B., & Wang, C. D. (2020). FinRL: A deep reinforcement learning library for automated stock trading in quantitative finance. In Deep Reinforcement Learning Workshop, 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada. arXiv preprint arXiv:2011.09607. (pp. 1-12).

Malik, A. (2022). “Bikin Ngilu! Kerugian Masyarakat Akibat Kripto Dan Robot Trading Ilegal Capai Rp6,5 Triliun.” [“Make Painful! Community Losses Due to Crypto and Illegal Trading Robots Reached Rp6.5 Trillion”]. Bareksa.Com.

Markowitz, H. (1952). Portfolio selection. Journal of Finance, 7(1), 77–91.

Moody, J., & Saffell, M. (2001). Learning to trade via direct reinforcement. IEEE Transactions on Neural Networks, 12(4), 875–889.

Wu, Y., & Tian, Y. (2017). Training agent for first-person shooter game with actor-critic curriculum learning. ICLR 2017 Conference.

Xiong, Z., Liu, X. Y., Zhong, S., Yang, H., & Walid, A. (2018). Practical deep reinforcement learning approach for stock trading. NIPS Workshop on Challenges and Opportunities for AI in Financial Services: The Impact of Fairness, Explainability, Accuracy, and Privacy, Montréal, Canada.

Yang, H., Liu, X. Y., Zhong, S., & Walid, A. (2020). Deep reinforcement learning for automated stock trading: An ensemble strategy. ICAIF ’20, October 15–16, 2020, New York, NY, USA. Available at SSRN.

PlumX Metrics

How to Cite
Faturohman, T., & Nugraha, T. (2022). ISLAMIC STOCK PORTFOLIO OPTIMIZATION USING DEEP REINFORCEMENT LEARNING. Journal of Islamic Monetary Economics and Finance, 8(2), 181-200.

Most read articles by the same author(s)