This article was automatically translated from the original Turkish version.
Deep Reinforcement Learning (DRL) is an artificial intelligence approach that combines the fundamental principles of reinforcement learning (RL) with the representational power of deep learning (DL). This method enables an agent to learn a policy through trial and error in an environment, with the goal of maximizing the total future rewards. DRL employs deep neural networks to perform this process in high-dimensional and complex state spaces.
The origins of reinforcement learning lie in behavioral psychology and optimal control theory. Pavlov’s conditioning experiments and Thorndike’s “Law of Effect” established the psychological foundation of RL, while Bellman’s work on dynamic programming and the Markov Decision Process (MDP) concept formed the mathematical backbone of modern RL algorithms.
From the 1980s onward, foundational RL algorithms such as TD(λ), REINFORCE, and Q-learning were developed. In 2013, Deep Q-Networks (DQN) achieved performance surpassing human levels on Atari games, thereby initiating the modern era of deep reinforcement learning (DRL), where deep learning and reinforcement learning converged.
DRL consists of four main components:
This structure is typically modeled within the framework of a Markov Decision Process (MDP), where the objective is to find an optimal policy that maximizes the expected cumulative reward for each state.

A representative diagram illustrating the core components of deep reinforcement learning
DRL algorithms are broadly classified into two categories:
Model-free methods are further divided into:
DRL is applied across various domains including games, robotic systems, natural language processing, and autonomous vehicles:
Krichen, Moez. 2023. "Deep Reinforcement Learning." In Proceedings of the 14th International Conference on Computing, Communication and Networking Technologies (ICCCNT), July 6–8, 2023, IIT Delhi, India. IEEE.
Sutton, Richard S., and Andrew G. Barto. 2021. "Appendix A: Reinforcement Learning and Deep Reinforcement Learning." In Reinforcement Learning: An Introduction. 2nd ed. Cambridge, MA: MIT Press.
Cebollada, Sergio, Luis Payá, Maria Flores, Adrián Peidró, and Oscar Reinoso. 2020. “A State-of-the-Art Review on Mobile Robotics Tasks Using Artificial Intelligence and Visual Data.” *Expert Systems with Applications* 114195. https://doi.org/10.1016/j.eswa.2020.114195.
Li, Zhongguo, Wen-Hua Chen, Jun Yang, and Yunda Yan. 2023. “AID-RL: Active Information-Directed Reinforcement Learning for Autonomous Source Seeking and Estimation.” *Neurocomputing* 544: 126281. https://doi.org/10.1016/j.neucom.2023.126281.
Nayeri, Zahra Makki, Toktam Ghafarian, and Bahman Javadi. 2021. “Application Placement in Fog Computing with AI Approach: Taxonomy and a State of the Art Survey.” Journal of Network and Computer Applications 185: 103078. https://doi.org/10.1016/j.jnca.2021.103078.
Wang, Fei, Xiaoping Zhu, Zhou Zhou, and Yang Tang. 2024. “Deep-Reinforcement-Learning-Based UAV Autonomous Navigation and Collision Avoidance in Unknown Environments.” *Chinese Journal of Aeronautics* 37 (3): 237–257. https://doi.org/10.1016/j.cja.2023.09.033.
Zhao, Yong, Bin Chen, XiangHan Wang, Zhengqiu Zhu, Yiduo Wang, Guangquan Cheng, Rui Wang, Rongxiao Wang, Ming He, and Yu Liu. 2022. “A Deep Reinforcement Learning Based Searching Method for Source Localization.” *Information Sciences* 588: 67–81. https://doi.org/10.1016/j.ins.2021.12.041.
Zhou, S. Kevin, Hoang Ngan Le, Khoa Luu, Hien V. Nguyen, and Nicholas Ayache. 2021. “Deep Reinforcement Learning in Medical Imaging: A Literature Review.” *Medical Image Analysis* 73: 102193. https://doi.org/10.1016/j.media.2021.102193.
No Discussion Added Yet
Start discussion for "Deep Reinforcement Learning" article
Historical Background
Core Components
Key Algorithms
Application Areas
Challenges
Development Areas and Research Directions