In this post, we describe the anatomy of how most Deep Reinforcement Learning algorithms work. We also cover the motivation to use RL over standard machine learning, On-Policy v/s Off-Policy learning, the Exploration-Exploitation Tradeoff, and many more important RL concepts.