Foundations of Deep Reinforcement Learning with PyTorch
Ends soon! Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Foundations of Deep Reinforcement Learning with PyTorch
This course is part of Deep Reinforcement Learning Hands-On Specialization
Included with
Learn more
Recommended experience
Recommended experience
What you'll learn
Understand the core principles of reinforcement learning and agent-environment interactions
Gain hands-on experience with the OpenAI Gym API and Gymnasium for RL applications
Implement key deep RL algorithms, including Deep Q-Networks and the Cross-Entropy method
Skills you'll gain
Tools you'll learn
Details to know
April 2026
7 assignments
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 7 modules in this course
This course provides a deep dive into reinforcement learning (RL) with a focus on practical applications using PyTorch. You'll explore core concepts like the OpenAI Gym API, deep Q-networks, and advanced RL libraries. As RL becomes increasingly important in fields like AI, robotics, and gaming, mastering this skill will help you stay ahead in the rapidly evolving tech industry.
Through hands-on projects and real-world scenarios, you'll enhance your problem-solving abilities and gain practical expertise in building RL models. The course covers a wide range of topics, from tabular learning and the Bellman equation to complex deep Q-networks, ensuring that you develop both foundational and advanced RL skills. What sets this course apart is its blend of theoretical knowledge with practical coding exercises. You'll learn how to implement RL algorithms using PyTorch while understanding the underlying math and principles, providing a well-rounded approach to mastering reinforcement learning. This course is perfect for professionals and students with a background in machine learning or Python programming. Prior knowledge of deep learning or neural networks will be helpful but not required to start. This course is part one of a three-course Specialization designed to provide a comprehensive learning pathway in Reinforcement Learning. While it delivers standalone value, learners seeking an in-depth progression may benefit from completing the full Specialization.
This module introduces the foundational concepts of reinforcement learning, including the roles of agents, environments, and the flow of information through rewards and observations. Learners will explore Markov processes and how they evolve into Markov decision processes by incorporating actions and rewards. By the end, you'll understand the basic structure and challenges of designing reinforcement learning systems.
What's included
1 video7 readings1 assignment
1 videoβ’Total 1 minute
- Overviewβ’1 minute
7 readingsβ’Total 44 minutes
- What Is Reinforcement Learningβ’7 minutes
- Complications in RLβ’7 minutes
- The Agentβ’4 minutes
- Observationsβ’6 minutes
- Markov Decision Processesβ’7 minutes
- Markov Reward Processesβ’7 minutes
- Adding Actions to MDPβ’6 minutes
1 assignmentβ’Total 16 minutes
- Introduction to Reinforcement Learning Fundamentalsβ’16 minutes
This module introduces learners to the Gymnasium library and the OpenAI Gym API, essential tools for building and interacting with reinforcement learning environments in Python. You will explore environment structure, naming conventions, and how to create and use environments programmatically. Practical examples, including implementing a simple agent, will help solidify your understanding of these foundational RL tools.
What's included
1 video6 readings1 assignment
1 videoβ’Total 1 minute
- Overviewβ’1 minute
6 readingsβ’Total 34 minutes
- Introductionβ’6 minutes
- Hardware and Software Requirementsβ’4 minutes
- The OpenAI Gym API and Gymnasiumβ’7 minutes
- The Environmentβ’4 minutes
- Creating an Environmentβ’5 minutes
- The Random CartPole Agentβ’8 minutes
1 assignmentβ’Total 16 minutes
- Exploring OpenAI Gym and Gymnasium Fundamentalsβ’16 minutes
This module introduces the foundational concepts and practical tools for building deep learning models using PyTorch. Learners will explore tensor operations, automatic gradient computation, neural network components, loss functions, and experiment monitoring with TensorBoard and Ignite. By the end, you'll be equipped to construct, train, and evaluate neural networks efficiently.
What's included
1 video10 readings1 assignment
1 videoβ’Total 1 minute
- Overviewβ’1 minute
10 readingsβ’Total 57 minutes
- Introductionβ’8 minutes
- Tensor Operationsβ’4 minutes
- Gradientsβ’4 minutes
- Tensors and Gradientsβ’5 minutes
- NN Building Blocksβ’8 minutes
- Loss Functionsβ’5 minutes
- Monitoring with TensorBoardβ’4 minutes
- Plotting Metricsβ’10 minutes
- PyTorch Igniteβ’5 minutes
- GAN Training on Atari Using Igniteβ’4 minutes
1 assignmentβ’Total 16 minutes
- PyTorch Deep Learning Fundamentalsβ’16 minutes
This module introduces the cross-entropy method as a reinforcement learning technique, guiding learners through its implementation and application to classic environments like CartPole and FrozenLake. Learners will gain practical experience building and tuning neural network models to solve RL tasks using this approach.
What's included
1 video3 readings1 assignment
1 videoβ’Total 1 minute
- Overviewβ’1 minute
3 readingsβ’Total 29 minutes
- Introductionβ’8 minutes
- The Cross-Entropy Method on CartPoleβ’12 minutes
- The Cross-Entropy Method on FrozenLakeβ’9 minutes
1 assignmentβ’Total 16 minutes
- Cross-Entropy Method Fundamentalsβ’16 minutes
This module introduces foundational tabular reinforcement learning methods, focusing on the Bellman equation and its role in value-based algorithms. Learners will explore value and Q-functions, and implement value iteration and Q-iteration techniques using practical examples like FrozenLake.
What's included
1 video6 readings1 assignment
1 videoβ’Total 1 minute
- Overviewβ’1 minute
6 readingsβ’Total 31 minutes
- Introductionβ’4 minutes
- The Bellman Equation of Optimalityβ’4 minutes
- The Value of the Actionβ’4 minutes
- The Value Iteration Methodβ’5 minutes
- Value Iteration in Practiceβ’10 minutes
- Q-Iteration for FrozenLakeβ’4 minutes
1 assignmentβ’Total 16 minutes
- Bellman Equation and Tabular Learning Fundamentalsβ’16 minutes
This module introduces the principles and implementation of Deep Q-Networks (DQNs), covering foundational concepts such as the Bellman equation, value iteration, and tabular Q-learning. Learners will explore how neural networks can approximate Q-values in complex environments, optimize training using stochastic gradient descent, and evaluate DQN performance on challenging tasks like Atari Pong. By the end, students will understand both the theory and practical aspects of training deep reinforcement learning agents.
What's included
1 video8 readings1 assignment
1 videoβ’Total 1 minute
- Overviewβ’1 minute
8 readingsβ’Total 62 minutes
- Introductionβ’5 minutes
- Tabular Q-Learningβ’7 minutes
- Deep Q-Learningβ’6 minutes
- SGD Optimizationβ’6 minutes
- DQN on Pongβ’11 minutes
- The DQN Modelβ’4 minutes
- Trainingβ’14 minutes
- Running and Performanceβ’9 minutes
1 assignmentβ’Total 16 minutes
- Deep Q-Networks Fundamentalsβ’16 minutes
This module introduces key abstractions and tools for implementing deep reinforcement learning agents using higher-level libraries. Learners will explore agent architectures, policy distributions, experience sources, and replay buffers, gaining practical skills to build and train DQN-based models efficiently.
What's included
1 video5 readings1 assignment
1 videoβ’Total 1 minute
- Overviewβ’1 minute
5 readingsβ’Total 31 minutes
- Introductionβ’8 minutes
- The Agentβ’5 minutes
- PolicyAgentβ’4 minutes
- The ExperienceSource Classβ’5 minutes
- Experience Replay Buffersβ’9 minutes
1 assignmentβ’Total 16 minutes
- Reinforcement Learning Library Conceptsβ’16 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor
Offered by
Explore more from Machine Learning
- Status: Free Trial
Specialization
- Status: Free Trial
Course
- Status: Free TrialC
Coursera
Course
- Status: Free TrialD
DeepLearning.AI
Course
Why people choose Coursera for their career
Frequently asked questions
Yes, you can preview the first video and view the syllabus before you enroll. You must purchase the course to access content not included in the preview.
If you decide to enroll in the course before the session start date, you will have access to all of the lecture videos and readings for the course. Youβll be able to submit assignments once the session starts.
Once you enroll and your session begins, you will have access to all videos and other resources, including reading items and the course discussion forum. Youβll be able to view and submit practice assessments, and complete required graded assignments to earn a grade and a Course Certificate.
More questions
Financial aid available,
