Welcome to the course
  • Welcome!
  • Before you start - Videos quality!
  • Resources
Introduction
  • Introduction #1
  • Introduction #2
  • Introduction #3
  • Introduction #4
  • Environment setup / Installation
  • Lab. OpenAI Gym #1
  • Lab. OpenAI Gym #2
  • Lab. OpenAI Gym #3
  • Lab. OpenAI Gym #4
Tabular methods
  • Deterministic & Stochastic environments
  • Rewards
  • Bellman equation #1
  • Bellman equation #2
  • Resource - code
  • Lab. Algorithm for deterministic environments #1
  • Lab. Algorithm for deterministic environments #2
  • Lab. Algorithm for deterministic environments #3
  • Lab. Algorithm for deterministic environments #4
  • Lab. Test with stochastic environment
  • Q-Learning
  • Lab. Algorithm for stochastic environments
  • Exploration vs Exploitation
  • Lab. Egreedy
  • Lab. Adaptive egreedy
  • Bonus Lab. Value iteration
  • Homework
  • Homework. Solution
  • Homework. Tuning
Scaling up
  • Scaling up
  • Neural Networks review
  • Lab. Neural Networks review #1
  • Lab. Neural Networks review #2
  • Lab. Random CartPole
  • Lab. Epsilon egreedy revisited
  • Lab. Pytorch updated ( version 0.4.0 )
  • Article. Pytorch updated! (further versions)
  • Lab. OpenAI Gym + Neural Network #1
  • Lab. OpenAI Gym + Neural Network #2
  • Lab. OpenAI Gym + Neural Network #3
  • Lab. Extended logging
DQN
  • Deep Reinforcement Learning
  • Lab. Deep Reinforcement Learning
  • Lab. Tuning challenge
  • Experience Replay
  • Lab. Experience Replay #1
  • Lab. Experience Replay #2
  • Lab. Experience Replay #3
  • DQN
  • Lab. DQN
DQN Improvements
  • Double DQN
  • Lab. Double DQN
  • Dueling DQN
  • Lab. Dueling DQN
  • Lab. Dueling DQN Challenge
DQN with video output
  • CNN Review
  • Lab. Random Pong
  • Saving & Loading the Model
  • Lab. Pong from video output #1
  • Lab. Pong from video output #2
  • Lab. Pong from video output #3
  • Lab. Pong from video output #4
  • Lab. Pong from video output #5
  • Lab. Pong from video output #6
  • Potential improvements
  • Article. Stacking 4 images together
Final notes
  • What's next?