Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Reinforcement Learning

Deepmind marked the year 2017 by creating the best Go player in the world. How did they achieve this? With deep learning, of course, but more precisely with reinforcement learning.

Deep Blue beat human chess players with traditional game analysis. It would build a tree of possible outcomes and prune it with different strategies (like alpha/beta, but adapted to the space of possible outcomes of chess). But this was not possible with Go, which was never solvable by computers until Deepmind created their network and its training methods. Because without training, the network is useless!

In this chapter, we will do the following:

Look at different types of reinforcement learning
Explore the concept of Q-learning
Estimate a Q function via a table and via a neural network
Make a network play an Atari game using Q-learning

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Reinforcement Learning

Create new playlist

Sign In

Sign Up

Table of Contents for
Reinforcement Learning