Implementation of DAgger

The code is divided into three main parts:

  • Load the expert inference function to predict an action given a state.
  • Create a computational graph for the learner.
  • Create the DAgger iterations to build the dataset and train the new policy.

Here, we'll explain the most interesting parts, leaving the others for your personal interest. You can check the remaining code and the complete version in the book's GitHub repository.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset