Assessments

  1. In reinforcement learning lingo, we call the ______ a policy.
  2. State whether the following statement is True or False: The goal of reinforcement learning is to discover a good strategy. One of the most common ways to solve it is by observing the long-term consequences of actions in each state.
  3. We need to train the agent by taking actions to the environment and receiving ______.
  4. State whether the following statement is True or False: Using the contextual bandits, we cannot introduce and make the proper utilization of the state.
  5. To find the stock prices, we can use the _______ library in Python.
    1. get_prices
    2. plot_prices
    3. yahoo_finance
    4. finance_yahoo
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset