The results of the test rewards, as shown in the following diagram, are promising, proving a clear benefit in the use of a dueling architecture:
Figure 5.10. A plot of the test rewards. The dueling DQN values are plotted in red and the DQN values are plotted in orange. The x axis represents the number of steps