References

¹"Reinforcement learning is learning what to do --- how to map situations to actions --- so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."

-Reinforcement learning: An Introduction - (1.1)

All reinforcement learning methods have to use learning by selection in one form or another and this contrasts sharply with supervised learning, where the feedback from the environment directly indicates what the correct action should have been.

-Reinforcement learning: An Introduction - (2.4)

²"It's not who has the best algorithm that wins, it's who has the most data."

Page updated

Google Sites

Report abuse