Iti0210w13

Allikas: Lambda

14. Nädal

Hüvitusega õppimine. Q-Learning. AIMA 21.3.2

A Painless Q-Learning Tutorial

Q-Learning/SARSA

Reinforcement Learning: Example and Tutorial