Next:
10.4.1 The General Philosophy
Up:
10. Sequential Decision Theory
Previous:
Solutions for the average
10
.
4
Reinforcement Learning
Subsections
10
.
4
.
1
The General Philosophy
Terminology
The general framework
10
.
4
.
2
Evaluating a Plan via Simulation
Temporal differences
10
.
4
.
3
Q-Learning: Computing an Optimal Plan
Value iteration
Policy iteration
Steven M LaValle 2020-08-14