Next: 10.4.1 The General Philosophy Up: 10. Sequential Decision Theory Previous: Solutions for the average

10.4 Reinforcement Learning

Subsections

10.4.1 The General Philosophy
- Terminology
- The general framework
10.4.2 Evaluating a Plan via Simulation
- Temporal differences
10.4.3 Q-Learning: Computing an Optimal Plan
- Value iteration
- Policy iteration

Steven M LaValle 2020-08-14