Featured image of post Notes on the Derivation of Least Squares Policy Iteration

Notes on the Derivation of Least Squares Policy Iteration

Here are my notes on the derivation of the Least Squares Policy Iteration (LSPI) algorithm. The notes are based on the original paper by Lagoudakis and Parr.

    / [pdf]
Last updated on Feb 15, 2024 11:05 -0500
Feedback
FOOTER