Report copyright - Pseudo-MDPs and Factored Linear Action Modelszhangx/papers/Yaoetal14.pdflinear approximate policy iteration algorithms such as LSPI [4]. We propose two approaches of learning a factored
Please pass captcha verification before submit form