reinforcement learning - cs.uml.edu
TRANSCRIPT
![Page 1: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/1.jpg)
Reinforcement Learning91.450 Robotics I
Spring 2014Prof. Yanco
Many of the slides in this presentation are from R. Sutton and A. Barto, as noted at the bottom of the slides
![Page 2: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/2.jpg)
![Page 3: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/3.jpg)
![Page 4: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/4.jpg)
![Page 5: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/5.jpg)
![Page 6: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/6.jpg)
![Page 7: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/7.jpg)
![Page 8: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/8.jpg)
![Page 9: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/9.jpg)
![Page 10: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/10.jpg)
![Page 11: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/11.jpg)
![Page 12: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/12.jpg)
![Page 13: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/13.jpg)
![Page 14: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/14.jpg)
![Page 15: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/15.jpg)
![Page 16: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/16.jpg)
![Page 17: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/17.jpg)
![Page 18: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/18.jpg)
![Page 19: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/19.jpg)
Reinforcement Learning for Robot Language Learning
Holly Yanco and Lynn Andrea Stein. “An Adaptive Communication Protocol for Cooperating Mobile Robots.” In From Animals to Animats 2: Proceedings of the Second International Conference on the Simulation of Adaptive Behavior, edited by J.-A. Meyer, H.L. Roitblat and S.W. Wilson. The MIT Press/Bradford Books, 1993, pp. 478-485.
![Page 20: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/20.jpg)
Robot Pseudocode
![Page 21: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/21.jpg)
Example run with 2 robots
![Page 22: Reinforcement Learning - cs.uml.edu](https://reader033.vdocuments.site/reader033/viewer/2022050211/626e18592c59681e4c0c66f5/html5/thumbnails/22.jpg)
Curse of dimensionality