1 2 chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm senso rs??...
TRANSCRIPT
![Page 1: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/1.jpg)
IntelligentAgents
Chapter2
Chapter21
![Page 2: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/2.jpg)
Reminders
Assignment0(lisprefresher)due1/28
Lisp/emacs/AIMAtutorial:11-1todayandMonday,271Soda
Chapter22
![Page 3: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/3.jpg)
Outline
♦Agentsandenvironments
♦Rationality
♦PEAS(Performancemeasure,Environment,Actuators,Sensors)
♦Environmenttypes
♦Agenttypes
Chapter23
![Page 4: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/4.jpg)
Agentsandenvironments
?
agent
percepts
sensors
actions
environment
actuators
Agentsincludehumans,robots,softbots,thermostats,etc.
Theagentfunctionmapsfrompercepthistoriestoactions:
f:P∗→A
Theagentprogramrunsonthephysicalarchitecturetoproducef
Chapter24
![Page 5: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/5.jpg)
Vacuum-cleanerworld
AB
Percepts:locationandcontents,e.g.,[A,Dirty]
Actions:Left,Right,Suck,NoOp
Chapter25
![Page 6: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/6.jpg)
Avacuum-cleaneragent
PerceptsequenceAction[A,Clean]Right
[A,Dirty]Suck
[B,Clean]Left
[B,Dirty]Suck
[A,Clean],[A,Clean]Right
[A,Clean],[A,Dirty]Suck......
functionReflex-Vacuum-Agent([location,status])returnsanaction
ifstatus=DirtythenreturnSuck
elseiflocation=AthenreturnRight
elseiflocation=BthenreturnLeft
Whatistherightfunction?Canitbeimplementedinasmallagentprogram?
Chapter26
![Page 7: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/7.jpg)
Rationality
Fixedperformancemeasureevaluatestheenvironmentsequence–onepointpersquarecleanedupintimeT?–onepointpercleansquarepertimestep,minusonepermove?–penalizefor>kdirtysquares?
Arationalagentchooseswhicheveractionmaximizestheexpectedvalueoftheperformancemeasuregiventheperceptsequencetodate
Rational6=omniscient–perceptsmaynotsupplyallrelevantinformation
Rational6=clairvoyant–actionoutcomesmaynotbeasexpected
Hence,rational6=successful
Rational⇒exploration,learning,autonomy
Chapter27
![Page 8: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/8.jpg)
PEAS
Todesignarationalagent,wemustspecifythetaskenvironment
Consider,e.g.,thetaskofdesigninganautomatedtaxi:
Performancemeasure??
Environment??
Actuators??
Sensors??
Chapter28
![Page 9: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/9.jpg)
PEAS
Todesignarationalagent,wemustspecifythetaskenvironment
Consider,e.g.,thetaskofdesigninganautomatedtaxi:
Performancemeasure??safety,destination,profits,legality,comfort,...
Environment??USstreets/freeways,traffic,pedestrians,weather,...
Actuators??steering,accelerator,brake,horn,speaker/display,...
Sensors??video,accelerometers,gauges,enginesensors,keyboard,GPS,...
Chapter29
![Page 10: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/10.jpg)
Internetshoppingagent
Performancemeasure??
Environment??
Actuators??
Sensors??
Chapter210
![Page 11: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/11.jpg)
Internetshoppingagent
Performancemeasure??price,quality,appropriateness,efficiency
Environment??currentandfutureWWWsites,vendors,shippers
Actuators??displaytouser,followURL,fillinform
Sensors??HTMLpages(text,graphics,scripts)
Chapter211
![Page 12: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/12.jpg)
Environmenttypes
SolitaireBackgammonInternetshoppingTaxiObservable??Deterministic??Episodic??Static??Discrete??Single-agent??
Chapter212
![Page 13: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/13.jpg)
Environmenttypes
SolitaireBackgammonInternetshoppingTaxiObservable??YesYesNoNoDeterministic??Episodic??Static??Discrete??Single-agent??
Chapter213
![Page 14: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/14.jpg)
Environmenttypes
SolitaireBackgammonInternetshoppingTaxiObservable??YesYesNoNoDeterministic??YesNoPartlyNoEpisodic??Static??Discrete??Single-agent??
Chapter214
![Page 15: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/15.jpg)
Environmenttypes
SolitaireBackgammonInternetshoppingTaxiObservable??YesYesNoNoDeterministic??YesNoPartlyNoEpisodic??NoNoNoNoStatic??Discrete??Single-agent??
Chapter215
![Page 16: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/16.jpg)
Environmenttypes
SolitaireBackgammonInternetshoppingTaxiObservable??YesYesNoNoDeterministic??YesNoPartlyNoEpisodic??NoNoNoNoStatic??YesSemiSemiNoDiscrete??Single-agent??
Chapter216
![Page 17: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/17.jpg)
Environmenttypes
SolitaireBackgammonInternetshoppingTaxiObservable??YesYesNoNoDeterministic??YesNoPartlyNoEpisodic??NoNoNoNoStatic??YesSemiSemiNoDiscrete??YesYesYesNoSingle-agent??
Chapter217
![Page 18: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/18.jpg)
Environmenttypes
SolitaireBackgammonInternetshoppingTaxiObservable??YesYesNoNoDeterministic??YesNoPartlyNoEpisodic??NoNoNoNoStatic??YesSemiSemiNoDiscrete??YesYesYesNoSingle-agent??YesNoYes(exceptauctions)No
Theenvironmenttypelargelydeterminestheagentdesign
Therealworldis(ofcourse)partiallyobservable,stochastic,sequential,dynamic,continuous,multi-agent
Chapter218
![Page 19: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/19.jpg)
Agenttypes
Fourbasictypesinorderofincreasinggenerality:–simplereflexagents–reflexagentswithstate–goal-basedagents–utility-basedagents
Allthesecanbeturnedintolearningagents
Chapter219
![Page 20: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/20.jpg)
Simplereflexagents
AgentE
nvi
ron
men
tSensors
What the worldis like now
What action Ishould do now Condition−action rules
Actuators
Chapter220
![Page 21: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/21.jpg)
Example
functionReflex-Vacuum-Agent([location,status])returnsanaction
ifstatus=DirtythenreturnSuck
elseiflocation=AthenreturnRight
elseiflocation=BthenreturnLeft
(setqjoe(make-agent:name’joe:body(make-agent-body)
:program(make-reflex-vacuum-agent-program)))
(defunmake-reflex-vacuum-agent-program()
#’(lambda(percept)
(let((location(firstpercept))(status(secondpercept)))
(cond((eqstatus’dirty)’Suck)
((eqlocation’A)’Right)
((eqlocation’B)’Left)))))
Chapter221
![Page 22: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/22.jpg)
Reflexagentswithstate
Agent
En
viro
nm
ent
Sensors
What action Ishould do now
State
How the world evolves
What my actions do
Condition−action rules
Actuators
What the worldis like now
Chapter222
![Page 23: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/23.jpg)
Example
functionReflex-Vacuum-Agent([location,status])returnsanaction
static:lastA,lastB,numbers,initially∞
ifstatus=Dirtythen...
(defunmake-reflex-vacuum-agent-with-state-program()
(let((last-Ainfinity)(last-Binfinity))
#’(lambda(percept)
(let((location(firstpercept))(status(secondpercept)))
(incflast-A)(incflast-B)
(cond
((eqstatus’dirty)
(if(eqlocation’A)(setqlast-A0)(setqlast-B0))
’Suck)
((eqlocation’A)(if(>last-B3)’Right’NoOp))
((eqlocation’B)(if(>last-A3)’Left’NoOp)))))))
Chapter223
![Page 24: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/24.jpg)
Goal-basedagents
Agent
En
viro
nm
ent
Sensors
What it will be like if I do action A
What action Ishould do now
State
How the world evolves
What my actions do
Goals
Actuators
What the worldis like now
Chapter224
![Page 25: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/25.jpg)
Utility-basedagents
Agent
En
viro
nm
ent
Sensors
What it will be like if I do action A
How happy I will be in such a state
What action Ishould do now
State
How the world evolves
What my actions do
Utility
Actuators
What the worldis like now
Chapter225
![Page 26: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/26.jpg)
Learningagents
Performance standard
Agent
En
viro
nm
ent
Sensors
Performance element
changes
knowledgelearning goals
Problem generator
feedback
Learning element
Critic
Actuators
Chapter226
![Page 27: 1 2 Chapteraima.eecs.berkeley.edu/2nd-ed/slides-pdf/chapter02.pdf · 2004. 1. 30. · rm Senso rs?? HTML pages (text, graphics, scripts) Chapter 2 11. En vironmen t t yp es Solitaire](https://reader036.vdocuments.site/reader036/viewer/2022081620/6117d8ed45b55d1eb41ba4c8/html5/thumbnails/27.jpg)
Summary
Agentsinteractwithenvironmentsthroughactuatorsandsensors
Theagentfunctiondescribeswhattheagentdoesinallcircumstances
Theperformancemeasureevaluatestheenvironmentsequence
Aperfectlyrationalagentmaximizesexpectedperformance
Agentprogramsimplement(some)agentfunctions
PEASdescriptionsdefinetaskenvironments
Environmentsarecategorizedalongseveraldimensions:observable?deterministic?episodic?static?discrete?single-agent?
Severalbasicagentarchitecturesexist:reflex,reflexwithstate,goal-based,utility-based
Chapter227