Professional Documents
Culture Documents
problem solving
Overview
Human agent:
eyes, ears, and other organs for sensors;
hands, legs, mouth, and other body parts for
actuators
Robotic agent:
cameras and infrared range finders for sensors; various motors
for actuators
Sensors
Mouth: Communication
Actuators Device
Head: General
Abilities
Motorics
Body: Application-
specific Abilities
Robots
Softbots
Mobile Agents
PDAs etc
Example: Vacuum-cleaner world
\input{tables/vacuum-agent-function-
table}
Rationality
Rational Agent:
For each possible percept sequence, a
rational agent should select an action that
is expected to maximize its performance
measure, given the evidence provided by
the percept sequence and whatever built-
in knowledge the agent has.
Rational agents
Performance measure: An objective criterion for success of
an agent's behavior, e.g.,
Robot driver?
Chess-playing program?
Spam email classifier?
Extremes
No autonomy ignores environment/data
Complete autonomy must act randomly/no program
PEAS:
Performance measure
Environment
Actuators
Sensors
Performance measure:
Time to complete course
Safe, fast, legal, comfortable trip, maximize profits
Environment:
Roads, other traffic, obstacles, pedestrians, customers
Actuators:
Steering wheel, accelerator, brake, signal, horn
Sensors:
Optical Cameras, lasers, sonar, accelerometer,
speedometer, GPS, odometer, engine sensors,
Keyboard.
PEAS
Environment:
Patient, hospital, staff
Actuators:
Screen display (questions, tests, diagnoses, treatments,
referrals)
Sensors:
Keyboard (entry of symptoms, findings, patient's answers)
PEAS
poker
medical
diagnosis
a clock a clock
\input{algorithms/table-agent-
algorithm}
Drawbacks:
Huge table
Take a long time to build the table
No autonomy
Even with learning, need a long time to learn the table entries
Agent types
Five basic types in order of increasing generality,
\input{algorithms/d+-agent-
algorithm}
The most effective way to handle partial
observability.
The agent should maintain some sort of
internal state that depends on the percept
history and thereby reflects at least some of
the unobserved aspects of the current state.
For example, for other driving tasks such as
changing lanes, the agent needs to keep
track of where the other cars are if it cant
see them all at once.
Model-based reflex agents
Very successful
18.6 kilometres
47 hours
50% attendance increase
1 tiny mistake (no injuries)
Architecture and Program
Program
Method of turning environmental input into actions
Architecture
Hardware/software (OS etc.) on which agents program
runs
RHINOs architecture:
Sensors (infrared, sonar, tactile, laser)
Processors (3 onboard, 3 more by wireless Ethernet)
RHINOs program:
Low level: probabilistic reasoning, vision,
High level: problem solving, planning (first order logic)
Knowledge of Environment
Programmed knowledge
Layout of the Museum
Doors, exhibits, restricted areas
Sensed knowledge
People and objects (chairs) moving
RHINO: no reflexes?
Dangerous, because people get everywhere
Goals:
Imagine:
A robot in the real world
A software agent dealing with web data streaming in
RHINO:
Invisible objects which couldnt be sensed
Including glass cases and bars at particular heights
Software adapted to take this into account
Determinism in the Environment
Non-deterministic environments
Have aspects beyond the control of the agent
Utility functions have to guess at changes in world
RHINO: non-deterministic
People moved chairs to block its path
Episodic Environments
In non-episodic environments:
Agent has to plan ahead:
Current choice will affect future actions
RHINO:
Short term goal is episodic
Getting to an exhibit does not depend on how it got to
current one
Long term goal is non-episodic
Tour guide, so cannot return to an exhibit on a tour
Static or Dynamic Environments
RHINO:
Fast decision making (planning route)
But people are very quick on their feet
Discrete or Continuous
Environments
Chess: discrete
RHINO: continuous
Visual data can be considered continuous
Choice of actions (directions) also continuous
RHINOs Solution to
Environmental Problems
Museum environment:
Inaccessible, non-episodic, non-deterministic, dynamic,
continuous
Internal structure
How to test of agent
whether agent
is acting
rationally Autonomous
Rational
Agent
Specifics of
the
Usual systems environment
engineering stuff
End of Chapter 2