Professional Documents
Culture Documents
2. (12 pts total, -1 each wrong answer, but not negative) For each of the following terms
on the left, write in the letter corresponding to the best answer or the correct definition
on the right. The first one is done for you as an example.
A Agent A Perceives environment by sensors, acts by actuators
Z Percept B States reachable from the initial state by a sequence of actions
U Percept Sequence C Guaranteed to find a solution if one is accessible
K Agent Function D Never over‐estimates cost of cheapest path to a goal state
P Performance Measure E Maximum number of successors of any node
L Rational Agent F Set of all leaf nodes available for expansion at any given time
M Fully Observable G Estimates cost of cheapest path from current state to goal state
Q Deterministic H Sequence of states connected by a sequence of actions
W Dynamic Environment I Represents a state in the state space
N Atomic representation J Guaranteed to find lowest cost among all solutions
R Factored representation K Maps any given percept sequence to an action
B State Space L Agent that acts to maximize its expected performance measure
I Search Node M Sensors give the complete state of environment at each time
V Link between nodes N Each state of the world is indivisible, with no internal structure
H Path O A node with no children in the tree
Y Abstraction P Evaluates any given sequence of environment states for utility
J Optimal Search Q Next state of environment is fixed by current state & action
C Complete Search R A state is a fixed set of variables or attributes, each with a value
X Expand a state S How a search algorithm chooses which node to expand next
O Leaf node T For n’ a successor of n from action a, h(n) ≤ cost(n, a, n’) + h(n’)
F Frontier U Complete history of everything agent has perceived
S Search Strategy V Represents an action in the state space
E Branching Factor W Environment can change while agent is deliberating
G Heuristic Function X Apply each legal action to state, generating a new set of states
D Admissible Heuristic Y Process of removing detail from a representation
T Consistent Heuristic Z Agent’s perceptual inputs at any given instant
3. (4 pts total, 1 pt each) Your book defines a task environment as a set of four things,
with the acronym PEAS. Fill in the blanks with the names for the PEAS components.
5. (4 pts total, 1 pt each) The heuristic path algorithm is a best-first-style search in which
the objective function is f(n) = (2−w)*g(n)+w*h(n) and the node of lowest f is expanded
next. You may assume that: the branching factor is finite; the search space contains
loops and cycles; h(n) is consistent; w ≥ 0; and each step cost ≥ ε > 0.
5c. When w = 1? A*
6. (5 pts total) Suppose you are given the following data about plant diseases:
LEAF INSECTS CLASS
moldy none healthy
wrinkled none healthy
green none healthy
moldy present diseased
wrinkled present diseased
green present healthy
6.a. (3 pts) Which choice of root node has the highest information gain resulting from
that choice? (LEAF, INSECTS) INSECTS
6.b. (2 pts) Using the root node that you chose in 6.a., draw a decision tree that
correctly predicts CLASS. present
INSECTS
none LEAF
moldy green
HEALTHY wrinkled
DISEASED HEALTHY
DISEASED
7. (8 pts total, 1 pt each) The sliding-tile puzzle consists of three black tiles (B), three
white tiles (W), and an empty space (blank). The starting state is:
B B B W W W
The goal is to have all the white tiles to the left of all the
black tiles; the position of the blank is not important.
7b. Does the search space have loops (cycles)? (Y=yes, N=no) Y
7e. Consider a heuristic function h1(n) = the number of black tiles to the left of the left-
most white tile. Is this heuristic admissible? (“Y" = yes, “N" = no) Y
7f. Consider a heuristic function h2(n) = the number of black tiles to the left of the right-
most white tile. Is this heuristic admissible? (“Y" = yes, “N" = no) Y
7g. Consider a heuristic function h3(n) = the number of black tiles to the left of the right-
most white tile plus the number of white tiles to the right of the left-most black tile. Is this
heuristic admissible? (“Y" = yes, “N" = no) N
7h. Consider a heuristic function h4(n) = h3(n) / 2. Is this heuristic admissible? (“Y" =
yes, “N" = no) Y
8. (2 pts total, 1 pt each) Suppose that there is no good evaluation function for a
problem (no cost function g), but there is a good comparison method: a way to tell
whether one node is better than another, but not to assign numerical values to either.
Answer Y (= yes) or N (= no).
8b. Suppose you also have a consistent heuristic, h(n). Is this enough to do A* search
and guarantee an optimal solution? N
9. (15 pts total, 1 pt each) Label the following as T (= True) or F (= False).
9.b. T A* search is both complete and optimal when the heuristic is consistent,
step costs ≥ ε > 0, and the total cost estimate is monotonic increasing.
9.c. F Most search effort is expended while examining the interior branch nodes
of a search tree.
9.d. F Uniform-cost search is both complete and optimal when the path cost
never decreases.
9.e. F Greedy best-first search is both complete and optimal when the heuristic
is optimal.
9.g. F A* search with a consistent heuristic and step costs ≥ ε > 0 can fail to find
the optimal solution if the search space contains loops.
9.h. F If the search space contains only a single local maximum (i.e., the global
maximum = the only local maximum), then hill-climbing is guaranteed to climb that
single hill and will find the global maximum.
9.i. T Taking two (or more) steps at once may help hill-climbing escape from
the ridge problem.
9.j. F Hill-climbing has very attractive space properties because it uses only
O(bd) space.
9.k. F Local beam search is like Depth-first search except that it keeps the N
best candidates at each level, NOT all of them.
9.l. F Simulated annealing accepts a move that makes things better with
probability exp(∆V ALUE/T).
9.m. F Simulated annealing will accept more moves at low temperature than at
high temperature.
9.o. T Simulated annealing uses constant space and can escape local optima.
10. (10 pts total) Basic definitions and expressions.
10a. (2 pts) Write down the definition of P(H | D) in terms of P(H), P(D), P(H and D), and
P(H or D).
10b. (2 pts) Write the expression that results from applying Bayes’ Rule to P(H | D).
10c. (2 pts) Write the definition of P(A or B) in terms of P(A), P(B), and P(A and B).
10d. (1 pt) T P(A and B) = P(A)P(B) if and only if A and B are independent.
10e. (1 pt) F P(A or B) = P(A) + P(B) if and only if A and B are independent.
10f. (1 pt) F P(A and B) = P(A)P(B) if and only if A and B are disjoint (i.e., do
not intersect, or do not occur together).
10g. (1 pt) T P(A or B) = P(A) + P(B) if and only if A and B are disjoint (i.e., do
not intersect, or do not occur together).
11. (18 pts total, 3 pts each) Label the following as Y (= yes) or N (= no) depending on
whether a perceptron with a “hard” decision boundary (step transfer function) can
correctly classify the examples shown. If your answer is Y (= yes), fill in a set of weights
that correctly classifies them. Use w0 as the threshold and wi as the weight for input xi.
All perceptrons have three Boolean inputs, x1, x2, and x3, and a “dummy” input, x0,
which is always equal to one. They all compute the decision function ∑ wi xi > 0.
You may not transform the input space, i.e., they operate on the stated inputs.
12. (5 pts total) Consider the following Bayesian Network. Variables A-D are Boolean:
A P(A=true) = 0.2 B P(B=true) = 0.7
C
D
A B P(C=true | A, B) B C P(D=true | B, C)
false false 0.1 false false 0.8
false true 0.5 false true 0.6
true false 0.4 true false 0.3
true true 0.9 true true 0.1
12.a. (1 pt) Use the chain rule to factor the full joint probability distribution over these
variables into a product of conditional probabilities, ignoring conditional independence
from the figure. Factor out the conditional probability of D first, C second, etc.
12.b. (3 pts) Use the structure of the network to eliminate irrelevant variables from 12.a
based on conditional independence, giving the minimum equivalent expression.
12.c. (1 pt) Substitute probabilities from the network into your equation 12.b to answer
the query: What is the probability that all four of these Boolean variables are false?
P(-a, -b, -c, -d) = P(-d | -c, -b) P(-c | -b, -a) P(-b) P(-a) = 0.2 * 0.9 * 0.3 * 0.8
= 0.0432