You are on page 1of 4

The Bridge and the Road

by Meir Retter
August 2017

1 Our Story
You are a traveler who needs to get to the other side of a canyon. There are two
ways you can get across: you can take the rickety bridge across the canyon, or
go around the canyon using the safe road. The bridge is dangerous and if you
choose to take it then you will have only an r probability (0 r 1) of making
it across without plummeting down to your death. That is, r is the safety of
the bridge. The road has no such hazard associated with it.
So why not just take the road? Well, there is a complication. There is a
hitman who wants you dead, and he has a killer robot at his disposal. Just as
you must choose whether to take the bridge or the road, the hitman will choose
one of these paths along which he will send his killer robot. You and the hitman
make these choices at the same time and independently.

There are 4 possible outcomes:


1. You and the hitman both choose the bridge. In this case, you might fall
to your death, and even if you dont, the robot kills you. Your probability of
survival is 0.
2. You and the hitman both choose the road. In this case, the robot kills
you. Your probability of survival is 0.
3. You choose the bridge and the hitman chooses the road. In this case,
youve escaped the killer robot but must get across the bridge safely. Your
probability of survival is r.
4. You choose the road and the hitman chooses the bridge. Congratulations,
youve escaped the robot and can safely walk along the road. Your probability
of survival is 1.
Obviously case 4 works out best for you, but you dont get to just choose case
4. You only get to decide to take the bridge or the road. So what should you do?
If this is a one-time scenario, then there is no definitive answer: whatever you
choose, the hitman could make the same choice, leaving you with a 0 probability
of survival. You could argue that you should take the road, because then if the
hitman doesnt make the same choice as you then you will live, but the hitman
might expect you to consider that and thus send his robot along the road!

1
Perhaps it would be better to take the bridge then...but the hitman might
anticipate this, and send his robot along the bridge...
We will instead consider the case where this scenario take place a large
number of times in a simulation. You can imagine it as a game, where the two
players (you and the hitman), each turn, choose bridge or road. If you
survive you get a point; if you die the hitman gets a point. This happens a large
number of times, and each players goal is to obtain as many points as possible.
Now, you could of course simply play the game based on your intuition,
trying to predict what your opponent will do and trick him and whatnot, but
instead we would like to find an optimal rule that you can use to make your
decisions. Of course, this rule will have an element of randomness to it; if it
was unchanging or followed a noticeable pattern then the hitman could take
advantage of that. Also, your strategy on a given turn should take into account
what the hitman has been observed to do on previous turns (if the hitman
chooses the road every single time, for example, you would be wise to start
choosing the bridge). One last thing: the hitman is no dummy either, and uses
strategy just like you to build his own optimal decision-making rule.
A few miscellaneous things to note before we proceed:
1. The value of r is common knowledge. You know r, the hitman knows r,
you know the hitman knows r, the hitman knows you know he knows r, etc.
2. It doesnt matter to the hitman whether you die by killer robot or by
falling off the bridge. He just needs you dead.
Now lets talk strategy.

2 Building a Decision Rule


Because you dont want to be predictable, your rule will not be some deter-
ministic algorithm like if the hitman has done this sort of stuff then take the
bridge, and if hes done that sort of stuff take the road. This would allow the
hitman to exploit you. Instead, your strategy will be as follows: each turn, you
will select a number y with 0 y 1 which will be your probability of taking
the bridge. Your probability of taking the road will be 1 y. You will choose y
based on r and your opponents activity in previous turns.
Of course, the hitman is doing all of this as well. Each turn he selects a
number k which will serve as the probability of him sending the killer robot
along the bridge. The robot is sent along the road with probability 1 k.
Now, suppose on a given turn youve selected a y and your opponent has
selected a k. Then your probability of survival ps is given by

ps (y, k, r) = (y)(1 k)(r) + (1 y)(k)(1)


= k ky + yr kyr

2
Now we ask a question that might seem pointless but actually is the key
to creating an optimal decision rule. If you somehow knew the value of k for
a turn, what would you select as y? Well, the values of k and r are known
constants so ps is just a function of y. We differentiate:
dps
= k + r kr
dy
Since k and r are known, dp dy is a constant. If its a positive constant,
s

then increasing y increases your probability of survival ps , and you should set
y = 1 (take the bridge). But if dpdy is negative, then the way to increase your
s

probability of survival is by decreasing y as much as possible, and you should


set y = 0 (take the road).
So, given k, you should take the bridge if and only if dp dy > 0. By simple
s

algebraic rearrangement, we find that this occurs when


r
k<
1+r
What does this mean intuitively? It means that you should only be willing to
take the bridge if the chance the robot will be sent along the bridge is sufficiently
low, where sufficiently depends on r. If the bridge was perfectly safe then you
would take it if and only if the robot has a < 50% chance of being sent there.
Substituting r = 1 into the formula shows this to be true. However, if the
bridge is very unsafe then you would obviously only take it if the killer was
almost certainly taking the road. Substituting values of r close to 0 verifies this
r
too. One last thing to note: the maximum that 1+r can ever reach is 12 . This
means if k > 12 , you should take the road no matter what.
Now we turn to the hitmans perspective. If he knew the y you selected
before a turn, what is his best move? Like before, we differentiate, but this
time with respect to k.
dps
= 1 y yr
dk
Keeping in mind that the hitman seeks to minimize ps , we see that he will
send the robot along the bridge if and only if dp
dk < 0, or equivalently
s

1
y>
1+r
This can be appreciated intuitively too. For example, if the bridge is very
unsafe, the killer might as well send his robot along the road. Substituting low
values of r into this formula, we see that the robot will be sent along the road
unless the hitman knows you are almost certainly taking the bridge. On the
other hand, if the bridge is safe (r = 1) then hell choose the bridge if and only
if your probability of taking the bridge is above 12 .

3
3 Applying the Decision Rule
Even though the value of k is not known to you, there are two ways that you
can practically apply this theory to the canyon game. The first is to realize that
an equilibrium is reached when at least one of the players chooses the border
value for his opponents decision rule. For example, suppose you always select
y = 1+r1
. Then dp dk = 0, that is, the hitmans choice of k has no effect on
s

r
your survival rate. Similarly, if the hitman selects k = 1+r , then dp
dy = 0. (In
s

the long term, both you and the hitman might as well choose these values as
your probabilities, because using any other value allows the opponent to take
advantage using a decision rule.) Recalling that

ps = k ky + yr kyr
1 r
we can substitute either y = 1+r , or k = 1+r , and derive your constant
resultant survival rate
r
ps =
1+r
Doing the substitutions for both y and k gives this too of course, but only
one is necessary.
A second way one might practically apply the decision rule is by proportion.
You make an arbitrary move on your first turn, and after that you keep track
of the fraction of times that your opponent took the bridge, use this as their
selected probability, and apply the inequality. It turns out that if even one of
r
the players does this then ps converges to 1+r , as before. Furthermore, if both
players do this, then the fractions of times they each will be choosing the bridge
will too converge to the values in the equilibrium described. (These facts were
verified by computer simulation.)

You might also like