Professional Documents
Culture Documents
Corresponding author.
ABSTRACT
The beer game and the supply line underweighting (SLU) theory are central to our knowledge of
decision making in dynamic environments such as supply chains. The core of these theories is
that people are incapable of recognizing the pipeline inventory and this is the main cause of
overordering and dysfunctional behavior. This article identifies lacunae in the theoretical and
empirical foundations of extant literature and proposes an alternate explanation, a correction
model, explaining why overreactions occur. We adopt a multi-method research design,
comprising a field case study and laboratory experiments, to ground our findings.
Subject Areas: Dynamic Decision Making, Misperception of Feedback, Supply Chain
Management.
1
INTRODUCTION
Dynamic decision tasks are those in which decisions made today alter the state of the system and
thus affect information that decisions to be made tomorrow will be based on (Diehl & Sterman,
1995). The feedback loop connecting decisions and environmental cues embodies the dynamic
nature of the system. Experiments have shown that people learn poorly when they are delayed in
receiving feedback about the outcomes of their actions. These studies have been conducted and
applied to a variety of settings such as firefighting (Brehmer, 1995), inventory control (Sterman,
1989a, b; Davis & Kottemann, 1995), and management of software development teams
(Sengupta & Abdel-Hamid, 1993). This substantial body of knowledge rests on the supply line
underweighting (SLU) theory. At its core, SLU holds that people are incapable of recognizing
the delayed effects of their actions (Sterman, 1989a, b; Diehl & Sterman, 1995; Croson &
Donohue, 2002, 2005, 2006; Senge, 2006; Croson, Donohue, Katok, & Sterman, 2008). SLU is
generally considered an undesirable cognitive bias that causes several organizational problems.
SLU implies that inventory managers are incapable of keeping track of in-transit shipments
and as a result, they overorder. The results are delayed learning, poor performance, and system-
wide chaos (Sterman, 2000; Senge, 2006). SLU bias is articulated through the stock management
formulation model (Sterman, 1989a), which is the building block of system dynamics modeling
in supply chain management, inventory management, and capacity management. The stock
management formulation model also has various other applications such as human learning, and
the modeling of organizational-level dynamics (Sterman, 2000, 2006; Dogan, 2007). Sterman
(1989a) first proposed this model in the beer game, which simulates a serial supply chain game
with four players. Participants are asked to minimize the total supply chain cost within the rules
of the game; customer demand is exogenous, and stock-out costs and inventory costs are charged
to each player. Decision making is complicated by the fact that there are lead times for
2
information and physical flows. Today the beer game is one of the most popular business
simulation games. More detailed descriptions may be found at www.systemdynamics.org and in
Sterman (1989a). Stermans model was later validated by several beer game based studies
(Sterman, 2000; Croson & Donohue, 2002, 2005, 2006; Sterman, 2006; Wu & Katok, 2006;
Croson et al., 2008).
Surprisingly, despite being developed back in 1989, SLU has only been studied in
laboratory settings and has never been tested in a field setting (to the best of our knowledge).
Exploring how SLU plays out in a real world supply chain setting is an interesting question in its
own right, given the salient position SLU occupies in literature. In addition to the lack of
empirical support, we believe that there are several reasonable arguments which cast serious
doubts on the robustness of SLUs theoretical basis. Evidence of SLU even within beer games is
not unequivocal; in fact, as we show in this paper, it is just as likely that the direction of effects is
opposite to that predicted by SLU. This exposes a need to reexamine the theoretical and
empirical support of SLU and seek out more robust explanations in order to understand and
improve managerial decision making. Our study responds to this need.
This study uses a multi-method approach (Boyer & Swink, 2008) involving a case study
and laboratory experiments to identify an alternative explanation that complements existing
theory. The behavior posited here is centered on interpersonal dynamics: frequently, customers
in the real world yell at unresponsive suppliers hoping that it will improve future deliveries, or at
least to vent their frustration. In some cases, however, free communication is not possible. For
example, in the beer game, participants are not allowed to speak to their supply chain partners.
Similarly, in the real world, a powerful supplier will not listen to the smaller buyer, so
communication may not be possible. In such cases, customers may inflate their orders
commensurate with the level of undersupply, hoping this correction will influence and
3
improve the suppliers behavior. This behavior, which we call the correction model, is developed
through a field case study and a lab experiment.
ANALYSIS OF LITERATURE
Overview
SLU bias was discovered by Sterman (1989a) through an experiment based on the beer game.
SLU was subsequently incorporated into the literature on organizational behavior and decision
making, and found applications in many settings outside inventory control. Sterman (1989a)
developed this theory through the stock management formulation, Equation (1), which is now the
workhorse of system dynamics modeling, and has applications in a wide array of business
domains (Sterman, 2000). In this formulation,
t
S and
t
SL
represent the inventory and supply
line positions respectively at time t . The desired stock and desired supply line are denoted by
*
S
and
*
SL , respectively, and are assumed to be constants chosen by the managers through the
anchoring and adjustment (A&A) heuristic (explained in the next section) at the commencement
of the task. Denoting
t
O as order rate,
t
L as the expected loss rate (demand from downstream),
S
! and
SL
! as the fractional adjustment rates for stock and supply line, and
SL
S
!
"
!
= as the
weighting ratio, the hypothesized decision rule is
( ) ( )
* *
0,
t t S t SL t
O Max L S S SL SL ! !
" #
= + $ + $
% &
. (1)
The beer game simulation is then used to generate data and estimate the coefficients of this
model. A weighting ratio less than 1implies a lower weight for the supply line than for inventory.
It is then argued that because there is no rational justification for weighting the two variables
differently it must constitute a cognitive bias and considered evidence of SLU.
4
Empirical Support for SLU
There is abundant evidence that people perform poorly in the beer game (Senge, 2006; Sterman,
2006; Gino & Pisano, 2008). SLU is thought to be a robust phenomenon based on the
consistently poor performance noticed in beer game studies (Sterman, 1989a; Sterman, 2000;
Croson & Donohue, 2002, 2005, 2006; Oliva & Gonalves, 2005; Dogan & Sterman, 2006;
Senge, 2006; Sterman, 2006; Croson et al., 2008). Consider, for example, the decision heuristic
hypothesized by Croson and Donohue (2006). These authors relaxed Stermans assumptions that
people anchor and adjust, and instead proposed a simpler formulation:
[ ]
0,
t O I t R t S t N t t
O Max I R S N ! ! ! ! ! ! = + + + + + (2)
Orders
t
O
are regressed against the independent variables: in-hand inventory
t
I , orders
received
t
R , incoming shipment
t
S , and total outstanding orders
t
N . According to this
formulation, if there were no SLU, then the decision-makers would be fully cognizant of each of
the four variables; therefore 1
I N S
! ! ! = = = " . Croson and Donohue (2006) tested this
formulation using their beer game simulation data, reasoning that
[i]f participants are accurately accounting for the supply line, the coefficient of orders outstanding
should be the same as the coefficient on inventory. If Stermans conjecture holds (i.e. participants
are underweighting the supply line), then we should find
N I
! ! > .
(Croson & Donohue, 2006, p. 330)
Their analysis did in fact seem to support their hypothesis. Based on the average values
( 0.0302
N
! = " and 0.2368
I
! = " ) it was concluded that SLU was supported (i.e.
N I
! ! > ).
Croson and Donohue first estimated the individual-level parameters using regressions of
individual players decisions and then averaged them over the complete sample to base their
conclusions. Averaging the parameters can, however, mask the individual effects. A similar
5
example would be to conclude normalcy based on the combined average weight of a sample
comprising only obese and underweight people. A more appropriate approach would be to use
the individual-level results and model estimates, or to first aggregate the data, and then
investigate the model for group effects. Therefore we first re-examined Croson and Donohues
(2006) results on the model estimates individually. This led to some interesting findings.
The absolute value of
I
! exceeded that of
N
! , as predicted by theory. It must be noted,
however, that for this dataset to support SLU, not only should
I
! exceed
N
! , but the sign of
each parameter must also be within the bounds set by the SLU framework. For about 60% of the
participants, the weight for the supply line was within the hypothesized range of 1 0
N
! " < <
which was consistent with Stermans SLU framework. However, surprisingly, for the remaining
40% of the participants the weight was 0 1
N
! < < (i.e. people were placing opposite-to-
hypothesized sign of weight to the supply line). It implied that the more the quantity on order,
the more the present order quantity. This behavior is quite at odds with the hypothesized SLU
behavior: rather than forgetting to subtract the supply line, the results indicated that subjects
were cognizant of the supply line, and were adding it to their net quantity. Moreover, whereas
the average value of
N
! was found to be !0.0302 (as noted previously, this was statistically
significant), it did not seem to be substantially significant:
N
! was very close to 0. Even if one
or two data points were removed, the results could swing the other way and indicate a positive
value instead of a negative. As Combs remarked with concern about management research,
I see more and more studies in which correlations and standardized regression coefficients of 0.05
or less receive the prized label highly significant I wonder whether the corresponding
phenomenal statistical power might mask other shortcomings of our research designs and leave us
with (phenomenal power and) itty-bitty effect sizes that limit the relevance of our research.
6
(Combs, 2010, p. 9)
Effect sizes of less than 0.05 would mean that the field is plagued by false positives (Ioannidis,
2005). Therefore, the beer game evidence of SLU does not seem robust.
Metters (1997) averred that ignoring prior orders is highly implausible in real-life
inventory management. However, Sterman offers the following anecdotal evidence in support of
SLU:
You cook on an electric range. To get dinner going as soon as possible, you set the burner under
your pan to high. After a while you notice the pan is just hot enough, so you turn the heat down.
But the supply line of heat in the glowing coil continues to heat the pan even after the current is
cut, and your dinner is burned anyway.
You arrive late and tired to an unfamiliar hotel. You turn on the shower, but the water is freezing.
You turn up the hot water. Still cold. You turn the hot up some more. Ahhh. Just right. You step
in. A second later you jump out screaming, scalded by the now too-hot water. Cursing, you
realize that, once again, youve ignored the time delay for the hot water to heat the cold pipes and
get to your shower.
(Sterman, 2006, p. 47)
These examples are intuitively appealing, but it is not readily evident to what extent they
support the SLU model. To begin with, they do not resemble real-life inventory management
settings. Managers usually have refined information about their own operations and in general
make good decisions (Bowman, 1963; Bolton & Katok, 2008). In regular and repeated tasks such
as inventory management, managers have at least a passing familiarity with their environment,
know the range of demand to expect, use rules of thumb for ordering, and would not routinely
make egregious mistakes, just as most of us do not routinely burn dinner or scald ourselves in
bathtubs at home or even in unfamiliar hotels. As previous studies in field environments have
7
shown, experienced decision makers deal effectively with temporal dependencies; Joslyn and
Hunts (1998) field experiment of police dispatchers decisions regarding disposition of police
units, and Kanfer and Ackermans (1989) study of air traffic controllers are only two examples.
As Gibson (2000, p. 144) notes, this contrasts with Stermans (1989a) original observation that
decision makers are fundamentally unable to cope with feedback delays.
Moreover, in real-life inventory management, the supply line is almost always explicitly
recorded by means of enterprise resource planning (ERP) systems or other electronic or manual
recording systems. It therefore seems difficult to miss the supply line while computing current
orders. Stermans anecdotal evidence may not be representative of such settings. Be that as it
may, barring these anecdotal examples and the beer game evidence, to the best of our
knowledge, there is no evidence from the field of real-life inventory management that supports
SLU. We now present analysis of the theoretical support of SLU.
Theoretical Support of SLU
Let us recapitulate the anchoring and adjustment (A&A) heuristic (Tversky & Kahneman, 1974).
This heuristic captures the bias wherein people anchor or overly rely on certain values, and then
adjust them to account for other elements of the situation. Usually, once the anchor is set, there is
a bias toward that value, as described by Tversky and Kahneman (1974). In this study, people
were asked to guess the percentage of African nations that are members of the United Nations.
Those asked whether the percentage was more, or less, than 10% guessed lower values (their
answer averaged 25%) than those who were asked if the percentage was more, or less, than 65%
(their answer averaged 45%) (Figure 1). This indicated that, firstly, people subconsciously
anchored their answer to the cue. This is the anchoring part of the heuristic. Secondly, suppose
30% was the answer that the subject pool would have given on average in the absence of any
externally provided anchor, the people then adjusted the anchored value toward their
8
independent answer. This meant that the adjustment was in the correct direction (i.e. toward their
independent belief of the answer) but its magnitude was insufficient (i.e. it stopped somewhere
mid-way between the anchor and the independent answer). This is the adjustment part of the
A&A heuristic.
Insert Figure 1 here
Against this background, the stock management formulation may be analyzed. We find
that although A&A is the theoretical foundation of the stock management formulation, the
literature has been ambiguous on what exactly the anchor is. Sterman (1989a, p. 324; 2000, p.
670; 2006, p. 29) identifies the anchor explicitly, but inconsistencies remain in how it is
explained with reference to the supporting equations (Sterman, 1989a, p. 331):
t t t t
IO L AS ASL = + +
( )
*
t S t
AS S S ! = "
Here the anchor is the expected loss rate [emphasis added]