Professional Documents
Culture Documents
Abstract. A model of decision making under bounded rationality is presented that combines
satisficing behavior with learning and adaptation through environmental feedback. The aspira-
t ions, or goals, of the decision maker dynamically adjust in response to t he observed sequence of
past decisions and their corres ponding effect s on the deci sion maker's objective functio n. A simple
linear res ponse model is employed to represent the beliefs of the decision maker concerning the
causal connection between hi s/ her deci sions and the resulting objective fun ction valu e. The com-
bination of t hese simple elements yields a decision process model rich in dynamic behavior; it can
exhibi t optimizing behavior in the long-run and chaoti c pseudo-random search in the short- run.
As such , the model bridges the gap between substanti ve rat ionality and procedural rationality.
Kev Words. Bounded rationality, adaptation, learning, aspirations, expectations, chaotic dynamics
Such arguments can be compelling, and the neoclassical [A] Solutions are simple-minded.
paradigm has had impressive success in accounting for
certain empirical regulariti es. All of this suggests that Second, the search process used to find a suitabl e solution
mathematical optimization as a characterization of lonq- obeys four principles. New altern at ive solutions are
run economic equilibrium needs to be taken seriou sly . If, synthesized from the modification of existing decision rul es
however, we are instead interested in the d1Jnamic response or the currently implemented solution. Alternatives are
of economic agents to an unanticipated change in the envi- more often considered one at a t ime - very seldom are all
ronment, it is just as surely necessary to model precisely the feasible alternatives available simultaneously. The
how decisions are actually made. Neoclassical short-run search for a new or better solution is undertaken only when
optimization models are not the answer in such circum- it is deemed necessary ; only when it is observed that goals
stances, for they are qualitat ively indistingu ishable from are not being met. When search for a solu t ion is under-
their long-run cousins. They still assume complete taken, a satisficing mode is used; t he fi rst solution found
information , just wh at is not available in t imes of rapid t hat is "good enough" is implemented and search terminat-
change and transition' ed. Therefore, the search for alternatives is:
473
474 K. D. Wall
[E] and employs satisficing behavior. If f(",(t)) > a(t+1), set 0:(t+1) = 0 and
stop the search process. (5)
Third, there is adaptaion and learning with respect to the
formation of goals, search strategies, and the information Finally, adaptation and learning requires that a(t+1) be
set upon which the decision process relies. Goals are defined adjusted to changing experience based on J(t):
by Simon in terms of the aspirations of the decision maker
and these are defined, in part, by the experiences of the a(t+1) = 1lI3(J(t)). (6)
decision maker. Search strategies, through experience, are
found to be either adequate or deficient and in need of
alteration. Exactly what constitutes relevant and impor-
tant information to which the decision maker must pay THE OPERATIONAL MODEL
attention also comes from experience and the lessons of
success and failure. Thus, there must be mechanisms for Any empirical implementation of the model described by
adaptation and learning in the formation of: (1) - (4) requires concrete form for J(t) and the mappings
llIi, i = 1, 2, 3. Broad guidance in such a task is provided
[F] aspirations; by three overriding considerations. First, research in cog-
nitive psychology has found that humans employ simple
[G] search strategies; linear relationships to capture causal connections between
variables and that linear models capture a great deal of this
[H] attending the environment. behavior (Dawes and Corrigan, 1974). Second, there is evi-
dence of persistence and inertia in the formation of beliefs
The manifestation of these tenets in mathematical terms and in the processing of information used to modify rela-
depends directly upon the type of problem under study. In tionships (Edwards, 1968; Einhorn, 1980). Third, percep-
economics the underlying decision problem is that faced by tion and judgement suffer from Significant biases with
a single decision maker who must select the values of N directly observed data subjectively assessed and interpreted
decision variables at time t so as to maximize the value of (Kahneman, Slovic, and Teversky, 1982, Parts II-IV). The
some objective function, f. The decision variables are repre- simplest possible mechanisms should be em- ployed with
sented by an N-vector, ~(t), indexed by t, the time at the focus primarily on linear relationships.
which their values have been fixed. The objective function
is then written as a function of ~(t), f = f(t) = f(~(t)). The With these considerations in mind the information set is
choice of ~(t) is limited by constraints and so the decision assumed to consist of nothing more than past and current
maker must select ~(t) from some set of feasible alterna- observations on f(t) and ",(t). Thus
tives denoted by $(t). Time is assumed to evolve discrete-
ly so t takes its values from the set integers, .;f. The J(t) = {f(T), ~(T); T = t, t-1 , t-2, .... }. (7)
decision variables are assumed to be continuously valued so
",(t) takes its values in $n. The objective function f is This is to be contrasted with the information set assumed
assumed to be a continously differentiable mapping from in the neoclassical optimizing paradigm where complete in-
$n to $1. The problem faCing our decision maker for each formation about f assumed; i.e., the functional form of f(t)
value of t can now be stated as: and all parameters, derivatives, etc. A further restriction
can be placed on Eq. 5 by requiring a finite "memory" and
max f(t) = f(",(t)) subject to ",(t) C $(t). this is done below by inclusion of only the current and N
previous observations are required.
At the end of each time period the decision maker observes
the resulting performance - the actual value of f(t) - and Equations (2) and (3) are replaced by a set of operations
any other variables of interest. Time is incremented by one consistent with [D] - [G] but involving only simple linear
period and the decision process repeated. functions. To wit, the decision maker at the beginning of
period t+ 1, holding aspirations a( t+ 1) , chooses "'( t+ 1) to
force the expected objective function, fe, at the end of the
Given this framework for the problem, the disiderata of a period to equal a( t+ 1). Algebraically this amounts to
behaviorally based model incorporating bounded rationality requiring "'( t+ 1) satisfy the constraint
are implemented as follows. First, consideration of [A] - [C]
suggests decision rules specifying a change in decision a(t+1) = fe(t+1) = fe(",(t+1)).
variables relative to previous values:
Furthermore, the decision maker uses a linear representa-
~(t+1) = ",(t) + ~",(t+1) tion of reality in constructing re:
= ",(t) + 0:(t+1)g(t+1) (1 ) fe(t+1) = f(t) + £t'~~(t+1). (8)
where g( t) represents the local search direction and 0:( t) The desired increment to ~(t) can then be obtained by
denotes the step length to be taken along this direction. computing the minimum norm solution to this equality
Furthermore, these two quantities are chosen on the basis constraint; Le.,
of an information set J( t) gathered from experience:
~~(t+1) = [a(t+1) - f(t)]£t/lI£tI1 2.
g(t+1) = 1lIt(J(t)) (2)
= {[at t+ 1)-f( t)l/ lI£tll }{£t!II£tll}
0:(t+1) = IlIM(t)). (3)
= 0:(t+1)g(t+1)
Realistic modeling would suggest that J(t) be very simple The step length is seen to depend upon the difference be-
and limited in its scope. In consideration of [D] - [F] the tween the achieved objective and the desired value of the
decision maker is presumed to hold at the end of time t an objective. Thus, if the decision maker is close to his/her
aspiration level a( t+ 1) representing a value of the objective goal then a small step is taken, while large discrepancies
function f(.) which he/she believes is satisfactory and a between the actual and desired objective value will lead to
reasonable goal to attain in the next time period. Motiva- more bold search. If the decison maker has acheived the
tion for continued search is provided by percieved failure to goal then no step is taken and satisficing behavior obtains.
achieve the currently defined goal; i.e., The search direction is seen to be a unit vector which
A Bounded Rationality Decision Process Model 475
approximates the direction of steepest ascent but uses only r(t+1) = [l-~r(t) + 6[f(t) -f(t-1)] (11)
"backward looking" information. It does, however, change
with experience and therefore embodies the learning and
adaptation required by [G]. a(t+1) = [l-tp]a(t) + tpf(t+1) +,r(t+1) (12)
The N-vector ~t represents the decision maker's best Step 3. Update estimate of the environmental response
estimate of the sensitivity of f( t) to changes in 2>( t). It is
comprised of nothing more than simple finite first differ- ~t = X(t)-l~f(t) (13)
ences derived from J(t). This formulation follows after the
work of Cyert and March (1962) and Day (1967) who argue Step 4. Determine new decision to implement
that humans employ finite first difference approximations
when dealing with the concepts of derivative and gradient. !l(t+1) = ~t/lI~tll; (14)
In the one-dimensional case where 2>(t) is a scalar ~t is
simply a(t+1) = [a(t+1) -f(t)lIlI~tll; (15)
M(t) = X(t)~t
DYNAMIC PROPERTIES OF THE MODEL
where X'(t) is the NxN matrix obtained by columnwise
concatenation of the vectors 2>(t-j) - 2>(t-j-1), O<j<N-1. If The model presented above is inherently dynamic, repre-
the rows of X(t) are linearly independent then a unique senting the evolution of decision making over time by a set
expression exists for ~t: of difference equations. In the terminology of dynamic sys-
tems, it is capable of describing transient behavior or the
short-run fluctuations of the decision variables as they are
incrementally altered in the search for a satisfactory so-
lution. For such models two overriding issues arise: (1) the
All that remains to be speci fied is the exact form of (4), existence of steady-state, or long-run, solutions; and (2)
and here we appeal to March and Simon (1958) and March the stability of these solutions.
(1988) as a point of departure. Aspirations are hypothesized
to adjust to experience by a linear adaptive adjustment For use of the model in economics it seems reasonable to
mechanism. Two additional terms, however, are added demand that the model demonstrate neoclassical optimiz-
representing environmental feedback from the ob- served ing behavior in the long-run under circumstances where
values of fl t) and the observed rate of change in f( t). The learning and adaptation would produce an accumulation of
result is a generalized double exponential smoothing of f(t): knowledge leading to complete information. In other words,
it seems reasonable to demand that the neoclassical opti-
mum be a solution to the model and that the model be able
to converge to this solution.
f(t+1) = [l-,O]f(t) + ,Of(t)
The remainder of this section presents some initial results
r(t+1) = [l-~r(t) + 6[f(t) - f(t-1)] demonstrating that these demands can be met. First the
following theorem demonstrates that under certain condi-
tions bounded rationality eventually leads to perfectly
a(t+1) = [l-tp]a(t) + <pf(t+1) + ,r(t+l). optimizing behavior.
Aspirations are seen to adjust to exponentially smoothed THEOREM 1. Consider f: se n .... se l , strictly concave and differ-
estimates of f(t) and ~f(t). Thus it is not only the per- entiable on an open bounded set die sen. Let 2>* E dlsatisfy
cieved level of the objective function but also the percieved
rate of change of the objective function that influence aspi-
*
f(2)*) > f(2)) for all 2> E <?f, 2> 2>*, and take art) = f* for all
t . Define 2>(t+1) to be the element of $(t) e dlnearest to
rations. Goal formation is argued to be significantly affect-
ed by the direction of change of the objective with a rapidly
rising (falling) f(t) leading the decision maker to increase ~(t+1) = 2>(t) + a(t+1)!l(t+1)
(decrease) the value of alt+1) above (below) what would
have been the case by keying only on the level of f(t). The for t = 1, 2, 3, ... and where the initial conditions have been
inclusion of a rate of change effect does not appear to have assigned so that frO) > f(-l) > .. . > f(-N). Then the
been considered in the existing li terature. sequence {2>( t)} converges to 2>*.
= [f* - f(~(0))]!;o/II!;o1l2 Fig. 1. Profit falls off rapidly for capital-to-labor ratios less
than 0.15 and production rates less than 15000 units. The
= DfCrJ*)[~* - ~(0)]!;o/II!;o112 "ridges" encountered just prior to these two precipices of-
ten confound many search algorithms by inducing zig-zag-
and ging behavior. More importantly, the profit function is
11 ~(1) - ~(O) 11 ~ IIDf(!J*)[~* - ~(O)lIl/II!;o11 strictly concave only within a neighborhood of~* = [0.3067,
47,274.41 ]' (at which point f* = $183,662.7). Thus for large
~ Df(!J*) 11 II~* - ~(O) 11/11 Df(!J) 11· values of Xl and X2, strict concavity does not hold.
Simulation Example. A very ~ood two dimensional example A much more interesting example is presented in Figs. 5-7.
is found in Day and Tinney (1968). A firm seeks to maxi- Here a more realistic and challenging situation has been
mize its profit through the manipulation of two decision simulated where the environment is no longer stable. Two
variables: X1(t) , its capital-to-labor ratio during period t, shifts in the , parameters of the profit function have been
and X2(t), its production level during period t. At the end introduced, corresponding to shifts in demand curve and in
of each period the firm observes its profit f(~(t)) = f(X1(t), the factor input supply curves. Over the period of the
X2(t)) = f(t) defined as: simulation there are now three different maxima that must
be sought:
f(~(t)) = lToX1(t)1-lT1 -1)Od(X2(t)_w2x1 (t))/WO]01 ~* = [0.31,47274]', f* = $183663;
The difficulty presented to our model of decision making For the model to be truly useful in empirical research it
can be gauged from the contour plot of f(~) displayed in must permit estimation and testing in conjunction with
A Bounded Rationality Decision Process Model 477
time series data. This requires casting eqs. (9) - (17) in a tions can be specified, but the ones presented in this paper
form suitable for econometric work and one framework are believed to be the most simple. Even so, these simple
immediately suggests itself. specifications, when coupled with learning and adaptation
demonstrate almost amazing abilities.
Equations (10) - (12) comprise a recursive system of linear
difference equations that bring to mind a state-space A number of questions are to be investigated in the near
approach employing a Kalman filter for generating model future. First there is the proof of convergence to the
residuals. neoclassical optimum when a(t) is not fixed at h. The
convergence of the algorithm has been demonstrated in
To see this, define as the state a 3xl vector, ll(t), where simulation, but is seen to depend in a complicated way on
a(O), the parameters ,8, /, 0, and cp, and the specification of
ll( t) = [i( t), i( t), a( t )J' $(t). A proof appears possible with the aid of a theorem on
algorithmic convergence by Zangwill (1967, Chpt. 11).
and define 2i(t) to be a vector of (N+l) exogenous, or
predetermined, inputs: Next there is the investigation of the effects of a changing
environment. The second example shows the algorithm
capable of finding new optima as the environment
2i(t) = [fit), f(t-l), ... , f(t-N); ,
undergoes shifts. It seems important to establish if limits
~(t), ~(t-1), ... , ~(t-N)J . exist to the ability of the model to learn enough to keep
pace with rapidly changing environments. For example,
A a state space form representation of the decision model "At what 'speed' of change in the environment does the
immediately results:
model be~in to lose its ability to 'catch up' with the
optimum? '
ll(t) = Fs(t-l) + Gz(t) (18)
Finally, there are a number of normative questions that the
l:( t) = H(2i( t) )ll( t) + 12(2i( t)) (19)
model can be used to answer. For example, if we assume a
where boundedly rational decision maker, as above, how can the
information set be ammended to help the decision maker
(1-,8) 0 0]
E= [
0 (1-0) 0 ,Q = [,80 0 0 ... 0]
-0 0 ... 0 , converge more rapidly to the maximum. In a rapidly
changing environment, how can the information set be
<.p / (l-cp) 0 0 0 ... 0 altered to aid in tracking the optimum and, hopefully , in
catch up with it? In other words , what can be done to
J:!(2i(t)) = [ Q : Q : ctlll!;tlI2J, increase learning and speed adaptation? The mod el appears
ideally suited to help in answering these and similar
I2(2i(t)) = [~(t-1) - f(t-l)!;tlll!;tlI2J questions.
and l:(t) = ~(t). The state space model is linear in the state Actual use of the model in macro- and micr~conometric
but time-varying in the output equations because of the studies has yet to be done. It is hoped that the inherent
terms involving !;t!II!;tIl 2. It should be noted, however, that dynamics of the model will make it ideal in explaining why
this time variation in the coefficient matrix H is due solely "getting there is half the fun". It may yield useful
to exogenous variables and may be treated as predetermin- econometric estimates of speeds of adjustment and
ed, Estimating the model can be accomplished by a number explanations of the ability of an industry to compete in a
of techniques, for example that used in Burmeister, Wall, changing world economy , or of the response of a national
and Hamilton (1986) , Kalaba and Tesfatsion (1980,1988), economy to shocks in world supplies of energy and capital.
or Pagan (1980). If one is willing to state stochastic
hypotheses regarding additive disturbances in (18) - (19) ,
then a Gaussian likelihood function can be postulated and REFERENCES
the methods of the first and last references above apply. If
one does not wish to introduce additive random errors then Baumol , W. J. and R. E. Quandt (1964). Rules of Thumb
a least-squares approach can be taken and the methods of and Optimally Imperfect Decisions. A.E.R., 54, pp.
Kalaba and Tesfatsion applied. In either case, the methods 23--46.
are well developed , tried and true. Burmeister, E., K. D. Wall , and J. D. Hamilton (1986).
Estimation of Unobserved Expected Monthly Infla-
tion using Kalman Filtering, J. Bus. and Econ.
DISCUSSION AND CONCLUSIONS Stats., 4, no. 2, pp 147-1 60.
Cyert, R. m. and J. G. March (1962). A Behavioral Theory
The primary purpose of this paper is the presentation of an of the Firm. Prentice-Hall, Englewood Cliffs, N. J.
operational model of boundedly rational decision making Dawes, R. and B. Corrigan (1974). Linear models in
that can be used in empirical studies of dynamic economics. decision making. Psychological Bull. 8], pp. 95-106.
The model's main attributes are: (1) its inherently dynamic Day, R. H. (1967). Profits, learning, and the convergence of
nature, for it naturally explains the dynamic evolution of satisficing to marginalism. Q. J . E. , 81, pp.302-311.
decsions by framing them in terms of difference equations; Day, R. H. and E. H. Tinney (1968). How to cooperate in
(2) its ability to depict both neoclassical optimizing business without really trying. J. P. E., 76, pp.
behavior and non-neoclassical suboptimizing behavior; (3) 583-600.
its very modest assumptions concerning the information set Edwards , W. (1968) Conservatism in human information
in the possession of the decision maker; (4) its very modest processing. In B. Kleinmuntz (Ed.), Formal repre-
demands on the computational and cognitive abilities of the sentation of human judgement. Wiley,New York,
decision maker; and (5) its explicit incorporation of pp.17-52.
learning and adaptation and the simple mechanisms by Einhorn , H. (1980). Learning from experience and sub-
which this takes place. Learning and adaptation is the optimal rules in decision making. In T. S. Walls ten
dominant component in the model, for it determines (Ed.), Cognitive Processes in Choice and Decision
whether one obtains the optimizing solution or a Behavior, Lawrence Erlbaum, Hillsdale, N. J.
suboptimal result. The former may be charcterized as a Friedman, M. (1953). Essays in Positive Economics. Univ.
special limiting case where , given a stable environment and of Chicago Press, Chicago.
sufficient time, learning and adaptation is adequate to Hansen, L. P. and T. J. Sargent (1981). Formulating and
guide the search to the overall maximum. Suboptimaization estimating dynamic rational expectations model. In
occurs when learning or adaptation allows aspirations to R. E. Lucas and T. J. Sargent (Eds.) Rational Expec-
converge too rapidly to the actual objective function value tations and Econometric Practice. Univ. of Minn.
thus terminating search prematurely. Press, Minneapolis, pp.91-126.
Kahnemann, D., P. Slovic, and A. Tversky (Eds.) (1982).
The model is but one operationalization of the key Judgement under Uncerlianty: Heuristics and Biases,
mappings in eqs. (2), (3), and (6). Other operationaliza- Cambridge Univ. Press, Cambridge.
478 K. D. Wall
Kalaba, R. and L. Tesfatsion (1980) . A Least--£quares Pagan, A. (1980) . Some identification and estimation
model specification test for a class of dynamic results for regression models with stochastically
non linear economic models with systematically varying coefficients. J . Econometrics, 13, pp.341-363.
varying parameters. J. Opt. Theory Appl., 32, Radner, R. (975) . A behavioral model of cost reduction.
pp.538-567. Bell J . Econ. 6, Spring, pp. 196- 215.
Kalaba, R. and L. Tesfatsion (1988) . Exact sequential Shoemaker, P. (1980). Experiments on Deci8ions under
filtering, smoothing, and prediction for nonlinear Risk: The Expected Utility Hypothesis, Martinus
systems. Nonlinear Analysis: Theory, Methods, and Niyhoff Publishing, Boston.
Applications, 12, pp.599-615. Simon, H. A. (1955) . A Behavioral Model of Rational
March, J . G. (1988). Variable risk preference and adaptive Choice. Q. J . E., 69, pp 99-118.
aspirations. J. Econ. Behavior Org., 9, pp.5-24. Simon, H. A. (1957). Models of Man. Wiley & Sons, New
March, J . G. and H. A. Simon (1958). Organizations. Wiley York.
& Sons , New York. Winter, S. (1971). Satisficing, selection, and the innovating
Nelson, R. R. and S. Winter (1973). Toward an evolution- remnant. Q.J.E. , 85, pp.237-261.
ary theory of economic capabilities , A. E. R., 63, Zangwill, W. I. (1969). Nonlinear Programming: A Unified
pp .440--4 49. Approach, Print ice-Hall, Englewood Cliffs, N. J.
-" -
~ .rrn~~~~~--~--~--~--~--~-'~~~~~-r~~~~~
0-
Cl
o
o
a:l
o
"'
C')
o