Professional Documents
Culture Documents
1---
Human
Factors
inComputiig
Systems WI ’940 “Celebrating
hwdepemfenw”
238
Boston,
Massachusetts
USA* April24-28,1994 Human
Factors
in Compu[ing
Sys[ems
II%?
information are placed in the more expensive-to-access, but develop the methodology. In the remainder of this paper,
more capacious, desk file drawer, and large quantities of we perform a series of analyses as exercises to establish the
information are in the file cabinet. For simplicity, it is feasibility and utility of this concept. It should be noted
assumed that the average time to access the information that the emphasis is on the development of the
within each of these categories is the same, and we have methodology itself and that while useful information may
ignored the staircase function produced by the fact that the be revealed about the systems we analyze (after all, that is
repositories are of discrete sizes. the purpose!), our analyses do not constitute a complete
evaluation of these systems, since that would require
CURVE B considering what other figures of merit might be relevant as
~c~ well.
IMPROVEMENT)
EMPIRICAL CHARACTERIZATION OF A SYSTEM
In our first analysis, we empirically measure the function of
Fig. 1 for one particular task done by users of the Spiral
Calendar, an application of the Information Visualizer [6]
(HIERARCHICAL developed by Mackinlay and DeLine [5]. A user can access
the schedule for various calendar dates by selecting objects
that represent the appropriate period, then selecting the unit
within that period, and so on. For example, to select June
4, 1982, the user would select the year 1982 within the
Decade 1980-1989 object, this would cause the Year 1982
TIME COST (S)
object to fly up and grow large. Then the user would select
Fig. 1. Cost of Knowledge Characteristic Function June from that Year object. This would cause the June
Month object to fly up. The user would select 4 from the
Now suppose that we were able to invent some device or June month object causing the containing Week object to
procedure that improves information access. The Cost of fly up. Finally, the user would select 4 from the week
Knowledge Characteristic Function should show that object, causing the daily schedule for that day to appear.
improvement by having at least some portion above the There could be fewer selections or more selections
original curve (e.g., Curve B in Fig. 1). Notice that we can depending on the condition of the display from which the
harvest this benefit in two ways, as shown by the arrows, user started.
If we keep the time cost the same, we could access more
documents (arrow a). On the other hand, if we keep the
number of documents the same, we can access them for a
lower cost (arrow b). In this way, the Cost-of-Knowledge
Characteristic Function is intended to help us reason
through more complicated consequences of system
improvements than just thinking that one system is better
than another.
239
ml HumanFactorsinComputingSystems
,,, ,,.
CHI’94 * “Celebra/i~]:
I/l(erdepetrde/ire”
1993). These days were chosen to lie on a logarithmic The set of 11 tasks was randomized into a block of trials.
scale We specifically chose such a large range because we Each user performed 5 of these blocks, each block
want to understand how the interaction scales with size. separately randomized, for a total of 55 trials/user. Users
were allowed to take a break between blocks if they wished.
Users On each trial, the user flipped a page in a notebook asking
The measurement was done on four users, members of the him or her to navigate through the calendar to the day
professional staff of the lab. Users varied in their display of a specific date. The trials were videotaped and the
experience. Two had never used the Information Visualizer, time measured from when the user had turned from the
one was one of the designers of the Spiral Calendar. notebook and was facing the display until the day page was
done displaying.
Procedures
As a warm-up, the user first performed a set of 11 accesses Empirical Results
to dates different from, but similar to in time, those The results of the measurement are in Table 1, column (4).
actually used in the experiment. These served to help the Plotting time to access a date as a function of number of
user assimilate the procedures of the experiment, to learn days back (column 2) gives Fig. 3. As might be expected,
how to operate the Spiral Calendar, and to ask questions. the time required increases with the number of days back.
As a limitation on the prototype, only data from 1993 was This is obviously because there are more steps in the
actuaIIy contained in the database and the calendars said dialogue to reach distant dates. How much additional time
1993, regardless of the simulated year. Users were simply can be understood by analyzing the predominant method
told of this limitation and that this data was being used to users utilized.
simulate a larger range of years.
-+=T%m‘h ;T
2 3 Sep. 4, 1993 11 T
2 In A,, (T ‘7$2 100’2 1A
Methods = Decadedisplsy
GET-YEAR ..0 if neceess
The cost in time for accessing some date can be GOAtJ SELECT-YEAR: (17197
characterized in terms of the major methods available to POINT-TO (171 9))
users. Let us take the most extreme case, accessing the date *Year-display
G ET. MONTH . . . if nmesaary
November 23, 1719. A GOMS model [2] for this GOAL: SELECT-MONTH: (November)
procedure would be
‘O’NT-TO(No~fiRtl-clisplay
GET-DAY . . . if n*ssar-y
CENTURY-METHOD =
GOAL: SELECT-WEEK: [??]
GOAL: DO-TASK POINT-TO [23]
GOAL: GET-DATE a Week-display
TURN-TOrMANUSCRIPTl
GET-DAY . . . if necessary
GET.DATE GOAL: SELECT-DAY: [23]
GOA&E+~~~~S-DAY-CALEN DAR POINT-TO (23))
. . . . if n~sqry
+ DaydispJay
GOAL: SELECT-CENTURY (1700’s)
POINT-TO (Century=1700-1790s))
= Century-display Neglecting the initial part of this method that has to do
GET-YEAR . . . if necessary
with our experimental procedure, the method can be
GOAL: SELECT-DECADE (171 (t’s)
POINT-TO (1710-1719)) summarized in terms of seven cycles of pointing and
240
Boston,
Massachusetts
USA* April24-28,1994 Human
Factors
in Compu[ig
Sys[ems
R
L—–
display+me each for Century, Deeade, Year, Month, Week, accessible (column 10) as a function of the cost in time
and Day. Other methods used in our measurement are the (column 8) in Table 1. This is done in Fig. 5 (Curve A:
same, except that the larger units of time, such as the Spiral Calendar). This graph, the first actual calculation of
century, or the decade, or even the month, are eliminated if this concept we have been able to achieve, shows how, as
the date is close enough, To make the methods easy to talk would be expected in a reasonable system, the small
about, we name them by the largest unit of time selected. amounts of knowledge can be accessed quickly, larger
The method for each task is listed in column (5) of Table 1. amounts of knowledge rquire longer times. For simplicity
we have omitted the stair-case detail of the curve that would
track abrupt representation shifts (e.g., from month to
year). The metric is roughly linear, in semi-log
30
coordinates, indicating that the items accessed increase
T Century exponentially. This linear shape of the curve may be a
natural form for describing accessibility y with cost for well-
designed systems.
o~
1 100 10000 1000000
a o~
o 2 4 6
NUMBER OF SELECT-DISPLAY
DAYS BACK CYCLES
Fig. 3. Time as a function of number of days back.
Model for Access Time Fig. 4. Time as a function of the number of selection-
On Fig. 3, we have circled those data points that are done display cycles.
with the same method and collapsed the data cells to one
data point per method in Table 1, columns (5) and after. At COMPUTATIONAL CHARACTERIZATION OF
this point, we can use our data to fit a simple model in DESIGN IMPROVEMENTS
order to characterize the direct walk time. To a first order of With the Cost-of-Knowledge Characteristic Function
analysis, the time (in seconds) to select a date is just conceptually in hand, we can now use it to help reason
proportional to the number of cycles required (l?ig. 4). By a about and discover variants in the system design. Let us
regression analysis, discuss briefly the effect of some design changes on the
system measured in the last section.
Time to Access= 3.3 + 3.5 * NCycles. (1)
One possibility is to speed up the system response. We
This model allows us to give a smoother characterization notice (from Eq. 1) that the time per picking cycle is on the
than the individual data points and we list the model times order of 3.5 s. This would seem to be relatively long.
in Table 1, column (8). Assuming a mouse point of around 1 s [2] and a response
animation time of 1 s [3] suggests that 2 s/cycle should be
Number of elements accessed possible The discrepancy suggests re-examination of the
Finally, we compute the number of days that it is possible animation algorithms< We now replot (as recalculated from
to reach in less than or equal to a certain amount of time. Eq. 1) the Cost of Knowledge Characteristic Function that
This is done by computing the number of days existing
would have resulted from a faster 2 second user-action cycle
within the different periods serviced by the methods in (Curve Bin Fig. 5). The curve is tilted upwards indicating
Table 1. We list the result in Table 1, column (10). With
an improvement in the system (if it can be achieved
the model, we can make reasonable estimates of what the
computationally and if some other phenomenon does not
data would be for other dates not actually measured (e.g., intervene). Notice that in this case we have plugged the
1,000,000 days distant). results from previous models into our new model to cascade
Cost-of-Knowledge Characteristic Function the speed with which we can think about design variants.
We can now plot the Cost-of-Knowledge Characteristic
Another possibility is to eliminate the Week display (or
Function of Fig. 1 by plotting the number of elements
probably better, to integrate it into the same display as the
241
Human
Factors
in Computig
Systems
10000000
T C: No Week
F-
(n 1000000
0
0
100000
w 10000
A
m
z
w
u)
1000
Ill
0
0
a 100
10
1=
I
~p u ,
1 I I
1 i
I
1
o 20 40 60 80 100 120
COST (S)
Fig. 5. Cost of Knowledge Characteristic Function for several variants of the Spiral Calendar and the CM calendar program.
Day) in order to reduce the number of action-display cycles Knowledge Characteristic Function Fig. 5 grows a bump at
required. If we were to do just this, then the Cost of the bottom (Curve C), because of the larger branching
factor at the Month level. In Curve D, we combine both
Table 2. Computation of Cost of Knowledge Characteristic variants.
Function for Sun CM Calendar.
With the Cost of Knowledge Characteristic Function, we
are led to view variants in terms of their effect on the cost-
structure of access rather than just on a single point.
242
Boston,
Massachusetts
USA* April24-28,1994 Human
Factors
in Computing
Systems
!!!Ei!
These operators include the system response time of this PROBABILITY DENSITY FUNCTION
particular system. They do not count the mental Fimlly, it should be noted that whereas we have determined
preparation time [2]. The time taken by the different the Cost of Knowledge Function, we have not as yet
methods is given in Table 3, which analyzes the weighted the items accessed by their value. There are
predominant method used. (Remember, methods were several such weighings, but one of the most important is
restricted to be direct walk). to weight items by a probability density function describing
their frequency of use.
Table 3. Analysis of methods for CM calendar task.
Anderson and Schooler [1] have shown that for many
ACIION ANALYSIS different kinds of information (e.g., news articles in the
Select Date P New York Times or messages in electronic mail) the
display probability an item D days old will be needed is given by
Month2 Month Select PR.EV button 2P
Pr(neededlll days ago) = A /(A+ De), where
_ ChJ)hJ_
Select Day
where A and C are constants. For the case of electronic
Year Month Select year on m+2P mail, A = .34 and C = 0.83, hence
display VIEW pulldown
menu Pr{neededlD days ago) = 0.34/ (0.34+ D0.83).
Year Select Month
Month Select Day If we multiply this function with the Cost of Knowledge
Year(n) Month Select Year on m+ 3P+ Characteristic Functions in Fig. 5 to obtain a curve
display VIEWpulldown (n-2)B expressing the expscted cost of accessing different numbers
menu of items (see Fig. 6). In the case of the calendars, it
Year Select PREV button expresses the fact that the user is likely to access recent
(n-1) times dates much more frequently. The area under the curves is
Year Select Month related to the total costs of using the two programs and the
Month Select Day curve also shows in what area the costs are concentrated.
Fig. 6 shows that the expected cost for the user is heavily
In this case, we are not trying to predict method times, but contained in the in the most recent hundred days. For this
to analyze them, so, as before we use regression analysis to reason, the Spiral Calendar prototype tested would be more
assign numbers to the operators. The regression gives expensive to use than CM (if only the direct walk feature
were considered). On the other hand, if the task involved
Time = 1.3 + 3.9 m + 1.4 P + 0.36 B . (2) reference to historical dates, then a different probability
density function would be appropriate.
This equation (the equivalent of Eq. 1 for CM) is used to
determine a smoother version of the Cost of Knowledge
Characteristic Function in Table 2, column (5). Finally, 1.8~
the number of days accessible within a given iso-cost 1,6
contour is determined from an inspection of the program 1.4
displays and summarized in Table 2, column (6).
before, we plot number of items accessible (column 6)
against cost (column 5) in Curve E in Fig. 5.
As
0.6
0.4
\
A ‘
\
Spiral Calendar
243
m , Human
,—__—-,
–,
Factors
inComputiig
Systems .
likely consequences for proposed variants of these systems themselves logarithmically, when the user’s shift among
in terms of the metric. Finally, we introduced the next step available access methods is taken into account. This would
in the development of this analysis, taking into account the be interesting to know as well as knowing what are
frequency of access for different items of information, characteristic values for the exponents.
244