Professional Documents
Culture Documents
Cube Analyst
CUBE ANALYST
VERSION 6.1.0
60-010-1
April 24, 2013
Contents
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
What is Cube Analyst? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
Scope of this document. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
Whats new? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Common elements and variations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Reading this document . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Conventions used in this document . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
Computing resources . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
Cost information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
Chapter 2
Estimation System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
Framework for handling different data consistently . . . . . . . . . . . . . . . . . . . . 12
Objectives. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
Handling data variability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
Options for users . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
Considerations for users . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
Deciding what information to input. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
Inputting data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
Estimating the matrix. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
Analyzing the estimated matrix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
Improving the estimated matrix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
Estimating highway and public transport matrices. . . . . . . . . . . . . . . . . . . . . 20
Overview of Cube Analyst. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
iii
Contents
Chapter 3
Chapter 4
Mathematical Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
Mathematical notation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
Explaining the letters and symbols . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
Notation used in the estimation equation . . . . . . . . . . . . . . . . . . . . . . . . . 34
Introduction to the mathematics in Cube Analyst . . . . . . . . . . . . . . . . . . . . . . 35
Main mathematical features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
Estimation equation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
Model parameters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
Maximum likelihood objective function . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
Describing the variation in data. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
Optimizer: Finding the minimum value . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
Mathematical summary. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
Maximum likelihood method: Background theory . . . . . . . . . . . . . . . . . 48
Application of maximum likelihood to Cube Analyst. . . . . . . . . . . . . . . 49
Cube Analyst objective function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
Cube Analyst trip estimation model. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
Estimating model parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
Optimization procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
Parameter errors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
Cell reliability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
Extensions to the calculations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
Chapter 5
iv
Contents
Routings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
Highways . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
Public transport . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
Setting confidence levels . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
Characteristics of the data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
Deciding on confidence values . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
Tuning estimation performance. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73
Control of routing information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
Analyzing the results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
Chapter 6
Estimation Process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
Study area. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
Estimating the matrix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
Evaluation: Sensitivity analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
Including part-trip data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
Chapter 7
Hierarchic Estimation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93
Introduction to hierarchic estimation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94
Approaches to estimating very large matrices . . . . . . . . . . . . . . . . . . . . . 94
Different levels of detail: Districts and zones. . . . . . . . . . . . . . . . . . . . . . . 94
Different approaches to hierarchic estimation . . . . . . . . . . . . . . . . . . . . . 95
Alternative approaches to hierarchic estimation . . . . . . . . . . . . . . . . . . . . . . . 96
Estimation with mixed district and zonal detail . . . . . . . . . . . . . . . . . . . . 96
Local matrices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
Summary of the hierarchic estimation process . . . . . . . . . . . . . . . . . . . . 99
Defining districts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
Running Cube Analyst for hierarchic estimation . . . . . . . . . . . . . . . . . . . . . . 106
Parameter ZCONF . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
Chapter 8
Chapter 9
Reports . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115
Summary of Reports . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116
Sample reports . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 118
Average confidence level . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 118
Final five iterations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119
Contents
Chapter 10
Files . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125
Chapter 11
Chapter 12
Chapter 13
Chapter 14
Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 161
Estimation with prior trip and count data only . . . . . . . . . . . . . . . . . . . . . . . . 162
vi
Contents
Estimation with prior trip, count, and trip end data . . . . . . . . . . . . . . . . . . . 163
Estimation with warm start and cost data . . . . . . . . . . . . . . . . . . . . . . . . . . . 164
Estimation with highways part trip data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 165
Estimation with public transport part-trip data . . . . . . . . . . . . . . . . . . . . . . . 166
Hierarchic estimation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167
Example of screenline volumes report . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 168
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 169
Contents
Chapter 1, Introduction
Chapter 9, Reports
ix
Introduction
Whats new?
Background
Computing resources
Cost information
Introduction
What is Cube Analyst?
Introduction
Scope of this document
Introduction
Whats new?
Whats new?
Cube Analyst can now estimate Cube Voyager Public Transport
matrices by using an intercept file output by the Cube Voyager PT
program.
Introduction
Background
Background
Cube Analyst enables transport planners to estimate origindestination (O-D) trip matrices and to maintain the currency of
existing O-D matrices, while minimizing survey costs.
As is described in Introduction to the mathematics in Cube
Analyst on page 35, Cube Analyst is suitable for estimating present
day matrices, but not for forecasting future year trip matrices.
The software contains a number of novel and distinctive features. It
was first developed as a collaborative venture with the Dutch
Ministry of Transport, the Rijkswaterstaat. Subsequently, studies
and developments undertaken for Centro (the Passenger Transport
Executive for the West Midlands area of England) led to a
broadening of the softwares capabilities to consider public
transport passenger matrices, as well as highway (vehicle) matrices,
and to estimate detailed matrices for very large study areas.
Introduction
Common elements and variations
Introduction
Reading this document
Introduction
Conventions used in this document
Technical term introduced for the first time, in upper and lower
case italics.
For example: Hessian
Introduction
Computing resources
Computing resources
Cube Analyst is a major system. The programs ensure that the
mechanics of operation for the user are straightforward, but it
requires familiarity with a number of programs, especially for data
preparation and analysis of results, and this should be taken into
account when planning to use it for the first time.
Cube Analyst is designed about a number of rigorous principles,
including the calibration of the mathematical estimation model
which the program undertakes. One consequence is that it is
computationally intensive; the differing sets of data are considered
simultaneously and this requires the availability of relatively large
amounts of random access memory (RAM), memory, and of disk
space.
Introduction
Cost information
Cost information
For highways, cost data is produced by Citilabs products.
For public transport in TRIPS, cost data is produced by MVPUBM.
10
Estimation System
Objectives
Estimation System
Framework for handling different data consistently
12
The system can work with little data, but the accuracy of the
estimated matrix is improved as more data is input.
Estimation System
Objectives
Objectives
The aim of Cube Analyst is to maximize the value of existing data
and to limit the need for costly surveys. As such, it is mainly
concerned with processing information in the best (statistical)
manner; though the accuracy of the estimated matrix remains
strongly affected by the amount and the quality of the information
input by the user.
Beside the role of estimating matrices for individual studies, Cube
Analyst is suited for use with regular surveys designed to keep
matrix information up-to-date.
Estimation System
Handling data variability
14
Estimation System
Options for users
Estimation System
Considerations for users
16
Inputting data
Estimation System
Considerations for users
Example
Data
Car ownership
Traffic growth
Counts
Land use
New bypass
Changes in:
Road/public
transport network
Traffic management
New bus/rail services
Travel habits
Out-of-town shopping
Estimation System
Considerations for users
Inputting data
Information may be input in the form of matrices, as trip ends, or as
network-related information. This data is prepared by the user
within Cube, which offers a variety of modes of data entry. Extra
information is required on data variability. This is input in the same
form as the information to which it corresponds. Each data item, for
example each count, trip end, etc., may have an individual
confidence level attached to it, but in many cases global values will
be used.
18
Estimation System
Considerations for users
Estimation System
Estimating highway and public transport matrices
20
Estimation System
Overview of Cube Analyst
For the most part, Cube Analyst simply reads the set of users
input data at this initial stage. However Cube Analyst also
analyzes and restructures routing information (from the TRIPS
route choice probability (RCP) file or Cube Voyager path file),
and count data, from the screenline file, into a more concise
and efficient file, called the intercept file. This restructuring can
be relatively lengthy so, as noted in Considerations for users
on page 16, it is possible to re-use an Intercept file once it has
been created. For Cube Voyager users, the creation of the
Intercept file is handled by the HIGHWAY program.
2. Calculation initiation
Estimation System
Overview of Cube Analyst
The user should note that these options for tuning the
performance of Cube Analyst exist, but should not necessarily
be concerned to apply them, as the default operation is usually
entirely satisfactory. It requires some experience with a
particular estimation problem to determine its best strategy.
3. Function evaluation
22
Estimation System
Overview of Cube Analyst
Estimation System
Overview of Cube Analyst
24
Types of data
Sets of data
Types of data
Cube Analyst can operate using some or all of the following types
of data:
Link counts
Turning counts
Trip ends
Routing information
Part-trip data
Link counts
For highways, this information may be surveyed with considerable
accuracy and exploit automatic counters, but it may not show the
current demand for travel (which the O-D matrix should represent)
if congestion has constricted flows.
For public transport, this data is often obtained from estimates of
passenger numbers in buses and rail carriages, and is of inherently
limited accuracy (but may still be usefully exploited by Cube
Analyst).
For both modes, it should be observed that matrices normally
apply to average situations for which individual counts will match
to only some extent.
26
Turning counts
The same comments as for link counts apply. Note that turning
counts may only be applied when inputting a Cube Voyager path
file. They are not supported for an estimation using a TRIPS RCP file.
Trip ends
The total number of trips generated from and attracted to zones
(G&A) may be obtained either from surveys or from mathematical
land-use type models. Surveys are appropriate when zone
boundaries are such that traffic may be counted entering and
leaving zones on distinct trips, rather than merely passing through
the zone. This tends to occur only for some zones, for example a car
park or an industrial estate, but these are often important zones for
a study.
It is possible to use data derived from both methods, for example, a
few zones surveyed and the remainder derived from a model, with
the resulting trip ends distinguished through differing confidence
levels.
Routing information
It is possible to survey routing data, though this is rarely done. The
modelling of routing is often not a very good replication of actual
(erratic) driver or passenger routing, and it is often not possible to
place much reliability on this otherwise important data. Cube
Analyst is therefore designed to use routing information, as far as
possible, only where the precise routing does not matter. Thus, for
skim cost information small variations in routes may be ignored,
while count information is used in bottleneck situations where
the number of routes is limited to a few alternative links (ideally
one).
28
Part-trip data
This data is surveyed in the form of matrices where the recorded
origin and destination are not necessarily the ultimate origin and
destination of the trip. This is illustrated in the figure Definition of
part-trip data that shows the recorded part of trip (S - E) relative to
the total trip (O - D). It is possible for one or both of points S and E
to coincide with the corresponding points O and D. For highways,
this data is typically obtained from licence plate matching surveys,
and from on-board surveys recording passenger boarding and
alighting points for public transport.
Sets of data
Cube Analyst estimates one matrix at a time, and the data should
form a set related to this particular matrix, that is, the data should
correspond to the same time period (hour(s) of day, day of week,
time of year) as the matrix. It should also correspond to the same
units of flow (vehicles, pcus, passengers, etc.). Sometimes the user
will have to transform data (for example, by factoring) to achieve
this, and this will usually imply a reduction (small or large) in
confidence levels for the transformed data.
Also, only one set of information may be input into Cube Analyst for
an estimation. Hence, if multiple sets exist, say, several traffic counts
for the same link, then the user must derive a single set. This may
simply be to choose the most recently surveyed set, or it might be a
weighted average of all available sets. Multiple sets of data usually
allow confidence levels to be increased relative to single sets of
data.
30
Mathematical Background
Mathematical notation
Mathematical summary
Mathematical Background
Mathematical notation
Mathematical notation
This section discusses mathematical notation. Topics include:
32
Symbol
Description
alpha
beta
eta
theta
lambda
xi (upper case)
xi (lower case)
Mathematical Background
Mathematical notation
Symbol
Description
exponent
Mathematical Background
Mathematical notation
Description
Origin zone
Destination zone
Link count
Screenline count (from count sites
.....)
=
Model parameters
H observed
h estimated
Description
Number of trips from i to j
Number of trips from origin i
Number of trips to destination j
Number of trips through link k
34
Mathematical Background
Introduction to the mathematics in Cube Analyst
Estimation equation
Model parameters
Mathematical Background
Introduction to the mathematics in Cube Analyst
with the users input data. The outputs are calculated from an
estimation equation, which must be provided. These points are
further explained below.
Given the range of possible input data, the full mathematical
expression of Cube Analyst is complex, but it involves some
principal components which we use to describe the essential
features of Cube Analyst. Mathematical summary on page 48
explains the standard Cube Analyst calculations by summarizing
the main mathematical steps. Extensions to the calculations on
page 57 shows how additional features are accommodated in the
calculations. This section continues with explaining Cube Analysts
mathematics in largely descriptive terms, while introducing the
main equations. Throughout this section, the mathematical
notation is defined Mathematical notation on page 32, where it is
not otherwise clear from the text.
Estimation equation
The heart of the estimation is an equation (estimation model)
whose output, , corresponds to the values of the cells of Cube
Analysts output matrix for trips between zones and . The form
of this mathematical estimation model in Cube Analyst is:
.....(1)
This equation contains the following elements:
its output,
36
Mathematical Background
Introduction to the mathematics in Cube Analyst
and
Mathematical Background
Introduction to the mathematics in Cube Analyst
Model parameters
For Cube Analyst, therefore, the estimated matrix is entirely
dependent on the values given to the model parameters. Cube
Analyst is thus, in effect, solely concerned to establish the most
appropriate values for these model parameters. (Cube Analysts
calculations are in parameter space, which accounts for some of
the behavior that may be observed in Cube Analysts output to the
screen and log file while it is computing, where the values of the
matrix may change in an apparently erratic manner.) Cube Analysts
calculations are mainly in the nature of a search for the best
model parameter values. Apart from the estimation equation itself,
the main features of the Cube Analyst calculations are:
We now describe the general issues for Cube Analyst when setting
model parameter values.
Unless the user supplies an input model parameter file (created
either by an earlier run of Cube Analyst), the model parameters are
automatically initialized to 1.0. From equation (1), it may be seen
that the initial estimate is identical to the prior matrix (or based on
the cost matrix, equation (2), if no prior matrix value exists).
It is possible to compare the estimated matrix with all of the items
of the users input data. For example, the sum of rows and columns
of the estimated matrix may be compared with input trip ends
(Mathematical summary on page 48 shows this in mathematical
terms for all data items). If the result of this comparison indicates
38
Mathematical Background
Introduction to the mathematics in Cube Analyst
Mathematical Background
Introduction to the mathematics in Cube Analyst
40
Mathematical Background
Introduction to the mathematics in Cube Analyst
Mathematical Background
Introduction to the mathematics in Cube Analyst
The assumption is made, therefore, that all input data for Cube
Analyst is subject to variation which may be described by the
Poisson probability distribution function (pdf ).
42
Mathematical Background
Introduction to the mathematics in Cube Analyst
Mathematical Background
Introduction to the mathematics in Cube Analyst
44
Mathematical Background
Introduction to the mathematics in Cube Analyst
Mathematical Background
Introduction to the mathematics in Cube Analyst
46
Mathematical Background
Introduction to the mathematics in Cube Analyst
3. Quasi-Newton method
4. Method of scoring.
Mathematical Background
Mathematical summary
Mathematical summary
This section presents a further explanation of Cube Analysts
calculations, as given in Introduction to the mathematics in Cube
Analyst on page 35.
Topics include:
Optimization procedure
Parameter errors
Cell reliability
48
Mathematical Background
Mathematical summary
= random variable
= observation
= parameter (or function of a parameter)
The likelihood function is then defined to be:
.....(5)
where:
that is,
is a set of
observations
that maximizes
=above)
Mathematical Background
Mathematical summary
.....(7)
Taking logarithms of equation (7) leads to:
.....(8)
It may be noted that
= constant
50
Mathematical Background
Mathematical summary
Description
Nij
Oi
Dj
QK
Number of trips
through screenline K
where: RijK is the proportion of trips in matrix cell (i, j) using screenline K
Comment
Screenline counts
Trip origins
Mathematical Background
Mathematical summary
Objective function, M =
Comment
Trip destinations
Prior matrix
Cost matrix derived
.... (12)
where
indicates summation over cells which are zero in the
prior matrix, but not the cost matrix.
52
Mathematical Background
Mathematical summary
Parameters.
2. Use the table Observed data, H, and estimated equivalents, h
on page 51 to calculate
data.
3. Calculate
leads to
.....(16)
where
.....(17)
and
.....(18)
Note:
are constants
Mathematical Background
Mathematical summary
is undefined if
or
.....(13)
where
= constant
or
let
Then differentiating (13) gives:
(for each )
.....(19)
(for each )
.....(20)
(for
( )
) .....(21)
.....(22)
.....(23)
Finally, we substitute (19) to (23) into (16) for each value of , and
use an optimization procedure to choose parameter values that
give values of that minimize the objective function (9).
54
Mathematical Background
Mathematical summary
Optimization procedure
Given an initial guess
Cube Analyst computes the maximum
likelihood estimates by generating a sequence of estimates
from
where
is a suitable step length, and
vector given by
denotes a search
is given by
.....(24)
From equation (16) we can write
.....(25)
This leads to
.....(26)
Mathematical Background
Mathematical summary
and
When
is calculated by the quasi-Newton method (as previously
described in Introduction to the mathematics in Cube Analyst on
page 35), the Hessian matrix updates the expected value,
,
using the BFGS update formula.
Parameter errors
The optimization produces an estimate
the variance of
parameter , and an estimate of the parameter value itself,
Therefore,
Standard Error =
.....(27)
Cell reliability
The sensitivity of the estimate of
, is defined to be
.....(28)
where is the objective function,
second differentials.
56
Mathematical Background
Extensions to the calculations
Mathematical Background
Extensions to the calculations
h
FZTZ =
Tij
PZTZ
FZTR =
Ti1
PZTR
FRTZ =
T1j
PRTZ
58
Overview
Matrices
Trip ends
Screenlines
Routings
Overview
There are a series of data preparation tasks which are discussed in
the following sections. Most of the tasks only require data files to
be created in a relatively mechanistic manner, but two of the tasks
require the user to make considered choices. These are discussed in
Screenlines on page 65 and in Setting confidence levels on
page 70.
The final sections in this chapter explain the estimation stage in
terms of tasks facing the user. As Cube Analyst usually requires
minimal input from the user, apart from the supply of prepared
data files, the estimation stage is very straightforward. However,
advice is given on possible ways of improving the speed of
estimation. This may be achieved through:
The final set of activities for the user are to analyze the results to
assess the quality of the estimation, partly to determine if and how
they might need to be improved. This topic is discussed in
Analyzing the results on page 75.
The ideas introduced in this chapter are subsequently illustrated in
later chapters with an example application of Cube Analyst, based
on an actual study. Further details on points covered in this section
are provided in the standardized estimation procedures.
60
Matrices
Cube is used to set or modify individual cells or ranges of cells. This
also permits confidence levels to be easily set to global or
individual values. For example, you can use a prior matrix (Table
101) to give information about basic trip patterns.
Prior matrix (Table 101)
|
| 20 |
| TABLE = 101
(Prior
)
| 20 |
|
1
2
3
4
5
6
7
8
9
10 | 20 |
|
------------------------------------------------+ 20 |
| 1:
1
1
0
5
45 126
50
21
30
55 | 20 |
| 2:
1
5
0
70 125
36
38
50
58
14 | 20 |
| 3:
1
1
0
2 108 119
90
69 148
44 | 20 |
| 4:
69
3
0
1
6
7
6
3
25
3 +----+
| 5:
100
1
0 192
71
20
12
11
14
7 |
| 6:
36
2
0
88
52
6
3
7
16
13 |
| 7:
62
3
0
32
36
58
9
63
9
61 |
| 8:
0
1
0
64
65
30 119
19 121
64 |
| 9:
0
7
0
57 123
70 178 279
7
38 |
| 10:
0
10
0
7
31
3
1
10
21
3 |
| 11:
0
13
0
19
35
4
96 170
28
29 |
| 12:
0
5
0
41 286
52 103 117
29
56 |
| 13:
0
9
0
24
99
50
90
91
23
12 |
| 14:
4
3
14
20
56
19
67
58
21
7 |
| 15:
28
2
36
1 185
1
1
2
15
1 |
+----------------------------------------------------------+
62
Trip ends
Trip ends may be determined either by reference to an existing
matrix, surveys (for example, of parking), or they may be calculated
from equations.
64
Screenlines
Screenlines are used to minimize the effects of assignment errors.
Screenlines are defined as the set of count sites which intercept
traffic/passenger flows between sets of zones which share the
same general corridors of movement (across which the screenlines
are suitably located).
The extent of a screenline is determined by the number of
alternative (reasonable) paths which are available. In many public
transport networks where services are sparse, or in rural highway
networks, there may only be a single reasonable route between
one general area and another. In this case, screenlines may
correspond to single links (although they are still treated as
screenlines in this context of Cube Analyst). In general, however, a
screenline will represent a set of links.
In the case of highways, a useful type of screenline is provided by a
river or a railway line, that has only a few crossing points. In this
case all traffic must be routed through known points, and so
assignment error associated with the screenline will be minimized.
For Cube Analyst, there is no difference between a group of traffic
counts on separate links (that form a logical screenline) and a single
link count amalgamating the flows on separate traffic lanes.
There will normally be few, if any, screenlines that entirely bisect a
study area and so intercept all trips either side of it. Cube Analyst
therefore employs the concept of partial screen lines. They are
partial in the sense that they do not extend between the
boundaries of a study area, but they intercept all trips between, at
least, certain defined pairs of zones.
The method for defining such partial screenlines is manual, and
based partly on judgement and the availability of count data sites.
The routing information, together with user-defined screenlines, is
used to define the set of O-D pairs whose routes they intercept. The
aim is to group count sites into screenlines that balance the
objectives to:
1. Maximize the number of O-D pairs that have all routes passing
through a screenline.
2. Minimize the number of O-D pairs per screenline, as this
66
Function
Northern
Western
Eastern
Central Area
Routings
Matrix estimation requires information about which routes are
used to connect each pair of origin and destination zones, and the
probability that each route is used. Ideally this would come from
survey information, but this is onerous and not very practical, so
the method uses modeling instead. This routing information is one
of the outputs from the assignment process. For TRIPS users it is
stored in the route choice probability (RCP) file. For Cube Voyager
Highway users, it is stored in the Cube Voyager path file. For Cube
Voyager Public Transport users, it is stored in route files.
This section discusses two types of routings:
Highways
Public transport
Highways
The main requirement for Cube Analyst is for the routings to reflect
all reasonable alternative paths whilst avoiding spreading out too
much so that they become unrealistic.
For Cube Voyager users, the paths reflected in the Intercept file
derive from combining the all-or-nothing paths from each
assignment iteration into one set. This can be done directly in the
HIGHWAY program. Alternatively, HIGHWAY can be used to
generate a path file, and the appropriate path sets and volumes
selected from it for use in Cube Analyst.
TRIPS users could use a similar approach, or apply one of the
stochastic methods. When considering networks where congestion
is a factor, the assignment itself relies on the trip matrix that the
estimation is trying to provide. Hence it may be preferable to apply
routes derived using methods that can calculate multiple routes
between zones based on stochastic (statistical) methods, rather
than to rely on the paths from a capacity-restrained assignment.
TRIPS supports two such methods, known after their originators as
Burrell and Dial. Both methods can be used successfully with Cube
68
Public transport
Cube Voyager PT outputs to route files by user class. Many controls
affect the routing, but a factors file provides a means to determine
the extent of multirouting. TRIPS automatically produces
multiroute paths and can also store them in a RCP File. The
determination of which links are used to connect pairs of origindestination zones is a function of a path building algorithm which
generates a set of reasonable paths. These are based on
considerations of generalized cost, which reflect users data about
transit times, fares, boarding and transfer penalties, and so on. A
submode split model can be used to reflect passenger biases when
deciding if different modes (bus, metro, rail, etc.) are candidates for
inclusion into the set of reasonable paths.
70
72
Even when trip ends have been determined simply from the
row and column totals of the prior matrix, the aggregation of
the data means that the trip end confidences will be higher
than the corresponding individual cell confidences. For this
reason, trip end data should always be used when a prior
matrix is input.
74
Iteration number
Step size
Part-trip data
76
78
Estimation Process
Study area
Data
Estimation Process
Study area
Study area
This section discusses a highways based application of Cube
Analyst for an 82-zone study area for the town of Guildford in
Surrey, UK, (pop. 100,000). The network shown has a major bypass
for the town, which is shown as a thicker line. Zone centroid
connectors are shown as pale blue lines. Eleven zones were
designated cordon-crossing zones at the study area boundary.
80
Estimation Process
Data
Data
The network was well provided by current traffic counts and these
were all given a confidence level of 80, which served as a
benchmark for other data confidences. Most of the trip end data
was synthesized, by disaggregation of UK Department of Transport
data with reference to zonal population and employment figures,
and was given confidence levels of 40. Higher confidence values of
80 were set for external trip ends, determined from a cordon
crossing survey, and to a set of five zones in the town center area
that were the subject of a car park survey. An out-of-date trip
matrix existed, which served as the prior matrix, and which was
given a uniformly low confidence for each cell of 5. Sixteen
screenlines were defined, which are shown in the following figure.
Estimation Process
Data
82
Estimation Process
Estimating the matrix
Estimation Process
Estimating the matrix
ZONE
1
2
3
4
5
6
7
8
9
10
Some
30
31
32
33
34
35
36
37
38
39
40
Some
70
71
72
73
74
75
76
77
78
79
80
81
82
84
GENERATIONS
ATTRACTIONS
NO
OBS.
EST. OBS-EST
%
OBS.
EST. OBS-EST
%
4869.0
4324.3
544.7
11.2%
3657.0
3591.6
65.4
1.8
3825.0
3745.0
80.0
2.1%
2984.0
3571.1 -587.1 -19.7
1798.0
2559.5 -761.5 -42.4%
5715.0
5710.1
4.9
0.1
419.0
383.2
35.8
8.5%
558.0
528.3
29.7
5.3
1256.0
1572.5 -316.5 -25.2%
2018.0
2156.1 -138.1 -6.8
2045.0
1731.1
313.9
15.4%
2084.0
1998.6
85.4
4.1
1935.0
1815.4
119.6
6.2%
2112.0
2194.3
-82.3 -3.9
1794.0
1894.8 -100.8
-5.6%
2673.0
2815.2 -142.2 -5.3
3662.0
3364.9
297.1
8.1%
4763.0
4247.7
515.3 10.8
430.0
388.9
41.1
9.5%
273.0
307.1
-34.1 -12.5
missing....
3870.0
3176.5
693.5
17.9%
2370.0
2375.0
-5.0 -0.2
2778.0
2618.2
159.8
5.8%
1304.0
1616.1 -312.1 -23.9
5450.0
4633.8
816.2
15.0%
3257.0
3175.7
81.3
2.5
2943.0
2741.1
201.9
6.9%
3006.0
2807.4
198.6
6.6
736.0
806.5
-70.5
-9.6%
1151.0
1107.2
43.8
3.8
368.0
785.9 -417.9 -113.5%
930.0
909.7
20.3
2.2
4042.0
4062.2
-20.2
-0.5%
1523.0
1570.5
-47.5 -3.1
1821.0
1964.4 -143.4
-7.9%
2026.0
2083.9
-57.9 -2.9
4719.0
4763.3
-44.3
-0.9%
2683.0
2326.7
356.3 13.3
3116.0
3440.8 -324.8 -10.4%
6410.0
6234.9
175.1
2.7
3030.0
3369.2 -339.2 -11.2%
5227.0
6016.3 -789.3 -15.1
missing....
1829.0
1639.9
189.1
10.3%
1251.0
1214.6
36.4
2.9
1089.0
1160.4
-71.4
-6.6%
1298.0
1364.2
-66.2 -5.1
4396.0
4122.0
274.0
6.2%
4226.0
3952.8
273.2
6.5
10600.0 11231.3 -631.3
-6.0% 11100.0 11146.4
-46.4 -0.4
6950.0
6931.0
19.0
0.3%
5806.0
5720.2
85.8
1.5
9200.0
9605.6 -405.6
-4.4%
9200.0
9384.8 -184.8 -2.0
14423.0 15045.9 -622.9
-4.3% 14313.0 14109.6
203.4
1.4
1008.0
824.4
183.6
18.2%
722.0
655.4
66.6
9.2
2270.0
2217.8
52.2
2.3%
2270.0
2236.2
33.8
1.5
5665.0
5396.6
268.4
4.7%
5665.0
5465.7
199.3
3.5
26660.0 26727.9
-67.9
-0.3% 28912.0 27872.0 1040.0
3.6
5310.0
5258.3
51.7
1.0%
5990.0
5940.6
49.4
0.8
6033.0
6390.8 -357.8
-5.9%
6085.0
6601.1 -516.1 -8.5
Estimation Process
Estimating the matrix
Estimation Process
Evaluation: Sensitivity analysis
Flow % differences
Screenline
(i)
(ii)
(i)
(ii)
Before (80)
1713
291
6.1
1.1
After (200)
1094
219
3.9
0.9
86
Estimation Process
Including part-trip data
Prior matrix
Trip ends
Link counts
Estimation Process
Including part-trip data
Part-trip data
The figure Part-trip data and link counts shows the two sets of link
flow information which were used. Link counts, shown as open
bandwidths, and part trip, as shown previously in the figure Parttrip data shown as link flows, using bandwidths on page 87. It may
be noted that some links had both link count data and part trip
data. In this application, the confidence levels for link counts were
set higher, at 80 or more, than those for part-trip data, which were
set at 60 in recognition of the sampling process inherent in license
plate surveys.
88
Estimation Process
Including part-trip data
Maximum
5.0
95.0
47.8
47.8
60.0
Minimum
5.0
200.0
80.0
80.0
60.0
Number of
5.0
80.0
40.0
40.0
60.0
6642
16
82
82
226
Stepsize
(Tolerance=0.00010)
0.0003559
0.0001208
0.0001890
0.0001123
0.0000580
Objective
Value
-8859208.83
-8859208.83
-8859208.83
-8859208.83
-8859208.83
Matrix
Total
229655.8
229656.0
229655.9
229655.9
229655.9
38 iterations because:
Estimation Process
Including part-trip data
90
Estimation Process
Including part-trip data
Estimation Process
Including part-trip data
92
Hierarchic Estimation
Defining districts
Hierarchic Estimation
Introduction to hierarchic estimation
94
Hierarchic Estimation
Introduction to hierarchic estimation
The total number of trips in the zonal and district matrices is the
same.
Hierarchic Estimation
Alternative approaches to hierarchic estimation
Local matrices
96
Hierarchic Estimation
Alternative approaches to hierarchic estimation
Hierarchic Estimation
Alternative approaches to hierarchic estimation
Local matrices
When using hierarchic estimation, Cube Analyst first estimates a
district matrix, which is used to influence the calculation of a set of
local matrices. These local matrices contain a mixture of zonal detail
and district-based information. The estimated zonal detail is
captured automatically by Cube Analyst and, as each local matrix is
estimated, is used to develop progressively an update of the entire
matrix at the zonal level of detail. The district matrix simply
represents the zonal matrix aggregated into a district matrix,
although the district matrix may be non-square, that is, there may
be a different number of origin and destination districts. Further
information about districts is given later in this section.
Consider a local matrix that is an extension of the combined district
and zonal matrix shown and discussed in Estimation with mixed
district and zonal detail on page 96.
98
Hierarchic Estimation
Alternative approaches to hierarchic estimation
A local matrix is defined for each origin and destination district pair
(the unshaded part in the figure represents one such pair), and the
fully estimated (zonal) matrix is produced when all local matrices
have been estimated.
Information involving trips from the RoW is obtained from the
district matrix. This element, and the fact that the total number of
trips is the same (in principle) for each local matrix, ensures that
consistency is maintained across the entire study area, even though
detail is calculated separately in estimations for different parts.
Hierarchic Estimation
Alternative approaches to hierarchic estimation
The following figure shows a study area divided into many (small)
zones (denoted by ij). These are grouped into a number of fewer
(and larger) districts (denoted by IJ). Subsequent topics in this
chapter give more information about creating districts.
100
Hierarchic Estimation
Alternative approaches to hierarchic estimation
network but does not aggregate the screenline count data. This
treatment of data is reflected in Cube Analysts reports on the
district matrix (see Figure 7.12b).
Cube Analyst can estimate all Local matrices in one run, but the
user may exercise considerable control over this process.
This example relates to a single Local matrix, but this stage is
repeated for all Local matrices. The example considers the same
structural elements introduced in the discussion on Zonal
estimation controlled by district matrix on page 98. The
information used to estimate Zonal cells, referenced as Mij,
includes:
Prior matrix and trip ends are used at zonal level in the
estimation
Hierarchic Estimation
Alternative approaches to hierarchic estimation
102
Hierarchic Estimation
Alternative approaches to hierarchic estimation
Hierarchic Estimation
Defining districts
Defining districts
Hierarchic estimation is a heuristic method which approximates the
formal mathematical methodology provided by a standard run of
Cube Analyst. It is most appropriate when the study area is large
enough to encompass sub-areas which can become districts where
the travel patterns are reasonably independent of one another.
The purpose of the estimated district matrix is, largely, to consider
the inter-district movements, while the focus of local matrices is the
intra-district movements. Because precision (greater detail) is
associated with the latter, it is desirable to minimize the amount of
inter-district movements.
The number of local matrices is approximately the square of the
number of districts. It therefore can make a considerable difference
to computational times whether, say, 10 districts are chosen (about
100 local matrix estimations) or 8 districts (about 64 local matrix
estimations).
Not all study area zones may be allocated to districts in this way,
either because some or all trips from or to a zone do not pass
through a screenline, or because allocation of the zone to a district
would violate the maximum number of zones per district. Zones
are then allocated to the adjacent district, based on the coordinates
associated with zone centroids. The effect of allocating zones to
district which is not based on routing behavior is potentially to
worsen the effects of the approximation implicit in hierarchic
estimation. In many cases, this worsening may be negligible in
practice, but will be more significant if those zones involve
relatively large numbers of trips, or if a significant proportion of
zones are involved. It is this latter consideration which makes it
inadvisable to use hierarchic estimation on study areas with less
than 500 zones.
The considerations involved in defining districts may be
summarized as:
104
Hierarchic Estimation
Defining districts
Hierarchic Estimation
Running Cube Analyst for hierarchic estimation
106
Hierarchic Estimation
Running Cube Analyst for hierarchic estimation
Parameter ZCONF
The extent of the constraining effect of the district matrix on the
local matrices is determined by Cube Analyst parameter ZCONF,
which acts as a confidence level, treating the district matrix as
observed data and the local matrix as estimated. For the local
matrix estimation, therefore, the district matrix is just another item
of observed data and ZCONF should be set in relation to
confidence levels for other items of observed data.
From the users point of view, the setting of ZCONF should be a
reflection of the degree and importance of the interaction between
districts, in terms of trips which cross more than one origin or
destination district boundary. (An effect of the automatic
generation of districts is to minimize such boundary crossings.) The
district matrix contains information about these interactions; if they
are important then the district matrix should be made
correspondingly significant with a relatively high setting of ZCONF.
A low value of ZCONF allows local matrices to reflect local data
more precisely, at the expense of the larger picture across the
entire study area. A possible symptom of an inappropriate setting
of ZCONF might be an unwarranted distortion of the distribution of
trip costs/lengths in the estimated matrix.
Hierarchic Estimation
Running Cube Analyst for hierarchic estimation
108
This chapter discusses the process for using Cube Analyst. Topics
include:
Outputs: overview
Estimation process
110
Outputs: overview
The outputs from Cube Analyst are:
112
Estimation process
The only program directly involved in the estimation process itself
is Cube Analyst, although other Cube programs play an important
part in the pre- and post processing of the data.
The data used may be some or all of the data described earlier in
Input data: overview on page 110.
Cube Analyst may also use model parameters, gradient search, and
intercept files from a previous run of Cube Analyst for the current
estimation to warm start the calculations.
Internally Cube Analyst can be considered to be made up of two
main parts each of which is executed alternately, namely:
Estimation model
Optimization step
114
Reports
This chapter discusses reports you can prepare with Cube Analyst.
Topics include:
Summary of Reports
Sample reports
Reports
Summary of Reports
Summary of Reports
The Analyst reports described in this chapter are saved in a print
(*.prn) file during program execution. The reports include:
Memory requirements.
116
0 = Normal Termination
4 = Warning. Review the print file and find the (W) tag for
information on the warning(s).
Reports
Summary of Reports
Reports
Sample reports
Sample reports
This section contains examples of reports:
Zone attractions
District matrix
Local matrix
118
Reports
Sample reports
Stepsize
(Tolerance= 0.0001)
0.0004152
0.0005055
0.0003342
0.0002368
0.0000781
Objective
Value
-4735264.48
-4735264.48
-4735264.49
-4735264.49
-4735264.49
Matrix
Total
239547.4
239548.6
239550.3
239551.0
239551.0
40.0
40.0
40.0
40.0
40.0
40.0
40.0
40.0
40.0
40.0
80.0
4869.0
3825.0
1798.0
419.0
1256.0
2045.0
1935.0
1794.0
3662.0
430.0
9200.0
4387.9
3763.3
2562.3
386.7
1574.9
1743.0
1827.1
1904.4
3288.3
381.3
9347.4
-481.1
-61.7
764.3
-32.3
318.9
-302.0
-107.9
110.4
-373.7
-48.7
147.4
-9.9%
-1.6%
42.5%
-7.7%
25.4%
-14.8%
-5.6%
6.2%
-10.2%
-11.3%
1.6%
Reports
Sample reports
Zone attractions
ZONE NO CONFIDENCE
1
40.0
2
40.0
3
40.0
4
40.0
5
40.0
6
40.0
7
40.0
8
80.0
9
40.0
10
40.0
11
80.0
<continued>
ATTRACTIONS
OBSERVED ESTIMATED
3657.0
3586.4
2984.0
3500.3
5715.0
5556.7
558.0
518.4
2018.0
2162.9
2084.0
1948.0
2112.0
2129.7
976.0
1030.0
2673.0
2804.5
0.0
0.0
5665.0
5549.3
ESTM-OBSV
-70.6
516.3
-158.3
-39.6
144.9
-136.0
17.7
54.0
131.5
0.0
-115.7
(ESTM-OBSV)/OBSV(%)
-1.9%
17.3%
-2.8%
-7.1%
7.2%
-6.5%
0.8%
5.5%
4.9%
n/a%
-2.0%
The trip end summaries can also be produced with the zone labels. Short
zone labels are printed if NODLAB=T, LNGLAB=F:
ATTRACTIONS
ZONE NO,NAME CONFIDENCE OBSERVED ESTIMATED ESTM-OBSV (ESTM-OBSV)/OBSV(%)
1 <Beaumont>
40.0
3777.0
3382.2
-394.8
-10.5%
2 <Cross_Ro>
40.0
3482.0
3441.1
-400.9
-10.4%
3 <Binley_S>
40.0
5815.0
5220.2
-594.8
-10.2%
<continued>
Long zone labels are printed if NODLAB=T and LNGLAB=T. The example below
shows hierarchic zone numbers and long zone labels in the report:
ZONE CONFIDENCE OBSERVED ESTIMATED ESTM-OBSV (ESTM-OBSV)/OBSV(%)
NUMBER & NAME
28480 <Beaumont Avenue>
40.0
5069.0
5544.5
475.5
9.4%
28172 <Cross Roads, town centre>
40.0
4025.0
4392.2
367.2
9.1%
27848 <Binley Street>
40.0
1898.0
2076.7
178.7
9.4%
<continued>
120
Reports
Sample reports
Maximum
Minimum
Number
of
Elements
Trip matrix confidence levels
Screen line confidence levels
Trip end (dest) confidence levels
Trip end (orig) confidence levels
Part Trip confidence levels
10.0
80.0
46.7
46.7
7.0
10.0
80.0
80.0
80.0
7.0
10.0
80.0
40.0
40.0
7.0
1083
2
95
95
594
Reports
Sample reports
District matrix
REPORTING PRIOR/ESTIMATED MATRIX TOTALS
CONFIDENCE
PRIOR ESTIMATED ESTM-PRIOR (ESTM-PRIOR)/PRIOR(%)
20.0 238498.0
240291.2
1793.2
0.8%
REPORTING OBSERVED/ESTIMATED GENERATIONS AND ATTRACTIONS
GENERATIONS
DISTRICT CONFIDENCE OBSERVED ESTIMATED ESTM-OBSV (ESTM-OBSV)/OBSV(%)
1
40.0 14616.0
13368.5
-1247.5
-8.5%
2
40.0 48050.0
47995.1
-54.9
-0.1%
3
40.0
7855.0
7711.1
-143.9
-1.8%
4
40.0 40478.0
42008.8
1530.8
3.8%
5
40.0 62530.0
59877.4
-2652.6
-4.2%
6
40.0 15462.0
16832.4
1370.4
8.9%
7
40.0 18734.0
19158.2
424.2
2.3%
8
40.0
6744.0
6707.7
-36.3
-0.5%
9
40.0 26890.0
26631.9
-258.1
-1.0%
ATTRACTIONS
DISTRICT CONFIDENCE OBSERVED ESTIMATED ESTM-OBSV (ESTM-OBSV)/OBSV(%)
1
40.0 21562.0
22434.0
872.0
4.0%
2
40.0 43850.0
43476.7
-373.3
-0.9%
3
40.0 43963.0
44217.7
254.7
0.6%
4
40.0 21627.0
20809.8
-817.2
-3.8%
5
40.0 30926.0
30638.1
-242.9
-0.8%
6
40.0 37198.0
39445.7
2247.7
6.0%
7
40.0
8070.0
7973.4
-96.6
-1.2%
8
40.0 15332.0
16906.5
1574.5
10.3%
9
40.0 14423.0
14344.4
-78.6
-0.5%
REPORTING OBSERVED/ESTIMATED SCREEN LINE COUNTS
SCREENLINE CONFIDENCE OBSERVED ESTIMATED ESTM-OBSV OBSV(%)
NO & NAME
1 A'shot Rd W-E
80.0
11677.0
11303.8
-373.2
-3.2%
2 A'shot Rd E-W
80.0
11677.0
11947.6
270.6
2.3%
<continued>
122
NO OF ODs
244
244
Reports
Sample reports
Local matrix
REPORTING SIDE CONSTRAINTS ON MATRIX TOTALS
DISTRICT
IN-PRIOR ESTIMATED ESTM-DISTRICT (ESTMDISTRICT)
CONSTRAINT
/ZONAL(%)
WITHIN DISTRICT
1506.2
958.0
1456.7
-49.5
-3.3%
FROM DISTRICT
6204.9
6276.0
5966.2
-238.7
-3.8%
TO DISTRICT
19303.5
16610.0
19181.9
-121.6
-0.6%
NOT IN DISTRICT
213276.5 214654.0 214338.2
1061.7
0.5%
MATRIX TOTAL
240291.2 238498.0 240943.0
REPORTING OBSERVED/ESTIMATED GENERATIONS AND ATTRACTIONS
GENERATIONS
ZONE NO CONFIDENCE OBSERVED ESTIMATED ESTM-OBSV (ESTM-OBSV)/OBSV(%)
R-o-W
40.0 233504.0
233520.1
16.1
-0.0%
25
40.0
1557.0
1625.1
68.1
-4.4%
26
40.0
1753.0
1654.5
-98.5
-5.6%
27
40.0
378.0
338.8
-44.2
-11.7%
28
40.0
1535.0
1339.8
-195.2
-12.7%
55
40.0
211.0
232.5
21.5
10.2%
56
40.0
875.0
878.7
3.7
0.4%
60
40.0
268.0
296.6
28.6
10.7%
61
40.0
1278.0
1061.8
-216.2
-16.9%
ZONE NO
R-o-W
30
44
48
53
72
77
ATTRACTIONS
CONFIDENCE OBSERVED ESTIMATED
40.0 215324.0
220304.4
40.0
2370.0
2431.7
40.0
12392.0
11794.8
80.0
1708.0
1757.0
40.0
209.0
273.1
80.0
4226.0
3691.0
80.0
722.0
661.0
ESTM-OBSV (ESTM-OBSV)/OBSV(%)
4980.4
2.3%
91.7
3.9%
-597.2
-4.8%
49.0
2.9%
64.1
30.7%
-535.0
-12.7%
-61.0
-8.5%
OBSV(%) NO OF ODs
-1.9%
244
7.7%
244
Reports
Sample reports
Note that as for standard estimations, short and long zone labels
can be shown in the trip end reports. The label for the R-O-W
(Rest-of-the World) will be left blank.
124
10
Files
Required (R)/
optional (O)3
Symbolic1
parameter
File2
ext
I/O
File description
.CTL
(R)
IMAT1
.MAT
Prior Trip/
(R)
.DAT
(R) If TRPEND=T
IDAT2
.DAT
Model
Parameters File
(R) If MODPAR=T or
if WARMST=T
IDAT3
.GDS
Gradient Search
File
(R) If WARMST=T
IDAT4
.DAT
Screenline File
(R) If SCRFIL=T
IDAT5
.ICP
Intercept File
(R) If WARMST=T
or If INTCPT=T
INET1
.NET
Network File
(R) If PRTTRP=T
PATH1
.PTH
VOYAGER Path
File
(R) If WARMST=F
and INTCPT=F and
no IDAF1 input
IDAF1
.RCP
Route Choice
Probability File
(R) If WARMST=F
and INTCPT=F and
no PATH input
10
Files
Required (R)/
optional (O)3
Symbolic1
parameter
File2
ext
I/O
File description
IDAF2
.PTL
Lines File
IDAF3
.DDF
District
Definition File
(R) If DSTRCT=T
IDAT6
.DAT
Local Matrix
Control File
(R) If DSTRCT=T
IMAT2
.MAT
Partial Estimated
Matrix
Coordinate File
IDAT7
.DAT
WARMST=T
hierarchic numbering
OMAT1
.MAT
Estimated Trip
Matrix
(R)
ODAT1
.DAT
Model
Parameter File
(R)
ODAT2
.GDS
Gradient Search
File
(R)
ODAT3
.DAT
Optimization
Log File
(R)
ODAT4
.ICP
Intercept File
ODAT5
.DAT
Text Intercept
File
(O)
ONET1
.NET
Network File
OPRN
.NET
Print File
(R)
126
11
Control Data
&PARAM keywords
&OPTION keywords
11
Control Data
&PARAM keywords
&PARAM keywords
It is usual to leave Cube Analyst control parameters to their default
values, with the user only setting the parameters associated with
data input and output file definition. These are described in
Standard user control parameters.
Of the remaining parameters, there is a set which is sometimes
changed (Secondary user control parameters on page 133) and
another which is rarely changed (Tuning control on page 135).
Most of the parameters in this last set are connected to the
operation of Cube Analysts optimization process, and hence are
only of interest when there is evidence of poor performance in
achieving convergence.
Topics in this section include:
Tuning control
Type = Integer(4)
Default = 101, 102, 0, 0
Example: TABLES=101,102,103,104
The input matrix numbers to be used. They are respectively the
prior trip matrix and confidence levels, and the cost matrix and
confidence levels.
MATID
Type = Character(60)
Default = Blank
Example: Estimated Matrix for Study Area
128
Control Data
&PARAM keywords
Type = Integer
Default = 2 if hierarchic numbering, 0 otherwise
Range = 0-3
Example: WIDEND = 0
Indicates the format of the Screenline File:
0 = Cube Analyst establishes the format automatically. For this to
happen, all numeric entries in the file need to be right justified for
Cube Analyst to determine the file format unambiguously.
1 = Version six format. This supports the record format where both
link flow and link toll data are stored in the same record type.
2 = Version seven format. Two types of screenline record format can
be defined at version seven; link flow records (S in column one) or
link toll records (T in column one). The former type must be used
for Cube Analyst runs.
3 = Version seven format with a CNode column inserted to support
the input of turning counts in addition to link counts.
If hierarchic numbering is in use (HIERND = T), WIDEND must be set
to 2 to use the version seven format
MFORM
Type = Integer
Default = 0
Range = 0-4
Example: MFORM = 0
Indicates the format of the output matrix:
11
11
Control Data
&PARAM keywords
0 = Save the matrix in the same format as the input matrix. This is
the default action.
1 = TRIPS
2 = TP+/VOYAGER
3 = TRANPLAN
4 = MINUTP
DEC
Type = Integer(50)
Default = 1
Range = 1 to Number of paths sets in the input VOYAGER path file
Example = 1,3
Applies only when a VOYAGER path file is input. It defines the path
sets to apply when building the intercepts for the screenlines. At
least one set must be specified, and sets are referenced by their
number rather than by their name.
130
Control Data
&PARAM keywords
PVOLS
Type = Integer(50)
Default = 1
Range = 0 to number of volumes in the input VOYAGER path file
Example = 1,2
Applies only when a VOYAGER path file is input. It defines the
volumes to apply when building the intercepts for the screenline. If
a value of 0 is specified, the volumes will be ignored, and the
weighting of alternative routes will be solely defined by the
iteration factors. Otherwise, PVOLS is a list of numbers that are 1 or
more, representing the selected volume.
NETID
Type = Character(40)
Default = Blank
Example: NETID='Network with estimated link flows'
Network identifier. Up to 40 alphanumeric characters can be used
to describe the contents of the output network. The identifier
should be enclosed in single quotes ('). NETID is only used if reading
part trip data (PRTTRP=T).
EFLOW
Type = Integer
Default = 2
Range = 1-20
Example: EFLOW = 4
The number of the volume field in the output network into which
the total link flows estimated by Cube Analyst will be written.
EFLOW is only used if PRTTRP=T.
11
11
Control Data
&PARAM keywords
ELINEn
Type = Integer
Default = ELINE(n)=2+n*2
PT Only
Range = 1-20
Example: ELINE1 = 6
ELINE(n) is the number of the volume field in the output network
into which the link flows estimated by Cube Analyst will be written
for line group n. ELINEn is only used if PRTTRP=T and doing a Public
Transport matrix estimation.
NFLOW
Type = Character(4)
Default = 'EFLW'
Example: NFLOW = 'EFLW'
Volume field identifier. Up to four alphanumeric characters can be
used to indicate the contents of the volume field specified by
EFLOW. The identifier should be enclosed in single quotes (').
EFLOW is only used if reading part trip data (PRTTRP=T).
NLINE
Type = Character(4)*8
Default = 'NLINEn='ELGn'
PT Only
Example: NLINE1 = 'ELG1'
Volume field identifier. Up to four alphanumeric characters can be
used to indicate the contents of the volume field specified by
ELINEn. The identifier should be enclosed in single quotes (').
NLINEn is only used if PRTTRP=T and doing a Public Transport
matrix estimation.
132
Control Data
&PARAM keywords
ZCONF
Type = Integer
Default = 100
Range = 1-10000
Example: ZCONF =200
Confidence level for side constraints applied to local matrices and
derived from estimated district matrix.
Type = Integer
Default = 3000
Range = 1-999999
Example: MXITER = 1500
The maximum number of iterations. Cube Analyst will stop if this
number of iterations has been reached and no convergence has
been achieved. The model parameter and gradient search files are
written out and can be used to restart Cube Analyst (from the
position it was in when it stopped) and the optimization continued.
The currently estimated matrix is also output.
ITERH
Type = Integer
Default = Generated by Cube Analyst
Range = 1-9999
Example: ITERH = 4000
The number of iterations between recalculations of the estimated
Hessian matrix.
11
11
Control Data
&PARAM keywords
UTOL
Type = Real
Default = 0.0001
Range = 0.0-99.0
Example: UTOL = 0.05
The accuracy tolerance in detecting convergence or failure. When
the maximum absolute size of the search vector is less than this
value then the procedure will be deemed to have converged.
IREP
Type = Integer
Default = 3
Range = 1-3
Example: IREP = 2
Reporting level for the optimization log file. See Information in the
optimization log file on page 157.
IHTYPE
Type = Integer
Default = 4
Range = 0-4
Example: IHTYPE = 2
This controls the type of optimization process used by Cube
Analyst, as shown in the following table. The difference for values 1
- 4 correspond to differences in the way the initial Hessian matrix,
H0, is calculated.
Methods of optimization
134
Optimization process
Value of IHTYPE
Comments
Steepest Descent
Simple searching
Quasi-Newton
1,2,3
Newton/Hybrid Newton
Control Data
&PARAM keywords
Tuning control
The parameters documented in this section would normally be
changed only in response to an error message generated by Cube
Analyst. In the event of this occurring, please contact
support@citilabs.com for advice.
MXCALL
Type = Integer
Default = 5000
Range = 1-999999
Example: MXCALL = 10000
The maximum number of function evaluations. (This should be
greater than MXITER. At least one function evaluation is required at
each iteration, possibly more.)
MXFREE
Type= Integer
Default= 4
Range= 1-10
Example: MXFREE = 7
The number of times a parameter may be freed from its bounds.
UMAX
Type = Real
Default = 1.0
Range = 0.0-1000.0
Example: UMAX = 0.5
The maximum allowed search step. If the maximum absolute value
of the search vector (called UNORMX in the log report) is greater
than this then the entire search vector is multiplied by a term
UMAX/UNORMX so that the new maximum entry is equal to UMAX.
11
11
Control Data
&OPTION keywords
&OPTION keywords
Note: Options TRIPM and COSTM work in conjunction with one
another.
TRIPM
Type = Logical
Default = True
If TRIPM = T then the input matrix file will contain at least two
tables. The first will be the prior trip matrix; the second will be the
associated confidence levels.
IF COSTM = F then these will be the only two matrices present in
the file.
TRIPM = F is only allowed if COSTM = T.
COSTM
Type = Logical
Default = False
If COSTM = T and TRIPM = T, then the input matrix file will contain
four two tables. The first two are as described above; the third will
be the cost matrix and the fourth will be the associated confidence
levels.
If COSTM = T and TRIPM = F, then the cost and confidence level
matrices will be the first and second supplied in the file.
SCRFIL
Type = Logical
Default = True
If SCRFIL = T then an input screenline file is supplied. See
Screenline file on page 140.
136
Control Data
&OPTION keywords
TRPEND
Type = Logical
Default = True
If TRPEND = T then an input trip end data file is supplied. See Trip
end file on page 142.
INTCPT
Type= Logical
Default= False
If INTCPT=T then an input Intercept file is supplied. See Intercept
file on page 149.
MODPAR
Type= Logical
Default= False
If MODPAR = T then an input model parameter file is supplied. See
Model parameter file on page 144.
WARMST
Type= Logical
Default= False
If WARMST = T then gradient search, model parameter, and
intercept files are supplied to warm start the estimation calculation.
The input of these files from a previous run of the same model
should assist the speed of optimization, but see Approaches to
running Cube Analyst on page 152.
HIERND
Type= Logical
Default= Set automatically from RCP input file; False if no RCP file
input.
11
11
Control Data
&OPTION keywords
Type= Logical
Default= False
If NODLAB=T, zone labels will be included in the Cube Analyst
reports. A coordinate file must be supplied containing node labels.
Coordinate file on page 143.
LNGLAB
Type= Logical
Default= False
LNGLAB=T to include long zone labels in the Cube Analyst reports.
Note that NODLAB = T must also be set to use long labels. If
NODLAB=T and LNGLAB=F, the short zone labels will be used. A
coordinate file must be supplied containing node labels. See
Coordinate file on page 143.
PRTTRP
Type= Logical
Default= False
If PRTTRP = T then an input network file is supplied which contains
part-trip link flows.
DSTRCT
Type= Logical
Default= False
If DSTRCT = T then a local matrix control file and district definition
file must be supplied. See Local matrix control file on page 147
and District definition file on page 148.
138
12
Screenline file
Coordinate file
Intercept file
12
Screenline file
This file is required if SCRFIL = T.
The screenline file is used to supply link/turn count and confidence
level data to Cube Analyst.
There are two formats of the file supported. The original format
(indicate by parameter WIDEND=2) just supports link counts. An
alternative format (WIDEND=3) has an extra column to allow
turning counts to be specified.
This section describes both formats:
140
Columns
Type
Contents
Character
2-5
Integer
Screenline number
6 - 14
Integer
Anode of link
15 - 23
Integer
Bnode of link
24 - 33
Real
34 - 40
Integer
41 - 58
Character
60
Integer
61 - 70
Integer
71 - 80
Integer
Type
Contents
Character
2-5
Integer
Screenline number
6 - 14
Integer
Anode of link/turn
15 - 23
Integer
Bnode of link/turn
24 - 32
Integer
33 - 42
Real
43 - 49
Integer
50 - 67
Character
70
Integer
Notes:
The file can contain a mixture of link and turning counts. For
link counts, the Cnode should be left blank.
12
12
Type
Content
1 - 10
Integer
Zone Number
11 - 20
Real
Generations
21 - 40
unused
41 - 50
Real
Attractions
51 - 60
Integer
61 - 70
Integer
142
Coordinate file
The input coordinate file must be supplied if option NODLAB has
been set to TRUE. The file supplies the correspondence between
node numbers and their hierarchic equivalents.
The format of the file is summarized below:
Columns
Type
Content
*1 - 10
Integer
11 - 20
Integer
X coordinate
21 - 30
Integer
Y coordinate
*31 - 40
Integer
41 - 48
Character
49 - 80
Character
Notes:
12
12
Description
Record one
(if COSTM = T)
Where:
144
Columns
Type
Content
1 - 23
Character
24 - 31
Integer
32 - 39
Integer
40 - 47
Integer
48 - 55
Integer
Number of screenlines
56 - 63
Integer
64 - 77
Real
78 - 91
Real
Step size
92 - 99
Integer
Remaining records
Default if parameter
not defined
Columns
Type
Content
1-8
Integer
Parameter number
10 - 22
Real
Parameter value
1.0
24 - 36
Real
0.1 E-6
38 - 50
Real
1.0 E10
52 - 64
Real
1.0
65 - 89
100 - 107
12
12
146
Type
Content
1 - 10
Integer
Origin District
11 - 20
Integer
Destination District
12
12
148
Intercept file
This file is required if INTCPT = T or WARMST = T.
Output by Cube Analyst and Cube Voyager HIGHWAY and PT, this
binary file stores information on routings and screenlines in a
concise format. Once established, it may be re-input to Cube
Analyst to save (substantial) processing times when Cube Analyst is
estimating or re-estimating for data where neither the routings or
screenline locations definitions have been altered. This file cannot
be edited by the user.
Note that there is also a text file version of the intercept file that can
be output. Its purpose is for information only; it is not intended for
subsequent input to Cube Analyst or any other program. The file is
written to if the file is named, and is generated from either the
input or output binary intercept file, depending upon which is
used. For each screenline it shows:
12
12
150
13
Computation times
13
Initial estimation
Initial estimation
Only basic input data is required, as contained in the routes (RCP for
TRIPS, PATH or ICP for Cube Voyager users), matrix, trip end, and
screenline files (as well as optionally a network file with part-trip
data). Program control parameters are allowed to take default
values. Occasionally, an input model parameter file may be
required to influence the model form by fixing some parameter
values.
Re-Estimation with altered data: Warm starting
152
If model parameters are fixed or freed from run to run then the
gradient search file from one run should not be used in the next.
Note that Cube Analyst may itself constrain model parameters. This
occurs when the estimated Hessian matrix is found to be,
mathematically speaking, non-positive definite, which arises
when one or more model parameters are degenerate. That is, a
model parameter is not contributing independently of another
model parameter.
In these circumstances, Cube Analyst gives a message reporting
how many such model parameters it is constraining, which is of the
form:
ME (I): XXX MODEL PARAMETERS ARE NOT CONTRIBUTING TO THE ESTIMATION
13
13
154
The model parameter file contains a value for the parameter, the
upper and lower bounds for the parameter, and a scaling factor.
The scaling factor is used only to assist the optimization process in
ensuring that maximum accuracy is obtained. It should be set
equal to the expected value of the parameter in the final
solutionit is only necessary to make this scaling factor of the
same order of magnitude if there are difficulties in ensuring
convergence. If no such difficulties are apparent then the scaling
factor can just be set to 1.0.
The lower and upper bounds for the parameters allow the user to
specify the degrees of freedom which are permitted. In particular if
a model parameter is set to 1.0 and its lower and upper bounds are
also set to this value then a number of standard forms for the
matrix estimation process can be achieved. For example:
Setting all a(i), b(j) and X(k) equal to a fixed value (for example,
1.0) together with their bounds then the problem is reduced to
a Gravity model driven only by the cost data (that is, only and
are allowed to vary). (Note that varying values of a(i) and b(j)
can be used to scale the numbers of trips per zone).
13
13
Setting a(i), b(j) and and to a fixed value and allowing x(k) to
be free provides a link constraint model.
Setting a(i) and b(j) to a fixed value defines a link gravity model.
Growth
factor
Gravity
Growth
factor link
Link
gravity
Routing
information
(RCP)
Trip costs
Link
constraint
156
13
13
158
Computation times
Program control parameters should usually take their default
values. However, computation times may be improved by setting
the set of parameters shown in the following table.
Parameters for influencing computation times
Control parameter
Comment
MXITER
ITERH
UTOL
13
13
Files
Files
For Cube Voyager users, the intercept file can be generated by
HIGHWAY and PT. In this case the intercept file name should always
be defined, along with the option INTCPT=T. Alternatively,
HIGHWAY can be used to generate a Cube Voyager Path file, which
is input to Cube Analyst, which will create the screenline Intercepts
from the paths.
160
14
Examples
Hierarchic estimation
14
Examples
Estimation with prior trip and count data only
162
Examples
Estimation with prior trip, count, and trip end data
14
14
Examples
Estimation with warm start and cost data
164
Examples
Estimation with highways part trip data
14
14
Examples
Estimation with public transport part-trip data
166
Examples
Hierarchic estimation
Hierarchic estimation
The following control data would be suitable for estimating a very
large public transport matrix where the data included part-trip data
as well as a trip matrix, trip ends, and count data.
Column
1...5...10...15...20...25...30...35..40..45..50..55..60T
itle:
Large PT Estimation using Part Trip Total Link Flow
&FILES IMAT1='PRIOR.MAT',
IDAT1='TEND.DAT',
IDAT4='SCRL.DAT',
IDAF1='ROUTES.RCP',
IDAF2='LINES.PTL',
IDAF3='DISTRICT.DDF',
IDAT6='LMCTL.DAT',
OPRN='MVESTM.PRN',
OMAT1=ESTM.MAT',
ODAT1='PARM.DAT',
ODAT2='GRAD.GDS',
ODAT3='LOG.DAT',
ODAT4='INTCPT.ICP'
ONET1='ESTMPTRP.NET' &END
&PARAM MATID='Estimated Matrix using Part Trip data',
NETID='Estimated Flows (total Part Trip flow)',
EFLOW=2,
NFLOW='ETOT' &END
&OPTION PRTTRP=T,
DSTRCT=T &END
14
14
Examples
Example of screenline volumes report
168
22670.0
22494.5
-175.5
OBSV(%) NO OF ODs
-2.9%
244
0.5%
244
-4.6%
160
-1.3%
160
-7.9%
383
-2.8%
687
1.7%
904
-0.8%
870
Index
A
analysis
Cube Analyst results 75
C
calculations
hierarchic estimation 57
common elements 6
computation times 159
computing resources 9
confidence levels
setting 70
controlling routing information 74
conventions used 8
coordinate file 143
cost data
example using 164
sources of 10
cost distribution function 29
COSTM 136, 136
count data, example using 162, 163
Cube Analyst
overview 21
running for hierarchic estimation 106
running, approaches to 152
D
data
Cube Analyst estimation process 81
sets, Cube Analyst 30
types, Cube Analyst 26
DDF 106
defining
districts 104
Index
I
approaches to 95
example 167
large matrices, approach for 94
levels of detail 94
matrices calculated 57
overview 94
HIERND 136
&OPTION keyword, Cube Analyst 137
highway matrix
estimating with part trip data, example 165
estimating, compared to public transport 20
I
ICP 74
IHTYPE
&PARAM keyword, secondary parameter 134
including part-trip data 87
inputs
estimating O-D matrix 110
INTCPT 136, 149
intercept file
description 149
introduction 2
IREP 128, 157
ITERH
&PARAM keyword, secondary parameter 133
influencing computation time with 159
L
large matrices, estimating 112
link counts, Cube Analyst input 26
LMC 106
LNGLAB 136, 136
local matrices
control file 147
hierarchic estimation 98
M
mathematical notation
equation 34
letters and symbols 32
mathematics
calculations summary 48
introduction 35
MATID, &PARAM keyword 128
matrices
estimation process 83
preparing for analysis 61
maximum likelihood method 48
mixed districts 97
170
Index
S
approaches 152
from Cube Voyager 160
hierarchic estimation 106
running Cube Cargo
hierarchic estimation 106
S
screenline file
description, Cube Analyst 140
screenline volumes report 168
screenlines 65
SCRFIL 136, 140
selecting model form 155
Sensitivity Analysis 86
Setting 70
Confidence Levels 70
Data 30
Side Constraints 57
Study Area 80
summary of reports 116
T
TABLES user control parameter 128
The Estimation Model 113
The Optimization Step 113
traffic counts 64
Trip Cost Matrix 26, 26
Trip End Data 163
Trip End File 142
trip ends
data description 28
determining 63
trip matrix
estimating in Cube Analyst 2
TRIPM 136, 136
TRPEND 136, 142
Tuning 73
Estimation Performance 73
U
UMAX 128, 128
user options 15
UTOL 128, 159, 159
V
Variations 6
W
Warm Start 164
WARMST 136, 150
Whats New in Version 7.1 4
WIDEND 128, 128
Z
ZCONF 106, 128, 128
Zonal Detail 96
Index
Z
172
Citilabs, Inc.
1211 Miccosukee
Tallahassee FL 32308 USA
World Wide Web
www.citilabs.com