You are on page 1of 6

This article was downloaded by: [Florida Atlantic University]

On: 11 November 2014, At: 01:09


Publisher: Taylor & Francis
Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office: Mortimer House,
37-41 Mortimer Street, London W1T 3JH, UK

International Journal of Pavement Engineering


Publication details, including instructions for authors and subscription information:
http://www.tandfonline.com/loi/gpav20

Identification of Homogeneous Sections from Road Data


a b
Rahul Misra & Animesh Das
a
Department of Civil Engineering , Indian Institute of Technology , 781 039, Guwahati, India
b
Department of Civil Engineering , Indian Institute of Technology , 208 016, Kanpur, India
Published online: 31 Jan 2007.

To cite this article: Rahul Misra & Animesh Das (2003) Identification of Homogeneous Sections from Road Data, International
Journal of Pavement Engineering, 4:4, 229-233, DOI: 10.1080/10298430410001672237

To link to this article: http://dx.doi.org/10.1080/10298430410001672237

PLEASE SCROLL DOWN FOR ARTICLE

Taylor & Francis makes every effort to ensure the accuracy of all the information (the “Content”) contained
in the publications on our platform. However, Taylor & Francis, our agents, and our licensors make no
representations or warranties whatsoever as to the accuracy, completeness, or suitability for any purpose of the
Content. Any opinions and views expressed in this publication are the opinions and views of the authors, and
are not the views of or endorsed by Taylor & Francis. The accuracy of the Content should not be relied upon and
should be independently verified with primary sources of information. Taylor and Francis shall not be liable for
any losses, actions, claims, proceedings, demands, costs, expenses, damages, and other liabilities whatsoever
or howsoever caused arising directly or indirectly in connection with, in relation to or arising out of the use of
the Content.

This article may be used for research, teaching, and private study purposes. Any substantial or systematic
reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in any
form to anyone is expressly forbidden. Terms & Conditions of access and use can be found at http://
www.tandfonline.com/page/terms-and-conditions
The International Journal of Pavement Engineering, Vol. 4 (4) December 2003, pp. 229–233

Identification of Homogeneous Sections from Road Data


RAHUL MISRAa,† and ANIMESH DASb,*

a
Department of Civil Engineering, Indian Institute of Technology, Guwahati 781 039, India; bDepartment of Civil Engineering,
Indian Institute of Technology, Kanpur 208 016, India

(Received 10 August 2002; Revised 1 January 2004)

For highway design or maintenance projects, a long road stretch needs to be delineated into some
homogeneous sections, on the basis of various pavement responses, such as, subgrade CBR value, field
moisture content, soil type, deflection, pavement distress parameters, roughness, etc. Cumulative
Downloaded by [Florida Atlantic University] at 01:09 11 November 2014

difference approach (CDA) for identification of homogeneous sections is suggested in the AASHTO
Guide for Design of Pavement Structures [AASHTO (1986) Guide for Design of Pavement Structures,
AASHTO, Washington, DC, pp. III-17– III-20 and Appendix J; AASHTO (1993) Guide for Design of
Pavement Structures, AASHTO, Washington, DC, pp. III-19 –III-24 and Appendix J]. This paper
identifies some of the limitations associated with this method and suggests improved yet simplistic
methodology for identification of homogeneous sections based on a combined approach of
classification and regression tree (CART) and exhaustive search. Examples are drawn from synthetic
and realistic data.

Keywords: Homogeneous section; Cumulative difference approach; Change-point; AASHTO Guide;


CART method

INTRODUCTION This paper also attempts to suggest simplistic procedure for


delineation of data into homogeneous sections where the
Pavement condition parameters, like subgrade strength, shortcomings of CDA are taken care of. Examples are
deflection due to test load, skid resistance, or level of presented to show the effectiveness of the proposed
various distresse, vary along a road’s length. These data methodology over the CDA.
are used for design of new roads, or rehabilitation of
existing roads. If design is based on the data of a particular THE CDA
parameter of the whole stretch taken together, it will lead
to an uneconomical design. Therefore, pavement response In the CDA, the term Zx is defined as the difference in the
data along a long stretch of road needs to be delineated cumulative area values at a given distance (x) between
into sections which are “relatively uniform”, referred to as the actual response and the average response (AASHTO,
homogeneous sections, and the design performed 1993). Mathematically,
individually for each of these homogeneous sections. 0X 1
n
This results in economy in design without compromising
Xn
B x i CX
nt
reliability level. Zx ¼ ai 2 @ i¼1 A ai ð1Þ
For identification of homogeneous sections, cumulative i¼1 Lp i¼1
difference approach (CDA) is suggested in the AASHTO’s
Guide (1986, 1993). CROW (1998) also suggests a similar and
methodology. However, there are situations where CDA

becomes sensitive to local variations of data (Divinsky r i21 þ r i
et al., 1997) and fails to delineate homogeneous sections ai ¼ xi ; ð2Þ
2
effectively. Also this method does not offer a choice of
the number of homogeneous sections a designer intends to where, xi is the distance along the road section up to i-th
delineate. These issues have been addressed in this paper. response measurement; n is the n-th pavement response

*Corresponding author. E-mail: adas@iitk.ac.in



Current address: Assistant Systems Engineer, Tata Consultancy Services, C-56, Phase-2, Noida 201 305, India. E-mail: rahul_iitg@yahoo.com

ISSN 1029-8436 print/ISSN 1477-268X online q 2003 Taylor & Francis Ltd
DOI: 10.1080/10298430410001672237
230 R. MISRA AND A. DAS

FIGURE 1 Identification of homogeneous section by Cumulative


Difference Approach [the data points are taken from the AASHTO’s FIGURE 2 Homogeneous sections delineated on the basis of the mean
Guide (1993) example]. value [the data points are taken from the AASHTO’s Guide (1993)
example].

measurement; nt is the total number of pavement response


Downloaded by [Florida Atlantic University] at 01:09 11 November 2014

measurements taken in the project; ri is the pavement


response value of the i-th measurement; Lp is the total
project length.
The Zx values are plotted versus distance along the road
length and the points where the slope of the Zx curve
changes its algebraic sign are taken as the border line bet-
ween the consecutive homogeneous sections (AASHTO,
1993). These points have been described as change-points
(Brodsky and Darkhovsky, 1993, Internet site of Cardiff
University),‡ and are referred to as such in the rest of the
paper.
FIGURE 3 Cumulative difference approach may fail to identify
A sample plot of the pavement response values as existence of other homogeneous sections within a homogeneous section.
read from in the AASHTO’s Guide (1993) and the
resulting Zx values, are plotted in Fig. 1. The change-
points, thus identified, are shown by vertical lines and all of them may either be completely above or
intersecting the Zx curve. below the mean horizontal line. In such a case, Zx
curve does not change its sign, hence CDA fails to
identify the existence of other possible homogeneous
Discussion on the CDA
sections. Such a situation is shown in Fig. 3 (plotted
In fact, the CDA is equivalent to finding those points at from synthetic data) with AB, BC, CD and DE as
which the response curve crosses the mean horizontal line, individual homogeneous sections, where both AB and
which includes all the data-points. This can be explained BC lie below the mean line, and similarly, where
as follows. As long as the response plot lies below CD and DE lie above the mean line. In such a
the mean line, the Zx value keeps on decreasing, as the situation, CDA could only identify AC and CE as
difference between the cumulative and the average area is two homogeneous sections (because the slopes
negative. The slope of the Zx curve changes its sign only change in magnitudes at points B and D, but not in
when the response curve crosses the average line. their signs).
This is cross-checked in Fig. 2, where the homogeneous . Figure 4 consists of a modified form of response
sections delineated are the same as those delineated by the values of the AASHTO guide example, where the
CDA procedure shown in Fig. 1. values in between the points A and B are deliberately
CDA, though a quick and straight-forward method for elevated to higher level by adding fixed offset value
identification of homogeneous sections, has certain to all the data-points in this zone. It is interesting to
limitations, as identified in the following: observe from Fig. 4 that CDA is now unable to
identify homogeneous sections which were pre-
. As mentioned above, CDA is the same as finding viously delineated (refer Fig. 1), as the average of
those points at which the response curve crosses the the response has shifted to a larger value. Thus, as
mean horizontal line. But there may be cases when per CDA, the data level of one part of the stretch
more than one homogeneous sections with different affects identification of homogeneous sections of
average response levels may exist consecutively, other part of the stretch, which is undesirable.


http://www.cf.ac.uk/maths/stats/changepoint, last accessed on 25th June, 2002)
IDENTIFICATION OF HOMOGENEOUS SECTIONS 231

2001). Some of them involve complexities in terms of


implementation, or assumptions related to the distribution
pattern of the response data. The approach suggested in
the present case is simple, basic and does not involve any
assumption related to the distribution pattern of the
response data. The suggested approach is presented
through the following three sub-sections.

Development of Classification and Regression Trees


(CART)
FIGURE 4 In Cumulative difference approach data level of one part The first step of the approach deals with creation of a large
of the stretch may affect identification of homogenous sections in the
other part. collection of partitions of the original data set by
recursively dividing the parent section into two child
sections. This can be better explained by taking a simple
. As well, CDA does not have any control over the example. Figure 5 presents a plot of synthetic random data
where the first 20 data points have a different mean and
Downloaded by [Florida Atlantic University] at 01:09 11 November 2014

number of homogeneous sections a designer intends


to choose. Statistically, one should be able to find out standard deviation to the other 20 points. The data-set may
desired number of homogeneous sections from a be divided into two parts at any point (say at “A”,
given response data of a road stretch, depending on as shown in the figure). A term, squared error,
the reliability level chosen. There may not be any may be calculated as the sum of squared difference
restriction on maximum length of a homogenous between the data points and the mean of each part, added
section, but from practical considerations, there must together. Thus,
be a provision for specifying minimum length of a X
A X
nt
homogeneous section. Because, frequent changes in Squared error ¼ ðr i 2 r L Þ2 þ ðr i 2 r R Þ2 ; ð3Þ
specification along the road stretch may cause i¼1 i¼A
inconvenience from the construction point of view.
Choice of minimum section length can not be where, r L is the average response value of the left part of A,
incorporated in CDA. r R is the average response value of the right part of A.
Now, by shifting the point A from one end of the data to
These limitations of CDA have prompted the authors to the other, it is possible to find a specific position where the
look for an improved method for delineation of value of squared error is minimum. This is an exhaustive
homogeneous sections, which is easy to implement and search. This exhaustive search procedure can easily
also works efficiently in all such situations. Divinsky et al. divide a given stretch into two homogeneous sections.
(1997) pointed out that CDA being sensitive to the mean However, this becomes an optimization problem when the
value of data-points, some methodology based on standard given section is to be delineated into homogeneous sections
deviation needs to be devised. He suggested that the whole of more than two in number. The solution in that case is
data-set could be replaced by moving the standard likely to require a very large computation time, specially
deviation of the chosen base-length operated on the same when the data size is large. Instead, the above procedure
data, and then the same CDA can be adopted to delineate could be operated again on the two sections individually
the homogeneous sections. However, for obvious reasons, which have just been delineated, resulting in four
this method is sensitive to what base-length is chosen, as delineated sections and so on (Montgomery et al., 2001).
shown by Divinsky et al. (1997), who chose a base-length
of five data-points for their study based on road roughness
data of Israel. Also, this method finally uses CDA for
delineation, so the limitations involved with CDA
discussed above are equally valid in this case.

PROPOSED METHOD

Some formulations are available in literature for


estimation of change-points. These are based on multiple
regression, generalized linear models, cluster analysis,
maximum likelihood estimation, autoregressive
moving average, exponential smoothening method,
Bayesian approach, etc. (Basseville and Nikiforov, 1993; FIGURE 5 Exhaustive search for dividing the data-points into two
Brodsky and Darkhovsky, 1993; Montgomery et al., homogeneous sections.
232 R. MISRA AND A. DAS
Downloaded by [Florida Atlantic University] at 01:09 11 November 2014

FIGURE 6 Original tree (a) and sub-tree (b) (for eight delineated sections) for Alaska roughness data (GPS section 1008, 25th May, 1990, Repeat 1).

This can be recursively done on the given data until each TABLE I Sections delineated by the proposed method [distance unit]
section delineated is smaller than a given minimum length
specified based on practical considerations. AASHTO Modified AASHTO
Guide example (Fig. 1) Guide example (Fig. 4)
The result obtained can be represented in the form
of a binary tree which has been illustrated with the help 0.0– 5.0 0.0– 5.0
5.0– 25.0 5.0– 25.5
of a sample data in Fig. 6. The data (Internet site University 25.0–33.5 25.5– 30.5
of Michigan){ represents the roughness level of a particular 33.5–52.5 30.5– 39.0
road section (GPS section 1008, May 1990) in Alaska. 52.5–57.5 39.0– 52.5
57.5–65.0 52.5– 65.0
Figure 6a shows the recursive binary splitting of this data in 65.0–76.5 65.0– 76.5
the form of a binary tree, till it has reached the minimum
specified section length, which has been taken as 50 data-
points in this case.
The Algorithm

Selection of Best Sub-tree Thus, the algorithm can be summarized in the form of
following two stage process:
Since the above mentioned binary tree has been
obtained by delineating sections with sizes close to the . The given section is recursively divided into homo-
minimum specified length, the number of sections geneous sections using binary splitting based on
resulted may be considerably large. Now, depending on a minimum squared error criterion. If any section
the requirement of the number of sections to be comes out to be less than the permissible minimum
delineated, a designer may choose to suitably combine section length, the process aborts further sub-division
some of these sections. This can be represented in the for that section.
form of a sub-tree derived from the original binary tree. . The large collection of sections delineated as a result of
Thus, for a given number of homogeneous sections to be previous step is reduced to the required number
delineated, one may obtain a number of such sub-trees of sections by merging individual sections with
as possible solutions. The best solution can be identified other sections. This is done by exhaustive search.
as that combination of sections which has the least sum The combination, which gives the least sum of standard
of standard deviations. Figure 6b now shows the best deviations, is the intended delineation.
sub-tree obtained for the sample data (as used in Fig. 6a),
when the number of sections delineated was restricted An algorithm of a similar yet more rigorous in approach
to eight. has been suggested in a recent paper by Gey and Lebarbier

{
http://www.umtri.umich.edu/erd/roughness, last accessed on 25th June, 2002
IDENTIFICATION OF HOMOGENEOUS SECTIONS 233

FIGURE 7 Plot of the Alaska roughness data and various possible homogeneous sections (GPS section 1008, 25th, May, 1990, Repeat 1).
Downloaded by [Florida Atlantic University] at 01:09 11 November 2014

(2002), where they have used penalty functions when the . The algorithm does not involve any assumption of the
number of change points are unknown. distribution pattern of the data-points.
. The algorithm takes into account the constraint of
minimum size of section to be delineated.
RESULTS AND DISCUSSION . It also provides the flexibility to choose the number of
sections to be delineated.
Based on the above algorithm, a computer code has been
developed in C language, and operated on various
synthetic and realistic data. Table I shows the sections
delineated for the AASHTO example (Fig. 1) and modified CONCLUSIONS
data (Fig. 4). An example on modified data (Column 2 of
Table I) shows that this could successfully identify The limitations of CDA (AASHTO, 1993; CROW, 1998)
a separate homogeneous section from distance unit for identification of homogeneous sections from road data
39.0 –52.5, which was provided deliberately. of a given stretch are discussed. A combined approach
Figure 7 shows various number of homogeneous based on classification and regression tree and exhaustive
sections that can be delineated depending on the search, is suggested in this paper, so as to delineate
requirement of the designer. Corresponding squared errors homogeneous sections more effectively. The proposed
are also put in Fig. 7. The data points in this figure have method takes care of the limitations of the CDA, and is
been taken from the same source as that of used in Fig. 6 simple, fast and flexible.
(GPS section 1008, May 1990, Internet site of
University of Michigan),{ and the minimum section
length is chosen to be 50 data-points in this case. Thus the
References
proposed method has a better flexibility in terms of
adjustments the possible in specifying minimum section (1986) Guide for Design of Pavement Structures (AASHTO, Washington,
DC), pp III-17–III-20 and Appendix J.
length and the number of delineated sections desired. (1993) Guide for Design of Pavement Structures (AASHTO, Washington,
As per CDA the sections delineated are 0 – 10, 10– DC), pp III-19–III-24 and Appendix J.
325, 325 – 575, 575 –760 and 760– 1000. These values are Basseville, M. and Nikiforov, N. (1993) The Detection of Abrupt
Changes—Theory and Applications (Prentice-Hall, Englewood
also derived from the proposed method at the first part of Cliffs, NJ), Information and Systems Sciences Series.
the algorithm, when the minimum section length is specified Brodsky, B. and Darkhovsky, B. (1993) Nonparametric Methods in
as 5. Subsequently, some other homogenous sections than Change-Point Problems (Kluwer Academic Publishers, Dordrecht,
The Netherlands).
those delineated by CDA are chosen by the proposed (1998) “Deflection Profile – Not a Pitfall Anymore”, The National
method. Since the proposed method performs an Information and Technology Centre for Transport and Infrastructure
exhaustive search on least sum of standard deviations, it (CROW), The Netherlands, Appendix II, pp.149–153.
Divinsky, M., Nesichi, S. and Livneh, M. (1997) “Development of a Road
can be said that the sections delineated are statistically Roughness Profile Delineation Procedure”, Journal of Testing and
more significant than those of CDA. Evaluation 25(4), 445 –450.
Thus, the advantages of the proposed algorithm can be Gey, S. and Lebarbier, E. (2002) “A CART based Algorithm for Detection
of Multiple Change Points in the Mean for Large Samples”, preprint
identified as: prepublication no. 2002-10 of Université de Paris XI, http://www.
math.u-psud.fr/ , biblio/ppo/2002/fic/ppo_2002_10.ps, last accessed
. The algorithm stated above is simple, fast, can work on on 25th June, 2002.
Montgomery, D.C., Peck, E.A. and Vining, G.G. (2001) Introduction to
any random data of any size, and overcomes the Linear Regression Analysis, Wiley Series in Probability and
limitations associated with CDA. Statistics, 3rd ed. (Wiley, New York).

You might also like