Professional Documents
Culture Documents
1029/2001WR001125, 2002
[1]
Stochastic processes; 1833 Hydrology: Hydroclimatology; 1821 Hydrology: Floods; KEYWORDS: rainfall
extremes, regional estimation, partial duration series, generalized least squares regression, L moments
Citation: Madsen, H., P. S. Mikkelsen, D. Rosbjerg, and P. Harremoes, Regional estimation of rainfall intensity-duration-frequency
curves using generalized least squares regression of partial duration series statistics, Water Resour. Res., 38(11), 1239,
doi:10.1029/2001WR001125, 2002.
1. Introduction
[2] The rainfall intensity-duration-frequency (IDF) relationship is widely applied in hydraulic and hydrological
engineering for design of structures that control storm
runoff and flooding. For a site where rainfall measurements
are available frequency analysis can be performed for
development of the IDF relationship. For design purposes
national maps of IDF relationships have been constructed,
e.g., in the United States [Hershfield, 1961], UK and
Ireland [Natural Environment Research Council (NERC ),
1975], and Denmark [Danish Water Pollution Control
Committee (DWPCC ), 1974]. These maps are based on
pooling information from regional stations and simple
interpolation between the sites.
[3] The construction of rainfall IDF maps in the traditional manner has two major deficiencies. First, if the
length of the at-site rainfall record is small as compared to
the design return period, the estimated IDF relationship is
very uncertain. Secondly, since the spatial variability of
extreme rainfall characteristics may be large even within
Copyright 2002 by the American Geophysical Union.
0043-1397/02/2001WR001125
21 - 1
21 - 2
2. Modeling Framework
[9] The rainfall intensity for a given duration is described
by a stochastic variable Z with observations {zi, i = 1, 2, . . ., m}
where m is the total number of rain events in the historical
time series. The extreme value model is based on the PDS
method where the population includes all events above a
threshold level. Basically, two different methods are available for choosing the threshold level. In type I sampling, a
threshold level z0 is explicitly defined and the events {zi >
z0, i = 1, 2, . . ., n} are included in the PDS. In type II
sampling, the n largest events are included in the PDS
{z(1) z(2) . . . z(n)}.
[10] In the present study type I sampling is applied,
implying that the average annual number of threshold
exceedances l becomes a regional variable. In the case of
type II sampling using a constant l in the region to define
the PDS, the threshold level becomes a regional variable.
An example of this approach is given by Mikkelsen et al.
[1995, 1996]. The regional modeling procedure described in
the following can be applied also in the case of type II
sampling, simply by considering z0 as a regional variable
instead of l. Furthermore, if neither z0 nor l are considered
constant in the region, the regional modeling procedure is
easily extended by including an additional parameter to be
regionalized.
[11] The exceedances in the PDS are described by a
stochastic variable X = Z z0, Z > z0 with observations {xi,
i = 1, 2, . . ., n} where n is the PDS sample size. It is generally
assumed that the occurrence of exceedances can be described
by a Poisson process with constant or one-year periodic
intensity, implying that the number of exceedances is Poisson-distributed with intensity l. In the basic PDS model
the one-parameter exponential distribution was applied for
modeling the exceedance magnitudes [Shane and Lynn,
1964; Todorovic and Zelenhasic, 1970]. Several alternative
two-parameter distributions have been proposed, including
the generalized Pareto [e.g., Hosking and Wallis, 1987;
Rosbjerg et al., 1992a; Madsen and Rosbjerg, 1997a,
1997b; Madsen et al., 1997a, 1997b], gamma [Zelenhasic,
1970], Weibull [Ekanayake and Cruise, 1993], and lognormal [Rosbjerg et al., 1991] distributions.
[12] In a PDS context the T-year event is usually defined
as the (1-1/(lT ))-quantile in the distribution of the exceedances [e.g., Rosbjerg, 1985]. Denote by F(x; A) the cumulative distribution for the exceedance magnitudes, the T-year
event can be written
1
zT z0 F1 1
;A
lT
21 - 3
s; m R s; t R s
zRT s z0 g l
2
p
X
bk Aik ei di
kl
q i b0
@g 2
R sg
Varfl
@l
2
2
@g
@g
VarfmR sg
VarftR2 sg 4
@m
@t2
where
partial derivatives
are evaluated at (l, m, t,) =
R the
s; m R s; t R s . In (4) it has been utilized that the
l
2
mutual dependence between the regional parameter estimates can be neglected [Madsen and Rosbjerg, 1997a].
[14] For estimation of the regional IDF relationships using
(3) (4) the modeling framework presented herein includes
the following elements: (1) evaluation of regional homogeneity for each of the three PDS parameters, (2) for the
parameters that show a significant regional variability, evaluation of the potential of describing the variability from
relevant physiographic and climatic characteristics, and (3)
determination of a regional distribution for the exceedances.
In the following sections the different modeling elements are
described.
Covfei ; ej g
8 2
< sei ;
ij
i 6 j
Covfdi ; dj g
8
< s2d ;
ij
i 6 j
0;
where
T
q 1 ; q 2 ; . . . ; q M
T
B 0b0 ; b1 ; . . . ; bp
1
1 A11 A1p
C
B
C
B
B
:
: C
AB:
C
A
@
1 AM1 AMp
ij Covfei di ; ej dj g
8 2
< sei s2d ;
ij
i 6 j
q R s AsT B;
Varfq R sg AsT
X
B
As s 2d
10
21 - 4
M
1 X
l i
M i1
11
s
^2ei
c
;
ni
M
1 X
s
^2
M i1 i
12
nV 2
ni
13
21 - 5
Figure 1. Empirical probability distributions of the 10-min rain intensity for the 41 stations.
direct use of the sample estimates may result in an error
covariance matrix that can not be inverted, and hence
provides an ill-posed solution of (7) [Tasker and Stedinger,
1989]. To overcome this problem the correlation coefficients are smoothed by relating the sample estimates to the
distance between stations. In this case an exponential
correlation function is used
h
rij j
dij
wdij 1
exp
dij
1nj
wdij 1
14
4. L Moment Analysis
[25] For determination of a regional parent distribution L
moment analysis is applied. The goodness of fit measure
proposed by Hosking and Wallis [1993] was formulated for
discriminating between various three-parameter distributions. For specific use in PDS modeling where the candidate
distributions are the one-parameter exponential distribution
and different two-parameter distributions, the test statistic
has been reformulated. First, consider a two-parameter
distribution. Since the L skewness of a two-parameter
distribution is determined by the LCV estimate, the distance
between the regional estimate of the L skewness and the
theoretical L skewness for the considered distribution can
be used as a measure of the goodness of fit. To test the
significance of this distance, it is related to the sampling
uncertainty of the regional L skewness estimate. Thus the
goodness of fit measure can be formulated
G3
t3 tDIST
3
s3
15
t2 tEXP
2
s2
16
5. Application Example
5.1. Rainfall Data
[28] In 1979 a new system of high-resolution automatic
rain gauges was introduced in Denmark. The measuring
network covers a total area of 43,000 km2, and the distances
between gauging stations range between 1 and 300 km. All
21 - 6
Figure 2. Spatial structure of the intersite correlation due to concurrent exceedances: (top) 10-min
intensity and (bottom) 24-hour intensity.
stations are equipped with tipping bucket gauges with a
bucket size of 0.2 mm. The raw data that consist of the
number of tips per min are transformed into one-min
intensities for individual rain events. The preliminary separation of rain events is defined as periods exceeding one
hour without precipitation. At present, 90 stations have been
connected to the measurement system, and the longest
records consist of more than 18 years of data. In the regional
study, 41 stations with more than 10 years of data have been
included, corresponding to a total of about 650 station
years. Details of the measurement system and the quality
control of the data are given by Jrgensen et al. [1998].
[29] From the original data of one-min intensities, maximum intensities averaged over different durations were
extracted. Denote by i(t) the one-min rain intensity at time
t. The mean intensity at time t, y(t), with duration t is
defined as
R tt=2
yt
tt=2
ixdx
17
18
Units
mm
m
eastings, northings
km
degrees
21 - 7
Duration
Mean, years1
Residual Variance,
(years1)2
Regression Model
Based on MAP,
Residual Variance,
(years1)2
10 min
30 min
60 min
3h
6h
12 h
24 h
48 h
3.22
3.10
3.10
3.01
2.81
2.52
2.63
3.02
0.195
0.278
0.256
0.211
0.166
0.215
0.460
0.664
0.120
0.181
0.184
0.172
0.071
0.043
0.056
0.075
21 - 8
Figure 3. GLS estimate of the Poisson parameter and corresponding 95% confidence interval compared
with observed values: (top) 10-min intensity and (bottom) 24-hour intensity.
Table 3. Results of the GLS Regression Analysis for the Mean Exceedancea
Regional Mean Model
Subregion 2
Subregion 3
Duration
Mean,
mm/s
Residual
Variance,
(mm/s)2
Mean,
mm/s
Residual
Variance,
(mm/s)2
Mean,
mm/s
Residual
Variance,
(mm/s)2
Mean,
mm/s
Residual
Variance,
(mm/s)2
10 minb
30 min
60 min
3h
6h
12 h
24 h
48 h
3.33
1.61
0.948
0.436
0.260
0.166
9.42 102
5.71 102
0
0
0
1.13 103
3.37 104
3.31 104
9.64 105
3.21 105
3.33
1.61
0.948
0.517
0.340
0.234
0.131
7.56 102
0
0
0
3.33
1.61
0.948
0.432
0.257
0.162
0.131
7.56 102
0
0
0
0
0
0
1.76 104
6.44 104
3.33
1.61
0.948
0.432
0.257
0.162
9.40 102
5.81 102
0
0
0
0
0
0
0
0
a
b
2.41
6.83
1.04
1.76
6.44
103
104
104
104
104
Subregions: 1, western Copenhagen area; 2, eastern Copenhagen area; 3, the rest of the country.
One outlier station excluded from the analysis (large Cooks D statistic).
21 - 9
10 min
30 mina
60 mina
3h
6h
12 h
24 h
48 h
a
Mean
Residual Variance
0.516
0.545
0.536
0.521
0.542
0.536
0.543
0.528
0
0
0
0
0
0
0
7.0 104
One outlier station excluded from the analysis (large Cooks D statistic).
GAM
WEI
LN
GP
EXP
10 min
30 min
60 min
3h
6h
12 h
24 h
48 h
2.8
1.1a
1.2a
2.8
3.1
2.5
2.5
2.1
1.7a
0.5a
0.2a
1.6a
1.6a
1.2a
1.0a
0.7a
7.6
9.2
8.7
6.4
6.4
6.2
6.6
7.8
0.7a
2.6
1.9a
0.5a
0.6a
0.4a
0.9a
1.1a
2.7
6.0
4.7
3.2
6.2
4.5
5.6
4.9
Figure 4. L moment ratio estimates for the 10-min intensity compared to the theoretical relationships
for a number of candidate distributions.
21 - 10
Figure 5. IDF curve and corresponding 68% confidence interval for a location in the Copenhagen area
(region 1) with a mean annual rainfall of 600 mm. Return period T is given in years.
distributed parent, the regional T-year event estimate, cf. (3),
reads
2
R
1
41
zRT s z0 m R s R
k s
k R s
1
^ R sT
l
!k R s 3
5;
19
1
2
t R2 s
6. Conclusions
[44] A general framework for regional analysis and modeling of extreme rainfall characteristics has been developed.
The model is based on a PDS parameterization, which
includes the average annual number of exceedances, the
mean value of the exceedance magnitudes, and the LCV to
be assessed from regional data. A GLS regression model that
explicitly accounts for intersite correlation and sampling
uncertainties has been introduced for evaluating the regional
heterogenity of the PDS parameters. For the parameters that
show a significant regional variability, the GLS model is
subsequently applied for describing the variability from
physiographic and climatic characteristics. The resulting
References
Adamowski, K., Y. Alila, and P. J. Pilon, Regional rainfall distribution for
Canada, Atmos. Res., 42, 75 88, 1996.
Allerup, P., H. Madsen, and F. Vejen, A comprehensive model for correcting point precipitation, Nord. Hydrol., 28, 1 20, 1997.
Cong, S., Y. Li, J. L. Vogel, and J. C. Schaake, Identification of the underlying distribution form of precipitation by using regional data, Water
Resour. Res., 29(4), 1103 1111, 1993.
Danish Water Pollution Control Committee (DWPCC), Estimation of IDF
curves (in Danish), Publ. 16, Danish Soc. of Eng., Teknisk Forlag, Denmark, 1974.
Ekanayake, S. T., and J. F. Cruise, Comparisons of Weibull- and exponen-
21 - 11