Professional Documents
Culture Documents
to Thunderstorm Prediction
Waylon Collins
NOAA/National Weather Service
Philippe Tissot
Texas A&M University Corpus Christi
Presentation Outline
I. Artificial Intelligence (AI) & Artificial Neural Networks (ANN)
II. Project #1: Thunderstorm Artificial Neural Network (TANN)
(a) Motivation
(b) Design/Framework
(c) Model Development/Optimization Strategy
(d) Most recent Results/Conclusions
(e) Operational TANN
III. Project #2: Improving TANN in connection with a NASA-funded
project Improved Convective Initiation Forecasting in the Gulf of
Mexico Region
5.
6.
7.
8.
9.
10.
Sensory perception
Pattern Recognition
Learning and knowledge acquisition
Inference from incomplete information
Ability to deal with unfamiliar situations
Adaptability to new, yet related situations (through expectational knowledge)
Inductive Reasoning
Common Sense
Display of Emotions
Inventiveness
de Silva, C. W., 2000: Intelligent Machines: Myths and Realities. CRC Press LLC
Rumelhart, D. E., J. L. McClelland, and the PDP Research Group, 1986: Parallel Distributed Processing:
Explorations in the Microstructure of Cognition. Volume 1: Foundations. MIT Press: Cambridge, MA.
X=1
Beal R. and T. Jackson, 1990: Neural Computing: An Introduction, Institute of Physics Publishing.
=1
cat speed-of-serial.ksh
#!/bin/ksh
# Initialize x
x=0
N=1000000
# Compute the sum of x after N iterations
time for ((i = 1; i <= $N; i += 1))
do
x=$((i**4))
done
# The answer
echo The sum of the 4th power of integers
1 through $N is $((x+1))
echo $?
./speed-of-serial.ksh
real
user
sys
The sum
through
0
0m6.07s
0m6.04s
0m0.02s
of the 4th power of integers 1
1000000 is 9.99999999999999983e+23
net
s
Processing within cell body
Source http://www.mathworks.com/help/pdf_doc/nnet/nnet_ug.pdf
de Silva, C. W., 2000: Intelligent Machines: Myths and Realities. CRC Press LLC
Image adapted from: Hagan, M. T., H. B. Demuth, and M. Beale, 1996: Neural Network Design,
International Thomson Publishing Inc.
URL http://www.mathworks.com/help/toolbox/nnet/
URL http://www.mathworks.com/help/toolbox/nnet/
URL http://www.mathworks.com/help/toolbox/nnet/
Error-Correction Learning
1
2
Memory-based Learning
Hebbian Learning
Competitive Learning
Boltzmann Learning
Haykin, S., 1999: Neural Networks: A Comprehensive Foundation, 2nd Edition, Prentice Hall.
Feed-back
Feed-forward
Linear
Nonlinear
Supervised
Hopfield Model
Self-organizing
Boltzmann
Machine
Feature Maps
Unsupervised
Karayiannis, N. B., 1993: Artificial Neural Networks: Learning Algorithms, Performance Evaluation, and
Applications. Kluwer Academic Publishers, Boston
ART
Continuous
Valued Input
Binary Input
Supervised
Hopfield Net
Hamming Net
Unsupervised
Carpenter/
Grossberg
Classifier
Supervised
Perceptron
Multi-Layer
Perceptron
Unsupervised
Kohonen
Self-Organizing
Map
Lippmann, R. 1987: An Introduction to Computing with Neural Nets, IEEE ASSP Magazine. pp 4-22.
7 :
(1)
=
=
(9)
(2)
= (3)
=
=
(4)
= (5)
= (6)
= (7)
7 :
= (8)
= (10)
1
2
= 1 (11)
= ()
12 ,
Beal R. and T. Jackson, 1990: Neural Computing: An Introduction, Institute of Physics Publishing.
(13)
= (14)
= (15)
(16)
= (17)
18 , (9)
()
19 ,
Beal R. and T. Jackson, 1990: Neural Computing: An Introduction, Institute of Physics Publishing.
j
B
Bishop, C. M, 1995: Neural Networks for Pattern Recognition, Oxford University Press.
Hagan, M. T., H. B. Demuth, and M. Beale, 1996: Neural Network Design, International Thomson Publishing Inc.
Compute
derivatives of
training error
w.r.t. ANN
weights via
EBP
Evaluate
training
error
Perform feedforward
computation
for all cases
in training set
No
Assign random
initial values for
all the weight
parameters
Perform
feedforward
computation for
all samples in
validation set
Evaluate
validation error
Yes
Select a neural
network structure
START
Desired
accuracy
achieved?
Evaluate test
error as an
independent
quality measure
for ANN model
quality
Perform
feedforward
computation for
all samples in
testing set
STOP
Training
Zhang, Q.J. K.C. Gupta, and V.K. Devabhaktuni, 2003: Artificial neural networks for RF and
microwave design: from theory to practice, IEEE Trans. Microwave Theory Tech., 51, pp. 1339-1350
2.
3.
4.
5.
6.
7.
8.
Zhang, Q.J. K.C. Gupta, and V.K. Devabhaktuni, 2003: Artificial neural networks for RF and microwave
design: from theory to practice, IEEE Trans. Microwave Theory Tech., 51, pp. 1339-1350
Haykin, S., 1999: Neural Networks: A Comprehensive Foundation, 2nd Edition, Prentice Hall.
Hagan, M. T., H. B. Demuth, and M. Beale, 1996: Neural Network Design, International Thomson Publishing
Inc.
Training Set
Training Set
Training Set
Training Set
Training Set
Training Set
Training Set
Training Set
Training Set
Testing Set
Yang, C.C., S.O. Prasher, J.A. Landry, H.S. Ramaswamy, and A. Ditommaso, 2000: Application of artificial
neural networks in image recognition of crop and weeds, Canadian Agricultural Engineering Vol. 42, 147152
Corn
1-A
1-B
Weeds
Yang, C.C., S.O. Prasher, J.A. Landry, H.S. Ramaswamy, and A. Ditommaso, 2000: Application of artificial
neural networks in image recognition of crop and weeds, Canadian Agricultural Engineering Vol. 42, 147-152
Yang, C.C., S.O. Prasher, J.A. Landry, H.S. Ramaswamy, and A. Ditommaso, 2000: Application of
artificial neural networks in image recognition of crop and weeds, Canadian Agricultural Engineering
Vol. 42, 147-152
1.
2.
3.
4.
5.
Motivation
Design/Framework
Model Development/Optimization Strategy
Most recent Results/Conclusions
Operational TANN
= 1+ez ; z = +
Model
K
j=1 j x j
; if
{ <,,
Wilks, D. S., 2006: Statistical Methods in the Atmospheric Sciences. 2nd Edition, Elsevier, Oxford.
Wilks, D. S., 2006: Statistical Methods in the Atmospheric Sciences. 2nd Edition, Elsevier, Oxford.
Predictors acquired from NWP predictions yet too coarse to account for processes that
trigger individual storms
McCann 1992, Schmeits 2005
High resolution (4-km) NWP models successful predicting thunderstorm
occurrence/mode, yet not location/timing.
Fowle and Roebber 2003, Weisman et. al. 2008
Higher resolution (1-km) NWP models may not provide additional skill owing to
extreme sensitivity of model output to initial condition, parameterization trigger
thresholds, predictability limits of NWP models
Elmore et. al. (2002)
Verification Domain
1o radius
McCann (1992)
Lee and Passner (1993)
1o grid
100-km radius
3-7
12
ANN
Expert System
6825 km2
6-12
Logistic Regression
7200 km2
5000 km2
729 km2
48
6
24
MOS
ANN
Adaptive Boosting
Current Study
400km2
12
ANN
Input latitude-longitude values: Inverse and Forward computer software (United States
National Geodetic Society) Output: GIS software run by Rick Smith (TAMUCC)
Box 238 (smaller box region); (+) center and four corners of the box. Locations of
NAM grid points (12-km) (); values of NAM parameters/ variables based on bilinear interpolation at box center. () MPE grid points (4-km). Airplane symbol
depicts location of VCT METAR; circle depicts 9.26 km (5.00 nm) radius relative
to VCT. Figure credits: Sergei Reid, Julien Clifford (TAMUCC)
Description (Units)
PWAT
MR850
RH850
CAPE
CIN
LI
Uxxx,Vxxx
VVxxx
Dropoff Proxy
LCL
LCL_T
CP
VShearS8
VShear86
Description (Units)
Uxxx,Vxxx
HIlow
CTP Proxy
VShearS7
Description (Units)
JD
Description (Units)
SMC_MAXGRAD
SMC_SD
SMC_MEAN
SMC_MAX
GMORANI
NDRY
Number of dry days [MPE=0] since the previous rain day (days)
Cheng, W. Y. Y., and W. R. Cotton, 2004: Sensitivity of a Cloud-Resolving Simulation of the Genesis of
a Mesoscale Convective System to Horizontal Heterogeneities in Soil Moisture Initialization. Journal of
Hydrometeorology, 5, pages 934-958.
Comparison of mean soil moisture content (SMC) from the API model (solid
black line) to that from the Land Surface Microwave Emission Model
(LSMEM; gray) based on TRMM (Tropical Rainfall Measuring Mission)
Microwave Imager 10.65 GHz (X-band) radiometer (Gao et. al, 2006), for Box 238.
Fischer, M. M., and A. Getis (eds.), 2010: Handbook of Applied Spatial Analysis: Software Tools,
Methods and Applications, Springer-Verlag Berlin Heidelberg, DOI 10.1007/978-3-642-036477_14,
ij
I= -0.393
I=0.000
I=+0.393
I=+0.857
Longley, P. A., M. F. Goodchild, D.J. Maguire, and D. W. Rhind, 2005: Geographic Information
Systems and Science, 2nd Edition, John Wiley & Sons, Ltd., 517pp.
Source: Avissar, R., and Y. Liu. 1996. Three-dimensional numerical study of shallow convective clouds and
precipitation induced by land surface forcing. J. Geophys. Res. 101, 7499-7518.
Parameter (Units)
SMC Day 2125/3287
GMORANI
0.24
2
1
)
SMC_MAXGRAD
1.66
SMC_MEANGRAD 2 1)
0.35
2
SMC_SD ( )
1.07
2
SMC_MEAN ( )
6.04
2
SMC_MAX ( )
9.39
NDRY
1.00
0.9
0.9
0.8
0.8
0.7
0.7
POD
AUC =0.89595
0.6
0.5
0.4
POD
0.5
0.4
0.3
0.2
0.2
0.1
0.1
0.1
0.2
0.3
0.4
0.5
0.6
False Alarm Rate
0.7
0.8
0.9
AUC =0.85828
0.6
0.3
Threshold selection,
preferably: POD
>75% & F<25%
0.1
0.2
0.3
0.4
0.5
0.6
False Alarm Rate
0.7
0.8
0.9
Symbol
Equation
POD
a/(a+c)
b/(b+d)
FAR
b/(a+b)
CSI
a/(a+b+c)
PSS
(ad-bc)/[(b+d)(a+b)]
HSS
2(ad-bc)/[(a+c)(c+d)+(a+b)(b+d)]
ORSS
(ad-dc)/(ad+bc)
CSS
(ad-bc)/[(a+b)(c+d)]
GSS
(a-ar)/(a+b+c-ar); ar=[(a+b)(a+c)]/n
POD
0.54
FAR
0.79
F
0.08
PSS
0.46
CSI
0.18
HSS
0.26
YuleQ
0.86
CSS
0.19
GSS
0.15
0.20
0.19
0.21
0.13
0.11
0.14
9 hour Predictions/Forecasts
TANN
TANN-MOS
NDFD
0.92
0.94
0.96
0.79
0.80
0.78
0.36
0.37
0.34
0.57
0.54
0.62
0.21
0.20
0.22
0.22
0.20
0.25
0.90
0.90
0.96
0.38
0.83
0.06
0.32
0.14
0.20
0.80
0.15
0.11
0.11
0.10
0.10
0.07
0.06
0.07
12 hour Predictions/Forecasts
TANN
TANN-MOS
NDFD
0.96
0.88
0.73
0.88
0.89
0.88
0.40
0.38
0.27
0.54
0.48
0.46
0.11
0.11
0.12
0.12
0.12
0.14
0.92
0.83
0.76
Box 103
NASA Research Opportunities in Space and Earth Sciences (ROSES) funded project
Principal Investigators:
John Mecikalski, University of Alabama in Huntsville (UAH)
John Williams, National Center for Atmospheric Research (NCAR)
Philippe Tissot, Texas A&M University-Corpus Christi (TAMUCC)
Others: David Ahijevych (NCAR), Nathan Bledsoe (UAH), Waylon Collins (NOAA),
Chris Jewett (UAH), Wayne MacKenzie (NOAA), Whitney Rutledge (TAMUCC),
Mathew Saari (UAH), John Walker (UAH).
Acquire satellite data (GOES visible, 3.9, 6.5, 10.7 and 13.3 m)
Mesoscale Atmospheric Motion Vectors (MAMV) derivation (GOES)
Convective Cloud Mask (CCM) generation (pre-convective/immature clouds)
Cloud Object Tracking (OT) (via MAMVs and CCM)
Interest Field (IF) (Spectral & Temporal differencing tests) calculations
CI prediction ( 2h) determination (threshold: 5/6 IF critical thresholds met)
CI Interest Field
10.7 m BT
10.7 m BT time trend
6.5-10.7 m BT difference
13.3-10.7 m BT difference
6.5-10.7 m BT time trend
13.3-10.7 m BT time trend
Critical Value
0
4
15
35 10
25 5
3
>
15
3
>
15
Breiman, L., 2001: Random Forests, Machine Learning, 45, pp. 5-32.
Algorithm
Consider data set [X,Y] X=Input (features) Y=Target
Draw Z bootstrap samples from a random subset of original data set
For each bootstrap sample q generate a decision tree
For each node in the decision tree, choose random subset m of M features
At each node, determine the optimal split of each variable based on subset m
The foregoing results in a decision tree based on bootstrap sample q
Repeat for every bootstrap sample, thus creating a forest of decision trees
To create the classifier, apply input vector X down the Q trees in the forest
For each input case, each tree votes for the class
We classify each input instance into the class with the most votes over all trees Q
Breiman, L., 2001: Random Forests, Machine Learning, 45, pp. 5-32.
Breiman, L., 2001: Random Forests, Machine Learning, 45, pp. 5-32.