Professional Documents
Culture Documents
DEPARTMENT
REDGEMAN@UIDAHO.EDU
OF STATISTICS
OFFICE: +1-208-885-4410
DR. RICK EDGEMAN, PROFESSOR & CHAIR SIX SIGMA BLACK BELT
Although Gauss made valuable contributions to both theoretical and practical astronomy, his principal work was in mathematics and mathematical physics. In theory of numbers, he developed the important prime-number theorem (see E). He was the first to develop a nonEuclidean geometry (see GEOMETRY), but Gauss failed to publish these important findings because he wished to avoid publicity. In probability theory, he developed the important method of least squares and the fundamental laws of probability distribution, (see PROBABILITY; STATISTICS). The normal probability graph is still called the Gaussian curve. He made geodetic surveys, and applied mathematics to geodesy (see GEOPHYSICS). With the German physicist Wilhelm Eduard Weber, Gauss did extensive research on magnetism. His applications of mathematics to both magnetism and electricity are among his most important works; the unit of intensity of magnetic fields is today called the gauss. He also carried out research in optics, particularly in systems of lenses. Scarcely a branch of mathematics or mathematical physics was untouched by Gauss.
The linear portion of the terminology refers to the response variable being expressed as a linear combination of the driver variables.
Model Estimation
Ordinarily the regression coefficients (the s) are of unknown value and must be estimated from sample information. The estimate of a given coefficient, i, is often symbolized by i. Although there are well-established statistical/ mathematical methods for determining estimates by these methods, they are generally tedious and are well-suited to performance by a computer. The resulting estimated model is:
^
^
Interval Estimation
Estimates will vary from sample to sample and it is useful to have estimates of the standard deviations of these estimates, S^ . These i estimated standard deviations tend to be included in the regression output of most statistical packages and can be used in the formation of confidence intervals for the true value of i, that is:
^ i +/- t/2,n-(p+1)Si ^
Where t/2,n-(p+1) is the value of Students t distribution with n-(p+1) degrees of freedom that places a proportion /2 in the upper tail of the t-distribution.
Where F,p,n-(p+1) is the value of F with p numerator df and n-(p+1) denominator df that places in the upper tail of the distribution.
In the anova table we have the following: SSR = Sum of Squares due to Regression SSE = Sum of Squares due to Error or Residual SST = Sum of Squares Total MSR = SSR/p = Mean Square Regression MSE = SSE/[n-(p+1)] = Mean Square Error or Residual
That is: SST = SSR + SSE So that R2 = SSR/SST represents the proportion of variation in Y that is explained by the behavior of the driver variables. R2 is the coefficient of determination.
(ei - ei-1)2/SSE
Which may range in value from 0 to 4. Tables of lower and upper critical values of D, denoted by dl and du, respectively, are widely available for significance levels of = .01 and = .05. The corresponding autocorrelation coefficient which may range from -1 to +1 is given by: ra = 1 - (D/2)
Day
1
2 3 4 5 6 7 8 9 10 11 12 13
X
31
91 13 69 70 64 38 50 94 82 15 42 27
Y
-2
9 -4 15 12 6 7 4 23 24 -2 -4 -6
Day
14
15 16 17 18 19 20 21 22 23 24 25 26
X
66
73 45 21 7 69 38 2 99 36 82 58 20
Y
14
10 -2 -12 -1 11 5 -17 18 8 9 21 -5
Day
27
28 29 30 31 32 33 34 35 36 37 38 39
X
88
97 93 72 6 55 15 10 21 88 55 27 66
Y
19
12 19 10 11 23 12 16 20 43 23 16 32
Day
40
41 42 43 44 45 46 47 48 49 50
X = 2,448 161,128
X
3
35 40 64 43 30 73 46 82 35 2
X2 =
Y
13
25 15 32 17 28 33 19 32 22 6
X = Order Volume
Descriptive Statistics: Volume, Res.Time Variable Volume Res.Time N 50 50 Mean 48.96 12.78 StDev 29.02 12.42 Variance 842.32 154.38 Sum 2448.00 639.00 Sum of Squares 161128.00 15731.00
Note:
SX2 = 842.32 = (X2 n(X2bar))/(n-1) = (161,128 50(48.962))/(49) and SX = 29.02 SY2 = 154.38 = (Y2 n(X2bar))/(n-1) = (15,731 50(12.782))/(49) and SY = 12.42 SXY = 181.42 = (XY n(Xbar)(Ybar)/(n-1) = (40,175 50(48.96)(12.78))/(49)
Correlations: Volume, Res.Time Pearson correlation of Volume and Res.Time = 0.503 P-Value = 0.000
Volume Res.Time
Second, get the correlation coefficient (rXY) and the coefficient of determination (r2).
These are: rXY = SXY/(SXSY) = 181.42/ (29.02*12.42) = .503 and
1 0
r2 = (.503)2 = .253 ^ ^ Next, get the estimates of the Slope ( ), Intercept ( ), and Regression Equation^ = + X Y ^ ^
0
1
These are:
^ ^
^ ^ ^
Y = 0 + 1X = 2.24 + 0.215X BE ABLE TO USE THIS.
NOTE THAT:
R-Sq = 25.3%
R-Sq(adj) = 23.8%
Analysis of Variance
Source Regression Residual Error Total DF 1 48 49 SS 1914.6 5650.0 7564.6 MS 1914.6 117.7 F 16.27
Unusual Observations
Obs 36 Volume 88.0 Res.Time 43.00 Fit 21.19 SE Fit 2.59 Residual 21.81 St Resid 2.07R
Percent
50 10 1 -2 -1 0 1 Standardized Residual 2
Frequency
10
15 20 25 30 35 Observation Order
40
45
50
Y|X=X*
^ = ^ + ^ X* = Y 0 1
2 for
OF STATISTICS
OFFICE: +1-208-885-4410
DR. RICK EDGEMAN, PROFESSOR & CHAIR SIX SIGMA BLACK BELT