Professional Documents
Culture Documents
Univariate Data
Central tendency
Mean
Median
Mode
Trimean
The trimean is almost as resistant to extreme scores as the median and is less subject to
sampling fluctuations than the arithmetic mean in extremely skewed distributions.
It is less efficient than the mean for normal distributions. .
The trimean is a good measure of central tendency and is probably not used as much as it
should be.
Trimmed Mean
Spread
Range
Largest-Smallest
Sensitive to extremes
Q3-Q1/2
Good for skewed distributions
Not better than SD for normal distributions
In a normal distribution, about 68% of the scores are within one standard deviation of the mean
and about 95% of the scores are within two standard deviations of the mean.
Supplement with semi inter quartile range when possibility of extremes.
Skew
Graphical Representations
Frequency Polygons& Cumulative Frequency Polygons
Histograms & Bar Graphs
Interval vs no of scores
Shape varies and depends on size of interval
Bar graphs use space between bars and are used for qualitative variables
Pearsons Correlation
Spearmans Rho
Probability
If A & B are not Independednt,
P(A & B) = p(A) * P(B|A)
Binomial Distribution
Assumptions:
Events are:
1.
2.
3.
4.
Dichotomous
Mutually exclusive
Independent
Randomly selected
Subjective Probability
Normal Distribution
A family of distributions that have the same general shape (Bell).
Defined by mean and standard deviation.
Called z-distribution
Z is normal only if X is normal.
Can convert between scores (z) and percentile ranks (Area under the curve).
Sampling Distribution
Example : Avgs of 12,13,14 yr lds to infer about avg of a bunch of teens from all age groups
When abs value of correlation s low (<0.4) then sampling distribution is approximately normal.
For higher values, negative skew.
Fishers z' = .5[ln(1+r) - ln(1-r)]
Fishers transformation is normally distribute with
Convert both to zs
Probability that a sample value is exactly equal to a specific value is 0 (continuous distribution)
So to calculate value at 10, calculate area under curve between 9.5 and 10.5
Point Estimation
Characteristics of Estimators
Unbiasedness
consistency
Relative efficiency
Estimating Variance
= (X-)/N
s = (X-M)/(N-1)
Confidence Intervals
CI for Mean, SD Known
Values needed
o Sample Mean
o Z-value depends on level of confidence
o Standard Error
CI is M - z M M + z M
95% z-value = 1.96
99% z-value = 2.58
is used as an estimate of SD
Use t-value instead of z-value
M - t sM M + t sM
M is the sample mean, sM is an estimate of M, and t depends on the degrees of freedom and
the level of confidence
Genera Formula
Compute Md = M1 - M2
Compute
Find z (1.96 for 95% interval; 2.58 for 99% interval)
Lower limit = Md - z
Upper limit = Md + z
Compute Md = M1 - M2
Compute SSE1 = (X - M1 ) for Group 1 and SSE2 = (X - M2 ) for Group 2
Compute SSE = SSE1 + SSE2
Compute df = N - 2 where N = n1 + n2
Compute MSE = SSE/df
Find t-value
Lower limit = Md - t
Upper limit = Md + t
Compute
Compute df = k(n-1) where k is the number of groups and n is the number of subjects in each
group.
Find t for the df and level of confidence
Lower limit = L - t sL
Upper limit = L + t SL
CI on Pearsons Correlation
Use procedure the r to z' table to convert the lower and upper limits from z' to r
5. Compute:
6. Lower limit = z'1 - z'2 - (z)(
8. Use an r to z' table to convert the lower and upper limits to r's.
Hypothesis Testing
Null Hypothesis