Professional Documents
Culture Documents
Statistics
Review of Statistical Concepts
Mathematical science pertaining to the collection, analysis, interpretation or explanation, and presentation of data. It produces quantities calculated from a random sample taken from a population of interest
Probability
Descriptive Statistics
Probability is the measure of how likely an event will occur or has occurred. An event is one or more outcomes of an experiment .An outcome is the result of a single trial of an experiment The probability of event A is P(A) ; 0 P(A) 1
Statistics Probability
Numerical and Graphical Examples
Probability represents uncertainty while statistics describe the sample data or imply information from the sample data
Industrial Engineering
Population
The entire set of potential observations (items, people, etc) about whose properties we would like to learn
Descriptive Statistics
Sample
The set of observational units (items, people, etc) whose properties our study is to observe. When we select a sample by scientific randomization, we are more easily able to generalize our conclusions to the population of interest. For a given characteristic, the collection of measurements that are actually observed
For practical reasons sample is used to represent the population. Data is collected for the sample members in an observational or experimental setting. This data can then be subjected to statistical analysis.
When we measure something in a population it is called a parameter. When we measure something in a sample it is called a statistic
2
Industrial Engineering
Descriptive Statistics
Summarize the population data by describing what was observed in the sample numerically or graphically. Numerical descriptors include mean and standard deviation for continuous data types , while frequency and percentage are more useful in terms of describing categorical data .
Descriptive Statistics
Inferential Statistics
Uses patterns in the sample data to draw inferences about the population. These inferences may take the form of: answering yes/no questions about the data (hypothesis testing) estimating numerical characteristics of the data (estimation), describing associations within the data (correlation), modeling relationships within the data (regression), extrapolation, interpolation, or other modeling techniques like ANOVA, time series, and data mining.
Industrial Engineering
Location The location is the expected value of the output being measured. For a stable process, this is the value around which the process has stabilized.
Descriptive Statistics
Spread The spread is the expected amount of variation associated with the output. This tells us the range of possible values that we would expect to see.
Shape
The shape shows how the variation is distributed about the location. This tells us if our variation is symmetric about the mean or if it is skewed.
Industrial Engineering
Numerical Mean Median Mode Standard Deviation variance range skewness kurtosis
Graphical scatter plot boxplot histogram boxplot histogram Box plot histogram probability plot
Descriptive Statistics
Shape
Industrial Engineering
Describing Variation
Review of Statistical Concepts
Variation in output comes from several sources such as materials, machines, methods, measurements, environment and people
Variation can be measured and described numerically or graphically Variation means the data is distributed
Descriptive Statistics
Median, The median is the physical center of the distribution. It is the value in the middle when the values of the distribution are arranged sequentially. Mode The mode is the most frequent value in the distribution. It is simply the value that appears most often
6
Industrial Engineering
Descriptive Statistics
Descriptive Statistics
Standard Deviation and Variance - both measure how far on average scores deviate or differ from the mean. The standard deviation is the average deviation about the mean
Industrial Engineering
Descriptive Statistics
Box Plot
A graphical display that provides important quantitative information about a data set. Some of this information is location or central tendency; Spread or variability; Departure from symmetry; and Identification of outliers
Descriptive Statistics
Histogram
A graphical display of a grouped frequency distribution as a way to abbreviate the values we are dealing with in a distribution.
Industrial Engineering
Minitab Example..
Descriptive Statistics