Professional Documents
Culture Documents
Description
This module reviews commonly used descriptive statistics and how to calculate those statistics.
Learning Objectives
1. 2. 3. 4. 5. 6. Describe the statistical procedures that should be used to describe categorical data. Define and calculate the following measures used to describe categorical data: counts and percentages. Describe the procedures that can be used to summarize continuous data. Define and calculate the following measures of central location: mean, median, and mode. Explain the uses of the midrange and geometric mean. Define and calculate the following measures of "spread": standard deviation, percentiles, quartiles, interquartile range, and range.
Key Concepts
In this module we will examine the important first step of data analysis description of the data. Last week we learned that the first step of any statistical analysis is to enter data into a paper or electronic "line listing." Once the data are entered, they must be checked, corrected, rechecked, and then carefully saved. Both the integrity and the privacy of the data must be protected at all times. Data should not be stored in non-secure places such as your email or on an unprotected mobile device or computer. The security of data has become particularly problematic with automated back up systems and mobile devices. Thus, you must be particularly careful about where you store data. Data should always be password protected. And, you must be extremely cautious about sending the data or sharing the data with others. Once the data are entered, checked, corrected, and rechecked, and you have ensured that the data are securely stored and protected, you are ready to begin the descriptive analysis!
The process of describing the data is essential. Data description is always the first step of any data analysis. The importance of performing the descriptive statistics can be compared to the importance of assessing your patient before performing any interventions or testing. The same is true of data. You must carefully perform the right assessments before proceeding with testing. What you learn in in the descriptive analysis directs all subsequent assessments and testing. So, the importance of this step cannot be overemphasized!
Before any data analysis can be performed, you must determine the level of measurement for each variable. This is important because the level of measurement determines the type of descriptive statistics that can be used to describe the data.
Continuous data are measured at the interval or ratio level. There are many descriptive statistics that can be used with continuous data.
The two important descriptive statistics that should be used with all continuous data are to describe the location of the "center" of the data and to describe the "spread" or dispersion of the data around that center. So, remember this always describe the center and spread. Describing the Center of the Data. The measures of the center of the data are known as the measures of "central location" or "central tendency." The measures most commonly used are the mean, median or mode. Watch this brief video on how to calculate the mean median, and mode: Calculation of the Mean Median and Mode. You might also enjoy this charming and memorable little song that explains the concepts very simply in less than 2 minutes! Mean, Median, and Mode Song. There are two other measures of central location that are less commonly used: the geometric mean and the midrange. The geometric mean is used for data that are measured on a logarithmic scale. Some laboratory data are measured on this scale for example, titers. The midrange is almost never used! The midrange is the point exactly halfway between the smallest and largest measurement. Describing the Spread of the Data. The measures of "spread" or "dispersion" most commonly used are the standard deviation, range, percentiles, quartiles, and the interquartile range. Watch these two videos on how to calculate the standard deviation: Standard DeviationExample 1 and Standard Deviation - Example 2. The two following videos describe how to calculate the range, quartiles, and interquartile rage: Range, Quartiles, and Interquartile Range: Example 1 and Range, Quartiles, and Interquartile Range: Example 2. This web site demonstrates the simplest method of obtaining the interquartile range (IQR): Simple Calculation of the IQR. There is another measure of spread that is less commonly used: the Coefficient of Variation (CV). It describes the amount of spread in relationship to the mean and is calculated as a ratio of the standard deviation to the mean. The CV is commonly used to describe the variability laboratory data and is occasionally used for other measures as well. If you see this in a publication remember that it is a measure of spread and, smaller values are considered good.
When describing categorical data, the standard practice is always to report both a count (frequency) and percentage. When describing continuous data, the standard practice is always to report both a measure of central tendency and a measure of spread.
Nurses use descriptive statistics all the time to describe the events in their unit or clinic. For categorical variables, you will always use frequencies (counts) and percentages. You have been using counts since elementary school! You know how to perform a count! Likewise, you know how to perform a percentage! We use counts and percentages almost daily! How many patents, what percentage developed infections, and so forth. For continuous variables, you must know how the measures of central location and the measures of spread. You will use these all of the time! Next week will learn how to determine which measure of central location and which measure of spread should be used.
Required Reading
Kim, M., & Mallory, C. (2014). Statistics for evidence-based practice in nursing. Burlington, MA: Jones & Bartlett. Read Chapter 6 U. S. Department of Health and Human Services, Centers for Disease Control and Prevention. (2006). Principles of epidemiology in public health practice (3rd ed.). Atlanta, GA: Author. You can access the required readings as follows:
The lessons in the book can be accessed individually at this web site: CDC Epidemiology Book web version. Read Lesson 2, Section 2-8. The book is available as an electronic book in pdf format at the following link: CDC Epidemiology Book - pdf version. Read Lesson 2, pp. 7-58.
Homework Assignment
1. 2. 3.
4. 5.
Please complete Homework Assignment #3. The link to the assignment is located below. This assignment is worth 5% of your course grade. Please be sure to follow the instructions listed below. You must complete the assignment by yourself. You may NOT seek assistance from another person. You may use any of the course resources, your books, and the internet to help you complete the assignment. For the questions that require a short written answer, please respond using sentences that are written using proper grammar, spelling and punctuation. The quality of the writing will be graded as well as the accuracy of the answer. Proper grammar, spelling, and punctuation are worth 10% of the score on these questions. There is no time limit on the assignment. You can enter the assignment multiple times until the deadline for completing the assignment (Sunday night at 11 pm Central Time).