This document discusses hypothesis testing and provides examples. It covers:
1) The key components of hypothesis testing including dependent and independent variables, hypotheses, and appropriate statistical tests.
2) Three examples of hypothesis tests with different study designs, variables, and appropriate statistical analyses. The examples assess the relationship between socioeconomic status and age of menarche, the effect of aspirin on temperature, and differences in protein intake among diet groups.
3) Guidance on selecting the correct statistical test based on the scale of variables and number of groups, including t-tests, ANOVA, correlation, regression, and non-parametric alternatives.
This document discusses hypothesis testing and provides examples. It covers:
1) The key components of hypothesis testing including dependent and independent variables, hypotheses, and appropriate statistical tests.
2) Three examples of hypothesis tests with different study designs, variables, and appropriate statistical analyses. The examples assess the relationship between socioeconomic status and age of menarche, the effect of aspirin on temperature, and differences in protein intake among diet groups.
3) Guidance on selecting the correct statistical test based on the scale of variables and number of groups, including t-tests, ANOVA, correlation, regression, and non-parametric alternatives.
This document discusses hypothesis testing and provides examples. It covers:
1) The key components of hypothesis testing including dependent and independent variables, hypotheses, and appropriate statistical tests.
2) Three examples of hypothesis tests with different study designs, variables, and appropriate statistical analyses. The examples assess the relationship between socioeconomic status and age of menarche, the effect of aspirin on temperature, and differences in protein intake among diet groups.
3) Guidance on selecting the correct statistical test based on the scale of variables and number of groups, including t-tests, ANOVA, correlation, regression, and non-parametric alternatives.
Department 2013 Introduction Variable Dependen : Independen : Group : Observation : Hypotheses (one/two-sided) H 0 : H 1 : Hypotheses Testing: Worksheet Hypothesis Testing Basic Principals Type of measurement Characteristics of variable Descriptive statistics Categorical Nominal Unorderedcategories Counts, proportions Ordinal 1 Ordered categories with intervals that are not quantifiable Counts, proportions, and median Numerical Continuous Infinite number of values Ranked spectrum with quantifiableintervals Counts, proportions, median, mean, standarddeviation Discrete 2 Limitedtointegers Process of Classification and Measurement Can increase the objectivity of our knowledge, reduce bias, and provide a means of communication. Clinical investigators prefer variables that contain more information and thereby provide greater power and/or smaller sample size. Continuous >Discrete >Ordinal >Nominal Overview Variable and scale of measurement Probability distribution Estimation Hypothesis Why is hypothesis testing so important? It provides an objectives framework for making decisions using probabilistic methods rather thanrelyingonsubjectiveimpressions People can formdifferent opinions bylooking at data, but a hypothesis test provides a uniform decision making criterion that is consistent for all people 7 Why is hypothesis testing so important? The research hypothesis is a specific version of the research question that summarizes the main elements of the study (sample, predictor and outcome variables The basis of the tests of statistical significance. A good hypothesis must be based on a good research question. It should also be simple, specific, and stated in advance. Case I Age at menarche (age of starting menstrual periods) is an important risk factor for breast cancer and possibly ovarian cancer. In general, women with earlier age at menarche have a higher incidence of breast cancer. The long- term trend in developed countries is that age at menarche has been declining over the past 50 years. One hypothesis is that women with higher childhood socioeconomic status (SES) have an earlier age at menarche. Questions for case I: What are the dependent and independent variables in the study? What is the appropriate scale of measurements for both variables? Is it paired or unpaired observation? What are the null hypothesis and alternative hypothesis? Is it one-sided or two-sided? What is the appropriate hypothesis testing (parametric/non parametric) that we can use to test the null hypothesis? Describe data analysis in descriptive statistic Answer Dependent : age at menarche (year, numerical) Independent childhood socioeconomic status (lower and higher, nominal) Unpaired observation, no matching procedure H 0 : age at menarche higher SES = age at menarche lower SES H 1 : age at menarche higher SES < age at menarche lower SES (one-sided) Test of distribution Only one group are normally distributed non parametric test Descriptive Statistic Variable Descriptive Statistic Age of menarche (median/IQR) 12.4 (2) SES (n/%) lower 30 (60) higher 20(40) In a pediatric clinic a study is carried out to see how effective aspirin is in reducing temperature. Twelve 5-year-old children suffering from influenza had their temperatures (F) taken immediately before andonehour after administrationof aspirin. Case II Questions for case II What are the dependent and independent variables in the study? What is the appropriate scale of measurements for both variables? Is it paired or unpaired observation? What are the null hypothesis and alternative hypothesis? Is it one-sided or two-sided? What is the appropriate parametric test that we can use to test the null hypothesis? Answer Dependent : body temperature (F, numerical) Independent : pem. aspirin Paired observation from the same person or subject, the dependent/outcome variables are measured twice H 0 : temperature before - temperature after =0 H 1 : temperature before - temperature after >0 (one- sided) Test of distribution Both group are normally distributed parametric test Descriptive Statistic Temperature (mean/SD) Descriptive Statistic Before 102.2 (0.7989) After 100.45(0.5649) Researchers want to compare protein intake (mg) among three groups of postmenopausal women: Women eating a standard American diet (SAD) Women eating a lacto-ovo-vegetarian diet (LAC) Women eating a strict vegetarian diet (VEG) Case III Questions for case III: What are the dependent and independent variables in the study? What is the appropriate scale of measurements for both variables? Is it paired or unpaired observation? What are null hypothesis and alternative hypothesis? What is the appropriate parametric test that we can use to test the null hypothesis? Answer Dependent : protein intake (mg, numerical) Independent : diet type (SAD, LAC, and VEG, nominal) Unpaired observation, no matching procedure H 0 : SAD = LAC = VEG H 1 : at least one of the pairs has mean difference of protein intake Test of distribution Both group are normally distributed parametric test Descriptive Statistic Protein (mean/SD) Descriptive Statistic STD 74.7(5.056) LAC 56.7(5.559) VEG 46.7(5.559) Is your Dependent Variable (DV) continuous? YES NO Is your Independent Variable (IV) continuous? Is your Independent Variable (IV) continuous? Correlation or Linear Regression YES YES YES Do you have only 2 treatments? NO NO Logistic Regression Chi Square NO T-test ANOVA If I have one Dependent Variable, which statistical test do I use?