Professional Documents
Culture Documents
HOW STATISTICS ARE USED Define statistics Reasons for studying statistcs Descriptive vs inferential statistics Surveys vs experiments Retrospective vs prospective studies Descriptive vs analytical surveys Define bias Purpose and component of clinical trial
Recorded data Number of traffic accidents Size of enrollment Number of patients visiting a clinic
Denote characteristics calculated for a set of data Mean Standard deviation Correlation coefficient
Body of techniques and procedures dealing with the collection, organization, analysis, interpretation, and presentation of information that can be stated numerically
Uses of statistics....
Inferential statistics is concerned with reaching conclusions from incomplete information Uses information obtained from a sample to say something about an entire population
Sources of Data
Observational surveys Planned surveys Examples: effects of atomic bomb explosion (no controlled or assigned subjects) Experiments Reduction of blood pressure among veterans with the use of anti-hypertensive and placebo
Categories of surveys
Health researchers conduct surveys on human populations... categories: Restrospective studies (also referred to as case-control studies) Prospective studies (also called cohort studies)
Categories of surveys
Retrospective studies use past data from selected cases and controls to determine differences, if any, in the exposure to a suspected factor.
The researcher identifies individuals w/ a specific disease or condition (CASE) Also, a comparable sample w/o the disease or condition (CONTROL)
Categories of surveys
PURPOSE OF THE COMPARISON: To determine if the 2 groups differ as to their exposure to some specific factor Examples: Smoking habits of women who bore premature babies with those of women who carried their pregnancies to term The researcher seeks to determine whether there is statistical relation b/w the possible STIMULUS VARIABLE, or causative factor (SMOKING), and the OUTCOME VARIABLE (PREMATURITY)
Categories of surveys
DISADVANTAGEs: Data were usually collected for other purposes Incomplete data Surveys frequently fail to include relevant variables that may be essential to determining whether the 2 groups studied are comparable Thus, results may be in a cloud of doubt Unknown biases frequently hinder these studies
Categories of surveys
ADVANTAGEs:
Economical Applicable to the study of rare studies Strong possibility of obtaining answers quickly bec the cases are usually easily identified
Categories of surveys
STEPS in Sample selection: Out come variable (disease) Stimulus variable (factor)
Categories of surveys
PROSPECTIVE STUDIES The researchers enroll a group of healthy persons (COHORT) and follow them over a certain period to determine the frequency with which a disease develops Example: Framingham study (based on the presence or absence of one or more variables) Smoking, diabetes, hypertension, obesity) At the beginning of the study, these are only suspected of being related to CVD or stroke
Categories of surveys
PROSPECTIVE STUDIES Steps: First look on at the key variables of interest because the disease being studied have not yet occurred In Framingham study, as subjects either developed CVD or had a stroke, the key variables were compared for the individuals who acquired the disease and those who did not
Categories of surveys
PROSPECTIVE STUDIES Steps: The researcher could then begin to determine the degree of importance of each variable in relation to CVD or stroke
Categories of surveys
PROSPECTIVE STUDIES Advantages: Accurate estimation of disease incidence in a population Possibility of including potentially relevant variables (age, gender, ethnicity, occupation) that are related to the outcome variable Data are collected under uniform conditions and for specified reasons Better opportunities to draw appropriate conclusions or comparisons Limited amount of bias (systematic error)
Categories of surveys
PROSPECTIVE STUDIES Disadvantages: Not used to establish or prove a causal relationship (because the variables cannot be randomly assigned or manipulated)
Clinical Trials
A carefully designed experiment that is generally considered to be the best method for evaluating the effectiveness of a new drug or treatment method Used extensively to test the efficacy of new drugs and treatments Required by FDA before drugs & other medical products receive approval
Clinical Trials
2 key features:
1. randomization 2. blinding ****both helps minimize bias
Clinical Trials
BLINDING... ***the study subjects and/or the investigators do not know who is in the control group and who is in the experimental group
Purpose: ***to reduce the likelihood that study assessments will be biased because subject or investigator behavior has been influenced by knowledge of treatment group assignment
Clinical Trials
Single-blind study.... ***the subject does not know if she or he is in the treatment or the control group
Double-blind study... ***neither the subject nor the investigator knows to which group the subject is assigned ***a neutral party keeps track of who is in which group and typically discloses it only at the conclusion of the trial.
Clinical Trials
RANDOMIZATION... ***method most likely to consistently reduce bias by producing equivalent treatment and control groups. Treatment group... ***receives a potentially therapeutic agent ***compared with the control group
Clinical Trials
Control group... ***receives a placebo therapeutic agent
or
the
standard
***In RANDOMIZATION, each subject in the trial is randomly assigned to either the experimental or the control group
Clinical Trials
Message: ***Trials that use non-random assignments tend to produce biased overestimates of true therapeutic efficacy***(Schechtman)
***answer could be PROBABLY NOT!!! ***there is likely to be something unique about your school that makes it non-representative of college students in general.
***a 50% response rate might be good enough to ensure a representative sample ***but sampling bias cannot be prevented
***a person is interviewed in each household of the selected blocks ***more economical technique than the random selection of persons throughout the city
***Best technique and widely used is the use of a computer program to select random numbers ex: SPSS
THINGS TO LEARN: 1. distinguish between ***qualitative & quantitative variables ***discrete and continuous variables ***symmetrical, bimodal, and skewed distributions ***positively and negatively skewed disrtibutions 2. construction & interpretation of frequency table 3. types of graphs for displaying quanti & quali data
***to succesfully explain your data, your first task is to classify and organize the data 3 ways of organizing & presenting data: 1. tables 2. graphs 3. numerical techniques
***if incorrect numbers are entered, the analysis will be mathematically correct, but there will be computererror
*** VERIFY, VERIFY, VERIFY
VALID PERCENT ***the % out of 100, using only those subjects with data CUMULATIVE PERCENT ***the % of all previous cases plus the current interval
-abscissa (w/c depicts the class boundaries -ordinate (w/c depicts the frequency of observations
-various shapes: ***symmetrical ***bimodal ***rectangular ***skewed to the right ***skewed to the left refer to Figure 3.4
Box and whisker plots -graphical examination of data -done by determining the median and the quartile statistics (to be discussed later)
PROBABILITY
**Applies exclusively to a future event, never to a past event
**stated numerically **defined in the range of 0 to 1 (never more, never less) ***a probability of 1.0 = event will happen with certainty
PROBABILITY
**probability of 0 = event will not happen
**probability of 0.5 = event should occur once in every two attempts, on average **probability close to 1.0 = event is more likely to happen **probability close to 0 = event is unlikely to happen
PROBABILITY
**defined as: the ratio of the number of ways the specified event can occur to the total number of equally likely events that can occur **probability of an event = P(E) -----the proportion of times a favorable event will occur in a long series of repeated trials
PROBABILITY
**P(E) = n / N = number of favorable outcomes number of possible outcomes
ex: one coin = 2 possible outcomes of fair toss coin: head or tail P (H) = 1 / 2
P (T) = 1 / 2
PROBABILITY
**P(E) = n / N = number of favorable outcomes number of possible outcomes
ex: dice = 6 equally possible outcomes N = 6 (1,2,3,4,5,6) P=3/6 P=2/6
NORMAL DISTRIBUTION
**NORMAL LIMITS ----used to classify levels of patients as being healthy or otherwise Ex: normal level of cholesterol = 200 mg/dl ***for patient with above value = indicates a significantly increased risk for coronary heart disease
NORMAL DISTRIBUTION
**giving normal limits is quite critical..WHY?
---given value may not have been accurate or may have been faulty which cause tragic and patient might be given unnecessary treatment or others might fail to receive a needed treatment
NORMAL DISTRIBUTION
**ex; SERUM ALBUMIN ---normal limits for albumin is calculated by adding and subtracting 2 standard deviations from the mean of a large set of observations obtained from a group of presumably healthy persons ---this will provide the limits that contain the middle 95% (w/c is the normal range)
NORMAL DISTRIBUTION
CLINICAL LIMITS **the lower & upper 2.5% points for any distribution, normal or otherwise, of healthy persons
NORMAL DISTRIBUTION
**Properties of normal distribution
1. bell-shaped curve = symmetrical about the mean 2. all normal distribution have a particular internal distribution for the area under the curve ----the relative area between any 2 designated points is always the same
NORMAL DISTRIBUTION
**Properties of normal distribution 3. exponential equation ---the normal distribution is a theoretical distribution defined by 2 parameters (the mean and the standard deviation)
HYPOTHESIS TESTING
**One of the principal objectives of research is comparison: ----how does one group differ from another? Ex; ---what is the mean serum cholesterol level of a group of middle-aged men? ---How does it differ from women of the same group age? ---how does it differ from that of men of other age group?
HYPOTHESIS TESTING
**Parameter or unknown characteristic of a population ---usually estimated from a statistic computed from sample data
HYPOTHESIS TESTING
Definition of terms:
HYPOTHESIS --- a statement of belief used in the evaluation of population values NULL HYPOTHESIS (Ho) --- a claim that there is no difference between the population mean and the hypothesized value
HYPOTHESIS TESTING
Definition of terms:
ALTERNATIVE HYPOTHESIS (H1) --- a claim that disagrees with the null hypothesis. If the null hypothesis is rejected, we left with no choice but to fail to reject the alternative hypothesis that mean is not equal to hypothesized value.
HYPOTHESIS TESTING
TEST STATISTIC --- a statistic used to determine the relative position of the mean in the hypothesized probability distribution of sample means CRITICAL REGION --- the region on the far end of the distribution ****one-tailed test --- if only one end of the distribution is involved
HYPOTHESIS TESTING
CRITICAL REGION --- the region on the far end of the distribution ****two-tailed test --- if both ends of the distribution are involved CRITICAL VALUE --- the number that divides the normal distribution into the region where we will reject the null hypothesis and the region where we fail to reject the null hypothesis
HYPOTHESIS TESTING
SIGNIFICANCE LEVEL --- the level that corresponds to the area in the critical region ---this area is usually small (means the results are infrequent and deemed unusual) ----this means statistically significant in the language of statistictians
HYPOTHESIS TESTING
TEST OF SIGNIFICANCE --- a procedure used to establish the validity of a claim by determining whether the test statistic falls in the critical region.
*** if it does, the results are referred to as SIGNIFICANT. ***this test is sometimes called the HYPOTHESIS TEST.