Professional Documents
Culture Documents
Investigate
Linear Regression
LT 2G: I can use technology to calculate the correlation coefficient and the regression equation and describe their relationship with the linear model (for example: slope and y-intercept) that best fits a given data set.
I do:
What would the intensity of an earthquake be in a city 200 miles from Parkfield.
Graphing!
To graph, press STAT PLOT (2nd Y=). Make sure Plot 1 is turned ON. Press ZOOM and select 9:ZoomStat and press ENTER.
J-TPS: Which graph displays a stronger positive correlation? How do you know?
III. Correlation
A.
The linear correlation coefficient (r), measures the strength and the direction of a linear relationship between two variables.
1. If x and y have a strong positive linear correlation, r is close to +1 (+1 indicates a perfect positive fit.) Positive values indicate a relationship between x and y variables such that as values for x increases, values for y also increase.
2. If x and y have a strong negative linear correlation, r is close to -1 (-1 indicates a perfect negative fit.) Negative values indicate a relationship between x and y such that as values for x increase, values for y decrease.
3. If there is no linear correlation or a weak linear correlation, r is close to 0. A value near zero means that there is a random, nonlinear relationship between the two variables
I do
The table shows a relationship between points allowed and games won by a football team over eight seasons. a. Find the equation of the line of best fit and the correlation coefficient. (Round to the hundredths place if necessary.) b. Discuss correlation and causation for the data set.
c. If 225 points were allowed, how many games do you think the team would have won.
I do
The table shows a relationship between points allowed and games won by a football team over eight seasons. a. Find the equation of the line of best fit and the correlation coefficient. (Round to the hundredths place if necessary.) b. Discuss correlation and causation for the data set.
c. If 225 points were allowed, how many games do you think the team would have won.
Line of best fit: .02+9.91 Correlation Coefficient: r = -.91
There exists a strong negative correlation between points allowed and games won by a football team. This relationship does not necessarily illustrate causation because other factors are significant in determining if a team will win or lose (namely the offense). If 225 points were allowed, = .02(225) + 9.91 = 5.41 So the team would have won approx. 5 games.
You do
Eight adults were surveyed about their education and earnings. The table shows the survey results. Find the equation of the line of best fit and the correlation coefficient. Discuss correlation and causation for the data set. How much earning would you have if you had 19 years of education?
You do
Eight adults were surveyed about their education and earnings. The table shows the survey results. Find the equation of the line of best fit and the correlation coefficient. Discuss correlation and causation for the data set. How much earning would you have if you had 19 years of education?
The equation of the line of best fit is y 5.59x - 30.28 and r 0.86. There is a strong positive correlation. There is a likely cause-and-effect relationship (more education often contributes to higher earnings). You would have earned approximately 75.93 thousand dollars.
Standard form is y = ax + b
The intercept, b, is the value when x = 0, although we need the value of the intercept to draw the line of best fit, it is only statistically meaningful when x can actually take values close to zero.
The slope, a, is almost always important for interpreting data because it describes the rate at which the data is changing.
Regression Equation
A psychologist has collected data on the number of times x per year a college student goes out on a date and the number of hours y of homework that student does per week. She came up with the equation of the regression line
y = -0.2x + 30 Interpret the slope and y-intercept
I DO
A study was done to investigate the relationship between the age in years of a young person x and the time y in minutes at which the child can run one mile. Data from children between the ages of 8 and 15 was collected. The equation of the regression line was found to be y = -0.5X + 17