Math TEKS Algebra 1 - Texas Comprehensive Center
Math TEKS Algebra 1 - Texas Comprehensive Center
Math TEKS Algebra 1 - Texas Comprehensive Center
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Tab 3: <strong>Algebra</strong> I<br />
Table of Contents<br />
Master Materials List 3-ii<br />
Spaghetti Regression 3-1<br />
Handout 1: Spaghetti Regression 3-6<br />
Transparency 1/Handout 2: Scatterplot 3-7<br />
Handout 3: Activity 1 Goodness-of-Fit 3-8<br />
Transparency 2 3-11<br />
Transparency 3 3-12<br />
Transparency 4 3-13<br />
Transparency 5 3-14<br />
Handout 4: Measuring 3-15<br />
Transparency 6 3-16<br />
Handout 5: Activity 2 3-17<br />
Handout 6: Activity 3 Absolute Value vs. Squaring 3-27<br />
Handout 7: Supplemental Material 3-31<br />
Understanding Correlation Properties with a Visual Model 3-34<br />
Handout 1: Activity 1 3-42<br />
Handout 2: Activity 2 3-53<br />
Handout 3: Activity 3 - Correlation vs. Causation 3-61<br />
Handout 4: Activity 3, Part B – Headlines 3-65<br />
Handout 5: Supplemental Reading 3-66<br />
Tab 3: <strong>Algebra</strong> I: Table of Contents 3-i
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Tab 3: <strong>Algebra</strong> I<br />
Master Materials List<br />
Graphing calculator<br />
Spaghetti or linguine<br />
Tape<br />
Colored markers<br />
Straightedge<br />
Computer with internet access and Java 1.4<br />
Yard stick<br />
Spaghetti Regression: Transparencies and handouts<br />
Correlation: Transparencies and handouts<br />
The following materials are not in the notebook. They can be accessed on the MTR<br />
website until the 9-12 MTR CDs are available.<br />
Java Applet (http://mathteks2006.net/applets)<br />
PowerPoint presentation: Correlation vs. Causation<br />
(http://mathteks2006.net/documents/correlation.ppt)<br />
Tab 3: <strong>Algebra</strong> I: Master Materials List 3-ii
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Activity: Spaghetti Regression<br />
Overview: Participants will investigate the concept of the “goodness-of-fit” and its<br />
significance in determining the regression line or best-fit line for the data.<br />
<strong>TEKS</strong>: This activity supports teacher content knowledge underlying the following<br />
<strong>TEKS</strong>.<br />
(A.2) Foundations of functions. The student uses the properties and<br />
attributes of functions.<br />
The student is expected to:<br />
(D) collect and organize data, make and interpret scatterplots (including<br />
recognizing positive, negative, or no correlation for data approximating<br />
linear situations), and model, predict, and make decisions and critical<br />
judgments in problem situations.<br />
Background:<br />
Fitting the graph of an equation to a data set is covered in all mathematics courses from<br />
<strong>Algebra</strong> I to Calculus and beyond. This module explores the concept in-depth, providing<br />
the participants with an understanding beyond that in ordinary secondary texts. The<br />
idea is to provide the background knowledge needed to understand the process of<br />
modeling.<br />
To enrich the study of functions, the <strong>TEKS</strong> call for the inclusion of problem situations<br />
which illustrate how mathematics can model aspects of the world. In real life, functions<br />
arise from data gathered through observations or experiments. This data rarely falls<br />
neatly into a straight line or along a curve. There is variability in real data and it is up to<br />
the student to find the function that best 'fits' the data. Regression, in its many facets, is<br />
probably the most widely used statistical methodology in existence. It is the basis of<br />
almost all modeling.<br />
This activity supports teacher knowledge underlying <strong>TEKS</strong> A.2.D, wherein students<br />
create scatterplots to develop an understanding of the relationships of bivariate data.<br />
This includes studying correlations and creating models from which they will predict and<br />
make critical judgments. As always, it is beneficial for students to generate their own<br />
data. This gives them ownership of the data and gives them insight into the process of<br />
collecting reliable data. Teachers should naturally encourage the students to discuss<br />
important concepts such as goodness-of-fit. Using the graphing calculator facilitates this<br />
understanding. Students will be curious about how the linear functions are created, and<br />
teachers should help students develop this understanding.<br />
Spaghetti Regression 3-1
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Knuth and Hartmann in Technology-Supported <strong>Math</strong>ematics Learning Environments<br />
discuss the common approach to this topic:<br />
A common instructional practice is to have students plot the data on a<br />
coordinate plane, and then ask them to use a piece of spaghetti to<br />
represent the line that they will “fit” to the data. Students are typically<br />
instructed to position the spaghetti noodle so that it appears to be as<br />
close as possible to each point—visually determining the “best” fit. At<br />
this point students might determine the equation for their line and then<br />
use that equation in making predictions about additional points.<br />
Alternatively, the objective for the lesson might be to determine a line of<br />
best fit analytically, usually by using the statistical capabilities of a<br />
graphing calculator, and then to use the resulting equation in a similar<br />
fashion (i.e., to make predictions). In the former situation, the line that<br />
students identified as their line of best fit has not been determined<br />
mathematically and may or may not be the best fit in reality. In the latter<br />
example, the line has been determined mathematically, but students<br />
may not have an understanding of “what the calculator did” in<br />
determining the equation for the line or why the line is called a least<br />
squares line of best fit (the most commonly used line of best fit).<br />
Moreover, teachers often may not attempt to explain the underlying<br />
ideas, since the focus of the lesson may be on the use of the equation<br />
for the line. In either situation, ideas underlying the least squares line of<br />
best fit are not beyond the grasp of students and should be a topic of<br />
discussion.<br />
Participants will investigate the concept of the “goodness-of-fit” and its<br />
significance in determining the regression line or best-fit-line for the data.<br />
Development sequence:<br />
Activity 1 What is meant by “best”?<br />
What are non-analytical methods used by students to determine fit?<br />
Develop an analytical measure for fit.<br />
Discuss various measures, including residuals.<br />
Activity 2<br />
Activity 3<br />
Appendix<br />
Develop the least squares regression method via absolute value<br />
regression.<br />
Explore the effects of squaring the residuals and contrast it with using<br />
the absolute value of the residuals.<br />
Deriving the regression formula via algebra and then thru<br />
calculus.<br />
Historical notes.<br />
Materials: Graphing calculator<br />
Spaghetti Regression 3-2
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Spaghetti or linguine<br />
Tape<br />
Colored markers<br />
Straightedge<br />
Computer with internet access (Activity 3)<br />
Transparencies: 1-6 (pages 3-7, 3-11 – 3-14, 3-16)<br />
Handout 1 (page 3-6)<br />
Handout 2 (page 3-7)<br />
Handout 3 (pages 3-8 – 3-10)<br />
Handout 4 (page 3-15)<br />
Handout 5 (pages 3-17 – 3-21)<br />
Handout 6 (pages 3-27 – 3-28<br />
Handout 7 (pages 3-31 – 3-33)<br />
Grouping: 4-5 per group<br />
Time: 1½ -2 hours<br />
Lesson:<br />
Procedures Notes<br />
Activity 1<br />
Have participants read and discuss Handout<br />
1, Spaghetti Regression:<br />
Overview/Learning<br />
Objectives/Background, (page 3-6).<br />
Give each participant 3-5 pieces of<br />
spaghetti, the Transparency 1/Handout 2,<br />
Scatterplot (page 3-7) and Handout 3,<br />
Activity 1: Goodness of Fit, (page 3-8 ).<br />
Have the participants examine the plot and<br />
visually determine a line of best-fit (or trend<br />
line) using a piece of spaghetti. They then<br />
tape the spaghetti line onto their graph.<br />
Ask: Who has the best line in your group?<br />
How can we determine this?<br />
Ask: What is meant by best?<br />
Ask: What is meant by a close fit?<br />
See, How Do You Find the Line of Best-<br />
Fit? (page 3-10), to discuss methods<br />
students use for placing trend lines. (Do not<br />
discuss how to measure yet; see below )<br />
Discuss the importance of modeling and<br />
student discussions of concepts such as<br />
goodness-of-fit (see the Trainer Notes<br />
Background discussion above.)<br />
This should be done individually so that<br />
there is variation in the choice of lines within<br />
each group.<br />
This page discusses the general idea behind<br />
linear regression. To determine a line of<br />
best fit you must have an agreed upon<br />
measure of “goodness”. If that measure is<br />
“closeness of the points to the line”, the best<br />
line is then the line with the least total<br />
distance of points to the line. There are<br />
many methods for measuring “closeness.”<br />
The most common is the method of least<br />
Spaghetti Regression 3-3
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Procedures Notes<br />
discuss how to measure yet; see below.) squares.<br />
Have the participants use a second piece of<br />
spaghetti to measure the distance from each<br />
point to the line and break off that length.<br />
Each member of a group must measure the<br />
same way. Thus, each group must decide<br />
their method for measuring before they<br />
begin.<br />
Groups may measure vertically, horizontally,<br />
perpendicularly, etc.<br />
Have the participants line up their spaghetti<br />
distances to determine who in their group<br />
has the closest fit. Then, they replace the<br />
segments and tape them to their scatterplot.<br />
Have each group present their method and<br />
results. A good way to accomplish this is to<br />
have the “winner” from each table come up<br />
to the front. They can then be grouped by<br />
their method of measurement. Have each<br />
share, discuss, compare, and contrast.<br />
Distribute Handout 4, Measuring, (page 3-<br />
15) to discuss three ways (vertically,<br />
horizontally, perpendicularly) to measure the<br />
space between a point and the line. Discuss<br />
the meaning of a residual and why it is used<br />
in evaluating the accuracy of a model.<br />
Activity 2<br />
Intuitively, we think of a close fit as a good<br />
fit. We look for a line with little space<br />
between the line and the points it is<br />
supposed to fit. We would say that the best<br />
fitting line is the one that has the least<br />
space between itself and the data points<br />
which represent actual measurements.<br />
Encourage diversity in measuring methods<br />
among the groups to add depth to the<br />
following discussions.<br />
This will determine the total error (i.e., total<br />
distance from their line to the data).<br />
Discuss the fact that since the groups used<br />
different methods of measuring, we cannot<br />
determine best-of-fit for the entire class.<br />
Discuss accuracy of measurement. Did they<br />
measure from the edge of each point or the<br />
middle, etc.?<br />
Why measure vertically? The sole purpose<br />
in making a regression line is to use it to<br />
predict the output for a given input. The<br />
vertical distances (residuals) represent how<br />
far off the predictions are from the data we<br />
actually measured.<br />
Spaghetti Regression 3-4
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Procedures Notes<br />
Distribute Handout 5, Activity 2, (pages 3-<br />
17 – 3-21).<br />
Tell the participants we will now determine<br />
who has the best trend line in the class.<br />
Tell participants to look for “FYI:” in the<br />
activity for calculator help.<br />
Have participants stop when they finish #5<br />
and use overhead 2 to cultivate a class<br />
discussion of the questions in #5 before<br />
proceeding. It is important that participants<br />
understand why the residuals must be<br />
absolute valued or squared before summing.<br />
Transparency 6 reproduces the figure on<br />
page 3-19.<br />
Activity 3<br />
Distribute Handout 6, Activity 3, (pages 3-<br />
27 – 3-28).<br />
Participants will need a computer with Java<br />
version 1.4.<br />
Have participants open the applet<br />
Regression and work through handout.<br />
www.mathteks2006.net/applets<br />
Supplemental Material<br />
Ask participants to read the Handout 7,<br />
Supplemental Material, (pages 3-31 – 3-<br />
33).<br />
In Activity 1 the groups used different<br />
measures of goodness-of-fit; thus the best<br />
trend line of the class could not be<br />
determined.<br />
The participants will need a Graphing<br />
Calculator.<br />
Encourage the calculator-capable<br />
participants to help out within their groups.<br />
In this activity, an interactive java applet is<br />
used to investigate several data sets and<br />
contrast geometrically and numerically the<br />
effect of using the square of the residuals<br />
vs. the absolute value of the residuals.<br />
Encourage the participants to test their own<br />
conjectures and share/discuss with group.<br />
The Supplemental Material discusses two<br />
ways to minimize the sum of the squared<br />
residuals which leads to the formula that the<br />
calculator uses to find the least squares<br />
regression line.<br />
Historical notes are included about who<br />
originally developed least squares<br />
regression and where the term regression<br />
comes from.<br />
Spaghetti Regression 3-5
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Overview<br />
Spaghetti Regression<br />
Participants will investigate the concept of the “goodness-of-fit” and its significance in<br />
determining the regression line or best-fit line for the data.<br />
Learning Objectives<br />
This activity supports Teacher Content Knowledge needed for A2D: The student<br />
is expected to collect and organize data, make and interpret scatterplots<br />
(including recognizing positive, negative, or no correlation for data approximating<br />
linear situations), and model, predict, and make decisions and critical judgments<br />
in problem situations.<br />
Background<br />
Fitting the graph of an equation to a data set is covered in all mathematics<br />
courses from <strong>Algebra</strong> I to Calculus and beyond. The objective of this module is<br />
to explore the concept in-depth to provide understanding beyond that in ordinary<br />
secondary texts.<br />
To enrich the study of functions, the <strong>TEKS</strong> call for the inclusion of problem<br />
situations which illustrate how mathematics can model aspects of the world. In<br />
real life, functions arise from data gathered through observations or experiments.<br />
This data rarely falls neatly into a straight line or along a curve. There is<br />
variability in real data and it is up to the student to find the function that best 'fits'<br />
the data. Regression, in its many facets, is probably the most widely use<br />
statistical methodology in existence. It is the basis of almost all modeling.<br />
This activity supports teacher knowledge underlying <strong>TEKS</strong> A.2.D, wherein<br />
students create scatterplots to develop an understanding of the relationships of<br />
bivariate data; this includes studying correlations and creating models from which<br />
they will predict and make critical judgments. As always, it is beneficial for<br />
students to generate their own data. This gives them ownership of the data and<br />
gives them insight into the process of collecting reliable data. Teachers should<br />
naturally encourage the students to discuss important concepts such as<br />
goodness-of fit. Using the graphing calculator facilitates this understanding.<br />
Students will be curious about how the linear functions are created, and teachers<br />
should help students develop this understanding.<br />
Handout 1<br />
Spaghetti Regression 3-6
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Scatterplot<br />
Transparency 1/Handout 2<br />
Spaghetti Regression 3-7
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Activity 1 Goodness-of-Fit<br />
Objective: To Investigate the concept of goodness of fit and develop an<br />
understanding of residuals in determining a line of best-fit.<br />
1. Examine the plot provided and visually determine a line of best-fit (or trend line)<br />
using a piece of spaghetti. Tape your spaghetti line onto your graph.<br />
2. Now let us investigate the “goodness” of the fit. Use a second piece of spaghetti to<br />
measure the distance from the first point to the line. Break off this piece to represent<br />
that distance. Each person at the table must measure in the same way, so discuss<br />
the method you will use before starting. Repeat this for each point.<br />
3. Line up your spaghetti distances to determine who in your group has the closest fit.<br />
Determine the total error; i.e., total distance from your line to the data. Then replace<br />
the segments and tape them to your scatterplot.<br />
Total error = _______<br />
Handout 3<br />
Spaghetti Regression 3-8
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Activity 1 Goodness-of-Fit – Possible Solutions<br />
Objective: To Investigate the concept of goodness of fit and develop an<br />
understanding of residuals in determining a line of best-fit<br />
1. Examine the plot provided and visually determine a line of best-fit (or trend line)<br />
using a piece of spaghetti. Tape your spaghetti line onto your graph.<br />
Trainer notes: Use the page titled How Do You Find the Line of Best-Fit? to discuss<br />
methods students use for placing trend lines. This page discusses the general idea<br />
behind linear regression. To determine a line-of best fit, you must have an agreed upon<br />
measure of “goodness.” If that measure is closeness of the points to the line, the best<br />
line is then the line with the least total distance. There are many methods for measuring<br />
“closeness.” The most common is the method of least squares.<br />
2. Now let us investigate the “goodness” of the fit. Use a second piece of spaghetti to<br />
measure the distance from the first point to the line. Break off this piece to represent<br />
that distance. Each person at the table must measure in the same way, so discuss<br />
the method you will use before starting. Repeat this for each point.<br />
Encourage at least one group to use the shortest distance from the point to the line (i.e.,<br />
the perpendicular distance.) Have each group present their method and results. A good<br />
way to accomplish this is to have the “winner” from each table come up to the front.<br />
They can then be grouped by their method of measurement. Have each share, discuss,<br />
compare, and contrast.<br />
Discuss the fact that since that the groups used different methods of measuring, we<br />
cannot determine best-of-fit for the entire class.<br />
Discuss the accuracy of their measurements. Did they measure from the edge of each<br />
point or the middle, etc.?<br />
3. Line up your spaghetti distances to determine who in your group has the closest fit.<br />
Determine the total error. (i.e., total distance from your line to the data.) Then,<br />
replace the segments and tape them to your scatterplot.<br />
Total error = _______<br />
Use the page titled Measuring to discuss three ways to measure the space between a<br />
point and the line. Discuss the meaning of a residual and why it is used in evaluating the<br />
accuracy of a model.<br />
Spaghetti Regression 3-9
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
How Do You Find the Line of Best Fit? – Possible Solutions<br />
So you’ve observed some data. You have a set of data points (x,y). You've plotted<br />
them, and they seem to be pretty much linear. How do you find the line that best fits<br />
those points? "That’s simple," your students say. "Put them into a TI-83 and look at the<br />
answer." Okay, but let us ask a deeper question: How does the calculator find the<br />
answer?<br />
What is meant by Best?<br />
First, we have to agree on what we mean by the "best fit" of a line to a set of points.<br />
Why do we say that the line on the left fits the points better than the line on the right?<br />
And can we say that some other line might fit them better still?<br />
Transparency 2 (page 3-11)<br />
Look at the following students’ responses to the task: draw a line of best fit for the data.<br />
What reasoning might they have given for their choice of lines?<br />
Passes through the most points, equal number of points above and below, passes through the<br />
end points, etc. [Transparencies 3-5 (pages 3-12 – 3-14)]<br />
Usually we think of a close fit as a good fit. But, what do we mean<br />
by close?<br />
How close are these points?<br />
Spaghetti Regression 3-10
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Discuss criteria that might be used to assess the “closeness” of these points? How many<br />
different ways might it be done?<br />
Transparency 2<br />
Spaghetti Regression 3-11
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Transparency 3<br />
Spaghetti Regression 3-12
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Transparency 4<br />
Spaghetti Regression 3-13
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Transparency 5<br />
Spaghetti Regression 3-14
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Measuring<br />
There are at least three ways to measure the space between a point and the line:<br />
vertically in the y direction, horizontally in the x direction, and the shortest distance from<br />
a point to the line (on a perpendicular to the line.)<br />
In regression, we usually choose to measure the space vertically. These distances<br />
are known as residuals.<br />
• Why would you want to measure this way? What do the residuals represent in relation<br />
to our function? Consider the purpose of the line and the following diagram.<br />
The purpose of regression is to find a function that can model a data set. The function is<br />
then used to predict the y values (or outputs, f(x) ) for any given input x. So, the vertical<br />
distance represents how far off the prediction is from the actual data point (i.e., the<br />
“error” in each prediction.) Residuals are calculated by subtracting the model’s<br />
predicted values, f(xi), from the observed values, yi.<br />
Residual = yi −<br />
f ( xi<br />
)<br />
Handout 4<br />
Spaghetti Regression 3-15
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Transparency 6<br />
Spaghetti Regression 3-16
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Activity 2<br />
Objective: Investigate various methods of regression.<br />
Whose model makes the best predictions? Let us compare everyone’s lines using<br />
the residuals.<br />
Before we begin, we need to know the equation for your spaghetti function,<br />
f(x) = mx + b. Assume the lower left corner of the graph is (0,0).<br />
f(x) = __________________<br />
1. Enter your function at Y1= in the calculator.<br />
2. Enter the actual data into L1 and L2. Put the x-values in L1 and the y-values in L2.<br />
Make certain that the x’s are typed in correspondence to the y’s.<br />
x 2 5 6 10 12 15 16 20 20<br />
y 14 19 9 21 7 21 18 10 22<br />
3. Place the predicted values, f(xi), created by your function, in L3. To do this, place<br />
your cursor on L3 and enter your function, using L1 as the inputs of the function.<br />
(See below.)<br />
FYI: Y1 can be found under [vars] → [Y-vars] → [1:function] → 1:Y1<br />
Handout 5-1<br />
Spaghetti Regression 3-17
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
4. Compute the residuals (the distances between the predicted values, f(xi) , and actual<br />
y values) and place them in L4. This can be done by entering L4 = L2-L3.<br />
5. On your home screen compute Sum(L4). Record your group’s functions and the<br />
corresponding sums.<br />
FYI: Sum can be found under [2 nd ][stat] → [math] → 5:sum<br />
Function Sum of the residual errors<br />
• Examine your values in L4. What is the meaning of a negative residual in terms of the<br />
graph and in terms of the function’s predictions? What is the meaning of a positive or<br />
negative total for the functions in #5?<br />
Handout 5-2<br />
Spaghetti Regression 3-18
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Examine the following student’s work.<br />
• In L4 what is the meaning of 39.23? What is the corresponding value in your<br />
table? Describe its meaning.<br />
• What is the meaning of a low total residual error? Is it a good measure of fit?<br />
Why or why not?<br />
Handout 5-3<br />
Spaghetti Regression 3-19
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
There are two possible ways to fix the above problem. One way is to take the absolute<br />
value of the residual; the other is to square the residual. Taking the absolute value of<br />
the residuals is synonymous with using our spaghetti segments to measure the vertical<br />
error.<br />
6. Find Sum(abs(L4)). Record your group’s functions and the corresponding sums.<br />
FYI: abs can be found under [2 nd ][0]<br />
Function Sum of the residual error<br />
• Compare with those in the class to determine who now has the lowest total<br />
error.<br />
Note: The calculator’s regression method uses the squared residuals when measuring<br />
the goodness-of-fit of a regression line.<br />
Let us compare our lines of best-fit, using the squared residuals.<br />
7. Find the total of the squared residuals by Sum((L4) 2 ) . This is often referred to as<br />
the Sum of the Squared Errors, noted SSE.<br />
Function SSE<br />
Handout 5-4<br />
Spaghetti Regression 3-20
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
• Compare with those in the class to determine who has the lowest sum of the squared<br />
errors. Did the best line in the group change? Why or why not?<br />
Let us compare our lines against the calculator’s regression line.<br />
8. Use your calculator to compute the linear regression function, f(x) = mx + b.<br />
f(x) = ___________________<br />
9. Enter the function into Y1 and place the function’s predicted values f(xi) in L3, i.e., L3 =<br />
Y1(L1).<br />
10. Quickly, compute the sum of squared errors by using SUM((L2- L3) 2 ).<br />
SSE = ________<br />
• How do the functions in the class compare to this one?<br />
11. Create a scatterplot and graph your group’s functions and the calculator’s regression<br />
function. Examine visually the goodness of fit of each in regard to their SSE.<br />
At least two methods exist for evaluating goodness of fit: taking the absolute value of<br />
the residuals and squaring the residuals. Although taking the absolute value seems<br />
most intuitive, relying on squaring does several things. The most desirable one is that it<br />
simplifies the mathematics needed to guarantee the “best” line. (See the appendix.)<br />
In Activity 3, you can investigate how squaring the residuals when measuring our<br />
goodness-of-fit affects the choice of the regression line.<br />
Understanding what you are looking for is always the toughest part of any problem, so<br />
the hard part is done. You now know how to measure “goodness” of fit. We can also<br />
say exactly what the calculator means by the line of best-fit. If we compute the residuals<br />
(i.e., the error in the y direction), square each one, and add up the squares, we say the<br />
line of best-fit is the line for which that sum is the least. Since it is a sum of squares,<br />
the method is called the Method of Least Squares! This is the most commonly used<br />
method but, as we have seen, it isn’t the only way!<br />
Handout 5-5<br />
Spaghetti Regression 3-21
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Activity 2 - Possible solutions<br />
Objective: Investigate various methods of regression.<br />
Whose model makes the best predictions? Let us compare everyone’s lines using<br />
the residuals.<br />
Before we begin, we need to know the equation for your spaghetti function,<br />
f(x) = mx + b. Assume the lower left corner of the graph is (0,0).<br />
f(x) = __1/3 x + 9________________<br />
1. Enter your function at Y1= in the calculator.<br />
2. Enter the actual data into L1 and L2. Put the x-values in L1 and the y-values in L2. Make<br />
certain that the x’s are typed in correspondence to the y’s.<br />
x 2 5 6 10 12 15 16 20 20<br />
y 14 19 9 21 7 21 18 10 22<br />
3. Place the predicted values, f(xi), created by your function, in L3. To do this, place your<br />
cursor on L3 and enter your function, using L1 as the inputs of the function. (See below.)<br />
FYI: Y1 can be found under [vars] → [Y-vars] → [1:function] → 1:Y1<br />
Spaghetti Regression 3-22
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
4. Compute the residuals (the distances between the predicted values, f(xi) , and actual y<br />
values) and place them in L4. This can be done by entering L4 = L2-L3.<br />
5. On your home screen compute Sum(L4). Record your group’s functions and the<br />
corresponding sums.<br />
FYI: Sum can be found under [2 nd ][stat] → [math] → 5:sum<br />
Function Sum of the residual errors<br />
Y= 1/3 x + 9 24.66<br />
Y= ¼ x + 11 15.5<br />
Y= 5/4 x 8.5<br />
Y= 2x + 3 -98<br />
• Examine your values in L4. What is the meaning of a negative residual in terms of the graph<br />
and in terms of the function’s predictions? What is the meaning of a positive or negative total<br />
for the functions in #5?<br />
In the graph, a negative residual in L4 means the actual point is below the line. In<br />
terms of the function’s predictions a negative residual means the function over predicted value.<br />
A positive sum of the residuals means you have more total under predictions than over<br />
predictions and vise versa for a negative sum of the residuals.<br />
Spaghetti Regression 3-23
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Examine the following student’s work.<br />
• In L4 what is the meaning of 39.23? What is the corresponding value in your table?<br />
Describe its meaning.<br />
It means this person’s function under predicted the value by 39.32.<br />
• What is the meaning of a low total residual error? Is it a good measure of fit? Why or<br />
why not?<br />
This is not a good measure of fit because large under predictions could be cancelled by<br />
large over predictions hence making the sum small, as in the above example.<br />
Spaghetti Regression 3-24
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
There are two possible ways to fix the above problem. One way is to take the absolute value of<br />
the residual; the other is to square the residual. Taking the absolute value of the residuals is<br />
synonymous with using our spaghetti segments to measure the vertical error.<br />
6. Find Sum(abs(L4)). Record your group’s functions and the corresponding sums.<br />
FYI: abs can be found under [2 nd ][0]<br />
Function Sum of the residual error<br />
Y= 1/3 x + 9 52<br />
Y= ¼ x + 11 48.5<br />
Y= 5/4 x 64.5<br />
Y= 2x + 3 124<br />
• Compare with those in the class to determine who now has the lowest total error.<br />
Note: The calculator’s regression method uses the squared residuals when measuring the<br />
goodness-of-fit of a regression line.<br />
Let us compare our lines of best-fit, using the squared residuals.<br />
7. Find the total of the squared residuals by Sum((L4) 2 ) . This is often referred to as the Sum<br />
of the Squared Errors, noted SSE.<br />
Function SSE<br />
Y= 1/3 x + 9 338<br />
Y= ¼ x + 11 289.375<br />
Y= 5/4 x 676.375<br />
Y= 2x + 3 2488<br />
Spaghetti Regression 3-25
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
• Compare with those in the class to determine who has the lowest sum of the squared<br />
errors. Did the best line in the group change? Why or why not?<br />
The best line could change. In regression using the absolute value and using the square<br />
may not agree, because it changes how you define what the best line is.<br />
Let us compare our lines against the calculator’s regression line.<br />
8. Use your calculator to compute the linear regression function, f(x) = mx + b.<br />
f(x) = _.156 x + 13.83__________________<br />
9. Enter the function into Y1 and place the function’s predicted values f(xi) in L3, i.e., L3 =<br />
Y1(L1).<br />
10. Quickly, compute the sum of squared errors by using SUM((L2- L3) 2 ).<br />
SSE = _259.67_______<br />
• How do the functions in the class compare to this one?<br />
The calculator linear regression function should have a lower SEE than the classes<br />
functions.<br />
11. Create a scatterplot and graph your group’s functions and the calculator’s regression<br />
function. Examine visually the goodness of fit of each in regard to their SSE.<br />
At least two methods exist for evaluating goodness of fit: taking the absolute value of<br />
the residuals and squaring the residuals. Although taking the absolute value seems<br />
most intuitive, relying on squaring does several things. The most desirable one is that it<br />
simplifies the mathematics needed to guarantee the “best” line. (See the appendix.)<br />
In Activity 3, you can investigate how squaring the residuals when measuring our<br />
goodness-of-fit affects the choice of the regression line.<br />
Understanding what you are looking for is always the toughest part of any problem, so<br />
the hard part is done. You now know how to measure “goodness” of fit. We can also<br />
say exactly what the calculator means by the line of best-fit. If we compute the residuals<br />
(i.e., the error in the y direction), square each one, and add up the squares, we say the<br />
line of best-fit is the line for which that sum is the least. Since it is a sum of squares,<br />
the method is called the Method of Least Squares! This is the most commonly used<br />
method but, as we have seen, it isn’t the only way!<br />
Spaghetti Regression 3-26
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Activity 3 Absolute Value vs. Squaring<br />
OBJECTIVE: It is important to understand the effect squaring has on the residuals and<br />
the placement of a regression line. In this activity, we will use an interactive java applet<br />
to investigate several data sets and contrast geometrically and numerically the effect of<br />
using the square of the residuals vs. the absolute value of the residuals.<br />
1. Place three points forming a triangle on the graph. Select “plot line” and place a<br />
trend line on the graph.<br />
2. Select “Draw residuals.” Using the handle points, adjust your line to visually<br />
minimize the length of the residuals.<br />
Select “Show Trend Line Equation.” ____________________<br />
3. Select “Draw (residuals) 2 .” Using the handle points, adjust your line to visually<br />
minimize the area of the squares.<br />
Equation of the line: ____________________<br />
4. Now select “Sum of the residuals” and adjust your line to numerically minimize the<br />
|residuals|. Record the equation and total: ___________________<br />
5. Now select “Sum of the (residuals) 2 ” and adjust your line to numerically minimize the<br />
(residuals) 2 . Record the equation and total:_________________<br />
6. Create a situation where the sum of the squares is less than the sum of the absolute<br />
value.<br />
7. Create a data set in which the least absolute value and least squares methods agree<br />
on the line of best fit.<br />
8. Place the following ordered pairs (4, 1), (4, 4), (-4, 0), and (-4, -3) in the table. Find<br />
the line of best fit for each method.<br />
• Compare and contrast these two methods.<br />
Handout 6-1<br />
Spaghetti Regression 3-27
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
• How does squaring the residuals affect how individual data points contribute to the<br />
total error? Does squaring increase or decrease the effect of an individual residual on<br />
the total error?<br />
• What is the effect of an outlier point on each of the possible trend lines for each<br />
method?<br />
Further investigation<br />
Another method for finding regression lines is Chebyshev’s Best-Fit Line Method, also<br />
known as the MinMax Method, which finds the line with the minimum maximum<br />
residual. Chebyshev’s evaluates each line based on its largest residual and takes the<br />
line with the smallest (largest residual ) as the regression line.<br />
• Use Chebyshev’s method in the previous graphs to determine a line of best fit. How<br />
does it compare to the least absolute value and least squares methods? How it is<br />
affected by outliers?<br />
Handout 6-2<br />
Spaghetti Regression 3-28
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Activity 3 Absolute Value vs. Squaring – Selected Answers<br />
OBJECTIVE: It is important to understand the effect squaring has on the residuals and<br />
the placement of a regression line. In this activity, we will use an interactive java applet<br />
to investigate several data sets and contrast geometrically and numerically the effect of<br />
using the square of the residuals vs. the absolute value of the residuals.<br />
1. Place three points forming a triangle on the graph. Select “plot line” and place a<br />
trend line on the graph.<br />
2. Select “Draw residuals.” Using the handle points, adjust your line to visually<br />
minimize the length of the residuals.<br />
Select “Show Trend Line Equation.” ____________________<br />
3. Select “Draw (residuals) 2 .” Using the handle points, adjust your line to visually<br />
minimize the area of the squares.<br />
Equation of the line: ____________________<br />
4. Now select “Sum of the |residuals|” and adjust your line to numerically minimize the<br />
|residuals|. Record the equation and total: ___________________<br />
5. Now select “Sum of the (residuals) 2 ” and adjust your line to numerically minimize the<br />
(residuals) 2 . Record the equation and total:_________________<br />
6. Create a situation where the sum of the squares is less than the sum of the absolute<br />
values. Participants should notice the effect squaring has on each residual. Place<br />
the points close to the line so that the residuals are less than 1.<br />
7. Create a data set in which the least absolute value and least squares methods agree<br />
on the line of best fit. Various possible answers<br />
8. Place the following ordered pairs (4, 1), (4, 4), (-4, 0), and (-4, -3) in the table. Find<br />
the line of best fit for each method. Note:The absolute value line is not unique.<br />
• Compare and contrast these two methods.<br />
Various answers: Note, both methods are valid. However, the absolute value method<br />
does not always give a unique regression line.<br />
Spaghetti Regression 3-29
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
• How does squaring the residuals affect how individual data points contribute to the<br />
total error? Does squaring increase or decrease the effect of an individual residual on<br />
the total error? If the residual is less than one, squaring decreases it’s effect on the<br />
total squared residual. If the residual is greater than one, squaring increases it’s effect<br />
on the total squared residual. Thus, the squaring method rewards small errors and<br />
penalizes large residual errors. This penalizing and rewarding effect of the least<br />
squares method is often described as desirable by statisticians. The absolute value<br />
methods however treats all residuals the same (with equal contempt).<br />
• What is the effect of an outlier point on each of the possible trend lines for each<br />
method? Since squaring will give disproportion weight to the outlier when compared to<br />
the absolute value method it will have a greater effect on the sum errors of the least<br />
squares regression line.<br />
Further investigation<br />
Another method for finding regression lines is Chebyshev’s Best-Fit Line Method, also known<br />
as the MinMax Method, which finds the line with the minimum maximum residual.<br />
Chebyshev’s evaluates each line based on its largest residual and takes the line with<br />
the smallest (largest residual ) as the regression line.<br />
• Use Chebyshev’s method in the previous graphs to determine a line of best fit. How<br />
does it compare to the least absolute value and least squares methods? How it is<br />
affected by outliers?<br />
Spaghetti Regression 3-30
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Supplemental Material<br />
Two ways to minimize the sum of the squared residuals<br />
The key to solving this or any problem is understanding exactly for what you are<br />
looking. Our model, or line of “best fit”, f ( x)<br />
= mx+<br />
b , will be one that minimizes the<br />
sum of the squares of the vertical distances between the actual points and the predicted<br />
2<br />
ones, i.e., the residuals = yi − f ( xi<br />
) . It can be written L = ∑ ( y − f ( x)<br />
) or<br />
∑<br />
2<br />
L = ( y − ( mx + b ) ) .<br />
∑<br />
2<br />
What kind of equation is L = ( y − ( mx + b ) ) ? That’s right, quadratic. And we<br />
actually know enough about quadratics from <strong>Algebra</strong> II to solve this problem. But, one<br />
of the key words in the above paragraph is minimize, which should also make you think<br />
Calculus! This gives us an easy alternative approach.<br />
Let us examine this quadratic more closely.<br />
∑<br />
L = ( y − ( mx + b )<br />
=∑<br />
2<br />
)<br />
2 2<br />
2<br />
( m x + 2bmx<br />
+ b − 2myx<br />
− 2by<br />
+ y<br />
It may look daunting, but remember, m and b are the only unknowns here. x and y are<br />
just numbers supplied by each of the actual points in our scatterplot.<br />
2 2<br />
2<br />
Expanding L farther, L = m ∑ x + bm∑<br />
x + nb − 2m∑<br />
xy − 2b∑<br />
y + ∑<br />
2 y<br />
(You might want to double check all this! Why let someone else have all the fun?)<br />
Remember that the summations are just constants! So now we have a choice to use<br />
calculus to find its minimum or use <strong>Algebra</strong> II to find its vertex.<br />
Let’ try the Calculus!<br />
In calculus, the minimum occurs here where the derivative is equal to zero. Since we<br />
have two variables, m and b, we will want to take the derivative of each variable<br />
separately. (These are called partial derivatives.)<br />
∂L<br />
= 2m<br />
∂m<br />
2<br />
∑x+ 2b∑x−2∑<br />
xy =<br />
0<br />
Handout 7-1<br />
Spaghetti Regression 3-31<br />
2<br />
)<br />
2
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
∂L<br />
= 2m y<br />
∂b<br />
∑x+ 2nb<br />
− 2∑<br />
=<br />
0<br />
All that’s left is to solve this system of equations by elimination or substitution. Take<br />
your pick.<br />
Using substitution, b in the second equation looks easiest to solve for. So, we get<br />
∑ y − m∑<br />
x<br />
b = . Substituting for b into the first equation and simplifying, we get<br />
n<br />
n∑xy<br />
+ ∑ x∑y<br />
m =<br />
.<br />
2<br />
2<br />
n x − x)<br />
∑<br />
(∑<br />
And that’s it. Your calculator or computer just sums the x’s, the y’s, the xy’s, etc. and<br />
out pops the slope and y-intercept of your regression equation. It is not hard, but<br />
certainly tedious when done by hand.<br />
(You may wonder how we know it is a minimum and not a maximum. The second<br />
derivative is 2; a positive second derivative means it must be a minimum.)<br />
Let us try it with <strong>Algebra</strong>!<br />
Here we go. Remember that we want to find the minimum of<br />
L = m<br />
2<br />
2<br />
2<br />
∑ x + bm ∑ x + nb − 2m<br />
∑ xy − 2b∑<br />
y + ∑<br />
2 y<br />
and that all of those summations are just constants. Thus, L is a quadratic with respect<br />
to m or b. This can be seen easily by rearranging.<br />
2<br />
L(m) = ( ∑ x ) m2 2 2<br />
+ ( 2b<br />
∑ x − 2∑<br />
xy)<br />
m − ( 2b<br />
∑ y + ∑ y + nb )<br />
L(b)= n b2 2 2<br />
2<br />
+ ( 2m<br />
∑ x − 2∑<br />
y)<br />
b + ( m<br />
∑ x − 2m∑<br />
xy −∑<br />
y )<br />
Handout 7-2<br />
Spaghetti Regression 3-32<br />
2
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Do they open up or down? The leading coefficients, ∑ 2<br />
x and n, are both positive, so<br />
the answer is up.<br />
From <strong>Algebra</strong> II, we know the vertex of Ax 2 − B<br />
+ Bx + C occurs at .<br />
2A<br />
So m =<br />
b =<br />
− ( 2b∑<br />
x − 2<br />
2<br />
2 x<br />
∑<br />
− ( 2m∑<br />
x − 2∑<br />
y)<br />
2n<br />
∑<br />
xy)<br />
=<br />
=<br />
∑ xy − b<br />
∑<br />
x<br />
∑ y − m∑<br />
x<br />
.<br />
n<br />
Handout 7-3<br />
Spaghetti Regression 3-33<br />
2<br />
∑<br />
x<br />
, and<br />
∑ ∑ ∑<br />
∑ (∑<br />
n xy + x y<br />
Substituting one into the other, we get m =<br />
and<br />
2<br />
2<br />
n x − x)<br />
∑<br />
∑<br />
∑<br />
∑ ∑<br />
(∑<br />
2<br />
y x − x xy<br />
b = . This is exactly the same result as before.<br />
2<br />
2<br />
n x + x)<br />
Some Historical Notes<br />
Who invented the method of least squares? It is not clear. Often credit is given to<br />
Karl Friedrich Gauss (1777–1855), who was first published on this subject in 1809. But<br />
the Frenchman Adrien Marie Legendre (1752–1833) published a clear example of the<br />
method four years earlier. Legendre was in charge of setting up the new metric system<br />
of measurement, and the meter was to be one ten-millionth of the distance from the<br />
North Pole through Paris to the Equator. Surveyors had measured portions of the arc<br />
but to get the best measurement for the whole arc, Legendre developed the method of<br />
least squares. He would probably use GPS today, but he was still amazingly accurate.<br />
Where does the term "regression” come from? The term was first used by Sir<br />
Francis Galton (1822-1911) in his hereditary studies. He wanted to predict the heights<br />
of sons from their father’s heights. He learned that a tall father tended to have sons<br />
shorter than himself, and a short father tended to have sons taller than himself. The<br />
heights of sons thus regressed towards the mean height of the population over several<br />
generations. The term "regression” is now used for many types of prediction problems,<br />
and does not merely apply to regression towards the mean.
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Activity: Understanding Correlation Properties with a Visual Model<br />
Overview: This activity encourages participants to visually explore the meaning of<br />
correlation and to recognize correlation patterns.<br />
.<br />
<strong>TEKS</strong>: This activity supports teacher content knowledge underlying the<br />
following <strong>TEKS</strong>:<br />
§111.32. <strong>Algebra</strong> I<br />
(a) Basic understandings.<br />
(5) Tools for algebraic thinking. Techniques for working with functions<br />
and equations are essential in understanding underlying<br />
relationships. Students use a variety of representations (concrete,<br />
pictorial, numerical, symbolic, graphical, and verbal), tools, and<br />
technology, (including, but not limited to, calculators with graphing<br />
capabilities, data collection devices, and computers) to model<br />
mathematical situations to solve meaningful problems.<br />
(A.2) Foundations for functions. The student uses the properties and<br />
attributes of functions.<br />
The student is expected to:<br />
(D) collect and organize data, make and interpret scatter plots<br />
(including recognizing positive, negative, or no correlation for data<br />
approximating linear situations), and model, predict, and make<br />
decisions and critical judgments in problem situations.<br />
Vocabulary: correlation, regression, Pearson Product moment correlation, causation<br />
Procedure: Participants use a computer to investigate correlation values and to<br />
practice estimating correlation values for scatterplots.<br />
After completing the activity, participants should have a visual feel for<br />
numerical correlation values, and should also be able to relate numerical<br />
values of correlation to contextual situations. Participants are also<br />
encouraged to investigate and understand the relationship between<br />
correlation and causation.<br />
Materials: Computer with internet access and Java 1.4<br />
PowerPoint slides: Correlation vs. Causation<br />
Handout 1 (pages 3-42 – 3-46)<br />
Handout 2 (pages 3-53 – 3-55)<br />
Handout 3 (pages 3-61 – 3-64)<br />
Handout 4 (page 3-65)<br />
Handout 5 (pages 3-66 – 3-67)<br />
Yard stick<br />
Photocopy of a forearm.<br />
Understanding Correlation Properties with a Visual Model 3-34
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Grouping: Individually or pairs<br />
Time: 2 - 2½ hours<br />
Lesson:<br />
Procedures Notes<br />
Activity 1 CSI Correlation<br />
Part A:<br />
Participants use a computer to<br />
investigate how the modeling process is<br />
used to generate new knowledge.<br />
Distribute Handouts 1 and 2, Activities 1<br />
and 2, (pages 3-42 – 3-46 and pages 3-<br />
53 – 3-55).<br />
Read the crime scene scenario.<br />
Participants will collect data from 8<br />
people using a yard stick.<br />
Participants will use a computer to<br />
investigate correlation values. Using the<br />
applet Correlation.<br />
www.mathteks2006.net/applets<br />
Hand out the photo copy of the<br />
assailants forearm. The participants will<br />
then extrapolate the assailants height.<br />
Part B: A Closer Look:<br />
Participants use a computer to<br />
investigate correlation values. Have<br />
participants open the applet Correlation.<br />
www.mathteks2006.net/applets<br />
Changes in the data set are investigated.<br />
Outliers, changes in scale, and the<br />
geocenter of a set of data are discussed.<br />
The forearm should be measured from<br />
the elbow to the wrist.<br />
Participants should discuss measuring<br />
techniques and degree of accuracy.<br />
You will need a photocopy of the<br />
assailants forearm to distribute to each<br />
group. If possible use someone who is a<br />
bit out of the normal range. For<br />
example, the tallest or shortest<br />
participant. This will cause the<br />
participants to extrapolate instead of<br />
interpolate.<br />
After completing Activity 1, participants<br />
should have a visual feel for numerical<br />
correlation values and should also be<br />
able to relate numerical values of<br />
correlation to contextual situations.<br />
Understanding Correlation Properties with a Visual Model 3-35
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Procedures Notes<br />
Activity 2<br />
Part A:<br />
The goal of this activity is to gain an<br />
intuitive understanding of r. Using the<br />
web applet Correlation, scatterplots are<br />
easily constructed. By clicking and<br />
dragging points, participants can change<br />
the data sets and investigate the effect<br />
on the correlation.<br />
Part B: The r Game<br />
Have participants play a game with<br />
several classmates to develop deeper<br />
understanding of correlations, leverage<br />
points, and geocenters. Participants use<br />
the web applet Correlation,<br />
http://mathteks006.net/applets, to create<br />
scatterplots with a specific correlation.<br />
(See Part B handout for further<br />
directions.)<br />
The participants should play several<br />
times until they have a good intuition of<br />
how each point’s relationship with the<br />
others affects the correlation.<br />
Activity 3 Correlation vs. Causation<br />
This activity explores the relationship<br />
between correlation and causation.<br />
Part A:<br />
Give out Handout 3, Activity 3 - Part A,<br />
(pages 3-61 – 3-64) or use the Power<br />
Point provided and lead a class<br />
discussion of correlation and causation.<br />
The dynamic nature of the applet allows<br />
you to see how the correlation changes<br />
when a data point is added or moved.<br />
Without technology, such intuition would<br />
take years to develop.<br />
When interpreting the correlation<br />
coefficient, you should always look at the<br />
scatterplot first to see if the relationship<br />
is linear. If it is, you may calculate the<br />
correlation coefficient. Always<br />
remember that a visual analysis of data<br />
is quite valuable in addition to a<br />
numerical analysis.<br />
Understanding Correlation Properties with a Visual Model 3-36
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Procedures Notes<br />
Correlation vs.<br />
Causation<br />
In a Gallup poll, surveyors asked, “Do you<br />
believe correlation implies causation?’”<br />
64% of American’s answered “Yes” .<br />
38% replied “No”.<br />
The other 8% were undecided.<br />
Ice-cream sales are strongly<br />
correlated with crime rates.<br />
Therefore, ice-cream causes<br />
crime.<br />
There is a humorous article discussing<br />
this poll in the appendix.<br />
If correlation implies causation, this<br />
would be a fabulous finding! To reduce<br />
or eliminate crime, all we would have to<br />
do is stop selling ice cream. Even<br />
though the two variables are strongly<br />
correlated, assuming that one causes<br />
the other would be erroneous. What are<br />
some possible explanations for the<br />
strong correlation between the two?<br />
One possibility might be that high<br />
temperatures increase crime rates<br />
(presumably by making people irritable)<br />
as well as ice-cream sales.<br />
Understanding Correlation Properties with a Visual Model 3-37
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Procedures Notes<br />
The Simpsons<br />
(Season 7, "Much Apu About Nothing")<br />
Homer:Not a bear in sight. The "Bear<br />
Patrol" is working like a charm!<br />
Lisa: That's specious reasoning, Dad.<br />
Homer:[uncomprehendingly] Thanks,<br />
honey.<br />
Lisa: By your logic, I could claim that<br />
this rock keeps tigers away.<br />
Homer:Hmm. How does it work?<br />
Lisa: It doesn't work; it's just a<br />
stupid rock!<br />
Homer:Uh-huh.<br />
Lisa: But I don't see any tigers<br />
around, do you?<br />
Homer:(pause) Lisa, I want to buy your<br />
rock.<br />
Without proper prope r interpretation,<br />
inte rpre tation,<br />
causation should not be<br />
assumed, or even implied.<br />
Cons ider the following res earc earch h<br />
undertaken by the Univers ity of <strong>Texas</strong><br />
Health S cience <strong>Center</strong> at S an Antonio<br />
appearing to s how a link between<br />
cconsumption ons umption of diet diet s oda and weight<br />
gain.<br />
The The ss tudy tudy of of more more than than 600 600 normal--weight<br />
normal weight<br />
people people found, found, eight eight years years later, later, that that they they<br />
were were 65 65 percent percent more more likely likely to to be be<br />
overweight overweight if if they they drank drank one one diet diet ss oda oda a a<br />
day day than than if if they they drank drank none. none. And And if if they they<br />
drank drank two two or or more more diet diet ss odas odas a a day, day, they they<br />
were were even even more more likely likely to to become become<br />
overweight overweight or or obes obese. e.<br />
An entertaining demonstration of this<br />
fallacy once appeared in an episode of<br />
The Simpsons (Season 7, "Much Apu<br />
About Nothing"). The city had just spent<br />
millions of dollars creating a highly<br />
sophisticated "Bear Patrol" in response<br />
to the sighting of a single bear the week<br />
before.<br />
Our students and the general public<br />
often take such relationships as causal.<br />
By no means does this state that diet<br />
soda causes obesity - but there is a<br />
strange pattern at play here.<br />
A relationship other than causal might<br />
exist between the two variables. It is<br />
possible that there is some other<br />
variable or factor that is causing the<br />
outcome. This is sometimes referred to<br />
as the "third variable" or "missing<br />
variable" problem.<br />
• What are some other possible<br />
plausible alternative explanations<br />
to our diet soda/obesity research<br />
example?<br />
Understanding Correlation Properties with a Visual Model 3-38
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Procedures Notes<br />
A re lationship othe r than causal<br />
m ight e xist be twe e n the two<br />
variables. It's possible that the re<br />
is some other variable or factor<br />
that is causing th e outcom e . This<br />
is some tim e s re fe rre d to as the<br />
"third va ria b le " or "m issin g<br />
variab le " proble m .<br />
Ice cream sales and the number of shark<br />
attacks on swimmers are correlated.<br />
Skirt lengths and stock prices are highly<br />
correlated (as stock prices go up, skirt<br />
lengths get shorter).<br />
The number of cavities in elementary<br />
school children and vocabulary size are<br />
strongly correlated.<br />
The re are two re lationships<br />
which can be mistaken for<br />
causation:<br />
1. Common re sponse<br />
2. Confounding<br />
We must be very careful in interpreting<br />
correlation coefficients. Just because<br />
two variables are highly correlated does<br />
not mean that one causes the other. In<br />
statistical terms, we simply say that<br />
correlation does not imply causation.<br />
There are many good examples of<br />
correlation which are nonsensical when<br />
interpreted in terms of causation.<br />
Understanding Correlation Properties with a Visual Model 3-39
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Procedures Notes<br />
→Z → X &Y<br />
1 . Common Re sponse :<br />
Both Xand Yre spond to change s in<br />
some unobse rve d variable , Z. All<br />
three of our previous examples are<br />
examples of common response.<br />
2. Confounding<br />
The effect of Xon Yis indistin guishab le<br />
from the effects of other explanatory<br />
variable s on Y. When studying medical<br />
tre atm e nts, the “place bo e ffe ct” is an<br />
example of confounding.<br />
When can we imply<br />
causation?<br />
Controlled experiments<br />
must be performed.<br />
Unless data have been gathered by experimental<br />
means and confounding variables have been<br />
eliminated, correlation never implies causation.<br />
The placebo effect is the phenomenon<br />
that a patient's symptoms can be<br />
alleviated by an otherwise ineffective<br />
treatment, since the individual expects or<br />
believes that it will work.<br />
For example, if we are studying the<br />
effects of Tylenol on reducing pain, and<br />
we give a group of pain-sufferers Tylenol<br />
and record how much their pain is<br />
reduced, the effect of Tylenol is<br />
confounded with the effect of giving them<br />
any pill. Many people will report a<br />
reduction in pain by simply being given a<br />
sugar pill with no medication.<br />
Experimental research attempts to<br />
understand and predict causal<br />
relationships. Since correlations can be<br />
created by an antecedent, Z, which<br />
causes both X and Y, or by confounding<br />
variables, controlled experiments are<br />
performed to remove these possibilities.<br />
Still the great Scottish philosopher David<br />
Hume has argued that we can only<br />
perceive correlation, and causality can<br />
never truly be known or proven.<br />
Understanding Correlation Properties with a Visual Model 3-40
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Procedures Notes<br />
Part B: Headlines<br />
Distribute Handout 4, Part B, (page 3-<br />
65). Participants brainstorm common<br />
causes of confounding variables for<br />
various headlines and related<br />
correlations.<br />
Within your group, brainstorm common<br />
causes or confounding variables. Write<br />
your ideas below and be prepared to<br />
share.<br />
Handout 5, Supplemental Reading,<br />
(pages 3-66 – 3-67).<br />
This is a humorous article discussing the<br />
correlation, causation debate.<br />
Power point slides of the headlines are<br />
included to help in a summary<br />
discussion of this activity.<br />
Understanding Correlation Properties with a Visual Model 3-41
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
ACTIVITY 1<br />
This module opens with an explanation of the way that paired measurements can be<br />
plotted in two-dimensional space. Next, positive and negative relationships are<br />
discussed and participants are asked to predict values using a regression equation. It<br />
concludes with a discussion of outliers.<br />
PART A<br />
Consider the following.<br />
At approximately 6:45 a.m., Tuesday morning, Principal Espinoza saw something<br />
strange as he opened the backdoor to B. Wyatt High School. As he entered the<br />
hallway, he immediately discovered the broken glass from the classroom door. It was a<br />
9 th grade <strong>Math</strong> classroom. The computers were missing, the desks were overturned,<br />
and the prized school banner was torn from the wall. The perpetrators were long gone,<br />
but they had left something behind. Next to the desk, where Mrs. Joe’s computer once<br />
sat, was the imprint of a forearm on the board. When the police arrived, they<br />
immediately began to gather forensic evidence. Mr. Espinosa, knowing your love of CSI<br />
and Numb3rs, asks you to help gather data to help identify the bandits.<br />
Bones of the arm can reveal interesting facts about an individual. But can they reveal a<br />
person's height? Forensic anthropologists team up with law enforcers to help solve<br />
crimes. Let us combine math with forensics to see how.<br />
Collect data for 8 people.<br />
Person Forearm<br />
Length<br />
(inches)<br />
Height<br />
(inches)<br />
Handout 1-1<br />
Understanding Correlation Properties with a Visual Model 3-42
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
1. From the table, describe any relationships you see between the variables forearm<br />
length and height.<br />
Making a scatter plot can provide a useful summary of a set of bivariate data (two<br />
variables). It gives a good visual picture of the relationship between the two variables<br />
and aids in the interpretation of the correlation coefficient and regression model. The<br />
scatterplot should always be drawn before working out a linear correlation coefficient or<br />
fitting a regression line.<br />
A positive association is indicated on a scatterplot by an upward trend (positive<br />
slope), where larger x-values correspond to larger y-values and smaller x-values<br />
correspond to smaller y-values. A negative association would be indicated by the<br />
opposite effect (negative slope) where the higher x-values would correspond to lower yvalues.<br />
Or, there might not be any notable linear association.<br />
2. We will use the web applet Correlation for further investigation in the following<br />
exercises. Enter the forearm length and height data into the table and examine the<br />
scatterplot.<br />
In 1896, Karl Pearson gave the formula for calculating the correlation coefficient known<br />
as r. (To see it, select show equation for r.) He argued that it was the best indicator of<br />
linear relationships. It derives its name from linear, meaning “straight line,” and corelation<br />
meaning to "go together." The drudgery of computing the correlation coefficient<br />
by hand is quite ominous. However, today’s calculators can easily compute r. It is often<br />
referred to as the Pearson Product Moment Correlation Coefficient.<br />
We can generally categorize the strength of correlation as follows:<br />
• Strong |r| > 0.8<br />
• Moderate: 0.5< |r |
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
If variables are strongly correlated, we often use one to predict the other. A gross<br />
example from forensic science is using the size and larva stage of maggots to predict<br />
time of death. Linear regression is the method used to create these mathematical<br />
prediction models. Given X, we can predict Y. If the correlation is high enough, record<br />
the function for the regression line.<br />
3. Using the information you collected, try predicting the height of our assailant for Mr.<br />
Espinosa. A copy of the police imprint from our assailant is attached.<br />
● What would increase your confidence in this prediction?<br />
In real life, mathematics always begins with a question. What do you want to know?<br />
This is followed by data collection. If it is bivariate data, scatterplots are drawn to give<br />
the “big picture.” If the relationship looks linear, the correlation coefficient is calculated<br />
to quantify the relationship. If the r value is reasonable, a linear function can be found<br />
that is used to predict what has not been observed; in our case, the height of the<br />
assailant.<br />
Handout 1-3<br />
Understanding Correlation Properties with a Visual Model 3-44
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
PART B – A Closer Look<br />
Now let us look more closely at how we measure the strength of associations between<br />
data sets. The correlation coefficient can range from -1 to 1. ( ± 1 being a perfect linear<br />
correlation between the two variables.) If the variables are completely independent, the<br />
correlation is 0. However, the converse is not true since the correlation coefficient<br />
detects only linear dependencies between two variables.<br />
Let us investigate changes in our data set.<br />
1. Click and drag one point of your scatterplot until the correlation is 0.3. Record the<br />
coordinates.<br />
● Is the placement of this point unique?<br />
● What does the new point represent in terms of the context?<br />
An outlier is an observation that lies an abnormal distance from other values in a<br />
sample. In a sense, this definition leaves it up to you, the analyst, to decide what will be<br />
considered abnormal. Before abnormal observations can be singled out, it is necessary<br />
to characterize normal observations. If the data point is in error, it should be corrected if<br />
possible. If there is no reason to believe that the outlying point is in error, it should not<br />
be deleted without careful consideration.<br />
● Would you consider your point an outlier? Why?<br />
2. Suppose a “mistake” was made. All the forearm sizes were reported in centimeters<br />
(1 in. = 2.54 cm.), and all the heights were recorded in inches. A student tells you<br />
that the correlation will be too low saying that increasing the forearm data by a factor<br />
greater than 1 will spread the points in a graph. Do you agree with the student?<br />
How would you explore this issue?<br />
Handout 1-4<br />
Understanding Correlation Properties with a Visual Model 3-45
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
● What do you suppose would happen to our correlation value if we changed to<br />
different height scale?<br />
3. Delete the outlying point from your table. Now, add two additional points to make a<br />
correlation of 0.99. Discuss the placement of your points.<br />
The geocenter, also called the center of mass or centroid is the “average” point of the<br />
data. If we have the points (x1,y1), (x2,y2) (x3,y3), and (x4,y4) then the coordinates of<br />
the geocenter would be ⎛ x 1+ x2<br />
+ x3<br />
+ x4<br />
y1<br />
+ y2<br />
+ y3<br />
+ y4<br />
⎞.<br />
The further a point is from the<br />
⎜<br />
⎝<br />
4<br />
,<br />
geocenter of the data the more “leverage” it has. (Note: The regression line always<br />
passes through this point.)<br />
4<br />
Students often have a naïve sense of correlation. We should look to extend their<br />
understandings. Dynamic applications such as Geometers Sketch Pad and web<br />
applets open up new avenues for exploration and deeper understandings. By allowing<br />
students to explore and test their own conjectures, they take ownership of their<br />
mathematical understandings.<br />
Handout 1-5<br />
Understanding Correlation Properties with a Visual Model 3-46<br />
⎟<br />
⎠
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
ACTIVITY 1 - Selected Answers<br />
This module opens with an explanation of the way that paired measurements can be<br />
plotted in two-dimensional space. Next, positive and negative relationships are<br />
discussed and participants are asked to predict values using a regression equation. It<br />
concludes with a discussion of outliers.<br />
PART A<br />
Consider the following.<br />
At approximately 6:45 a.m., Tuesday morning, Principal Espinoza saw something<br />
strange as he opened the backdoor to B. Wyatt High School. As he entered the<br />
hallway, he immediately discovered the broken glass from the classroom door. It was a<br />
9 th grade <strong>Math</strong> classroom. The computers were missing, the desks were overturned,<br />
and the prized school banner was torn from the wall. The perpetrators were long gone,<br />
but they had left something behind. Next to the desk, where Mrs. Joe’s computer once<br />
sat, was the imprint of a forearm on the board. When the police arrived, they<br />
immediately began to gather forensic evidence. Mr. Espinosa, knowing your love of CSI<br />
and Numb3rs, asks you to gather data to help identify the bandits.<br />
Bones of the arm can reveal interesting facts about an individual. But can they reveal a<br />
person's height? Forensic anthropologists team up with law enforcers to help solve<br />
crimes. Let us combine math with forensics to see how.<br />
Collect data for 8 people. (The number can vary, use at least 7.)<br />
Person Forearm<br />
Length<br />
(inches)<br />
Height<br />
(inches)<br />
10.5 63<br />
10 66<br />
11.5 72<br />
10.25 62<br />
10.5 66<br />
11.5 71<br />
12.5 76<br />
Understanding Correlation Properties with a Visual Model 3-47
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
1. From the table, describe any relationships you see between the variables forearm<br />
length and height.<br />
As forearm length increases, the height increases.<br />
Making a scatter plot can provide a useful summary of a set of bivariate data (two<br />
variables). It gives a good visual picture of the relationship between the two variables<br />
and aids in the interpretation of the correlation coefficient and regression model. The<br />
scatterplot should always be drawn before working out a linear correlation coefficient or<br />
fitting a regression line.<br />
A positive association is indicated on a scatterplot by an upward trend (positive slope)<br />
where larger x-values correspond to larger y-values and smaller x-values correspond to<br />
smaller y-values. A negative association would be indicated by the opposite effect<br />
(negative slope), where the higher x-values would have lower y-values. Or, there might<br />
not be any notable linear association.<br />
2. We will use the web applet Correlation for further investigation in the following<br />
exercises. Enter the forearm length and height data into the table and examine the<br />
scatterplot.<br />
Understanding Correlation Properties with a Visual Model 3-48
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
In 1896, Karl Pearson gave the formula for calculating correlation coefficients known as<br />
r. (To see it, select show equation for r.) He argued that it was the best indicator of<br />
linear relationships. It derives its name from linear, meaning “straight line,” and corelation<br />
meaning to "go together." The drudgery of computing them by hand is quite<br />
ominous. However, today’s calculators can easily compute them. It is often referred to<br />
as the Pearson Product Moment Correlation.<br />
We can generally categorize the strength of correlation as follows:<br />
• Strong: |r| > 0.8<br />
• Moderate: 0.5< |r |
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
their forearm? These questions of reliability are very important in applying the<br />
ideas of this activity to data from any real life experiment.<br />
In real life, mathematics always begins with a question. What do you want to know?<br />
This is followed by data collection. If it is bivariate data, scatterplots are drawn to give<br />
the “big picture.” If the relationship looks linear, the correlation coefficient is calculated<br />
to quantify the relationship. If the r value is reasonable, a linear function can be found<br />
that is used to predict what has not been observed, in our case, the height of the<br />
assailant.<br />
PART B – A Closer Look -Selected Answers<br />
Now let us look more closely at how we measure the strength of association between<br />
data sets. The correlation coefficient can range from -1 to 1. ( ± 1 being a perfect linear<br />
correlation between the two variables.) If the variables are completely independent, the<br />
correlation is 0. However, the converse is not true since the correlation coefficient<br />
detects only linear dependencies between two variables.<br />
.<br />
Let us investigate changes in our data set.<br />
1. Click and drag one point of your scatterplot until the correlation is 0.3. Record the<br />
coordinates.<br />
2.<br />
Ex.(8.07, 72.6) Answers may vary<br />
Understanding Correlation Properties with a Visual Model 3-50
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
● Is the placement of this point unique?<br />
No, it is not. Encourage participants to find several.<br />
Further Questions: How many places can the point be placed? 3, 4, infinite?<br />
● What does the new point represent in terms of the context?<br />
Answers may vary. The example point (8.07, 72.6) represents a person that is<br />
approximately 6 feet and ½ inches tall with 8 inch forearms.<br />
An outlier is an observation that lies an abnormal distance from other values in a<br />
sample. In a sense, this definition leaves it up to you, the analyst, to decide what will be<br />
considered abnormal. Before abnormal observations can be singled out, it is necessary<br />
to characterize normal observations. If the data point is in error, it should be corrected if<br />
possible. If there is no reason to believe that the outlying point is in error, it should not<br />
be deleted without careful consideration.<br />
● Would you consider your point an outlier? Why?<br />
Answers may vary. Yes. The point represents an abnormal situation. He is 6 feet<br />
tall and has the arms of a child or possibly an amputee.<br />
2. Suppose a “mistake” was made. All the forearm sizes were reported in centimeters<br />
(1 in. = 2.54 cm.) and all the heights were recorded in inches. A student tells you that<br />
the correlation will be too low saying that increasing the forearm data by a factor<br />
greater than 1 will spread the points in a graph. Do you agree with the student?<br />
How would you explore this issue?<br />
Encourage participants to try it. This question should encourage a healthy<br />
discussion of important misconceptions. How does the visual spread of the data<br />
affect the correlation? What if both height and forearm are recorded in<br />
centimeters? How would this dilation affect the correlation? This confronts the<br />
students’ belief that if we spread the data, the correlation should diminish.<br />
However, changes in scale do not affect correlations.<br />
● What do you suppose would happen to our correlation value if we changed to a<br />
different height scale?<br />
It should not affect the correlation. Changes in scale do not affect correlations.<br />
3. Delete the outlying point from your table. Now, add two additional points to make a<br />
correlation of 0.99. Discuss the placement of your points.<br />
Understanding Correlation Properties with a Visual Model 3-51
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
The geocenter, also called the center of mass or centroid is the “average” point of the<br />
data. If we have the points (x1,y1), (x2,y2) (x3,y3), and (x4,y4) then the coordinates of<br />
the geocenter would be<br />
. The further a point is from the geocenter<br />
⎛ x 1+ x2<br />
+ x3<br />
+ x4<br />
y1<br />
+ y2<br />
+ y3<br />
+ y4<br />
⎞<br />
⎜<br />
,<br />
⎟<br />
⎝ 4<br />
4 ⎠<br />
of the data the more “leverage” it has. (Note: The regression line always passes<br />
through this point.)<br />
Students often have a naïve sense of correlation. We should look to extend their<br />
understandings. Dynamic applications such as Geometers Sketch Pad and web<br />
applets open up new avenues for exploration and deeper understandings. By allowing<br />
students to explore and test their own conjectures, they take ownership of their<br />
mathematical understandings.<br />
Understanding Correlation Properties with a Visual Model 3-52
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
ACTIVITY 2<br />
H.G Wells once said,”Statistical thinking will one day be as necessary as the<br />
ability to read and write.”<br />
The goal of this activity is to gain an intuitive understanding of r. Using the web applet<br />
Correlation, scatterplots can easily be constructed. The dynamic nature of the applet<br />
allows you to see how the correlation changes when a data point is added or moved.<br />
PART A<br />
1. Clear your table and place two points on the graph. Note the correlation.<br />
● Would any two points have the same value? Explain.<br />
A student remarks that “when r is undefined, it means there is no linear model for the<br />
data.” Do you agree? How would you explore/explain this?<br />
2. Make a lower left to upper right pattern of 10 points with a correlation of 0.7.<br />
3. Make a vertical stack of 9 data points on the left side of the window. Add a 10 th point<br />
somewhere to the right and drag it until the correlation again reaches 0.7. Is its<br />
placement unique?<br />
4. Make another scatter plot with 10 data points in a curved pattern that starts at the<br />
lower left, rises to the right, then falls again at the far right. Adjust the points until<br />
you have a smooth curve with a correlation close to 0.7.<br />
● Does any other curved pattern have this same correlation?<br />
● What can you conclude about the numerical value of a correlation?<br />
Handout 2-1<br />
Understanding Correlation Properties with a Visual Model 3-53
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Juan and Kaylee have collected data for a class experiment. They correctly found an rvalue<br />
of .15. Juan claims that no function will model the data. But, Kaylee says that the<br />
r-value is wrong because she has found a good one. Is this possible? How would you<br />
help these students?<br />
5. Make a data plot with a correlation of 0 by placing 8 to 10 points on the graph.<br />
6. Enter 4 points in the table to make a perfect rectangle. Note the correlation value.<br />
7. Create several other data sets with a horizontal or vertical line of symmetry and note<br />
the correlation value.<br />
Let us take a closer look at the numerical value of r by investigating the equation that<br />
produces this quantity. n is the number of points.<br />
8. Select “Show equation for r” and examine the formula to determine why and when<br />
the correlation is undefined. (Hint: use two points)<br />
So far, we have developed some intuitions about r. Its formal definition is quite<br />
complex. However, r 2 is much simpler. So we mention it here. r 2 is the fraction of total<br />
variation in the y variable that can be explained by the regression equation. The rest of<br />
the variation is due to randomness or some other factors. For example, if the correlation<br />
coefficient is 0.7 then r 2 = 0.49 meaning that 49% of the variation in the y-variable can be<br />
explained by the regression equation. The other 51% is due to some other factors. How<br />
does this affect your understanding of how the strength of correlations are categorized<br />
in part A of activity 1? Consider some other “strong,” “moderate,” and “weak” r-values.<br />
Handout 2-2<br />
Understanding Correlation Properties with a Visual Model 3-54
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
PART B – The r Game<br />
One basic rule when interpreting the correlation coefficient is to “First look at the<br />
scatterplot to see if the relationship between variables is linear.” If it is, you may<br />
calculate the correlation coefficient. Always remember that a visual analysis of data is<br />
quite valuable in addition to a numerical analysis.<br />
To understand r, it is important to understand how individual points affect the value of<br />
correlations. The relationship of outliers, leverage points and non-leverage points to the<br />
geocenter of a set of data are explored in this simple exercise.<br />
Use the web applet Correlation at http://mathteks006.net/applets to practice creating<br />
scatterplots with a specific correlation.<br />
1. Challenge your classmates to place seven points on the graph that have a correlation<br />
of 0.7.<br />
2. You are not allowed to delete or drag points once they are placed on the graph.<br />
3. Price is Right Rules – The closest r-value without going over wins!<br />
Play several times varying the number of points and r-value.<br />
Handout 2-3<br />
Understanding Correlation Properties with a Visual Model 3-55
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
ACTIVITY 2 -Selected Answers<br />
H.G Wells once said, ”Statistical thinking will one day be as necessary as the<br />
ability to read and write.”<br />
The goal of this activity is to gain an intuitive understanding of r. Using the web applet<br />
Correlation, scatterplots can easily be constructed. The dynamic nature of the applet<br />
allows you to see how the correlation changes when a data point is added or moved.<br />
PART A<br />
1. Clear your table and place two points on the graph. Note the correlation.<br />
● Would any two points have the same value? Explain.<br />
No, depending on the placement of the points, it may be 1, -1, or undefined.<br />
A student remarks that “when r is undefined, it means there is no linear model for the<br />
data.” Do you agree? How would you explore/explain this?<br />
No, consider the points (1, 4) and (3,4) modeled by y= 4. There is no correlation<br />
for these points since when the x-values increase, the y-values neither increase<br />
or decrease. The correlation is not zero since the variables are not independent<br />
of one another.<br />
Understanding Correlation Properties with a Visual Model 3-56
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
2. Make a lower left to upper right pattern of 10 points with a correlation of 0.7.<br />
3. Make a vertical stack of 9 data points on the left side of the window. Add a 10 th point<br />
somewhere to the right and drag it until the correlation again reaches 0.7. Is its<br />
placement unique? No<br />
Understanding Correlation Properties with a Visual Model 3-57
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
4. Make another scatter plot with 10 data points in a curved pattern that starts at the<br />
lower left, rises to the right, then falls again at the far right. Adjust the points until<br />
you have a smooth curve with a correlation close to 0.7.<br />
● Does any other curved pattern have this same correlation?<br />
● What can you conclude about the numerical value of a correlation?<br />
Juan and Kaylee have collected data for a class experiment. They correctly found an rvalue<br />
of .15. Juan claims that no function will model the data well. But, Kaylee says that<br />
the r-value is wrong because she has found a good model. Is this possible? How<br />
would you help these students?<br />
It is possible. Since correlation measures only a linear relationship, to have r<br />
close to or equal to zero does not mean that there is no relationship between X<br />
and Y! For example, a relationship might be quadratic.<br />
Understanding Correlation Properties with a Visual Model 3-58
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
5. Make a data plot with a correlation of 0 by placing 8 to 10 points on the graph.<br />
6. Enter 4 points in the table to make a perfect rectangle. Note the correlation value.<br />
Understanding Correlation Properties with a Visual Model 3-59
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
7. Create several other data sets with a horizontal or vertical line of symmetry and note<br />
the correlation value.<br />
Let us take a closer look at the numerical value of r by investigating the equation that<br />
produces this quantity. n is the number of points.<br />
8. Select “Show equation for r” and examine the formula to determine why and when<br />
the correlation is undefined and zero. (Hint: use two points)<br />
So far, we have developed some intuitions about r. Its formal definition is quite<br />
complex. However, r 2 is much simpler, so we mention it here. r 2 is the fraction of total<br />
variation in the y variable that can be explained by the regression equation. The rest of<br />
the variation is due to randomness or other factors. For example, if the correlation<br />
coefficient is 0.7 then r 2 = 0.49 meaning that 49% of the variation in the y-variable can be<br />
explained by the regression equation. The other 51% is due to other factors. How does<br />
this affect your understanding of how the strength of correlations are categorized in part<br />
A of activity 1? Consider some other “strong,” “moderate,” and “weak” r-values.<br />
Understanding Correlation Properties with a Visual Model 3-60
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
ACTIVITY 3 - Correlation vs. Causation<br />
Objective<br />
To explore the relationship between correlation and causation.<br />
PART A<br />
In a Gallup poll, surveyors asked, “Do you believe correlation implies causation?” 64%<br />
of American’s answered “Yes” while only 38% replied “No”. The other 8% were<br />
undecided.<br />
Consider the following:<br />
Ice-cream sales are strongly correlated with crime rates.<br />
Therefore, ice-cream causes crime.<br />
If correlation implies causation, this would be a fabulous finding! To reduce or eliminate<br />
crime, all we would have to do is stop selling ice cream. Even though the two variables<br />
are strongly correlated, assuming that one causes the other would be erroneous. What<br />
are some possible explanations for the strong correlation between the two? One<br />
possibility might be that high temperatures increase crime rates (presumably by making<br />
people irritable) as well as ice-cream sales.<br />
An entertaining demonstration of this fallacy once appeared in an episode of The<br />
Simpsons (Season 7, "Much Apu About Nothing"). The city had just spent millions of<br />
dollars creating a highly sophisticated "Bear Patrol" in response to the sighting of a<br />
single bear the week before.<br />
Handout 3-1<br />
Understanding Correlation Properties with a Visual Model 3-61
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Homer: Not a bear in sight. The "Bear Patrol" is working like a charm!<br />
Lisa: That's specious reasoning, Dad.<br />
Homer: [uncomprehendingly] Thanks, honey.<br />
Lisa: By your logic, I could claim that this rock keeps tigers away.<br />
Homer: Hmm. How does it work?<br />
Lisa: It doesn't work; it's just a stupid rock!<br />
Homer: Uh-huh.<br />
Lisa: But I don't see any tigers around, do you?<br />
Homer: (pause) Lisa, I want to buy your rock.<br />
Correlations are often reported inferring causation in newspaper articles, magazines,<br />
and television news. But, without proper interpretation, causation should not be implied<br />
or assumed.<br />
Consider the following research undertaken by the University of <strong>Texas</strong> Health Science<br />
<strong>Center</strong> at San Antonio, appearing to show a link between consumption of diet soda and<br />
weight gain.<br />
The study of more than 600 normal-weight people found, eight years later, that they were 65<br />
percent more likely to be overweight if they drank one diet soda a day than if they drank none.<br />
And if they drank two or more diet sodas a day, they were even more likely to become<br />
overweight or obese.<br />
Our students and the general public often take such relationships as causal. By no<br />
means does this state that diet soda causes obesity - but there is a strange pattern at<br />
play here.<br />
A relationship other than causal might exist between the two variables. It is possible that<br />
there is some other variable or factor that is causing the outcome. This is sometimes<br />
referred to as the "third variable" or "missing variable" problem.<br />
• What are some other possible plausible alternative explanations to our diet<br />
soda/obesity research example?<br />
We must be very careful in interpreting correlation coefficients. Just because two<br />
variables are highly correlated does not mean that one causes the other. In statistical<br />
terms, we simply say that correlation does not imply causation. There are many<br />
Handout 3-2<br />
Understanding Correlation Properties with a Visual Model 3-62
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
good examples of correlation which are nonsensical when interpreted in terms of<br />
causation.<br />
For example, each of the following are strongly correlated:<br />
• Ice cream sales and the number of shark attacks on swimmers.<br />
• Skirt lengths and stock prices (as stock prices go up, skirt lengths get shorter).<br />
• The number of cavities in elementary school children and vocabulary size.<br />
• Peanut butter sales and the economy.<br />
Two relationships which can be mistaken for causation are:<br />
1. Common response: Both X and Y respond to changes in some unobserved variable,<br />
Z. All three of our examples above are examples of common response.<br />
2. Confounding variables: The effect of X on Y is hopelessly mixed up with the effects<br />
of other variables on Y.<br />
When studying medical treatments, the placebo effect is an example of confounding.<br />
The placebo effect is the phenomenon that a patient's symptoms can be alleviated by<br />
an otherwise ineffective treatment, since the individual expects or believes that it will<br />
work.<br />
For example, if we are studying the effects of Tylenol on reducing pain, and we give a<br />
group of pain-sufferers Tylenol and record how much their pain is reduced, the effect of<br />
Tylenol is confounded with giving them any pill. Many people will report a reduction in<br />
pain by simply being given a sugar pill with no medication.<br />
False causes can be ruled out using the scientific method. This is done through a<br />
designed experiment.<br />
In practice, three conditions must be met in order to conclude that X causes Y, directly<br />
or indirectly:<br />
1) X must precede Y<br />
2) Y must not occur when X does not occur<br />
3) Y must occur whenever X occurs<br />
Handout 3-3<br />
Understanding Correlation Properties with a Visual Model 3-63
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Experimental research attempts to understand and predict causal relationships (X → Y).<br />
Since correlations can be created by an antecedent, Z, which causes both X and Y<br />
(Z → X & Y), or by confounding variables, controlled experiments are performed to<br />
remove these possibilities. Unless data has been gathered by experimental means and<br />
confounding variables have been eliminated, one can not infer causation.<br />
Still the great Scottish philosopher David Hume has argued that we can only perceive<br />
correlation, and causality can never truly be known or proven.<br />
Handout 3-4<br />
Understanding Correlation Properties with a Visual Model 3-64
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
PART B - Headlines<br />
Consider the following headlines and their matching correlations from various sources.<br />
Accepted uncritically, each might be used to "prove" one’s point of view in an article.<br />
Within your group, brainstorm common causes or confounding variables. Write your<br />
ideas below and be prepared to share.<br />
Correlated variables Causation factors<br />
1. Kids’ TV Habits Tied to Lower IQ Scores<br />
IQ scores and hours of TV time (r = -0.54)<br />
2. Eating Pizza ‘Cuts Cancer Risk’<br />
Pizza consumption and cancer rate (r = -0.59)<br />
3. Gun bill introduced to ward off crime<br />
Gun ownership and crime (r = 0.71)<br />
4. Reading Fights Cavities<br />
Number of cavities in elementary school children and their<br />
vocabulary size (r = 0.67)<br />
5. Graffiti Linked to Obesity in City Dwellers<br />
BMI and amount of graffiti and litter (r =0.45)<br />
6. Stop Global Warming: Become a Pirate<br />
Average global temperature and number of pirates ( r = -0.93)<br />
Handout 4<br />
Understanding Correlation Properties with a Visual Model 3-65
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Supplemental Reading<br />
NEW POLL SHOWS CORRELATION IS CAUSATION<br />
WASHINGTON (AP) The results of a new survey conducted by pollsters<br />
suggest that, contrary to common scientific wisdom, correlation does in fact imply causation.<br />
The highly reputable source, Gallup Polls, Inc., surveyed 1009 Americans during the month of<br />
October and asked them, "Do you believe correlation implies causation?" An overwhelming<br />
64% of American's answered "YES", while only 38% replied "NO". Another 8% were<br />
undecided. This result threatens to shake the foundations of both the scientific and mainstream<br />
community.<br />
"It is really a mandate from the people." commented one pundit who wished to remain<br />
anonymous. "It says that The American People are sick and tired of the scientific mumbo-jumbo<br />
that they keep trying to shove down our throats, and want some clear rules about what to<br />
believe. Now that correlation implies causation, not only is everything easier to understand, it<br />
also shows that even Science must answer to the will of John and Jane Q. Public."<br />
Others are excited because this new, important result actually gives insight into why the result<br />
occurred in the first place. "If you look at the numbers over the past two decades, you can see<br />
that Americans have been placing less and less faith in the old maxim 'Correlation is not<br />
Causation' as time progresses." explained pollster and pop media icon Sarah Purcell. "Now,<br />
with the results of the latest poll, we are able to determine that people's lack of belief in<br />
correlation not being causal has caused correlation to now become causal. It is a real advance<br />
in the field of meta-epistemology."<br />
This major philosophical advance is, surprisingly, looked on with skepticism amongst the<br />
theological community. Rabbi Marvin Pachino feels that the new finding will not affect the plight<br />
of theists around the world. "You see, those who hold a deep religious belief have a thing called<br />
faith, and with faith all things are possible. We still fervently believe, albeit contrary to strong<br />
evidence, that correlation does not imply causation. Our steadfast and determined faith has<br />
guided us through thousands of years of trials and tribulations, and so we will weather this storm<br />
and survive, as we have survived before."<br />
Joining the theologists in their skepticism are the philosophers. "It's really the chicken and the<br />
egg problem. Back when we had to worry about causation, we could debate which came first.<br />
Now that correlation IS causation, I'm pretty much out of work." philosopher-king Jesse "The<br />
Mind" Ventura told reporters. "I've spent the last fifteen years in a heated philosophical debate<br />
about epistemics, and then all of the sudden Gallup comes along and says, "Average household<br />
consumption of peanut butter is up, people prefer red to blue, and...by the way, CORRELATION<br />
IS CAUSATION. Do you know what this means? This means that good looks actually make<br />
you smarter! This means that Katie Couric makes the sun come up in the morning! This means<br />
that Bill Gates was right and the Y2K bug is Gregory's fault." Ventura was referring to Pope<br />
Gregory XIII, the 16 th century pontiff who introduced the "Gregorian Calendar" we use today,<br />
and who we now know is to blame for the year 2000.<br />
The scientific community is deeply divided on this matter. "It sure makes my job a lot easier."<br />
confided neuroscientist Thad Polk. "Those who criticize my work always point out that, although<br />
highly correlated, cerebral blood flow is not 'thought'. Now that we know correlation IS causal, I<br />
can solve that pesky mind-body problem and conclude that thinking is merely the dynamic<br />
movement of blood within cerebral tissue. This is going to make getting tenure a piece of cake!"<br />
Handout 5-1<br />
Understanding Correlation Properties with a Visual Model 3-66
<strong>Math</strong>ematics <strong>TEKS</strong> Refinement 2006 – 9-12 Tarleton State University<br />
Anti-correlationist Travis Seymour is more cynical. "What about all the previous correlational<br />
results? Do they get grandfathered in? Like, the old stock market/hemline Pearson's rho is<br />
about 0.85. Does this mean dress lengths actually dictated the stock market, even though they<br />
did it at a time when correlation did not imply causation? And what about negative and<br />
marginally significant correlations? These questions must be answered before the scientific<br />
community will accept the results of the poll wholeheartedly. More research is definitely<br />
needed."<br />
Whether one welcomes the news or sheds a tear at the loss of the ages-old maxim that hoped<br />
to eternally separate the highly correlated from the causal, one must admit that the new logic is<br />
here and it's here to stay. Here to stay, of course, until next October, when Gallup, Inc. plans<br />
on administering the poll again. But chances are, once Americans begin seeing the<br />
entrepeneurial and market opportunities associated with this major philosophical advance, there<br />
will be no returning to the darker age when causal relationships were much more difficult to<br />
detect.<br />
http://www.obereed.net/hh/correlation.html<br />
Handout 5-2<br />
Understanding Correlation Properties with a Visual Model 3-67