AP STAT Section 3.3: Correlation and Regression Wisdom

AP STAT Section 3.3: Correlation and Regression Wisdom

AP STAT Section 3.3: Correlation and Regression Wisdom EQ: What are influential points and lurking variables and how do they impact the association between two variables? Recall:

Outliers--- points that are well removed from the trend that the other points seem to follow. Outliers in Univariate Data Set value in a set of data that does not fit with the rest of the data more than 3 standard deviations from the mean

lies outside the 1.5(IQR) fences Outliers in Bivariate Data extreme with respect to other y values in regression, a point that has an unusually large residual Influential Point in Bivariate Data

when removed the regression line changes leverage on the regression coefficient (aka known as slope) normally outliers in x direction, but are not always outliers in terms of regression (i.e. residual not large)

350 Heart attacks 300 Outlier 250

200 150 Influential observation 100 50 0

0 2 4 6 8

10 Wine consumption 12 14

16 The original data set is graphed at the right. Classify the new point as an outlier and/or an influential point. State whether its presence increases or decreases the strength of the association of the variables.

Outlier Decreases strength of association Outlier and Influential Influential Increases strength of Decreases strength of association

association Go over graphs on pp 235 236 Important Notes: Outlier points are almost always influential, but not vice-versa. Outliers in y-direction may influence y-intercept, but not the slope of the

regression line. Test for influential point is what happens to regression line when point is removed No RULE for determining outliers and influential points. Be able to explain what happens to correlation, slope, yintercept, and coefficient of determination when these points are

added to or removed from a scatterplot. Assignment: p. 238 239 #59 62 During the months of March and April of a certain year, the weekly weight increases of a puppy in New York were collected. For the same time f rame, the retail price increases of snowshoes in Alaska were collected.

Create a scatterplot for this data. Analyze both your graph and the summary statistics in a few sentences. Conclusion? The weight increase of a puppy in New York is CAUSING the price of snowshoes in Alaska to increase

or vice-versa. OF COURSE NOT!! Be sure your relationship makes sense. Scatterplots and correlation do not demonstrate causation. Association does not imply causation! Causation follows from linear

regression only. Lurking Variables --- has an important effect and yet is not included among the predictor variables under consideration. Perhaps its existence is unknown or its effect unsuspected. What

could be a lurking variable in these examples? a. There is a strong positive correlation between the foot length of K-12 students and reading scores. b. Students who need tutors have lower test scores than students who dont.

c. A survey shows a strong positive correlation between the percentage of a country's inhabitants that use cell phones and the life expectancy in that country. A group of college students believes that herbal tea has remarkable powers. To test this belief, they make weekly visits to a local nursing home, where they visit with the residents and serve them herbal tea. The nursing home staff reports that after several months

many of the residents are more cheerful and healthy. A skeptical sociologist commends the students for their good deeds but scoffs at the idea that herbal tea helped the residents. Identify the explanatory and response variables in this informal study. Then explain why lurking variables account for the observed association. KEY IDEAS TO FOCUS

ON: Univariate Data Bivariate Data Assignment: p. 242 243 #63, 64, 66, 67 p. 244 247 #69, 70, 73

Recently Viewed Presentations

  • Miracles - Joel D. Heck

    Miracles - Joel D. Heck

    The Circumstances. On Sept. 27, 1942, Lewis gave a talk on miracles at the Church of St Jude on the Hill, London. He later used some of that talk for the book, and the talk is now incorporated into God...
  • Determination of The Relationship Between Cardiac Output ...

    Determination of The Relationship Between Cardiac Output ...

    Resting Metabolic Rate: All subjects reported to the lab between 6 and 8am having fasted overnight. Subjects reclined in chair for 30 minutes to return to a resting metabolic condition. Data (O2 uptake, CO2 production, Respiratory Quotient) were collected for...
  • New Years Resolutions for SPAG Time is ticking

    New Years Resolutions for SPAG Time is ticking

    Can you make up a SPAG anagram of your own then test your partner? mole coins. flu plots. spare photo. a little carpet. lace us. Time is ticking… unscramble . the . key words . and/or note down when to...
  • St Peter's Huddersfield - Beaumont Street PDL Course

    St Peter's Huddersfield - Beaumont Street PDL Course

    Beaumont Street Studios University Certificate in Digital Media Personal Development and Learning PowerPoint Presentation St Peter's Huddersfield St Peter's Huddersfield Heritage And future Built by James P Pritchett in the 1830s.
  • George Orwell & 1984 - KHHS Super Seniors

    George Orwell & 1984 - KHHS Super Seniors

    George Orwell. Real name is Eric Blair. Born in 1903 in Bengal, a part of the British colony of India. Attended various boarding schools in England from the age of 5 until the age of 18. Lived in Burma after...
  • Integrated Resource Planning Stakeholder Outreach Meeting April 17,

    Integrated Resource Planning Stakeholder Outreach Meeting April 17,

    Introductions, Pat Poli. Upper Peninsula Insight, Art Thayer. Upper Peninsula Insight, Brandy Brown. Review of Comments Received, Pat Poli. Questions and Discussion. Upper Peninsula Considerations. Art Thayer: Energy Efficiency Programs Director.
  • The Internet and the World Wide Web - sites.google.com

    The Internet and the World Wide Web - sites.google.com

    Web site is a collection of documents. Document is a web page. Pages are published to the web. Hypertext Markup Language (HTML) Creates web pages. Describes how pages should look. Content enclosed in tags <tag>content</tag> Teaching tip. Show your students...
  • The Sherlock Project Tellabs, Inc. Purdue University Computer ...

    The Sherlock Project Tellabs, Inc. Purdue University Computer ...

    Times New Roman Default Design Microsoft Word Document Microsoft Excel Worksheet The Sherlock Project Purdue University, Department of Computer Sciences Sponsored by Tellabs, Inc. Overview The CS 406/407 Courses: Fundamentals of Software Engineering The Project: Sherlock System and Human Resource...