Lecture 6 - Linear Relationships, Scatter Plots, Covariance - Unit 2: Regression Flashcards
Suppose that you are told the class median for
the Quiz 1 grades was 80% and the IQR was
5%.
Assuming that π2 is exactly in the middle of
π1 and π3, then one quarter of the class
obtained a grade between 77.5 and 82.5%.
A) TRUE
B) FALSE
B) False
Half the class would have grades between 77.5 - 82.5%
Which numerical measure(s) cannot be
approximated by looking at a boxplot?
A) Mean
B) Median
C) Range
D) Interquartile range
A) Mean
Linear Relationships
Relationship Between Two Variables
bivariate data where two variables are measured per unit
π₯ is called the explanatory variable (independent variable)
- explains HOW the response changes from unit to unit
π¦ is called the response variable (dependent variable)
- Measures an outcome of a study; captures the outcome of interest
Linear Relationships - Variables
The typical amount of calories a person consumes per day and that
personβs percent of body fat
higher calories = higher body fat
x - explanatory variable
- number of calories
y - response variable
- % body fat
Linear Relationships -Variables
Inches of rain in the growing season and the yield of corn in bushels
per acre
more rain = higher yield
x - explanatory variable
- amount of rain
y - response variable
- yield of corn
Linear Relationships
General Approach
- Make a scatter plot of all (π₯, π¦) coordinates.
- Compute numerical measures that describe the linear relationship.
-Covariance
-Correlation - Make predictions.
-Regression line
-R-squared
Scatter Plots
Scatter Plot and Relationships
Chapter 3
Scatter Plots
What do we notice?
Flipper length increases,
Body mass increases
Positive Linear Relationship
Scatter Plots
Interpreting Scatter Plots
After plotting two variables on a scatter plot, we describe the overall pattern of the relationship. Specifically, we look for:
- Form: linear, curved, clusters, no pattern
- Direction: positive, negative, no direction
- Strength: how closely the points fit the βformβ
And clear deviations from that pattern (i.e., outliers of the
relationship).
Numerical Measures of Linear Relationships
Covariance, π π₯π¦
Covariance, π π₯π¦
How does this work?
π₯1= 1 is BELOW itβs mean.
π¦1 = 1 is BELOW itβs mean.
Covariance, π π₯π¦
Determine the signs for the other three grids.
Covariance, π π₯π¦
Covariance Calculation
Covariance, π π₯π¦
Question