Unit 3- Bivariate Data Flashcards

1
Q

What is bivariate data and what associations do we identify and describe?

A

Bivariate data is data that has two variables and is therefore collected in pairs.
We focus on: associations between two numerical values or categorical values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are explanatory vs response variables.

A
  1. Explanatory variable: these are independent and are used to explain or predict a difference in the response variable.
  2. Response variable: these are dependant and is explained by the explanatory variable.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are scatterplots and their purpose?

A

Scatterplots display two variables with scattered data points. This allows us to identify if there is a clear association.

Associations:
- directions are positive or negative
- form is linear or non-linear
- strength is weak, moderate, strong.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Explain Pearson’s correlation coefficient.

A

This is (r).
- shows the strength of a relationship.
- only valid for linear relations.
-ve or +ve shows the direction.
- closer to 1 or -1 is perfectly strong.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the strength categories of Pearson’s correlation coefficient.

A

(for +ve or -ve)
0.75-1 strong
0.5-0.75 moderate
0.25-0.5 weak
less than 0.25 no association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Explain the coefficient of determination.

A

This is (R^2)
- the proportion of the total variation that can be explained by the linear relationship.
- e.g. R^2 value of 0.94 means 94% of variation is explained.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Explain least-squares regression line.

A

(Line of best fit.)
given by y=a+bx
y= response variable
a= y int
b= gradient
x= explanatory variable

to find b, b=r x (s(y)/s(x))
r= correlation coefficient
s(y)= sample standard deviation of y vals
s(x)= sample standard deviation of x vals

to find a, a= y(mean)-b(mean) times x(mean)
y= mean of y vals
b= slope
x= mean of x vals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Explain residual plots.

A

These verify if it is appropriate to fit a linear model to bivariate data.
- this is a graph of the residuals against the explanatory variable.
- a good residual plot is when the dots are evenly spread and no pattern is evident.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Explain Association vs Causation.

A

Correlation does not imply causation.
- just because two variables might display an association, it does not guarantee a cause-and-effect relationship.
- both variables may be responding to a third variable
- or this is simply a coincidence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the equations for a residual Plot?

A

(residual is explanatory on x axis and residuals on y axis)

First find predicted y values: Use regression formula and sub the actual x values into the x in the equation and find the y.

Residuals: actual y value - predicted y value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is an extrapolation vs interpolation?

A

Extrapolation is outside the dataset. Interpolation is inside of this.

Try to do both in exam. If it is inside dataset use your graph and look at where a value would be. Otherwise use the regression eqn and sub this in.

If it says use graph, then use a diagram to find a value. Otherwise just use algebra. Its good to do both though.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do we see our reasonability?

A

Use extrapolation and interpolation and see if the graph matches the algebra.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How do you calculate r and R squared on the scientific calculator?

A

Mode- [2] stat- [2] A+BX (insert data into table)

[AC]- [1] shift- [3] r

to find R^2 just square this value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly