Chapter 7: Scatterplots, Associations, and Correlation Flashcards

1
Q

Define ‘Scatterplots’.

A

Shows the relationship between 2 quantitative variables measured of the same cases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define ‘Associations’.

A
  • Direction: A positive direction or association means that, in general, as one variable increases, so does the other. When increases in onw variable generally correspond to decreases in the other, the association is negative.
  • Form: The simplest form is straight, but you should certainly describe other patterns around the underlying relationship.
  • Strength: A scatterplot is said to show a strong association if there is a little scatter around the underlying relationship.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define ‘Outlier’.

A

A point that does not fit the overall pattern seen in the scatterplot.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define ‘Response variable, explanatory variable, y-variable, x-variable’.

A

In a scatterplot, you must choose a role for each variable. Assign the response to the y-axis the response variable that you hope to predict or explain. Assign to the x-axis the explanatory or predictor variable that accounts for, explains, predicts, or is otherwise responsible for the y-variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define ‘Correlation coefficient’.

A

A numerical measure of the direction and strength of a linear association
r = sum (Zx Zy) / (n-1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define ‘Common response’.

A

Changes in both x and y are caused by a lurking variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define ‘Lurking variable’.

A

A variable not present in our analysis that may influence our understanding of the relationship between x and y.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define ‘Confounding variables’.

A

Variables whose effects on the response variable, y, are entangled and difficult to distinguish.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Define ‘Re-expression’.

A

We re-express data by taking the log, the sqrt, the reciprocal, or some other mathematical operation of all values of a variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define ‘Ladder of Powers’.

A

Places in order the magnitude of effects that many re-expressions have on the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are some features of the correlations, r?

A
  • The sign of the correlation gives the direction of the relationship.
  • -1<=r<= 1 ; A correlation of 1 or -1 is a perfect linear relationship. A correlation of 0 indicates that there is no linear relationship.
  • Correlation has no units, so shifting or scaling the data, standardizing, or even swapping the cariables has no effect on the numerical value.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Is a large correlation a sign of causal relationship?

A

No.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the assumptions and conditions of correlation?

A
  • Quantitative variables Condition
  • Straight Enough Condition
  • No outliers Condition
How well did you know this?
1
Not at all
2
3
4
5
Perfectly