Unit 12: Correlation Flashcards

1
Q

explain the difference between t and z test

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q
  • 1-sample t-test
A

Have the value
you’re testing
against (population
mean) but NO
population SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

z-test

A

have population
mean & SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Correlations are between what kinds of variables

A
  • between two continuous variables
  • between a dichotomous variable and continuous one
  • between an ordinal variable and a continuous one

not looking at group differences, but looking at the associations between variables.
* If two variables are correlated it means that they co-vary.
* Does not imply causation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

response variable

A

dependent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

explanatory variable

A

independent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

A researcher would like to know if a mother’s height
can explain how tall her child will be. Which is the
response variable?

a. child’s height
b. mother’s height
c. father’s height

A

a. child’s height

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what do we use correlations for

A

Two variables that correlate means that as one variable changes, so
does the other. They co-vary.

  • A statistically significant correlation indicates that a relation is present
  • Null hypothesis: there is no correlation between the variables (or the correlation between the variables is 0)
  • Correlations are very flexible
  • When two variables are correlated…
  • The correlation coefficient quantifies what is common between variables.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

The Scatter Plot

A

Shows the relationship between two quantitative
variables measured on the same individuals.

  • The values of one variable appear on the horizontal axis,
    and the values of the other variable appear on the vertical
    axis.
  • Each individual corresponds to one point on the graph.

The scatter plot is a visual representation of data, plotting two data distributions in one figure (i.e., two values or scores for each individual)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what does a scatter plot line mean

A
  • The amount of scatter in the points that are plotted suggests the strength of
    the relationship between variables.
  • A positive relationship emerges when the data scatters from the lower left to
    the upper right.
  • A negative relationship emerges when the data scatters from the upper left to
    the lower right
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

After plotting two variables on a scatterplot, we describe the relationship by
examining the form, direction, and strength of the association. We look for an
overall pattern …

A
  • Form: linear, curved, clusters, no pattern
  • Direction: positive, negative, no direction
  • Strength: how closely the points fit the “form” or how scatter versus close
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q
A

negative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q
A

zero

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

how do we interpret scatterplots? explain negative and positive association

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Correlation Values

A
  • Correlations range from -1.0 to +1.0.
  • Values closer to +/- 1 are considered perfect correlations.
  • A positive correlation is indicated by a positive value
  • A negative correlation is indicated by a negative value
  • The correlation coefficient is a measure of the direction and
    strength of a linear relationship.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do you get the correlation coefficient?

A
17
Q

The Strength of a Correlation

A

The sign of the relationship between two variables has nothing to do
with its strength.
* Rule of thumb to determine the strength of a correlation (Visual
Statistics, 2009):
* 0 to .3 are considered “weak” correlations
* .3 to .7 are considered “moderate” correlations
* .7 and above are considered “high” correlation

18
Q

Correlation: properties of r

A
  1. -1 ≤ r ≤ 1
  2. The sign indicates the direction of association
  3. positive association: r > 0
  4. negative association: r < 0
  5. no linear association: r  0
  6. The closer r is to ±1, the stronger the linear association
  7. r has no units and does not depend on the units of
    measurement
  8. The correlation between X and Y is the same as the
    correlation between Y and X
19
Q

The Coefficient of Determination

A
  • The proportion of either variable explained by the other variable.
  • This value is the significant Pearson correlation value squared: r2xy
  • For r=.7; r2 = .49
  • 49% of variation in x explained by y OR
  • 49% of variation in y explained by x
20
Q

why is correlation useful

A

Establishing reliability and validity
* Test-retest reliability
* If you just run a t-test between the two occasions, you may end up finding a statistical
difference even on a reliable exam. There is something called “testing effect” where
people may end up doing better the second time they do it.
* However, even if they do end up doing better the second time around, if the exam is
reliable, the two occasions should have strong correlation.
* How different constructs are related to each other

21
Q

The Pearson Correlation

A

The most commonly-used correlation value is the Pearson Correlation
(formally the Pearson Product Moment Correlation), with the following characteristics (think assumption checks):
* The correlation is bivariate—there are just two variables involved.
* Both variables are measured on at least an interval scale (i.e., continuous data).
* The variables have a linear relationship.
* No significant outliers.
* The sampling distribution to which the data belong is normally distributed.
* (Usually satisfied when your sample size is large)

22
Q

Bivariate correlations

A

refer to the relationship between two variables.
* There can be correlations between:
* nominal variables (think dichotomous variables),
* ordinal variables,
* interval/ratio variables,
* and variables of different data scales.

23
Q

The Point-Biserial Correlation

A
  • The point-biserial correlation- relationship between dichotomous
    variable and continuous variable
  • Dichotomous variable must be coded 0 and 1
  • Additional assumption:
  • Equal variances in Y for each of the categories of the dichotomous variable
  • Calculated the same way
24
Q
A
25
Q
A
26
Q

What happens when you fail the assumption
checks?

A
  • Spearman Rho and Kendall’s tau
  • No need to be “normal”
  • Ordinal scale is okay!
  • Not too many ties
  • Kendall’s tau-b corrects for ties
  • Puts them into rank order and looks for monotonic relations
  • Positive: Lower on variable A is associated with lower on variable B
  • Negative: lower on 1 variable is associated with higher on another variable
27
Q

Spearman Rho

A

It assesses how well the relationship between two variables can be described using a monotonic function

28
Q

Kendall’s tau

A

Kendall’s Tau is a non-parametric measure of relationships between columns of ranked data. The Tau correlation coefficient returns a value of 0 to 1, where: 0 is no relationship, 1 is a perfect relationship

29
Q

monotonic function

A

A monotonic function is a function which is either entirely nonincreasing or nondecreasing

30
Q
A