Correlation Flashcards

1
Q

Why would we compute a partial correlation

A

.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why would we compute a semi-partial correlation?

A

.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What’s the main difference between semi and partial correlation?

A

.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Which correlation is larger or further away from zero, and why?:

A

.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What kind of variables is X and Y in correlations?

A

X and Y variables in correlations is random- beyond the experimenter’s control and subject to sampling error in both.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the goal of correlations?

A

To express the degree of relationship between X and Y.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define univariate information.

Provide an example.

A

Univariate information deals with 1 variable varying with itself. Not looking at the relationship between 2 variables yet.

We use the general sums of squares information (i,e., x vary with x; y vary with y).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Explain the direction and strength of correlation.

A

It is bounded by -1 and +1. Zero indicates no relationship. The relationship gets larger in strength as we go from 0 to +1 or -1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Conceptually define SSCP - what does SSCP tell us?

A

We are taking the cross product of 2 deviations. It tells us how X and Y varies together.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Conceptually define SS

A

It is the raw measure of variability.

The deviation of x times the deviation of x…

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Conceptually define covariance

A

.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Conceptually define variance and SD

A

Variance is the SS over df.

SD is the average deviation from the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the conceptual formula for pearson r?

A

r = degree to which X and Y vary together/degree of which X and Y vary individually

or

r = covariability of X and Y/variability of X and Y seperately

SSCPxy/sqrt of SSxSSy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Why would we assess scatterplots before we access numbers?

A

Since pearson r doesn’t show curvilinear graphs, we look at scatterplots to show us the trend and outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does pearson correlation measures specifically?

A

The degree of and direction of Linear Relationships between 2 variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

X is to predictor as Y is to

A

Y is to outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Define bivariate

A

How variables vary with each other rather than separately.

18
Q

Looking at bivariate information, what indicates the positive or negative direction?

Why can’t the denominator of the correlation statistics be negative?

A

Looking at the bivariate information, the SSCP or (x-xbar)(y-ybar) indicates the + or - direction.

The denominator is always positive due to the square rooting of the SSx and SSy.

19
Q

In a pearson r correlation formula, what indicates the direction?

A

The SSCP. A positive SSCP = positive correlation, and negative SSCP = negative correlation.

20
Q

Why is correlations considered the standardized relationship?

A

Because it’s bounded by -1 and +1.

21
Q

Define covariance in regards to correlations.

A

Covariance is the stepping block to correlations - it is the unstandardized relationship between our 2 variables and also get variance along with relationship.

22
Q

Where in a matrix do we see covariance?

A

The off-diagonal values will be covariances and the diagonal values are variances.

23
Q

If the scatterplot looks about a football, what might the correlation be?

A

About .50.

24
Q

If the scatterplot looks like a wider football, what might the correlation be?

A

About .30… less correlation.

25
Q

What does the correlation effect size indicate?

What is the notation?

How is it interpreted?

A

little r squared = tells us the proportion in X that can be explained by Y.

Ex: If r squared is .26, we would indicate that 26% of the number of doctor visits can be explained by attitudes of drug use.

26
Q

Correlations are standardized metrics and tells us how 2 variables are related to each other in a standardized way - we can calculate this correlation based on previously standardized information… what is this previously standardized information? And how will we get the scores?

A

Z-scores.

We convert all of our x and y values into z-scores and we can just do covariance of the z-scores (ZxZy or SSCP) over n-1.

27
Q

Why is it important to calculate covariances (like z-scores)?

A

It is important to calculate a covariance that are unstandardized because in estimations, covariances are used.

Just like computing sd, we start with variance and square root it to get the sd…

28
Q

What is the hypothesis testing notation for pearson correlation?

A

p = 0; no linear relationship

p ≠ 0; there is a relationship

29
Q

What is the df for a correlations test? Why is this the case?

A

The df for r is n-2 because we have 2 means (a mean for x and a mean for y).

30
Q

What is n in a correlations test?

A

The number of individuals (rows).

31
Q

If the critical value is .632 and our r obtained is .51, do we reject or fail to reject the null?

A

We fail to reject the null.

32
Q

Why would sample size affect the strength of a correlation?

A

It could be not powerful enough to detect the correlation. The larger the n, the smaller the critical value number.

33
Q

What is the statistical sentence for pearson correlation if n = 10, df = 8, r = .51, CV = .632.

A

r (df) = .51, p >.05

r (8) = .51, p >.05

We fail to reject the null

34
Q

In a covariance, does a large number indicate the magnitude of the relationship?

A

No, it doesn’t indicate strength, just the direction (pos. or neg.), because covariances are unbounded (there’s nothing to compare the magnitude to).

35
Q

What does a covariance near zero indicate?

What does a correlation near zero indicate?

A

They both indicate there is no linear relationship.

However, a correlation near zero doesn’t mean there isn’t a relationship at all… just no LINEAR relationship.

36
Q

Restricting the range of data (removing extremes) will do what to a correlational scatterplot?

In real life application, when would this happen?

A

The correlation or the degree of relationship gets weaker, smaller.

If we restrict age for example, like only getting college kids instead of 25 to 70 y.o.

37
Q

What’s wrong with selecting for extremes in a correlation?

A

Researchers take the mid-point and say anything above or lower is too high or too low and dichotomize continuos variables - artificially inflating the relationship and with the low group and high group on the continuos measure, they attempt to do t-tests that are easier to interpret by the masses… but really, just to get a significant correlation.

38
Q

What happens when a value is extreme on X and extreme on Y?

A

It is artificially inflated and makes the correlation much stronger (an outlier).

39
Q

What happens when a value is extreme on Y but in the middle of X?

A

The correlation decreases - reduced the relationship.

40
Q

How does trend of the base model affect the relationship due to an outlier?

A

If an outlier is near the trend of the base model (i.e., if my base model is headed into a positive direction), and the outlier on x is also in a positive direction relative to the trend, it will barely affect the model in a positive direction. However, if the trend is positive, but the outlier on x is negative, it brings the base model down by a lot. (pg 11).

41
Q

What is the diagonals of correlation matrix?

A

1s… a variable that is correlated with itself is 1.

42
Q

If I’m asked to compute the variance of x, what will I do?

A

I am essentially solving for x-xbar squared, so I will need to compute the mean of the observations, then solve for the sum of squares and divide it by n-1.