chapter 7/week 6 Flashcards

correlational research

1
Q

correlational research

A

used to describe the relation between two or more naturally occurring variables

ex: Is income related to age? Is self confidence related to GPA? Is depression related to anxiety?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

covary

A

vary or change together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

correlation coefficient

A

a statistic that indicates the degree to which two variables are related to one another in a linear fashion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

pearson correlation coefficient (r)

A

the most commonly used measure of correlation

Ranges from -1.00 to +1.00

Most commonly used; always this just the name is not said

The magnitude or numerical value of a correlation expresses the strength of the relation between the two variables
When r = .00 the variables are not related
A correlation of 0.78 indicates that the variables are more strongly related than does a correlation of 0.30
Magnitude is unrelated to the sign of the correlation; two variables with a correlation of 0.78 are just strongly related as two variables with a correlation of -0.78.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

sign of the correlation coefficient

A

indicates the direction of the relation between the two variables

Variables can either positively or negatively related

positive correlation
negative correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

positive correlation

A

a direct, positive relation between two variables; as one increases the other increases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

negative correlation

A

an inverse, negative relation between two variables; as one variable increases the other decreases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

perfect correlation vs no correlation

A

When there is a perfect correlation all of the data will fall into a straight line
- Should never happen; something is wrong

no correlation – scattered points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

when r = .00

A

A correlation of .00 indicates that there is no linear relation between the two variables. However, there could be a curvilinear relation between them.

Ex: anxiety and performance – not linear

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the components of the equation to calculate r?

A

X and y: participants score

Σxy: multiply each participant’s x- and y-scores together, then sum these products across all participants

(Σx)(Σy): sum all participants’ x-scores, sum all participants’ y-scores, then multiply these two sums

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Coefficient of determination

A

r^2 or the square of the correlation coefficient

proportion of variance in one variable that is accounted for by the other variable
- It ranges from 0 to 1
- closer to 1, better you can calculate the variance in your target variable
- indicator of systemic variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

If the correlation between children’s and parent’s scores is .40

A

Coefficient of determination is (0.16). That is 16% of the variance in children’s IQ scores can be accounted for by their parent’s scores

In other words, 16% of the total variation in children’s IQ scores is systematic variance that is variance related to the parent’s IQ scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Statistical significance

A

a finding that is very unlikely to be due to error variance; exists when a correlation coefficient calculated on a sample has a very low probability of being zero in the population

A correlation coefficient is statistically significant when the correlation calculated on a sample has a very low probability of being zero in the population from which the sample came

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What probability level (we call this alpha) is very low?

A

It depends on your field of study, prior research and how much risk you want to take of being wrong

In psychology we generally pick an Alpha level of 0.05. We think 5% is very low.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what must you do before you run any statistical test

A

you must first determine your alpha level, which is also called the “significance level.” this is the test probability of making a wrong decision (Type I error)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Directional hypothesis

A

predicts the direction of correlation (pos/neg)

17
Q

Nondirectional hypothesis

A

predicts that two will be correlated, doesn’t specify the direction

18
Q

Statistical Significance of r is affected by several factors!!!!!!!!!!

A

sample size
effect size
significance level

19
Q

sample size (affects SS)

A

i.e., power
An increase in your sample will increase your power, that is your ability to find a significant result

20
Q

effect size (affects SS)

A

i.e., coefficient of determination

If the effect size ss larger, you are more likely to find a significant result

21
Q

significance level (affects SS)

A

i.e., alpha level

If the alpha level (i.e., .05 or .01 or .001) is set at a higher level, you are more likely to find a significant result

22
Q

3 factors that distort correlation coefficient

A

Restricted range
Outliers
Reliability of measures

23
Q

Restricted Range

A

data in which participants’ scores are confined to a narrow range of the possible scores on a measure

Having a restricted range artificially lowers correlations below what they would be if the full range of scores were present
Ex – sampling one school to see if ses correlates to grades
- Restricted range; will not see a correlation because everyone in that school lives in the same zip code and has the same ses status
- Need to sample more schools from other areas

24
Q

Outliers

A

A score is considered an outlier if it is more that 3 standard deviations away from the mean

On line outliers
off line outliers

25
Q

on line outliers

A

fall in the same pattern as the rest of the data and tend to artificially inflate r

26
Q

off line outliers

A

fall outside of the pattern of the rest of the data and tend to artificially deflate r

27
Q

Reliability of Measures

A

The more unreliable a measure is, the lower its correlations with other measures will be

If the true correlation between college aspirations and SAT score is 0.45, but you use an aspiration scale that is unreliable, the obtained correlation will not be 0.45 but rather near .00.

28
Q

Correlation and Causality

A

Correlation does not mean causation

Even when two variables have a perfect correlation of 1.00, we cannot conclude that one variable causes the other variable

29
Q

criteria for inferring causality

A

covariation
directionality
extraneous variables

Correlational research satisfies the first (and sometimes the second) criterion, but never the third

30
Q

Covariation

A

changes in one variable are associated with changes in the other variable; same as correlation (i.e., high school GPA → SAT score)

31
Q

Directionality

A

the presumed causal variable preceded the presumed effect in time (i.e., smoking → lung cancer)

32
Q

Extraneous variables

A

all other variables that may affect the relation between the two target variables are controlled or eliminated (think discrimination and depression?)

33
Q

Spurious correlation

A

a correlation between two variables that is not due to any direct relationship between them but rather to their relation to other variables

Some other variance (z) may cause both x and y

Ex: students who are highly depressed do not do well in class, and they may try to relieve depression by drinking

34
Q

Partial correlation

A

the correlation between two variables with the influence of one or more other variables statistically removed

If a partial correlation between two variables (with the influence of a third variable removed) is significantly lower than the Pearson correlation between the two variables, then the correlation between them is at least partly due to the third variable

For example, the correlation between motivation and test performance may be lower because of a third variable, such as study time. In other words, partial correlation between motivation and test performance is a function of study time

35
Q

Other indices of correlation

A

Remember Pearson correlation coefficient (r) si only appropriate when you have a continuous variable (i.e., interval or ratio scale)

spearman rank-order correlation
phi coefficient
point-biserial correlation

36
Q

Spearman rank-order correlation

A

correlation used when variables are measured on an ordinal scale, that is when the numbers reflect the rank ordering of participants on some attribute

Ex: correlation between starred reviews and box office reviews

37
Q

Phi coefficient

A

correlation used when both variables are dichotomous (i.e., nominal with categories)

Ex: correlation between gender and HS dropout

38
Q

Point-biserial correlation

A

correlation used when only one of the variables is dichotomous

Ex: gender and IQ