Chapter 6 Flashcards

1
Q

Correlational Study

A

Examines the relationship between 2 or more measured variables (not manipulated or controlled by experimenter)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Correlation

A

Statistical technique used to measure and describe the relationship between 2 variables

You can correlate any two variables as long as they are numerical (meaning they can be represented by numbers)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why do we use a correlation coefficient?

A

It is used to make a prediction (if two variables are related, we can use one variable to predict the other; example: SAT scores and college success)
To measure reliability (Test-retest, alternate forms; example: is that dependable friend going to pick you up at 2am at the airport)
To measure validity (Are the two variables really related?; example: SAT and ACT scores related to college grades)

BUT. IT IS NOT A MEASURE OF CAUSALITY

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

All correlations range from? And what does this number mean?

A

-1.00 to +1.00

This absolute value shows strength of relationship
Higher the absolute number, the stronger the relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is perfect correlation?

A

+/- 1.00 is the strongest possible relationship

The graph of a perfect correlation is just one straight line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does the sign of the correlation tell you?

A

It tells us the directionality of the relationship of any two variables, X and Y

If the sign is positive: (the variables change in the same direction)
As X is increasing, Y is increasing
As X is decreasing, Y is decreasing

If the sign is negative: (the variables change in opposite directions)
As X is increasing, Y is decreasing
As X is decreasing, Y is increasing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the correlation coefficient?

A

r, it is reflected by a spread. The fatter the oval the lower the correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What kind of line will r have if it equals zero?

A

It will be horizontal, because there is no correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Pearson correlation coefficient

A

r= the Pearson coefficient
r measures the amount that the 2 variables (X&Y) vary together taking into account how much they vary apart
It is a ratio

r= (degree to which X and Y vary together) / (degree to which X and Y vary separately)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Sum of Products of Deviations (SP)

A
Definitional Formula
SP= The sum of (X-X bar)(Y-Y bar)
Computational Formula
SP= The sum of XY - ((sum of x, times the sum of Y) / n)
N is the number of (X,Y) pairs
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

r squared

A

percentage of variance in Y accounted for by X

This ranges from 0 to 1 (POSITIVE ONLY)
you can not have a negative percentage, because squaring anything is positive

This number is a meaningful proportion (unlike the Pearson’s r)
It has a similar idea to effect size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the limitations of Pearson’s r?

A
  1. Correlation does not mean causation
  2. Strength of the relationship
    (Pearson’s doesn’t give directly interpretable strength of relationship, the r squared (coefficient of determination ))
    3.Outliers (extreme scores)
    (scores with extreme X and/or Y value can drastically effect Pearson’s r)
  3. Restriction of range
    (restricted range of measured values can lead to inaccurate conclusions about the data;
    finding no correlation when there really isn’t one
    finding a correlation when there really is one)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is regression?

A

Fitting a line to the data using an equation in order to describe and predict data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Simple regression

A

Uses just two variables (x and y)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Multiple regression

A

one y and many x’s. You’re still predicting one outcome, but comparing it to multiple causations
Multiple regression has a lot more external validity. Meaning that It is most comparable to the real world.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Linear regression

A

fits data to a straight line

17
Q

Curvilinear regression

A

involves using geometry AND calculus to come up with a solution

18
Q

From geometry we know:

A

That we can describe any line by an equation
Slope = change in Y per unit change in x
y intercept = where line crosses the Y axis (when X = 0)
statistics notation : y hat = bX + a,
it is like y = mx + b

19
Q

Y hat

A

Is the predicted value of Y, given a certain value of X

20
Q

Strong, moderate, weak correlation?

A
Strong= 0.8
Moderate= 0.4
Weak= 0.2
21
Q

What does r = 0.0 look like?

A

The best fit line is a horizontal line. And dots are basically everywhere

22
Q

What is the Pearson Correlation Coefficient?

A

It is r.
It measures the amount that the two variables (X and Y) vary together taking into account how much they vary apart.

r= (degree to which X and Y vary together) / (degree to which X and Y vary separately)

23
Q

Standard Error of the Estimate

A

It is an estimate of how consistent it would be if we resampled over and over again. The amount of sampling variation there would be for Beta (slope).

Example:

If SE(B) = 0.2 and (B) by itself is 1. Then B is 5 standard errors away from zero. That means B = 1 is pretty far away from zero.