Test 2 (pg 147-170) Flashcards

1
Q

As a research method, __________ allow you to describe the relationship between two measure variables.

A

correlational designs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

A ___________ helps by assigning a numerical value to the observed relationship.

A

correlation coefficient (descriptive statistics)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

In addition to describing a relationship, correlations allow us to ________ from one variable to another.

A

make predictions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Correlations vary in their _______.

A

magnitude

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

_________ is an indication of the strength of the relationship between two variables (the strength of the relationship).

A

Magnitude

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the different types of correlational relationships?

A

positive, negative, none, or curvilinear

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

The magnitude or strength of a relationship is determined by the ________ describing the relationship.

A

correlation coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

A __________ is a measure of the degree of relationship between two variables.

A

correlation coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

A correlation coefficient can vary between ____ and _____.

A

-1.00 and +1.00

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

The _______ the relationship is between the variables, the closer the coefficient is to either -1.00 or +1.00.

A

stronger

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The _______ the relationship is between the variables, the closer the coefficient is to 0.

A

weaker

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

We typically discuss correlation coefficients as assessing a ______, ______ or ______ relationship, or no relationship at all.

A

strong, moderate, or weak

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

+or - .70 - 1.00 =

A

a strong relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

+ or - .30 - .69 =

A

a moderate relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

+or- .00 - .29

A

no relationship (.00) to weak

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

A correlation coefficient of either -1.00 or +1.00 indicates a __________.

A

perfect correlation, the strongest relationship possible.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What would it mean if the correlation for height in weight in a group of 20 people was perfect (+1.00)?

A

This would mean that the person with the highest weight would also be the tallest person, the person with the second highest weight would be the second tallest person, and so on down the line. (and the height and weight decreased by a set amount each time.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

If there was a perfect negative correlation in the 20 person height/weight study, what would that mean?

A

This would mean that they person with the highest weight was the shortest, and so on, and that the height increased and the weight decreased by a set amount for each individual.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

A correlation coefficient of _______ represents a perfect relationship, while a coefficient of ______ indicates no relationship.

A

+ or - 1.00 = perfect

0.00 = none

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

A ________ is a figure that graphically represents the relationship between two variables.

A

scaterplott

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

In a ______, two measurements are represented for each participant by the placement of a marker.

A

scatterplot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

The stronger the correlation, the _______ the data points cluster around an imaginary line through their center.

A

more tightly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

When there is a perfect correlation, where do the data points fall on a scatterplot?

A

the data points all fall on a straight line.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Describe what a positive relationship would look like on a scatterplot.

A

The majority of the data points fall along an upward angle, from lower left corner to upper right corner. The relationship is linear, and the stronger the correlation, the closer to the imaginary straight line the points will be.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Describe what a negative relationship would look like on a scatterplot.

A

The majority of the data points fall along an downward angle, from upper left corner to lower right corner. The relationship is linear, and the stronger the correlation, the closer to the imaginary straight line the points will be.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

A negative correlation indicates that an ______ in one variable will lead to an _______ in the other variable.

A

increase, decrease, or decrease, increase

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

A positive correlation indicates that an ______ in one variable will lead to an _______ in the other variable.

A

increase, increase, or decrease, decrease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Describe what no relationship would look like on a scatterplot.

A

the data points are scattered in a random fashion. The correlation coefficient would be 0 or very close to 0 (the example in the book was -.09)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

A correlation coefficient of 0 usually indicated no meaningful relationship. However it is also possible for a correlation coefficient of 0 to indicate a _________.

A

curvilinear relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

A graph represents the relationship between psychological arousal (the x-axis) and performance (y-axis). The individuals perform better when they are moderately aroused than when arousal is either very low or very high. The correlation coefficient for these data is also very close to 0, being -.05. What type of relationship is this? What does the scatterplot look like?

A
  • curvilinear relationship (so the correlation coefficient does not represent the high correlation seen in the scatterplot)
  • the points are tightly clustered in an inverted U shape
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

correlation coefficients tell us only about _______ relationships.

A

linear

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q
List the type of relationship for the following examples:
smoking and cancer
Intelligence and weight
mountain elevation and temperature
memory and age
A

smoking and cancer = positive
Intelligence and weight = none
mountain elevation and temperature = negative
memory and age = curvilinear

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q
Which of the following correlation coefficients represent the weakest relationship between the two variables?
-.59
\+.10
-1.00
\+.76
A

+.10

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Explain why a correlation coefficient of 0.00 or close to 0.00 may not mean that there is no relationship between the variables.

A

A correlation coefficient of .00 or close to .00 may indicate no relationship or a weak relationship. However, if the relationship is curvilinear, the correlation coefficient could also be .00 or close to this. In this case, there is a relationship between the two variables, but because the relationship is curvilinear, the correlation coefficient does not truly represent the strength of the relationship.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

The most common error made when interpreting correlations is assuming that the relationship observed is ______ in nature - that a change in variable A causes a change in variable B.

A

causal ***correlations simply identify relationships - the do not indicate causality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

What is wrong with the statement “Lets stop drug use in schools by making sure they can all read.”

A

illiteracy does not cause drug use, Even if there is a strong positive correlation between drug use and illiteracy.
(the assumed causality and directionality)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

______ is the assumption that a correlation indicates a causal relationship between the two variables.

A

causality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

_____ is the inference made with respect to the direction of a causal relationship between two variables.

A

directionality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

The _______ is the problem of a correlation between two variables being dependent on another (third) variable.

A

third-variable problem

40
Q

When we interpret a correlation, it is also important to remember that although the correlation between the variables may be very strong, it may also be that the relationship is the result of some ________ that influences both of the measured variables.

A

third variable

41
Q

What is the well known third variable problem study conducted by social scientists and physicians in Taiwan?

A

The researchers attempted to identify the variables that best predicted the use of birth control - a question of interest to the researchers because of overpopulation problems in Taiwan. they collected data on various behavioral and environmental variables and found that the variable most strongly correlated with contraceptive use was the number of electrical appliances in the home. (So people with more electric appliances used contraceptives more), whereas those with fewer electrical appliances tend to use contraceptives less. The third variable was education - the more educated a person was, the more likely they were to use contraceptives, and the more likely they were to have better jobs and make more money, thus being able to afford more appliances.

42
Q

It is possible statistically to determine the effects of a third variable by using a correlational procedure known as ________.

A

partial correlation

43
Q

________ is a correlational technique that involves measuring three variables and then statistically removing the effect of the third variable from the correlation of the remaining two variables.

A

Partial correlation

44
Q

With__________, if the third variable is responsible for the relationship between two variables that appear to correlate, then the correlation should disappear when the effect of the third variable is removed, or partialed out.

A

partial correlation

45
Q

_________ is a variable that is truncated and has limited variability. (the variable does not vary enough)

A

restrictive range

46
Q

Colleges that are very selective, such as Ivy league schools, would have a _________ on SAT scores - they only accept students with very high SAT scores. Thus, in these situations, SAT scores are not a good predictor of college GPAs because of the “______” on the SAT variable. (If SAT scores have limited range, the correlation between SAT and GPA appears to decrease.

A

restrictive range

47
Q

________ - arguing that a well-established statistical trend is invalid because we know a “person who” went against the trend.

A

Person-who argument

48
Q

______ is the most commonly used correlation coefficient when both variables are measured on an interval or ratio scale.

A

Pearson product-moment correlation coefficient (Pearson’s r)

49
Q

In the Pearson’s r, ____ is the statistical notation we use to report this correlation coefficient.

A

r

50
Q

The development of the Pearson’s r correlation coefficient is typically credited to ______ who published his formula for calculating r in 1895. (even though Edgeworth had thought of a similar formula 3 years earlier, he had published it deep in a statistical paper that was very hard to follow so no one discovered it until years later).

A

Karl Pearson

51
Q

To calculate Pearson’s r, we begin by converting the raw scores on the two different variables to the _________. (one example of this would be a z-score)

A

same unit of measurement

52
Q

A _____ represents the number of standard deviation units a raw score is above or below the mean.

A

z-score

53
Q

High raw scores are always above the mean and have ______ z-scores. Low raw scores are always below the mean and have _______ z-scores.

A

positive

negative

54
Q

After calculating a z-score the next step in calculating Pearson’s r is to calculate what is called a ______.

A

cross-product

55
Q

A ________ is the z-score on one variable multiplied by the z-score on the other variable.

A

cross-product

56
Q

If a correlation is strong and positive, what would the z-score be?

A

the positive z-score on one variable would go with a positive z-score on the other variable. Negative z-scores on one variable go with negative z-scores on the other variable,

57
Q

If both z-scores used to calculate the cross-product are positive, than the cross-product will be ______.

A

positive

58
Q

If both z-scores used to calculate the cross-product are negative, than the cross-product will be ______.

A

positive (two negatives multiplied to each other = a positive)

59
Q

The mean of all of the cross-products will tell you the ______.

A

correlation coefficient

60
Q

r = (E zxzy) / (N)

The sum of all z-scores(x) and z-scores(y) divided by the number of participants

A

Pearson’s r formula

61
Q

z = (X-ú) / (ó)
x is the raw score to be standardized
u is the mean of all the scores
ó is the standard deviation of the entire population

A

z-score

62
Q

When calculating a correlation coefficient, we should have at least 10 participants per variable. If there are two variables, how many individuals would we need?

A

10 participants per variable (thus, with two variables, we need a minimum of 20 individuals)

63
Q

The ________ is a measure of the proportion of the variance in one variable that is accounted for by another variable; calculated by squaring the correlation coefficient.

A

coefficient of determination

64
Q

In the height and weight example, the coefficient of determination tells us how much variation in weight is accounted for by the variation in height. Squaring the correlation coefficient of +94, we get r2 = .8836, or 88.36% of the variance in weight can be accounted for by the variance in height - a very high coefficient of determination.
TRUE OR FALSE

A

TRUE

65
Q

The type of correlation coefficient used depends on the ________ .

A

type of data collected in the research study.

66
Q

Pearson’s correlation coefficient is used when both variables are measured on an ________.

A

interval or ratio scale.

67
Q

_________ is the correlation coefficient used when one (or more) of the variables is measured on an ordinal (ranking) scale.

A

Spearman’s rank-order correlation coefficient

68
Q

To use the Spearman’s correlation coefficient, if one of the variables is interval or ratio in nature, it must be _________ before the calculations are done.

A

ranked (converted to an ordinal scale

69
Q

______ means having only two possible values, such as gender.

A

dichotomus

70
Q

_______ is the correlation coefficient used when one of the variables is measured on a dichotomous nominal scale and the other is measured on an interval or ratio scale.

A

Point-biserial correlation coefficient

71
Q

_________ is the correlation coefficient used when both measured variables are dichotomous and nominal.

A

phi coefficient

72
Q

Pearson- Both variables must be ______.

A

interval or ratio

73
Q

Spearman - Both variables are _______.

A

ordinal (ranked)

74
Q

Point-biserial - one variable is ______, and on variable is ________.

A

interval or ratio

nominal and dichotomous

75
Q

Phi - both variables are _________.

A

nominal and dichotomous

76
Q

In a recent study, researchers were interested in determining the relationship between gender and the amount of time spent studying for a group of college students. Which correlation coefficient should be used to assess this relationship?

A

In this study, gender is a nominal scale, and the amount of time spent studying is ratio in scale. Thus, a point-biserial correlation coefficient is appropriate.

77
Q

If i wanted to correlate class rank with SAT scores for a group of 50 individuals, which correlation coefficient would I use?

A

Because class ranks are an ordinal scale of measurement and SAT scores are measured on an interval/ratio scale, you would have to convert SAT scores to an ordinal scale and use the Spearman correlation coefficient.

78
Q

The correlation procedure allows us to predict from one variable to another, and the degree of accuracy with which we can predict depends on the __________.

A

strength of the correlation

79
Q

A tool that enables us to predict an individual’s score on on variable based on knowing one or more other variables is ________.

A

regression analysis

80
Q

__________ is a procedure that allows us to predict an individual’s score on one variable based on knowing one or more other variables.

A

regression analysis

81
Q

If you are working in a human resources office, and you want to predict how well future employees might perform based on test scores and performance measures, you could use regression analysis to make such prediction by developing a ________.

A

regression equation.

82
Q

The _______ is the best-fitting straight line drawn through the center of a scatterplot that indicates the relationship between the variables.

A

regression line

83
Q

__________ involves determining the equation for the best-fitting line for a data set.

A

Regression analysis

84
Q

The formula for a linear regression analysis is :

A

y = bx+a (same a slope, y=mx+b, just different symbols)
y is the predicted value on the y variable, b is the slope of the line, x represents an individuals score on the x variable, and a is the y-intercept.
b = slope(standard deviation for y divided by the standard deviation for x)
a = Y - b(X)

85
Q

A more advanced use of regression analysis is known as ______ which involved combining several predictor variables in a single regression equation. With this type of analysis, we can assess the effects of multiple predictor variables (rather than a single predictor variable) on the dependent measure.

A

multiple regression analysis

86
Q

The magnitude of a correlation coefficient is to ______ and the type of correlation is to _________.

A

absolute value; sign

87
Q

The _______ should be used when both variables are measured on an ordinal scale.

A

spearman

88
Q

Drew is interested in assessing the degree of relationship between belonging to a greek organization and the number of alcoholic drinks consumed per week. Drew should use the ______ correlation coefficient to assess this.

A

Point-biseral

89
Q

student Test #1 Test #2
1 First First
2 Second Third
3 Third Second
4 Fourth Forth
5 Fifth Fifth

A

Spearman

90
Q

An ________ is used to measure things by ranking and doesn’t necessarily imply equal distances between the rankings. Medals in the Olympics (Gold, Silver, Bronze) and socioeconomic status (upper, middle, lower) are examples of “ “. For example, you are rating which ice cream flavor out of three (chocolate, vanilla, and strawberry) you like best. You love chocolate but only like vanilla and strawberry a little bit. Using an “ “, your rankings would be 1) chocolate, 2) vanilla, and 3) strawberry although the distance between chocolate and vanilla is much larger than the distance between vanilla and strawberry.

A

ordinal

91
Q
student      Test #1       Test #2
1                  100            Happy
2                 90             Happy
3                 80             Happy
4                 70             Sad
5                 60             Sad
A

Point-Biseral

92
Q
student      Test #1       Test #2
1                  Pass           Pass
2                 Pass           Pass
3                 Pass           Fail
4                 Fail             Fail
5                 Fail             Pass
A

Phi

93
Q

A _______ is a measurement scale that identifies things using a word. Also known as a qualitative scale, items are usually organized by their category or name. There is no order or ranking in this type of scale. Some examples of nominal categories are gender (male and female) and ethnicity (Caucasian, African-American, Hispanic, Asian, etc.). For example, in an experiment using using student volunteers several nominal scales could be used to create different books. The students could be categorized by gender, age, or what year in school they are.

A

Nominal Scale

94
Q

IV string & DV numeric –> __________ –> Pearson R

A

Dummy variable coding

95
Q

IV string & DV string –> _________`

A

Cramer’s V

96
Q

Hypo: as hrs violent tv increase, acts of aggression tend to increase. We would use …

A

Pearson r

97
Q

A way of assigning numerical values to a categorical variable so that it reflects class membership

A

dummy variable coding