Week 10: Tests of Relationships - Correlation; Regression Flashcards

1
Q

Test difference (is group A different that group B/does this treatment cause this outcome) for what type of outcomes?

A

continuous outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Test proportions (is group A different that group B/does this treatment cause this outcome) for what type of outcomes?

A

categorical outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How do you test relationships between two groups?

A
  1. Measure correlation between A and B

2. Fit a regression model for A and B

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Example: Is the male group different from the female group by their BMI level?

A

Test the difference of the mean BMI between the groups of male and female

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Example: Is the gender linked to a certain age group?

A

Compare the proportions of being older than 50 yrs between the groups of male and female

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Example: What is the relationship between BMI and DBP at baseline?

A

Does DBP increase as BMI increase?

Test relationship between DBP and BMI

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When you look at the relationship between two variables -

A

Correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How do you calculate correlation of data?

A
  1. Draw a scatter plot to visualize

2. Compute a correlation coefficient (r) to quantify

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

If r = 0.96, how is weight loss and exercise time correlated?

A
  • Weight loss is highly correlated with exercise time

- Weight loss increases as exercise time increases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

way of visualizing the relationship between two variables -

A

scatter plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

T/F Scatter plot can visually clarify the strength and shape of a relationship

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

T/F Correlation coefficient r is a measure of a linear association between two variables in the range of 0 and 1

A

False, in the range of -1 and 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

T/F The sign of r indicates the direction of the correlation

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

T/F The absolute value of r indicates the strength of the correlation

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q
Interpretation of Correlation Coefficient r:
0-.25 = 
.25-.5 = 
.5-.75 = 
.75-1 =
A
0-.25 = Little to no relationship
.25-.5 = fair
.5-.75 = moderate to good
.75-1 = good to excellent
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

T/F Correlation Coefficient r values can be used as strict cutoff points.

A

False, values should NOT be used as strict cutoff points, as they are affected by:

  1. sample size
  2. measurement error
  3. the types of variables being studied
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

correlation coefficient r is a measure of what type of relationship only?

A
  • Linear relationship only

- A curvilinear relationship won’t be described accurately using the linear correlation coefficient r

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

For interpretation, look at what two things to determine if association in the set of data is linear or curvilinear?

A
  1. r value

2. scatter plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the most commonly reported measure of correlation?

A

Pearson (produce-moment) correlation coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

When is it appropriate to use Pearson (produce-moment) correlation coefficient?

A

when X and Y are continuous variables with underlying normal distributions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is a nonparametric analog of the Pearson r?

A

Spearman rank correlation coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

When is it appropriate to use the spearman rank correlation coefficient?

A

when X and Y are ordinal variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

When would you use phi coefficient?

A
  • appropriate for use when both X and Y are dichotomous variables (2 variables)
  • special case of the Pearson correlation coefficient, given only two values of X and Y
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

When would you use The point biserial correlation coefficient?

A
  • appropriate when a dichotomous X is correlated with a continuous variable Y
  • a special case of the Pearson correlation coefficient
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Example: What coefficient would you use?
The data representing the motor and verbal skills in a group of 60 adults with traumatic brain injury. Scores are graded pass or fail, and assign 1 to pass and 0 to fail.

A

Phi coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Example: What coefficient would you use?
The data represents the developmental scores on tests of proximal (reaching) and distal (prehensile skill) behaviors in 12 normal infants, 30 weeks of age

A

Pearson (Product-Moment) Correlation Coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Example: What coefficient would you use?
The data representing the ratings of elbow flexor spasticity (resistive force in kilograms) for patients who have had a stroke on the right (1) or left (0) sides

A

Point Biserial Correlation Coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Example: What coefficient would you use?

The data represents the scores of verbal and reading comprehension for a sample of 10 children with learning disability

A

Spearman Rank Correlation Coefficient

29
Q

Precautions in Using Correlation Coefficient:

Nonlinearity -

A
  • Pearson correlation coefficient is all about linear relationship
  • A strong curvilinear relationship may be identified as no correlation (r=0)
30
Q

Precautions in Using Correlation Coefficient:

Outlier -

A

A single point can have a large influence on the correlation

31
Q

Precautions in Using Correlation Coefficient:

Interpretation is subjective -

A
  • No hard and fast rules that determine an r value is strong, moderate or weak
  • Interpretation should base on the nature of the data, the purpose of the research and the researcher’s knowledge of the subject matter
32
Q

Precautions in Using Correlation Coefficient:

Causation -

A
  • No causal relationship can be determined based on the correlation coefficient value
  • Correlation of A and B is the same as correlation of B and A
  • Correlation cannot be used to establish a cause-and-effect situation
33
Q

When you look at the relationship between two variables in a cause-and-effect situation?

A

Regression

34
Q

How do you calculate a regression line?

A
  1. Draw a scatter plot with a regression line to visualize

2. Compute a coefficient of determination (R2) to quantify

35
Q

Ex: if r = 0.96; R2 = 0.92, what is the relationship between weight loss and exercise time?

A
  • Weight loss is highly correlated with exercise time

- Exercise time predicts the Weight loss

36
Q

Regression used to predict values of one variable to another: x-> y
x =
y =

A
x = independent (predictor/explanatory) variable 
y = dependent (outcome/response) variable
37
Q

used to examine the causal relationship of the two variables, X and Y, that are linearly related?

A

Linear regression

y=a +bX

38
Q

used to examine the causal relationship of the two variables, X and Y, when Y is binary?

A

Logistic regression

y=a+bX

39
Q

Example: Linear or logistic regression

Mobility and functional characteristics could predict whether a person did or did not have a history of falls.

A

Logistic

40
Q

Example: Linear or logistic regression
The data consisting of the systolic blood pressure (Y) and age (X) in a sample of 10 women. Base it to predict a woman’s blood pressure by knowing her age

A

Linear

41
Q

Example: Linear or logistic regression

Physical/psychological function as predictors of successful return to work on year after traumatic brain injury

A

Logistic

42
Q

Logistic regression reports odds ratio (OR) to estimate what?

A

the odds of membership in the target group linked to the predictor variable

43
Q

T/F Confidence intervals can also be determined for each OR for logistic regression

A

True

44
Q

A significant OR will not contain what within its CI?

A

null value of 1.0

45
Q

How to determine if assumptions for regression analysis have been met?

A

Patterns of residuals (Y-axis) plotted against predicted scores (X-axis).

46
Q

If Patterns of residuals (Y-axis) plotted against predicted scores (X-axis) provide horizontal band, what does that mean?

A

demonstrates that assumptions for linear regression have been met

47
Q

If Patterns of residuals (Y-axis) plotted against predicted scores (X-axis) provide curvilinear pattern, what does that mean?

A

indicates nonlinear relationship.

48
Q

Coefficient of determination R2

A

measure of proportion, indicating the accuracy of prediction based on X

49
Q

what does R^2 represent?

A

percentage of the total variance in the Y scores that can be explained by the X scores

50
Q

1-R^2 represents?

A

percentage of the total variance in Y not explained by the X scores

51
Q

Example: For the regression of blood pressure on age, r=0.87; R2=0.76

A
  • 76% (R2) of the variance in systolic blood pressure explained by the variance in age
  • 24% (1-R2) of the variance in systolic blood pressure not explained by the variance in age
52
Q

What do the researchers use correlation for?

A

Identify the relationship between two variables

53
Q

What do the researchers use regression analysis for?

A

Predict the effect of one variable upon another

54
Q

T/F Correlation can be used to establish a cause-and-effect relationship

A

False, Regression not correlation

55
Q

T/F A hard and fast rule exists for its interpretation in terms of strong, moderate or weak correlation.

A

False, no hard and fast rule

56
Q

T/F It is important to include all the data points to compute the correlation coefficient correctly

A

False

57
Q

T/F A strong linear relationship can be identified as a high correlation coefficient value

A

True

58
Q

T/F Correlation coefficient is the ratio of risks of exposure vs. non-exposure groups

A

False, Odds ratio not correlation

59
Q

T/F Coefficient of determination (R2) is the percentage of variance in the predictor variable accounted for by the outcome variable

A

False, variance of outcome variable accounted for by the predictor variable

60
Q

T/F Coefficient of determination (R2) is the proportion of variance in the outcome accounted for by the predictor variable

A

True

61
Q

T/F Correlation coefficient (r) value ranges from 0 to +1

A

False, -1 to +1

62
Q

T/F Within the same study, the correlation coefficient (r) value can be computed by squaring the coefficient of determination (R2) value

A

False, the opposite

63
Q

T/F Coefficient of determination (R2) value ranges from -1 to 1

A

False, 0-1

64
Q

Which of the following statistical procedure is proper to use when you want to study the relationship between the two variables, being interested in knowing which would have caused the other?

A

Regression analysis

65
Q

T/F Logistic regression is used to predict a continuous outcome variable

A

False, linear regression -> continuous and logistic -> categorical

66
Q

T/F Coefficient of determination (R2) is a good measure for an index of predicted variance

A

True

67
Q

T/F Coefficient of determination (R2) is a measure of linear association between two variables

A

False, r is

68
Q

T/F Linear regression is used to predict a binary outcome variable

A

False, continuous outcome variable