Tests of Relationships- Correlation; Regression Flashcards

1
Q

The questions below test _________ or __________.

  • Is group A different from group B?
  • Does this treatment cause this outcome?
A

Difference or proportions depending on the outcome variable type.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q
  • Test difference for _________ outcomes.

- Test proportions for ________ outcomes.

A
  • continuous

- categorical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The questions below test _____________.

  • What is the relationship between A and B?
  • Does variable A increase with variable B?
A

relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Examples:
Which tests would be appropriate for the following questions?

  1. ) Is the male group different from the female group by their BMI level?
  2. ) Is the gender linked to a certain age group?
  3. ) What is the relationship between BMI and DBP at baseline?
A
  1. ) Test the difference of the mean BMI between groups of male and female.
  2. ) Compare the proportions of being older than 50 yrs between the groups of male and female.
  3. ) Test relationship between DBP and BMI.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Tests of Differences (parametric vs non-parametric):
-What 2 tests are used for two group independent comparison? Which is used for parametric data? Which is used for non-parametric data?

  • What 2 tests are used multi group independent comparison? Which is used for parametric data? Which is used for non-parametric data?
  • What 2 tests are used two group paired comparison? Which is used for parametric data? Which is used for non-parametric data?
  • What 2 tests are used for multi group paired comparison? Which is used for parametric data? Which is used for non-parametric data?
A
  • Independent t-test, Mann-Whitney U Test
  • Independent t-test = parametric
  • Mann-Whitney U Test = non-parametric
  • ANOVA, Kruskal-Wallis H Test
  • ANOVA = parametric
  • Kruskal-Wallis H Test = non-parametric
  • Paired t-test, Wilcoxon Signed Rank Test
  • Paired t-test = parametric
  • Wilcoxon SIgned Rank Test = non-parametric
  • Repeated Measures ANOVA, Friedman Test
  • Repeated Measures ANOVA = parametric
  • Friedman Test = non-parametric
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Tests of Proportions (parametric vs non-parametric):
-What 2 tests are used for two group independent comparison? Which is used for data that isn’t sparse? Which is used for sparse data?

  • What 2 tests are used for multi group independent comparison? Which is used for data that isn’t sparse? Which is used for sparse data?
  • What 2 tests are used for two group paired comparison? Which is used for data that isn’t sparse? Which is used for sparse data?
  • What 2 tests are used for multi group paired comparison? Which is used for data that isn’t sparse? Which is used for sparse data?
A
  • Chi-Square, Fisher’s Exact
  • Chi-Square = data not sparse
  • Fisher’s Exact = data sparse
  • Chi-Square, Fisher’s Exact
  • Chi-Square = data not sparse
  • Fisher’s Exact = data sparse
  • McNemar Test, McNemar Exact Test
  • McNemar Test = data not sparse
  • McNemar Exact Test = data sparse
  • Stuart-Maxwell Test, Generalized Stuart-Maxwell
  • Stuart-Maxwell Test = data not sparse
  • Generalized Stuart Maxwell = data sparse
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Tests of Cerrelation; Regression:

A

1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q
  • Correlation is when you look at the ____________ between two variables.
  • Draw a ______________ to visualize
  • Compute a ________________ to quantify
A
  • relationship
  • scatter plot
  • correlation coefficient (r)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

_____________ is a way of visualizing the relationship between two variables. Each point represents the intersection of a pair of related observations. They can visually clarify the _______ and shape of a relationship.

A
  • Scatter plot

- strength

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

________ _________ is a decimal number in the range of -1 to +1 and is a measure of linear relationships between two variables.

A

Correlation coefficient (r)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q
  • What is a perfect positive correlation value?

- What is a perfect negative correlation value?

A

+1

-1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

With correlation coefficient:

  • The sign of r indicates __________.
  • The absolute value of r indicates __________.
A
  • direction

- strength

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q
  • How do we interpret correlation coefficient (r) values?

- Should these values be used as strict cutoff points? Why or why not?

A
  • 0-0.25 = little or no relationship
  • 0.25-0.50 = fair
  • 0.50-0.75 = moderate to good
  • 0.75-1 = good to excellent

-No, because they are affected by sample size, measurement error, and the types of variables beinig studied.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Correlation coefficient is a measure of ________ relationship only. It cannon be used for _____________ relationship.

A
  • linear

- curvilinear

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are 4 types of correlation coefficients?

A
  • Pearson (Product-Moment)
  • Spearman Rank
  • Phi
  • Point Biserial
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q
  • _________ Correlation Coefficient is the most commonly reported measure of correlation.
  • It is appropriate when X and Y are __________ variables with underlying ________ distributions.
A
  • Pearson

- continuous, normal

17
Q
  • __________ Correlation Coefficient is a __________ analog of the Pearson (r).
  • It is appropriate for use when X and Y are ________ variables.
A
  • Spearman Rank, non-parametric

- ordinal

18
Q
  • _________ Correlation Coefficient is a special case of the Pearson (r), given only ____ values of X and Y.
  • It is appropriate for use when both X and Y are ___________ variables.
A
  • Phi, two

- dichotomous

19
Q
  • _____________ Correlation Coefficient is a special case of the Pearson (r).
  • It is appropriate when a ___________ X is correlated with a ___________ variable Y.
A
  • Point Biserial

- dichotomous X, continuous Y

20
Q

What are some precautions in using Correlation Coefficient?

A

Nonlinearity
-A strong curvilinear relationship may be identified as no correlation.
Outlier
-A single point can have a large influence on the correlation.
Interpretation is subjective
-No hard and fast rules that determine an (r) value is strong, moderate, or weak
-Interpretation should be based on the nature of the data, purpose of the research, and the researcher’s knowledge of the subject matter
Causation
-No causal relationship can be determined based on the (r) value
-Correlation of A and B is the same as correlation of B and A
-Correlation cannot be used to establish a cause-and-effect situation.

21
Q
  • Regression is when you look at the relationship between two variables in a ______________ situation.
  • Draw a _________ with a _________ line to visualize.
  • Compute a ________________ to quantify.
A
  • cause-and-effect
  • scatter plot, regression line
  • coefficient of determination (R²)
22
Q

___________ is used to predict values of one variable from another:
-(X -> Y) where X is the ___________ (predictor/explanatory) variable, and Y is the ___________ (outcome/response) variable.

A
  • Regression

- independent,dependent

23
Q

What are the 2 types of regression?

A
  • Linear

- Logistic

24
Q
  • __________ Regression is used to examine the causal relationship of the two variables, X and Y, that are linearly related.
  • Fits a regression line Y=a+bX through the points and estimate a and b
A

Linear

25
Q
  • __________ Regression s used to examine the causal relatonship of the two variables, X and Y, when Y is ______.
  • It reports the _________ to estimate the odds of membership in the target group linked to the predictor value.
  • ______________ can also be determined for each OR.
  • a significant OR will not contain the null value, 1.0, within its confidence interval.
A
  • Logistic, binary
  • odds ratio (OR)
  • Confidence Intervals (CI)
26
Q

Assumptions for Regression:

  • examine a plot of ________ where residual = Y-Ŷ
  • the residual scores will be randomly dispersed close to zero
  • Figure: patterns of resduals (Y-axis) plotted against predicted scores (X-axis).
    • _________ band demonstrates that assumptions for linear regression have been met.
    • residuals ________ as predicted values increase.
    • Curvilinear pattern indicates __________ relationship.
A

-residuals

  • horizontal band
  • increase
  • nonlinear
27
Q
  • The ______________ is the square of the Correlation Coeffecient (r).
  • R² represents the percentage of total variance in the Y scores that can be explained by the X scores.
  • 1-R² is the percentage of the total variance in Y not explained by the X scores.
  • A measure of proportion, indicating the accuracy of prediction based on __.
A
  • Coeffecient of Determination (R²)

- X

28
Q

Tests of Relationships (correlation; regression):

A

1

29
Q

Quiz 1:
What do the researchers use correlation for?

  1. ) Test the differences between two variables
  2. ) Compare the proportions between two variables
  3. ) Predict the effect of one variable upon another
  4. ) Identify the relationship between two variables
A

4.) Identify the relationship between two variables

30
Q

Quiz 2:
What do the researchers use regression analysis for?

  1. ) Test the differences between two variables
  2. ) Compare the proportions between two variables
  3. ) Predict the effect of one variable upon another
  4. ) Identify the relationship between two variables
A

3.) Predict the effect of one variable upon another

31
Q

Quiz 3:
Which of the following is a correct statement about correlation analysis?

1.) A strong linear relationship can be identified as a high correlation coefficient value
2.) It is important to include all the data points to compute the correlation coefficient correctly
3.) Correlation can be used to establish a cause-and-effect relationship
4.) Correlation coefficient is the ratio of risks of exposure vs. non-exposure groups
5.) A hard and fast rule exists for its interpretation in terms of strong, moderate or weak correlation

A

1.) A strong linear relationship can be identified as a high correlation coefficient value

32
Q

Quiz 6:
Which of the following is a correct statement about the correlation coefficient (r) and the coefficient of determination (R2)?

  1. ) Coefficient of determination (R2) is the percentage of variance in the predictor variable accounted for by the outcome variable
  2. ) Correlation coefficient (r) value ranges from 0 to +1
  3. ) Within the same study, the correlation coefficient (r) value can be computed by squaring the coefficient of determination (R2) value
  4. ) Coefficient of determination (R2) is the proportion of variance in the outcome accounted for by the predictor variable
  5. ) Coefficient of determination (R2) value ranges from -1 to 1
A

4.) Coefficient of determination (R2) is the proportion of variance in the outcome accounted for by the predictor variable

33
Q

Quiz 7:
Which of the following statistical procedure is proper to use when you want to study the relationship between the two variables, being interested in knowing which would have caused the other?

  1. Chi square test
  2. Student’s t-test
  3. Correlation analysis
  4. Analysis of variance
  5. Regression analysis
A
  1. Regression analysis
34
Q

Quiz 8:
Which of the following is a correct statement about regression analysis?

  1. Logistic regression is used to predict a continuous outcome variable
  2. Coefficient of determination (R2) value ranges from -1 to +1
  3. Coefficient of determination (R2) is a measure of linear association between two variables
  4. Coefficient of determination (R2) is a good measure for an index of predicted variance
  5. Linear regression is used to predict a binary outcome variable
A
  1. Coefficient of determination (R2) is a good measure for an index of predicted variance
35
Q

Summary for Tests of Relationships; Correlation, Regression (1):

  • The association between two variables can be measured using ___________ ____________
  • _________ (product-moment) correlation coefficient measures the linear correlation between two continuous variables X and Y
  • __________ _____ correlation coefficient is a nonparametric analog of the Pearson r and also appropriate for use when X and Y are ordinal variables
  • _____ ________ correlation coefficient is appropriate for use when X is dichotomous and Y continuous
  • ____ coefficient is appropriate for use when both X and Y are dichotomous variables
A
  • correlation coefficients
  • Pearson
  • Spearman Rank
  • Point Biserial
  • Phi
36
Q

Summary for Tests of Relationships; Correlation, Regression (2):

  • Correlation coefficient value r quantitatively describes the ________ and __________ of a relationship between two variables
  • The prediction of an outcome from a variable can be tested using __________ analysis
  • ________ regression examines the causal relationship of X to Y when Y is continuous
  • _________ regression examines the causal relationship of X to Y when Y is dichotomous
  • The coefficient of determination R2 quantitatively describes the percentage of the total variance in the Y scores that can be explained by the __________
A
  • strength and direction
  • regression
  • Linear
  • Logistic
  • X scores