module 10 Flashcards

1
Q

Correlation tests are trying to determine if there is an ______ between variables

A

association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

t or f: correlation can imply causation

A

false

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is the strength of association between two numerical variables measured by

A

pearson’s correlation coefficient, used to evaluate a sample correlation coefficient against a null hypothesis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

for the purposes of hypothesis testing, the population parameter is given the greek letter ____

A

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

The correlation coefficient can take on values anywhere from ⍴=____ to ⍴=_____

A

⍴=-1 to ⍴=1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

p=-1 means a perfect ______, a value of ⍴=0 means ____, and a value ⍴=1 means a perfect _____

A

negative correlation, no association, positive correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Bivariate normal distribution

A

extension of the normal distribution for two numerical variables that allows for an association between them

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what are the steps for conducting a correlation test

A
  • define null and alt hypotheses
  • establish null distribution
  • conduct stat test
  • draw scientific conclusion
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

the null and alt hypotheses for correlation test if its directional
Ho: ⍴=____
Ha: ⍴≠____

A

HO: ⍴=0
HA: ⍴≠0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

the null and alt hypotheses for a strong or weak correlation test
Ho: ⍴____0 (or ⍴____0)
Ha: ⍴____0 (or ⍴___0)

A

Ho: ⍴≤0 (or ⍴≥0)
Ha: ⍴>0 (or ⍴<0)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the null distribution for a correlation test?

A
  • t-distribution
  • therefore, correlation tests are special cases of single sample t tests
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

______ the null hypothesis if the observed score is greater than the critical score (i.e., tO>tC) or if the p-value is smaller than the Type I error rate (i.e., p<⍺).

A

reject

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

______ the null hypothesis if the observed score is less than or equal to the critical score (i.e., tO≤tC) or if the p-value is larger or equal to the Type I error rate (i.e., p≥⍺).

A

fail to reject

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

For a correlation test, the scientific conclusion depends on _______ in the hypotheses.

A

directionality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

t or f
For non-directional hypotheses, the conclusion is either:

Reject the null hypothesis and conclude that there is evidence of an association between the two numerical variables.
Fail to reject the null hypothesis and conclude that there is no evidence of an association between the two numerical variables.

A

true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

For directional hypotheses, the conclusion is either….

A

For directional hypotheses, the conclusion is either:

Reject the null hypothesis and conclude that there is evidence of a positive (or negative) association between the two numerical variables.
Fail to reject the null hypothesis and conclude that there is no evidence of a positive (or negative) association between the two numerical variables.

17
Q

what does r refer to in relation to correlational tests

A

the statistical test used to evaluate a sample correlation coefficient against a null hypothesis.

18
Q

linear regression test

A

determines if changes in one variable can predict changes in another variable

19
Q

in linear regression, one variable is the ____ and the other is the _____

A

predictor (independent), response (dependent)

20
Q

Sampling error is considered to only occur in the _____ variable and not in the ______ variable

A

response, predictor

21
Q

the _____ variable is always the variable you want to make predictions about

A

response

22
Q

Linear regression assumes that the relationship between the numerical variables is described by ______

A
  • y=a+bx
  • b=slope
  • a=intercept
23
Q

3 components to the statistical model

A
  • systematic component (makes predictions, the equation)
  • random component (probability distribution for sampling error, normal distribution for the response variable)
  • link function (connects systematic component to the random component)
24
Q

what do you need to estimate in order to fit the statistical model to data

A

the intercept and slope that best explains the data (a and b in the equation)

25
Q

residual difference

A
  • (r)
  • difference between observed data and predicted value
  • how far apart your actual data point and the line of the slope are
26
Q

residual variance

A
  • average squared residual value across all data points
  • aka sum of squares (SSQ)
27
Q

how is residual variance calculated

A
  • Calculate the residual for each data point
  • Take the square of each residual
  • Sum the squared residuals across all data points
  • Divide by the degrees of freedom, which are df=n-2
28
Q

n

A

number of sampling units

29
Q

how to minimize the sums of squares/residual variance

A

vary the slope and intercept parameters

30
Q

systematic component

A

mathematical relationship that connects the predictor variable to the response variable

31
Q

how many hypotheses can be tested for linear regression

A

two, one for each parameter (intercept and slope)

32
Q

varying the intercept changes the _____ and varying the slope changes the ____

A

height of the line, relationship between variables

33
Q

directional null and alt hypotheses for intercept

A

HO: a≤βa
HA: a>βa

34
Q

non directional null and alt hypotheses for intercept

A

HO: Intercept is not different from a reference value, or a=βa in symbols

HA: Intercept is different from a reference value, or a≠βa in symbols

35
Q

directional null and alt hypotheses for slope

A

HO: b≤βb
HA: b>βb

36
Q

non directional null and alt hypotheses for slope

A

HO: Slope is not different from a reference value, or b=βb in symbols
HA: Slope is different from a reference value, or b≠βb in symbols

37
Q

four assumptions to linear regression

A
  • linearity (response can be described by a linear combo of the predictor variable)
  • independence (predictor variable should be independent of each other)
  • normality (normally distributed, shapiro wilks test)
  • homoscedasticity (should b similar across range of predictor variable)
38
Q

what is the shapiro wilks test used for

A
  • evaluate the assumption of normality quantitatively, if the residuals are normally distributed or not
39
Q

t or f: There is no 1-to-1 map between violations of assumptions and trustworthiness of the statistical conclusions.

A

true