Hypothesis Testing 2 Flashcards

1
Q

what is analysis of variance (ANOVA)?

A

Most effective way of analysing data

Multiple variations

possible to use wrong analysis of variance

Inaccurate conclusions result

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What equation is used to compare more than two poulation means

A

To compare more than two population means.
Ho: 𝜇1= 𝜇2= 𝜇3= 𝜇4= … … 𝜇n ;n= population

Ha : At least one mean differs from others.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the 2 types of ANOVA tests?

A

One-Way ANOVA (F-test)
- Only on independent variable tested

N-way ANOVA
- Two independent variables tested

Analysis of Covariance (ANCOVA)
- if independent variables are all categorical, some of the independent variables are categorical and some are continuous ANCOVA will be more appropriate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a post-hoc test used for?

A

used after statistically significant results are found and where the differences came from

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the different types of post-hoc tests?

A

Methods

fisher PLSD

Student-newman-keuls (SNK)

Duncans multiple range test

Scheffé’s test

Dunnett’s test

Bonferroni

for more info go to slide 7 lect 8

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the 3 assumptions of analysis of variance ?

A

Normally distributed data
- natural variability of measurement is normally distributed

Homogeneity of variance
- although means may vary from group to group the variance is relatively constant in all treatment groups

Treatment effects are additive
- effect of treatment is assumed to have a quantity either positive or negative to the control

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the kruskal-Wallistest?

A

Nonparametric test.

Allows to compare more than two population

Null Hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the Friedman’s test?

A

Nonparametric Test

Three or more measures or experimental conditions.

Can be used in repeated measures.

Nonparametric version of two-way ANOVA

need random sample

need 1 independent and 1 dependant variable

minimum of 12 participants

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what does larger sample sizes mean and what value does p need to be to reject a null hypothesis?

A

Larger sample size leads to accurate estimates.

P< 0.05 – Null hypothesis is rejected.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How do you calculate the sample size tests?

A

Unpaired t-test

N = (Zα + Zβ)^2 2σ^2/d^2

Where:
N = required sample size
d = size of difference to be detected

Zα= z value corresponding to chosen alpha level
* Obtained from z tables
* For example, z = 1.96 for alpha = 0.05 and 2-tailed test
* Recall that alpha defines protection against type I errors

Zβ= z value corresponding to chosen beta level
* Obtained from z tables
* For example, z = 0.84 for beta = 0.20 and 2-tailed test
* Recall that beta defines protection against type II errors
* Power is 0.8 (80%) when beta = 0.20

σ = population standard deviation
* usually estimated from previous experiments

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do you calculate the power of a statistical test?

A

Zβ = (√ N.d/ √ 2. σ) – Zα

Where:
N = actual sample size
d = difference actually detected

Zα = z value corresponding to chosen alpha level
Obtained from z tables as previously described

Zβ = z value corresponding to actual beta level

σ = actual standard deviation

Beta is then the probability read from z tables

Power (%) = 100 (1 – Beta)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the purpose of correlation?

A

To determine whether the relationship between two variables is
statistically significant.

To determine whether the relationship is positive or negative.

To determine what proportion of the variability in one variable can be accounted for by the other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

how are variables named and how are correlations established?

A

Correlations establish whether two variables have a linear relationship

Variables:
Y = dependent, outcome or response variable
X = independent, predictor or explanatory variable
X1, X2 = no clear
dependent or independent

variable Convention used in this chapter:
Upper case letter = variable
Lower case letter = individual measurements

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is pearsons parametric correlation coefficient?

A

Product moment correlation coefficient
r = ∑xy/√(∑x^2∑y^2)

Where:

∑x^2 is the sums of squares of the X values

∑y^2 is the sums of squares of the Y values

∑^xy is the sum of products of the individual
pairs of X and Y values

r = 1 for perfect linear relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

how do you interpret pearsons correlation coefficient?

A

r ranges from -1, through 0, to 1

r = +1
line from bottom left corner to top right corner on graph
strong positive correlation

r= -1
line from top left to bottom right on graph
strong negative correlation

r= +0.59
less scatter than 0 more that -1 and +1
weak positive correlation

r=0
lots of scatter no correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the limitation for the pearsons correlation coefficient?

A

Only applies to linear relationships.

For large samples, weak correlations may be statistically significant despite being of little practical value.

Coefficient of determination:
- Equals r^2
- Proportion of variation in
dependent variable that is
accounted for by variation
in independent variable.

Only homogenous groups should be included in the analysis.

A statistically significant correlation does not necessarily imply a ‘causal’
relationship.

17
Q

What is the bivariate normal distribution?

A

The marginal distributions of x and y are normally distributed

This distribution is defined by the:
- mean of the dependent
variable
- mean of the independent
variable
- standard deviation of the
dependent variable
- standard deviation of the
independent variable
- population correlation
coefficient

18
Q

What are other correlation coefficient methods?

A

Spearman’s nonparametric correlation coefficient.

Kendall’s nonparametric correlation coefficient.

Bland and Altman’s intra-class correlation coefficient.