Chpt 13 Flashcards
What does ANOVA stand for
Analysis of Variance
what is the F- ratio
is the ratio of 2 variables
What does ANOVA allow us to do?
allows us to compare multiple pops and even subgroups of these pops
- how two groups interact with each other quantitatively
What question does ANOVA help us answer
do all 3 means come form a common population
- we are not asking if they were exactly equal. we are asking if each mean likely came from the larger overall population
What is the Null hypothesis for ANOVA
HO= M1 = M2 = M3
What is the problem with using pairwise comparison for 3 pop means
the type I error will compound with each t-test
95% confidence = (.95)(.95)(.95) = .857
so, a (or critical value) would be come 1 - .857 =
143
Type 1 error rate went from 5% (0.05) to 14.3%
what is partitioning
separating total variance into its component parts
- we do this by using ANOVA
What is the variability between the means
distance from overall mean
if the variability between the means (distance from overall mean) in the numerator is relatively Large compared to the variance within the samples the ratio will be
much larger than 1
- the samples mostly likely do NOT come from a common pop
- reject Ho that means are equal
What is the variability within the samples called
internal spread (the denominator)
If the F ratio is similar/similar what does this tell us
- Fail to reject Ho
- means are fairly close to overall mean and/ or distributions overlap a bit
If the F ratio is Small /Large
- Fail to reject Ho
- the means are very close to overall mean and/or distributions melt together
What is the formula for f ratio
B/W / W/in or Among / Around
variance b/w + Variance w/in (error variance) =
Total variance
Factor definition
independent variable (ie. assembly method)
What are the required assumptions for ANOVA
- normally distributed
- distributions must be independent
- the variance of the response variable (Qsquared) is the same for all pops
What are the steps to ANOVA
- Calculate sample mean for each pop
- calculate overall mean for all pops (add up all means / # of means)
- Estimate the variance (Xbar 1- Overall mean)squared / n-1
- compute the sum of squares b/w treatments
- computer mean squares b/w treatments
- calculate sum of squares due to error
- calculate the mean squares due to error
- Setup the ANova table
- calculate f-ratio and p-value
what is SSTR stand for
sum of squares b/w treatments
what is MSTR
mean square b/w treatments
what is SSE
sum of squares due to error
What is MSE
mean square due to error
what is the formula for SSTR
sum (# of sample)(pop1 mean - overall mean)sqaured (do for each set of pops)
What is the formula for MSTR
SSTR/k-1
What is the formula for SSE
(# of samples)(Variance of pop1) + (# of samples) (Variance of pop2) + (# of samples)(Variance of Pop3)
What is the formula for MSE
SSE/nr-k
What is the F-ratio formula
MSTR/MSE
what is k-1
3 pops - 1 = degrees of freedom
What is nr-k
total # of sample for all 3 pops (ex. each contain 5 samples than nr = 5x3
k = total pops (in this case 3)
so df = 15 - 3
What is Fishers LSD
remember ANOVA tells us if at least 2 of the groups are different from each other
- Fisher’s LSD tests 2 specific groups against each other
what does LSD stand for
LEast Significant Difference
What is the formula for Fisher’s LSD
t a/2 x square root of MSE (1/n1 + 1/n2)
what is t a/2
critical value using within degrees of freedom and alpha / 2
What do you compare LSD to
(xbari - xbarj)
- reject if (xbar i - xbarj) is greater than or equal to LSD
- do this for each group
LSD is used to determine
where the differences occur
what is the null hypothesis for LSD
HO: Mi = Mj
what is the test statistic for LSD
t = (xbar i - xbarj) / square root of MSE (1/ni + 1/nj)
What is the rejection rule for LSD - pvalue approach
reject HO if Pvalue is less than or equal to a (CV)
what is the rejection rule of LSD - cv approach
reject HO if t is less than or equal to - t a/2 or
t is greater than or equal to t a/2
what is the rejection rule of LSD - cv approach
reject HO if t is less than or equal to - t a/2 or
t is greater than or equal to t a/2
what is the degrees of freedom for LSD and t- distribution
t a/2 is based on a t-distribution with nT-k degrees of freedom
what is T???
LSD and Confidence intervals - if the confidence interval includes the value 0
we cannot reject Ho, that the pop means are equal
If the LSD confidence interval does not include the value 0
we can conclude there is a difference in pop means
- do not reject Ho
what is a Comparisonwise Type1 Error rate
indicate the level of significance associated with a single pairwise comparison a = 1-.95 = 0.05
What is a experimentwise type 1 error rate
Prob we will not make a type 1 error for all 3 tests
.95)(.95)(.95
- this gets larger the more groups you have
What is the experimentwise type 1 erorr rate denoted as
aEW
What is the experimentwise type 1 erorr rate denoted as
aEW
How do we control the overall experimentwise error rate
use Bonferroni Adjustment
What is the Bonferroni Adjustment
we use smaller comparisonwise error rate for each test
What is the formula for Bonferroni adj
aEW / C (C to test c pairwise comparisons)
ex.
a = 0.05 / 3 pops = 0.017
What are some other procedures we could use to control the overall experimentwise error rate
- Tukey’s procedure
2. Duncan’s multiple range test
When is randomized block design used
useful when the experimental units are homogenous
What do we use if exeperimental units are heterogenoeous
Blocking is often used to form homogenous groups
Problem with Randomized block design? (double check this is what it is referring to)
can arise whenever differences due to extraneous factors (ones not considered in the experiment) cause the MSE term to become too LARGE
- this can cause the f-value to be small, signaling no difference among treatment means when in fact a difference exists
HOw do you compute f-ratio for randomized block design
F = MSTR/MSE
In our example what would the workstation be
the factor of interest
in our example of randomized block design what would the controllers be
the blocks
what would the treatments be in a randomized block design
the pops
- 3 treatments (or pops) associated with workstation factor correspond to the 3 workstation alternatives
what would the treatments be in a randomized block design
the pops
- 3 treatments (or pops) associated with workstation factor correspond to the 3 workstation alternatives
What is the randomized aspect
is the random order in which the treatments (systems) are assigned to controllers
- 6 controllers were selected at random and assigned to operate each of the systems
- a follow up interview and a medical exam of each controlelr in the study provided a measure of stress for each controller on each system
What is SST = for randomzied block design
SST = SSTR + SSBL + SSE
What does k represent in randomized block design
the # of treatments
What does b represent in randomized block design
of blocks
What does nT represent in randomzied block design
total sample size (nT = kb)
What are the steps in randomized block design
- compute SST (total sum of squares)
- Compute SSTR (Sum of squares due to treatments)
- Compute SSBL Sum of Squares due to blocks
- Compute SSE (sum of squares due to error)
What is the formula in randomized block design for SSE
SSE = SST - SSTR - SSBL
What is the formula in randomized block design for SST
sum (Xbar - total block mean) squared
What is the formula in randomized block design for SSTR
(# in sample){sum (treatment mean - block mean)squared
What is the formula for SSBL
(# of pops) [(block mean - total block mean) squared]
What does SSBL mean
sum of squares due to blocks
What is SST
Total sum of squares
what is the degrees of freedom for SSTR
k- 1 ( # of pops - 1)
what is the degrees of freedom for SSBL in randomized block design
b-1 (# of blocks -1)
What is the degrees of freedom for SSE in randomized block design
(k-1)(b-1)
What is the degrees of freedom for SST in randomized block design
nT-1
read notes - i left some out
ready notes i left some out
Describe a factorial experiment
an exerimental design that allows simultaneous conclusions about 2 or more factors
Why use Factorial
used becasue the experimental conditons include all possible combinations of hte factors
Give an example of a Factorial Experiment
study involving (GMAT)
- scores range form 200 to 800
higher scores imply higher aptitude
- to impreove the GMAT scores, consider 3 prep programs
- each program has 3 treatments (the program they are in business, Engineering, Arts)
- second factor - whether a student’s undergrad affects the GMAT score (college)
What would be if we have 3 treatments (prep programs for GMAT) combinations in factorial design if we have 2 factors
factor 1 - the prep program
factor 2 - college attended
3 x 3 = 9 treatment combinations
What is replications
the sample size of 2 for each treatment combination indicates we have 2 replications
What is the formula in Block design for SST
sum (sample - overall mean)sqaured (for all samples)
What is the formula in block design for SSTR
of blocks [(treatment mean - overall treatment mean) sqaured) + (Treatment mean#2 - overall treatment mean) squared) + (treatment mean #3 - overall treatment mean) Squared)