Stats Flashcards

Question 1

Q

z-test

Answer

A

variance is known
(y-mu)/(sigma/sqrt(n))
(y1-y2)/sqrt(sigma2/n1+sigma2/n2)

Question 2

Q

t-test

Answer

A

variance is not known

y1-y2)/(s/sqrt(n)

Question 3

Q

CLT

Answer

A

zn = (x-nmu)/(nsigma2)

Question 4

Q

95 percentile

Answer

A

y +/- z(SE Mean)

SE Mean = s/sqrt(n)

Question 5

Q

ANOVA

Answer

A

SS(treat), df = a-1, MS, F
SS(E), df = N-a, MS, F
SS(T), df = N-1, MS, F

Question 6

Q

a in ANOVA

Answer

A

of treatments

Question 7

Q

n in ANOVA

Answer

A

of blocks

Question 8

Q

i in ANOVA

Answer

A

treatment

Question 9

Q

j in ANOVA

Question 10

Q

residual

Answer

A

yij - average(yi)

Question 11

Q

3 model adequacy checking graphs

Answer

A

(1) normal prob plot
(2) predicted values plot
(3) time series plot

Question 12

Q

normal prob plot

Answer

A

catches outliers, need to transform
x = residual
y = normal % probability

Question 13

Q

predicted values plot

Answer

A

tests homogeneity; control by control, randomize, transform
x = predicted yi
y = residual

Question 14

Q

time series plot

Answer

A

tests independence
x = run order time
y = response

Question 15

Q

tests for equality of variance

Answer

A

(1) bartletts

(2) modified levines

Question 16

Q

Box Cox

Answer

A

selects transform

Question 17

Q

Contrasts

Answer

A

(1) orthogonal

(2) scheffe - don’t need to specify in advance

Question 18

Q

Comparing Means

Answer

A

(1) Fischer LSD - does not use overall error rate
(2) Tukey’s test - uses overall error rate
(3) Dunnett’s test - when you have a control

Question 19

Q

Determining sample size

Answer

A

(1) operating characteristics of curves

(2) specifying std dev

Question 20

Q

Random Effects Model

Answer

A

Randomly selects levels

Question 21

Q

Random Control Block Design

Answer

A

blocks represent a restriction on randomization

- control of nuisance

Question 22

Q

SS(treat)

Answer

A

(1/b) sum(yi2 - y2/N)

Question 23

Q

SS(block)

Answer

A

(1/a) sum(yj - y2/N)

Question 24

Q

SS(E)

Answer

A

SS(T) - other SS’s

Question 25

Q

SS(T)

Answer

A

sum(yij2 - y2/N)

Question 26

Q

df for RCBD

Answer

A

SS(Treat) = a - 1
SS(blocks) = b-1
SS(E) = (a-1)(b-1)
SS(T) = N-1

Question 27

Q

Latin Square

Answer

A

blocking in 2 directions
2 restrictions on randomization
disadvantage - small DF, control by replicating operators

Question 28

Q

Latin Square setup

Answer

A

SS(Treatments), df = p-1
SS(Rows), df = p-1
SS(columns), df = p-1
SS(E), df = (p-1)(p-2)
SS(T), df = p2-1

Question 29

Q

Crossover

Answer

A

eliminate issue of time

- may still have residual effect (mixing of results)

Question 30

Q

Graeco Latin Square

Answer

A

blocks in 3 directions

Question 31

Q

Main effect

Answer

A

sum(A+)/2 - sum(A-)/2

Question 32

Q

Interaction

Answer

A

diff(A’s at B+)/2 - diff(A’s at B-)/2

Question 33

Q

SS(A)

Answer

A

1/bn(sum(yi2 - y2/abn)

Question 34

Q

SS(int)

Answer

A

1/n(sum(yij2 - y2/abn) - SS(A) - SS(B)

Question 35

Q

df for factorial design

Answer

A

A = a-1
B = b-1
error = ab(n - 1)
T = abn - 1

Question 36

Q

SS(blocks)

Answer

A

1/(ab) sum(yk2 - y2/abn)

Question 37

Q

SS(A) for factorial

Answer

A

[a + ab - b - (1)]^2/4n
n is number of replicates
4 represents 2^2, would be 8 for 2^3

Question 38

Q

SS(T) df for factorial

Question 39

Q

Main effect for factorial

Answer

A

A = 1/2n [a + ab - b - (1)]

2 represents 2^2, would be 4 for 2^3

Question 40

Q

Coefficient for regression

Question 41

Q

R^2

Answer

A

ss(model)/SS(Total)

Question 42

Q

Orthogonality

Answer

A

(1) = number of + and -
(2) sum of elements in column = 0
(3) I * col -> unchanged
(4) products of any 2 columns yields a column already on table

Question 43

Q

VIF

Answer

A

1/(1-R^2)

Question 44

Q

Types of error

Answer

A

standard error (for regression coefficient)
pure (from replication)
lack of fit (from pooling)
residual (PE + LOF)

Question 45

Q

Dispersion effect

Answer

A

look at ranges

Question 46

Q

Half normal

Answer

A

plot of coefficients

Question 47

Q

Defining relation

Question 48

Q

Design generator

Answer

A

A = BC (aliasing)

Question 49

Q

Resolution

Answer

A

Shortest word in a defining relation

Question 50

Q

Family

Answer

A

I = +/- ABC

Question 51

Q

Confirmation Experiment

Answer

A

Set factors at levels and compare -> regression model

Question 52

Q

Choosing a design

Answer

A

highest resolution

Question 53

Q

Number of treatment combinations

Answer

A

2^(5-2) = 8

Question 54

Q

Folding

Answer

A

change signs for all factors, odd become negative

Question 55

Q

Combined defining relation

Answer

A

multiply - words, copy + words

Question 56

Q

Aliases

Answer

A

1/2([i] + [i]’)

Question 57

Q

Plackett Burman

Answer

A

different class of III design

needs to be a multiple of 4
non-regular
non-geometric
not flexible - cannot be represented by cubes

Question 58

Q

Super saturated

Answer

A

P-B and sort on last row, delete all - or +

- k>N-1

Question 59

Q

k

Answer

A

number of factors

Question 60

Q

Treatment design

Answer

A

know how design is confounded
prevent nuisance variables
signal what we know and don’t know

Question 61

Q

Experimental design

Answer

A

Randomize to prevent bias

- Figure out execution

Question 62

Q

Estimate correct alias

Answer

A

prior knowledge of system
interaction plot
p-values for each individually
run other half

Question 63

Q

Empirical vs Mechanistic

Answer

A

derived vs. theoretical law

Question 64

Q

Regression

Answer

A

no statement of effect, not causal

Answer 61

A

Slightly different regression

Answer 62

A

Variability in raw data versus variability in means

Answer 63

A

CI around confirmation run

Answer 64

A

how well points fit regression

Answer 65

A

pure, lack of fit

Answer 66

A

sequential process, method/path of steepest ascent

Answer 67

A

(1) 1st order model
(2) check error, interactions, quadratic effects (curvature)
(3) Ax1 = 1; x2 = something
(4) x = something
(5) test with new factor levels and keep stepping
(6) perform new factorial with region of exploration centered around optimal points

Answer 68

A

help check if don’t want to replicate
check for curvature
add df for error

Answer 69

A

n(f) factorial runs, n(c) centerpoint runs, 2k axial

Answer 70

A

(1) 1st order -> lack of fit

(2) introduce axial points to allow quadratic terms

Answer 71

A

indicates good model

- similar variances for points of interest when rotated

Answer 72

A

one factor is always at the center
all points equidistant from center point, leads to = var
spherical, no points at vertices

Answer 73

A

don’t collect, missing value
change other factor -> shift design
constrained region - D-optimal
inscribed CCD (inside of box)
face-centered->replace corner with face points

Answer 74

A

constant monitoring and improving
slight changes
more data to find smaller differences\
longer period of time, lurking

Answer 75

A

factor levels not independent
lattice simplex
centroid simplex

Answer 76

A

{p, m}
p = components of mixture (sugar, cream)
m = all positive combinations of mixture (sugar = 0, 1/3, 2/3, 1)
p = 3 means 2D, m = 2 means 3 points on edge

Answer 77

A

2^p - 1 runs

Answer 78

A

lattice is more flexible than centroid

Answer 79

A

axial points in the interior

Answer 80

A

checked 2nd time around