Exam 4 Flashcards

1
Q

define Analysis of Variance

A

Analysis of variance is a technique that allows us to compare two or more populations of interval data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Analysis of variance is: three things….

A

Analysis of variance is:
 an extremely powerful and widely used procedure
 a procedure which determines whether differences exist between population means
 a procedure which works by analyzing sample variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

example of a one-way analysis of variance

A

Examples: Accident rates for 1st, 2nd, and 3rd shift Expected mileage for five brands of tires

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

assumptions of a one-way analysis of variance

A

Populations are normally distributed
Populations have equal variances
Samples are randomly and independently drawn

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

x is the ________ variable, and its values are __________.

A

x is the response variable, and its values are responses.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

xij refers to the _________, ___________

A

xij refers to the observation, treatment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Each population is a _______ ________.

A

Each population is a factor level.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Population classification criterion is called a _______

A

Population classification criterion is called a factor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

hypothesis of One-way ANOVA H0

A

H0: All population means are equal

i.e., no factor effect (no variation in means among groups)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

hypothesis of One-way ANOVA H1

A

H1: At least one population mean is different
i.e., there is a factor effect
Does not mean that all population means are different (some pairs may be the same)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Since µ1 = µ2 = µ3 = µ4 is of interest to us, a statistic that measures the proximity of the sample means to each other

A

between-treatments variation. It is denoted SST, short for “sum of squares for treatments”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

SSE (_____ ___ _______ ___ ______) measures the ______-_______ _________.

A

SSE (Sum of Squares for Error) measures the within-treatments variation. measure of the amount of variation we can expect from the random variable we’ve observed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

MST stands for

A

mean square for treatments

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

MSE stands for

A

Mean square for errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

ANOVA: in the F table….

A

numerator degrees of freedom determine the column

denominator degrees of freedom determine the row

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

ANOVA: Degrees of freedom (for F crit)

A

df1=k-1 (MST), df2=n-k (MSE)

17
Q

ANOVA: Treatments df

A

treatments df: k-1 (k=treatments)

18
Q

ANOVA: Error df

A

Error df: n-k (n= #of obs, k=treatments)

19
Q

ANOVA: Total df

20
Q

What is a multinomial experiment?

A

Unlike a binomial experiment which only has two possible outcomes (e.g. heads or tails), a multinomial experiment:

•Consists of a fixed number, n, of trials.
   	• Each trial can have one of k outcomes, called cells.
• Each probability pi remains constant.
• Our usual notion of probabilities holds, namely:
	p1 + p2 + … + pk = 1
   	• Each trial is independent of the other trials.
21
Q

what is a Chi-squared goodness of fit test used for

A

How “close” are the observed values to those which would be expected under the fitted model

22
Q

how are degrees of freedom calculated for a X^2 test of a contingency table?

23
Q

what is ei=npi

A

expected frequency

24
Q

which test statistic measures the similarity of the expected and observed frequencies?

A

chi-squared goodness of fit test

25
when performing a test of a contingency table, what should you do if the expected frequency isn't given?
row total x column total / n
26
CSGOF: observed frequency, fi, comes from where
actual number from the problem
27
CSGOF: expected frequency is calculated as:
n x pi
28
CSGOF: Delta (difference) is calculated as:
( fi - ei ) Observed minus expected
29
CSGOF: Summation Component is calculated as:
( fi - ei )^2 / ei (observed minus expected) / expected
30
The Chi-squared test of a contingency table is used to:
The Chi-squared test of a contingency table is used to: • determine whether there is enough evidence to infer that two nominal variables are related, and • to infer that differences exist among two or more populations of nominal variables. In order to use these techniques, we need to classify the data according to two different criteria.
31
Chi-Squared test of a contingency table expected value calculation
eij = (row total x column total) / n n=number of obs
32
chi-squared distribution with ________________ degrees of freedom
chi-squared distribution with (r – 1)(c – 1) degrees of freedom
33
degrees of freedom chi-squared test of a contingency table
( r - 1 ) ( c - 1 ) r=rows c=columns
34
chi-squared test of a contingency table Rule of 5
In a contingency table where one or more cells have expected values of less than 5, we need to combine rows or columns to satisfy the rule of five.
35
chi-squared test of a contingency table example
120 females, 12 were left handed, 108 were right handed | 180 males, 24 were left handed, 156 were right handed
36
chi-squared test of a contingency table expected value
Expected value = (row total x column total) / grand total