Exam 4 Flashcards

1
Q

define Analysis of Variance

A

Analysis of variance is a technique that allows us to compare two or more populations of interval data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Analysis of variance is: three things….

A

Analysis of variance is:
 an extremely powerful and widely used procedure
 a procedure which determines whether differences exist between population means
 a procedure which works by analyzing sample variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

example of a one-way analysis of variance

A

Examples: Accident rates for 1st, 2nd, and 3rd shift Expected mileage for five brands of tires

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

assumptions of a one-way analysis of variance

A

Populations are normally distributed
Populations have equal variances
Samples are randomly and independently drawn

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

x is the ________ variable, and its values are __________.

A

x is the response variable, and its values are responses.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

xij refers to the _________, ___________

A

xij refers to the observation, treatment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Each population is a _______ ________.

A

Each population is a factor level.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Population classification criterion is called a _______

A

Population classification criterion is called a factor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

hypothesis of One-way ANOVA H0

A

H0: All population means are equal

i.e., no factor effect (no variation in means among groups)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

hypothesis of One-way ANOVA H1

A

H1: At least one population mean is different
i.e., there is a factor effect
Does not mean that all population means are different (some pairs may be the same)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Since µ1 = µ2 = µ3 = µ4 is of interest to us, a statistic that measures the proximity of the sample means to each other

A

between-treatments variation. It is denoted SST, short for “sum of squares for treatments”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

SSE (_____ ___ _______ ___ ______) measures the ______-_______ _________.

A

SSE (Sum of Squares for Error) measures the within-treatments variation. measure of the amount of variation we can expect from the random variable we’ve observed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

MST stands for

A

mean square for treatments

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

MSE stands for

A

Mean square for errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

ANOVA: in the F table….

A

numerator degrees of freedom determine the column

denominator degrees of freedom determine the row

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

ANOVA: Degrees of freedom (for F crit)

A

df1=k-1 (MST), df2=n-k (MSE)

17
Q

ANOVA: Treatments df

A

treatments df: k-1 (k=treatments)

18
Q

ANOVA: Error df

A

Error df: n-k (n= #of obs, k=treatments)

19
Q

ANOVA: Total df

A

n-1

20
Q

What is a multinomial experiment?

A

Unlike a binomial experiment which only has two possible outcomes (e.g. heads or tails), a multinomial experiment:

•Consists of a fixed number, n, of trials.
   	• Each trial can have one of k outcomes, called cells.
• Each probability pi remains constant.
• Our usual notion of probabilities holds, namely:
	p1 + p2 + … + pk = 1
   	• Each trial is independent of the other trials.
21
Q

what is a Chi-squared goodness of fit test used for

A

How “close” are the observed values to those which would be expected under the fitted model

22
Q

how are degrees of freedom calculated for a X^2 test of a contingency table?

A

r-1 x c-1

23
Q

what is ei=npi

A

expected frequency

24
Q

which test statistic measures the similarity of the expected and observed frequencies?

A

chi-squared goodness of fit test

25
Q

when performing a test of a contingency table, what should you do if the expected frequency isn’t given?

A

row total x column total / n

26
Q

CSGOF: observed frequency, fi, comes from where

A

actual number from the problem

27
Q

CSGOF: expected frequency is calculated as:

A

n x pi

28
Q

CSGOF: Delta (difference) is calculated as:

A

( fi - ei ) Observed minus expected

29
Q

CSGOF: Summation Component is calculated as:

A

( fi - ei )^2 / ei (observed minus expected) / expected

30
Q

The Chi-squared test of a contingency table is used to:

A

The Chi-squared test of a contingency table is used to:
• determine whether there is enough evidence to infer that two nominal variables are related, and
• to infer that differences exist among two or more populations of nominal variables.

In order to use these techniques, we need to classify the data according to two different criteria.

31
Q

Chi-Squared test of a contingency table expected value calculation

A

eij = (row total x column total) / n n=number of obs

32
Q

chi-squared distribution with ________________ degrees of freedom

A

chi-squared distribution with (r – 1)(c – 1) degrees of freedom

33
Q

degrees of freedom chi-squared test of a contingency table

A

( r - 1 ) ( c - 1 ) r=rows c=columns

34
Q

chi-squared test of a contingency table Rule of 5

A

In a contingency table where one or more cells have expected values of less than 5, we need to combine rows or columns to satisfy the rule of five.

35
Q

chi-squared test of a contingency table example

A

120 females, 12 were left handed, 108 were right handed

180 males, 24 were left handed, 156 were right handed

36
Q

chi-squared test of a contingency table expected value

A

Expected value = (row total x column total) / grand total