Stat - Exam #3 Flashcards

Question

What is a FACTOR?

Answer 1

A classification variable used to separate data into several columns; -EX: Age, gender, species, class, etc; -LEVELS of a factor are the possible categories -EX: Factor = Classification Levels= Freshman, Sophomore

Answer 2

Conducts ONE hypothesis test to find out whether all three populations are EQUAL to each other;

Answer 3

- At least ONE population means is DIFFERENT from the others; - Cannot tell if only one population mean s different or if all are different

Answer 4

-F-test to find out which is closer to the truth

Answer 5

A stat test to determine whether two VARIANCES are equal; | -Tests for the equality of two variances

Answer 6

H0: sigma1 = sigma2; H1: sigma1 >/= sigma2

Answer 7

-Determines which Hypothesis is closer to the truth and follows the f-distribution; F0= (sigma1)^2 / (sigma2)^2

Answer 8

1. The total area under the curve equals 1; 2. The range starts at ZERO and goes to POSITIVE infinity; 3. Is SKEWED to the RIGHT

Answer 9

-Infinitely number of F-curves, but each F-curve has TWO degrees of freedom (F_alpha,df_n,df_d)

Answer 10

* *From the F-statistic = 1. The first degree of freedom is the degrees of freedom of the variance in the NUMERATOR; 2. The second degree of freedom is is the variance in the DENOMINATOR

Answer 11

-The computer will provide the P-value and use the P-value to make a conclusion

Answer 12

1. Simple random samples; 2. INDEPENDENT samples; 3. NORMAL populations; 4. EQUAL standard deviations

Answer 13

- Robust to moderate violations of assumptions; - Most robust when number of observations in each column is EQUAL = BALANCED sample; - Data do not have to distributed exactly normal; - Standard deviations do not have to be exactly equal as long as they are close by the "Rule of 2"

Answer 14

The ratio of the largest sample standard deviation to the smallest sample standard deviation is LESS than 2 **sigma_big/sigma_small

Answer 15

1. Normal shape = used KS-stat; | 2. Equality of the population variances = Levene’s Stat

Answer 16

By dividing the variation of the means b the variation within within each sample; -Bigger the number, the easier to see the means are different **F(0) = (variation b/w means/variation w/n sample)

Answer 17

1. an OVERALL test to see whether there is strong evidence of differences among the population means; 2. a detailed follow-up analysis to decide which of the population means differ and to estimate how large the differences are (=POST HOC Tests)

Answer 18

- LITTLE F-values = DO NOT REJECT the null: | - LARGE F-values = REJECT the null

Answer 19

- Uses the population variance between the population means to estimate the population variance (sigma^2); - Will NEVER know the true value of the population parameter, so must estimate a value for the population variance from the data by substituting the sample variance of the sample averages; - Calculate the variation between sample by treating the samples AVERAGES as data points and calculating their sample variance (similar to SUM of SQUARES) and multiply by the number of observations

Answer 20

Variation betweens samples = (n_1)(S^2) n_1 = number of observations; S^2 (of x-bar) = sample variance of the sample averages

Answer 21

- One assumption of ANOVA is all populations have SAME pop. variance (sigma^2); - Combine all observations in populations and determine the variance of this ONE column of data = POOLED population variance; - Estimate if from a sample by using the POOLED SAMPLE VARIANCE; **s^2_pooled = sigma^2

Answer 22

the F-stat!; **F_0 = (ns^2_x-bar)/(s^2_pooled)

Answer 23

- Calculates several sum of squared terms, then divides by the appropriate degrees of freedom to get an average of the sum of squared terms; - Called the “sum of squares” and the “means squares"

Answer 24

-Used to calculated to calculate the F-stat

Answer 25

- A measure of of the variation of the combined data around its sample average, called the GRAND MEAN (x-double bar); - How the computer measures the entire variation in a sample; - Breaks this variation into two components, sum of squares for the MODEL and sum of squares for the ERROR

Answer 26

SS(total) = SS(model) + SS(error)

Answer 27

- Estimate of the “variation between means” ; | - Sum of squares for the model is all so called the sum of squares for the TREATMENT (SSTr)

Answer 28

- Estimate of the “variation within the samples”; | - Sum of squares of the errors is the best measure of the population variance

Answer 29

- Estimate of the “variation within the samples”; | - Sum of squares of the errors is the best measure of the population variance

Answer 30

- Only increases the sum of squares ; * *Need to normalize the variation in the observations by dividing by the DOF of each variation, and this gives TWO estimates of the population variance needed for the F-stat

Answer 31

(n-1); the degrees of freedom for the MODEL and degrees of freedom for the ERROR add up to the TOTAL

Answer 32

*Mean Squares of Model; - DOF= (k-1) - Formula (SSM/(k-1))

Answer 33

*Mean Squares of Error; - DOF = (n-k) ; - Formula = SSE/(n-k)

Answer 34

*Mean Squares of Total; - DOF = (n-1); - Formula = SST/(n-1)

Answer 35

The info needed to calculate the F-STAT **F_0 = (MSM/MSE) ``` MSM = variation in samples MSE = variation within samples ```

Answer 36

The info needed to calculate the F-STAT **F_0 = (MSM/MSE) ``` MSM = variation in samples MSE = variation within samples ```

Answer 37

*P-Value!!

Answer 38

H0: u1 = u2 = u3; H1: At least ONE population mean is different from the others

Answer 39

A conclusion is made based on the value of the TEST STAT

Answer 40

*Small = mean that all pop. means are EQUAL

Answer 41

*Large = means that the pop means are farther apart than would be considered reasonable due to sample variation

Answer 42

*Large = means that the pop means are farther apart than would be considered reasonable due to sample variation

Answer 43

The variable that can be used to predict the value of the second variable; —Can cause the second variable or be strongly related to the second variable; — May come first temporally

Answer 44

The variable whose value can be explained by a first variable; — May come second temporally

Stat - Exam #3 Flashcards

(68 cards)