exam 3 Flashcards

1
Q

Define ANOVA

A

Analysis of variance between groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why use ANOVA?

A

To test more than two groups as opposed to T-test which test only for two groups. The difference between the two groups is that ANOVA compares the difference of more than 2 means while T test compares only the difference between 2 means.
.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Objective of ANOVA

A

Test for the difference between the means of two or more groups on one factor/dimension/variable. Each group is only tested once.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Question to ask when using ANOVA

A

is there a difference between the means of the different groups? And is this difference more than we would expect by chance?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Hypothesis and ANOVA

A

Null Hypothesis: There is no difference between groups on variable/measure X except that which is expected by chance.
H0: u1 = u2 = u3
Research Hypothesis: There is a difference between groups on variableX that is more than what is expected by chance; There is at least one difference that is significant. H1: x1 ≠ x2 ≠ x3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Hypotheses and ANOVA notes

A

–(this does not tell us which one of the three differences is responsible for the rejection of the null. It could be one of the three, it could be all of the three)
–“More than what is expected by chance” – we interpret this to mean that it is due to the grouping variable.
Non-directional. All ANOVAs are non-directional.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Types of ANOVA we will use

A

1) Simple Analysis of Variance/one-way analysis of variance

2) Factorial design

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Simple analysis variance-ANOVA

A

Where there is one factor or one treatment variable i.e. group membership, this is also called one-way analysis of variance because there is only one grouping dimension.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Factorial design-ANOVA

A

Factorial ANOVA (e.g. 3x2): Effect of exercise (high, medium, low impact) on weight loss by gender.
–Factor 1 (Independent Variable)Treatment (high/medium/low impact exercise)
–Factor 2 (Independent Variable)Gender (male/female)
–Weight Loss Outcome (Dependent Variable)
–Questions to be answered: Main effect of exercise? Main effect of gender? Interaction between exercise & gender?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

COMPUTING ANOVA

A

•F Test Statistic

F = MSBetween / MS Within

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Logic behind this ratio: (test statistics)

A
  • Mean squares are estimates of variance
  • Within Group Variance is Due to Chance (individual difference)
  • Between Group Variance is due to the grouping category (Independent Variable)
  • An increasing F value…
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

STEPS to compute ANOVA

A
1)A statement of Null and Research Hypotheses
Null H0:  u1 = u2 = u3
 Research H1:  x1 ≠ x2 ≠ x3
2)Level of risk .05 (always)
3)Test statistic  ANOVA 
F =  MSBetween / MS Within
4)Compute the test statistics value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

•The F ratio definition:

A

Ratio of variability between groups to variability within groups. To get this we compute the sum of squares for each group of variability between groups; within groups; and the total

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

explain between Groups:

A

The sum of the differences between the mean of all scores and the mean of each group’s score, squared. (How different is each group’s mean from the overall mean?)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

explain Within Groups:

A

The sum of the differences between each individual score in a group and the mean of each group, squared. (How different is each score in a group from the group mean?)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Total in ANOVA means:

A

The sum of the between-group and within-group sum of squares

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Total sum of squares in ANOVA means:

A

Sum between –group and within-group sum of square

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Degrees of freedom definition

A

approximation of the sample or the group size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

There are 2 sets of degrees of freedom for ANOVA

A
  • between group k-1 (k equals the number of groups)– groups minus 1
  • within group N-k (N equals the total sample size) - total number of people in groups
20
Q

Post-Hoc Comparisons after fact

A

Each mean is compared with each other mean, type I error is controlled by SPSS (Bonferonni)
Use if you get a significant F; it gives you more info on where the groups lie; and helps you know where that difference is.

21
Q

results Interpretation: (ANOVA)

A

State out what test you were performing, give the results and then interpret it.

22
Q

LINEAR REGRESSION

A

1) Regression: is a statistical prediction. It happens when data used on past events such variable correlations1 are applied to future events given the knowledge of only one variable.

23
Q

explain Prediction

A

using a set of previously collected data, to calculate how correlated the variables are with one another.

24
Q

prediction requirements

A

we need an already established correlation and we need to have data on one of the variables. The higher the coefficient, the more accurate the prediction is of one variable from the other based on that correlation.

25
Q

Purpose

A

Regression uses our knowledge of relationships between variables to predict the value of one variable from another

26
Q

regression line

ne

A

•(RG): reflects our best guess as to what score on the Y variable (dependent variable/when you will dye) would be predicted by a score on the X variable (independent variable/amount of cigarettes smoked). It is the line drawn based on the values in the regression equation
allows us our best guess at estimating

27
Q

What is another name for the regression line?

A

the line of best fit, because it minimizes the distance between each individual point and the regression line minimizes the error in prediction.

28
Q

What is an error in prediction

A

Directly related to correlation, it is the distance between each individual data. QuestionWhat would the line look like if the correlation was perfect?

29
Q

What is the role of the Regression equation?

A

It allows us to plot this line correctly.

The equation that defines the points and the line that are closes to the actual scores.

30
Q

Regression equation

A

Y’ = bX + a
a&b could be calculated from data
Y ‘= dependent variablethe predicted score or criterion
X = independent variablethe score being used as the predictor (you know this value)
b = the slope direction and “steepness” of the line
a = the interceptpoint at which the line crosses the y-axis
X Y X2 Y2 XY

31
Q

How to calculate the Slope and Intercept From your Data

A

B=∑ XY– [(∑ X* ∑Y)/n] / ∑X2 – [(∑ X)2/n]

A = ∑ Y – b(∑ X) / n

32
Q

How good is our prediction?

A
  • Look at the absolute value of the correlation our prediction is based on
  • Look at the difference between the predicted value (Y’) and the actual value (Y) from the data set.–>error of estimate
33
Q

What is an error of estimate?

A

the difference between the predicted value (Y’) and the actual value (Y) from the data set.

34
Q

What is a MULTIPLE REGRESSION?

A

Predicting an outcome from 2 or more variables rather than one.

35
Q

Multiple regression equation

A

Y’ = bX1 + bx2 + a
X1value for the first independent variable
X2value for the second independent variable
bregression weight for each variable

36
Q

Rules for the new independent variables

A
  • should be adding something unique to the equation
  • should be correlated to dependent variable (should share something in common)
  • The 2 (or more) variables should be independent from each other, but related to the outcome variable.
37
Q

What is a Chi Square?

A

non-parametric tests that allow you to determine if what you observe in a distribution of frequencies (i.e. rates, percentages, looking at fair allocation) is what would be expected by chance alone.

38
Q

Types of Chi-Square

A

One Sample vs. Two Sample Test: Number of Dimensions

i.e. average GPA and average GPA by college level

39
Q

how do you Compute a Chi Square

A

You can easily compute what you would expect by chance

Compares what is observed in the data by what would be expected by chance

40
Q

CHI SQUARE equation

A

X2 = ∑ (O-E)2/ E

∑=summation sign
X2 = Test statistic
O = Observed Frequency
E = Expected Frequency (By chance)

41
Q

STEPS TO COMPUTE Chi Square

A
  • Null and Research Hypothesis (Words and Symbols)
  • Compute the Test Statistic
  • Determine if it is significant or not
  • Write a 2-3 sentence results section, including a numerical X2 statement.
42
Q

Hypotheses

A

Null hypothesis H0: P1 = P2 = P3 percentage of occurrence in any one category
Research hypothesis  H1: P1  P2  P3states that there is a difference in the frequency/proportion of occurrences in each category

43
Q

How to Compute the Test Statistic?

A

Test Statistic
category Observed Expected Difference D squared D squared/E
Total Total=Obtained value

44
Q

Significant?

A

•Determine if it is significant or not
Degrees of freedom r-1 (r equals rows)
Obtained value exceeds the critical valuesignificantreject NULLaccept researchresults not due to chance
Obtained value below the critical valuenon-significantaccept NULLresults due to chance

45
Q

Write a 2-3 sentence results

A

In Chi square The Null hypothesis is we expected it to be chance
Do not say there will be equal; because it is already said use equal rates instead.
The research hypothesis  rates not due to chance

46
Q

numerical X2 statement.

A
x2(2) = 20.6, p < .05
•x2  represents the test statistic
•2 is the number of degrees of freedom
•20.6 is the obtained value
•p < .05 is the probability