FINAL REVIEW Flashcards

1
Q

BIVARIATE TABLE

A

a table that displays the joint frequency distributions of 2 variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

CELLS

A

the cross classification categories of the variables in a big aria the table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

X^2 (CRITICAL)

A

the score on the sampling distribution of ALL possible sample chi squares

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

X^2 (OBTAINED)

A

the test statistic as computed from SAMPLE RESULTS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

CHI SQUARE TEST

A

a non-parametric test of hypothesis for variables that have been organized into a bivariate table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

COLUMN

A

the vertical dimension of a bivariate table
- each column represents a score on the INDEPENDENT VARIABLE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

EXPECTED FREQUENCY (fe)

A

the cell frequencies that’d be expected in a bivariate table if the variables were INDEPENDENT

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

GOODNESS OF FIT TEST

A

an additional use for chi square that tests the significance of the distribution of a single variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

INDEPENDENCE

A

the NULL hypothesis in the chi square test
- 2 variables are interdependent if, for all cases, the classification of a case on one variable has NO EFFECT on the probability that the case will be classified in any particular category of the second variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

MARGINALS

A

the row and column subtotals in the bivariate table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

NONPARAMETRIC

A

a “distribution free” test
- these tests don’t assume a normal sampling distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

OBSERVED FREQUENCIES (fo)

A

the cell frequencies actually observed in a bivariate table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

ROW

A

the HORIZONTAL dimension of the bivariate table, conventionally representing a score on the dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

THE DECISION TO REJECT THE NULL HYPOTHESIS IS NOT SPECIFIC

A
  • means that only ONE statement in the model OR the null hypothesis is WRONG
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

WHAT LEVEL CAN A CHI SQUARE TEST BE CONDUCTED AT

A

can be measured at the NOMINAL LEVEL, which is the lowest level of measurement
- because it is NONPARAMETRIC, chi square requires NO ASSUMPTION at all about the shape of the population or sampling distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

THE ____ CERTAIN WE ARE OF THE MODEL, THE _____ OUR CONFIDENCE THAT THE NULL HYPOTHESIS IS THE FAULTY ASSUMPTION

A

more, greater

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

A “_____” OR EASILY SATISFIED MODEL MEANS THAT OUT DECISION TO ______ THE NULL HYPOTHESIS CAN BE MADE WITH EVEN GREATER CERTAINTY

A

weak, reject

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

CHI SQUARES FLEXIBILITY

A

can be used and conducted with any variables at ANY LEVEL OF MEASUREMENT

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

WHAT ARE A BIVARIATE TABLES TWO DIMENSIONS

A
  • the horizontal (across) dimension (ROWS)
  • the vertical (up and down) dimension (COLUMNS)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

ROW = ________
COLUMN = ___________

A

dependent, independent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

SUBTOTALS THAT ARE ADDED TO EACH COLUMN AND ROW IS CALLED _________

A

marginals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

WHAT IS REPORTED AT THE INTERSECTION OF THE ROW AND COLUMN MARGINALS

A

the total number of cases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

ON A TABLE, WHAT VARIABLE IS LISTED FIRST

A

the dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

THE CONCEPT OF INDEPENDENCE

A

the relationship between the VARIABLES, not between SAMPLES

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
WHAT IS THE NULL HYPOTHESIS FOR CHI SQUARE
that the variables are INDEPENDENT
26
IF THE NULL HYPOTHESIS IS _____ AND THE VARIABLES ARE _________, THEN THERE SHOULD BE _________ DIFFERENCE BETWEEN THE EXPECTED AND OBSERVED FREQUENCIES
true, independent, little
27
IF THE NULL HYPOTHESIS IS ______, THERE SHOULD BE _______ DIFFERENCES BETWEEN THE EXPECTED AND OBSERVED FREQUENCIES
false, large
28
THE GREATER THE ________ BETWEEN EXPECTED AND OBSERVED FREQUENCIES, THE _____ LIKELY THAT THE VARIABLES ARE ______ AND _____ LIKELY THAT WE WILL BE ABLE TO _______ THE NULL HYPOTHESIS
differences, less, independent, more, reject
29
X^2(OBTAINED) = Σ (fo - fe) ^2 / fe
calculation of chi square
30
FE = ROW MARGINAL X COLUMN MARGINAL / N
expected frequency formula of each cell
31
STEP ONE : MAKING ASSUMPTIONS AND MEETING TEST REQUIREMENTS
model : independent random samples - level of measurement is NOMINAL
32
STEP TWO : STATING THE NULL HYPOTHESIS
ho : the two variables are INDEPENDENT (h1 : the two variables are DEPENDENT)
33
STEP THREE : SELECTING THE SAMPLING DISTRIBUTION AND ESTABLISHING THE CRITICAL REGION
- the sampling distribution of sample chi squares are POSITIVELY SKEWED, with higher values of sample chi squares in the upper tail of the distribution - the critical region is established in the UPPER TAIL OF THE SAMPLING DISTRIBUTION
34
DF = (R-1)(C-1)
degrees of freedom formula
35
STEP FOUR : COMPUTING THE TEST STATISTIC
x^2 (obtained) = Σ (fo - fe) ^2 / fe
36
CHI SQUARE TEST OF STATISTICAL SIGNIFICANCE
tests the null hypothesis that the variables are INDEPENDENT in the population
37
IF WE _______ THE NULL HYPOTHESIS, WE ARE CONCLUDING, WITH A KNOWN PROBABILITY OF ERROR (determined by alpha level), THAT THE VARIABLES ARE ________ ON EACH OTHER IN THE POPULATION
reject, dependent
38
A _____ SMALL SAMPLE SIZE IS DEFINED AS ONE WHERE A _____ PERCENTAGE OF THE CELLS HAVE EXPECTED FREQUENCIES OF 5 OR LESS
small, high
39
CONSIDERING TWO VARIABLES SIMULTANEOUSLY
relationship between variables
40
RELATIONSHIP
2 variables are related if the distribution of cases in (or among) the values (or categories) of one variable differs depending on which value (or category) of the other variable is considered
41
WHEN IS THERE A RELATIONSHIP
when there are two variables - never when there’s one, three, four, five, etc
42
RELATIONSHIP BETWEEN
the distributions differ
43
NO RELATIONSHIP BETWEEN
distributions don’t differ
44
NULL
no relationship
45
OBSERVED FREQUENCIES
all we have is the null hypothesis of relationships - shows some coordination
46
EXPECTED FREQUENCIES
no coordination - “what we think is in the world”
47
X^2 (OBTAINED) = Σ (Fo - Fe)^2 / Fe
formula for x^2 - “how far is what i’m observing to what i expect”
48
BECOMES MORE NORMAL = __________
becomes a sampling distribution
49
PHI Φ
a chi square based measure of association - appropriate for nominally measured variables that have been organized into a 2x2 bivariate the table
50
MEASURES OF ASSOCIATION ARE DESCRIPTIVE STATISTICS THAT
summarize the overall strength of the association between 2 variables
51
WHATS COMPUTED TO MEASURE THE STRENGTH OF THE ASSOCIATION
phi Φ
52
Φ = √ X^2 / N
formula for phi Φ
53
MEASURES OF ASSOCIATION
number that talks about the extent to the measures
54
DISTRIBUTIONS NOT DIFFERING
0.0
55
DISTRIBUTIONS DIFFERING AS MUCH AS POSSIBLE
1.0
56
CORRELATIONS NEAR 0
weak
57
CORRELATIONS NEAR 1
stronger
58
LINEAR RELATIONSHIP
a relationship between 2 variables in which the observation points (dots) in the scatter gram can be approximated with a straight line
59
REGRESSION LINE
the simple, best fitting straight line that summarizes the relationship between 2 variables
60
SCATTERGRAM
graphic display device that depicts the relationship between 2 variables
61
SLOPE (b)
the amount of change in one variable per unit change in the other
62
TOTAL VARIATION
the spread of the Y scores around the mean of Y - equal to Σ(Yi - Ybar)^2
63
UNEXPLAINED VARIATION
the proportion of the total variation in Y that’s NOT accounted for by X
64
Y INTERCEPT (a)
the point where the regression line crosses the y axis
65
THE STATISTICAL TECHNIQUES OF CORRELATION AND REGRESSION ARE MORE APPROPRIATELY USED WITH HIGH QUALITY, PRECISELY MEASURED VARIABLES AT THE _____ ______ ________
interval ratio level
66
WHICH SCORE IS ARRAYED ALONG THE HORIZONTAL AXIS
independent (X) value
67
WHICH SCARES ARE ALONG THE VERTICAL AXIS
dependent (Y) variables
68
2 REASONS WHY SCATTERGRAMS ARE USED
1. provide at least impressionistic information about the existence, strength, and direction of the relationship of linearity 2. the scattergram can be used to predict the score of. case on one variable from the score of that case on the other variable
69
WHERE WOULD ALL DOTS LIE IN A PERFECT ASSOCIATION
all dots would lie on the regression line
70
THE ________ OF THE BIVARIATE ASSOCIATION CAN BE JUDGED BY OBSERVING THE SPREAD OF DOTS AROUND THE _______ ______
spread, regression line
71
AS ___ INCREASES, ___ ALSO INCREASES
X, Y
72
IF THE RELATIONSHIP HAD BEEN NEGATIVE, THE REGRESSION LINE WOULD HAVE SLOPED IN THE _________ DIRECTION TO INDICATE THAT ______ SCORES ON ONE VARIABLE WERE ASSOCIATED WITH ____ SCORES ON THE OTHER
opposite, high, low
73
THE OBSERVATION POINTS OR DOTS ON A SCATTERGRAM ___________.
must form a pattern that can be approximated with a straight line
74
IF THE RELATIONSHIP IS NON LINEAR, YOU MIGHT NEED TO TREAT THE VARIABLES AS IF THEY WERE _______ RATHER THAN _______ IN LEVEL OF MEASUREMENT
ordinal, interval ratio
75
THE MEAN OF ANY DISTRIBUTION OF SCORES IS THE POINT AROUND WHICH _____________
the variation of the scores, as measured by squared deviations, is minimized
76
Σ(Xi - XBar) ^2
variance of x
77
IF THE REGRESSION LINE IS DRAWN SO THAT IT TOUCHES EACH _____________, IT WOULD BE THE STRAIGHT LINE THAT COMES AS CLOSE AS POSSIBLE TO ALL THE SCORES
conditional mean of Y
78
CONDITIONAL MEANS ARE FOUND BY
summing all Y values for each value of X and then dividing by all numbers of the cases
79
THE Y INTERCEPT, OR THE POINT WHERE THE REGRESSION LINE CROSSES THE Y AXIS
a
80
THE SLOPE OF THE REGRESSION LINE, OR THE AMOUNT OF CHANGE PRODUCED IN Y BY A UNIT CHANGE IN X
b
81
SCORE OF INDEPENDENT VARIABLE
x
82
THE POINT AT WHICH THE REGRESSION LINE CROSSES THE VERTICAL, OR Y, AXIS
y intercept (a)
83
THE LEAST SQUARES REGRESSION LINE IS THE AMOUNT OF CHANGE PRODUCED IN THE DEPENDENT VARIABLE (Y) BY A UNIT CHNGW IN THE INDEPENDENT VARIABLE (X)
slope (b)
84
IF THE VARIABLES HAVE A ______ ASSOCIATION, THEN CHANGES IN THE VALUE OF X WILL BE ACCOMPANIED BY SUBSTANTIAL CHANGES IN THE VALUE OF Y, AND THE SLOPE (b) WILL HAVE A _________ VALUE
strong, high
85
THE ________ THE EFFECT OF X ON Y, THE _______ THE VALUE OF THE SLOPE (b)
weaker, lower
86
IF THE TWO VARIABLES ARE UNRELATED, THE LEAST SQUARES REGRESSION LINE WOULD BE PARALLEL TO THE X AXIS, AND SLOPE (b) WOULD BE
0.0, line would have NO slope
87
B = COV(X,Y) / VAR (X)
formula for slope (b)
88
VERTICAL ABOVE MEAN
x
89
DEVIATIONS FROM THE MEAN
“how far are my dispersions from the mean?”