Stats - Selecting Stats Tests Flashcards

1
Q

Best measure for central tendency with similar/close scores or values vs best measure for extreme scores or values

A

Mean is better with similar/close values
Median is better when there are extreme scores/values (i.e., $10,000,000 homes) or substantial percentages of maximum scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Idiographic

A

single subject design/approach

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Nomothetic

A

group approaches

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What’s Normative data

A

data that can be compared both within & across subjects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Ipsative data

A

results from a forced-choice format (can only describe relative strengths/interests within a subject & not used for comparison across subjects)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

When conducting a ABAB design you should be concerned with:

A

failure of DV to return to baseline

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What statistical test has nominal data collected for one independent variable?

A

Chi-Square

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What statistical test has nominal data collected for two independent variables?

A

Multiple Sample Chi-Square

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What statistical test has interval or ratio data collected for one group of subjects?

A

T-Test for Single Sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What statistical test has interval or ratio data collected for two correlated groups of subjects?

A

T-Test for Matched or Correlated Samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What statistical test has interval or ratio data collected for two independent groups of subjects?

A

T-Test for Independent Samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What statistical test has interval or ratio data collected for more than two groups of subjects are compared to one independent variable

A

One-Way ANOVA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What concept refers to the number of possible variations in outcome that can be obtained

A

Degrees of freedom

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What’s the formula to measure the degrees of freedom in a single sample chi-square?

A

df = # columns - 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What’s the formula to measure the degrees of freedom for multiple sample chi-square?

A

df = (# rows - 1) x (# columns - 1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What formula measures the degrees of freedom for the T-Test for Single Subjects?

A

df = N (# of subjects) - 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What formula measures the degrees of freedom for the T-Test for Matched or Correlated Samples?

A

df = # pairs -1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

With One-Way ANOVA, what are the 3 different ways of calculating degrees of freedom?
1. total degrees of freedom
2. degrees of freedom between groups
3. degrees of freedom within groups

A
  1. df total = N-1
  2. df between groups = # groups - 1
  3. df within groups = df total - df between groups
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What’s the formula to calculate the expected frequency in a Chi-Square?

A

expected frequency = N (sample size) / total # of cells
ex: survey 200 people to see which genders (male or female = 2) vote for Democrat or republican (= 2)
200 / (2 x 2) = 200 / 4 = 50 (expected frequency)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is the formula to calculate the expected frequency of a Chi-Square when data is given for each cell?

A

(Sum of Row x Sum of Column) / N (sample size)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What’s the risk of running multiple t-tests instead of one-way ANOVA? *will be on test

A

increases the probability of making a type 1 error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What’s the possibility of making a Type 1 error if alpha = 0.05 and you run 6 t-tests?

A

6 (t-tests) x 0.05 = 0.30

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What does it mean if the F-Ratio equals or approximates 1.0 vs if it’s above 2.0, what do they mean with significance?

A

1.0 = there is no significance
2.0 = there is significance (one group did better than the other group/stronger your results are)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

When the ANOVA is found to be significant, with there being differences between groups, it’s not clear which one of the groups is significantly different from which others; what type of testing is then required?

A

Post-Hoc Testing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Which one of the post-hoc tests (Scheffe & Tukey vs Fisher’s LSD) provides the least protection from a Type II (2) error, and which offers the most protection?

A

Least protection: Scheffe & Tukey
Most protection: Fisher’s LSD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Which one of the post-hoc tests (Scheffe & Tukey vs Fisher’s LSD) provides the least protection from a Type I (1) error, and which offers the most protection?

A

Least protection: Fisher’s LSD
Most protection: Scheffe & Tukey

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What’s the downside of Scheffe & Tukey protecting you from Type 1 Errors?

A

The chance of a Type II error increases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What test do you use when groups are being compared on 2 independent variables (e.g., sex & treatment)

A

Two-way ANOVA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What test has the advantage of permitting analysis of each independent variable’s main effects (significance) but also allows analysis of interaction effects?

A

Two-way ANOVA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

When running a two-way ANOVA, why are 3 F-ratios obtained?

A

one for each of the 2 independent variables and then another for the interaction effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

How do you know if there was a main effect of a two-way ANOVA when there’s a table?

A

Add up the totals of each column and see if they are the same or different numbers when comparing the totals. If they are different, then that means there was a main effect; if they are the same, there was no main effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Easy way to see if there were interaction effects with a two-way ANOVA table

A

Add up two squares that are diagonal from one another, then add up two other squares diagonal from each other; if the sum of each pair of diagonals is different, you can conclude there are interaction effects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

When do you use trend analysis as a follow-up to ANOVA, with linear or non-linear data?

A

To analyze non-linear data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Statistics that depict relationships between variables (X & Y) are termed ________, while statistics that predict are termed _________ or _________

A

relationships = correlations
predict = regressions or analyses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What are the two broad categories of tests of relationship & prediction (hint - variate) and how many variables (X &Y) are in each test

A

Bivariate: 2 variables (1 X & 1 Y)
Multi-variate: involves several Xs and one or more Ys

36
Q

Which variable (X & Y) is the predictor while the other is the criterion

A

X (Predictor): This is the independent variable that you believe influences or predicts changes in the other variable

Y (Criterion): This is the dependent variable, the one that is being predicted or explained by the predictor (the outcome)

e.g., studying the relationship between hours of study (X) and exam score (Y), you might predict the exam score (Y) based on the number of study hours (X)

37
Q

What does a correlation coefficient test do?

A

Sees whether there’s a relationship between X & Y

38
Q

A statistical test with the word regression or analysis means we are looking for…

A

a prediction/to predict something

39
Q

With a correlation coefficient, what does the negative - or the positive + tell us about the relationship between X & Y?

A

The direction

40
Q

With a correlation coefficient, what does the numerical value (between -1.0 & +1.0) tell us about the relationship between X & Y?

A

the strength

41
Q

With a correlation coefficient, what does a numerical value of 0 mean vs close value to either +1 or -1

A

0 = no relationship between the variables
close to +1 or -1 = perfect relationship between the variables

42
Q

What’s it called when you square the correlation coefficient?

A

Coefficient of determination

43
Q

What does the coefficient of determination represent? (e.g., the correlation between education & income = 0.50)

A

The amount of variability in Y that is shared with explained by or accounted for by X

e.g.,
- 0.50 (correlation coefficient squared = 0.25 (correlation of determination)
- 25% of the variability in income is explained by education
- this leaves 75% of the variability in income to be accounted for by other factors like motivation, connections, luck, etc.)

44
Q
  1. How is a simple linear regression (prediction) equation derived in a scatter plot?
  2. What does the computer use to calculate it?
A
  1. the line of best fit
  2. the least squares criterion
45
Q

The regression equation is Y= a + bX, what is the “a” and the “b”

A

a = constant/intercept which determines the height of the line
b = slope which determines the steepness of the line

46
Q

What are the 3 assumptions of a bivariate correlation, and what are they?

A
  1. a linear relationship between X & Y
    (you can put a line through the dots)
  2. homoscedasticity (similar spread of scores)
  3. unrestricted range of score (scores must go all the way from low to high)
47
Q

What calculates the correlation between X & Y when there’s a curvilinear relationship?

A

Eta
(old aunt Eta has a curved back)

48
Q

What type of data does X & Y need to be for a Pearson r statistical test?

A

Interval/Ratio Data

49
Q

What type of data does X & Y need to be for Spearman’s Rho or Kendall’s Tau statistical tests?

A

Ordinal or rank-ordered

50
Q

What two tests do you use if one of your variables is nominal (interval/ratio) and the other has a dichotomy (like gender)?

A

a Point-Biserial (true dichotomy) or a Bi-Serial (artificial dichotomy)

51
Q

You’re correlating a continuous variable and one dichotomous; which 2 statistical tests do you use?

A

a Point-Biserial (true dichotomy) or a Bi-Serial (artificial dichotomy)

52
Q

Trick to remembering that Bi-Serial goes with artificial dichotomy, vs Point-Biserial going with true dichotomy

A

Commercials saying “Buy cereal” and cereals are artificial food/flavours/sweeteners

53
Q

Which test (Phi vs Tetrachoric) goes with either two variables with true dichotomies or two with artificial dichotomies?

A

true dichotomies: Phi
artificial dichotomies: Tetrachoric

54
Q

Correlation concept to describe the most basic correlation with no extraneous variables affecting the relationship

A

Zero Order Correlation

55
Q

Correlation concept to describe when you examine the relationship between X & Y with the third variable removed that may be influencing or confounding X & Y

A

Partial or First-Order Correlation

56
Q

The correlation concept that describes when you examine the relationship between X & Y with the third variable removed from only one of the original variables

A

Part (Semipartial) Correlation

57
Q

What is a variable that influences the strength between the predictor and criterion?

A

Moderator variable

e.g., age would be the moderator variable with old age & smoking = stronger correlation vs young age & smoking = weaker correlation

58
Q

What is the variable that explains why there’s a relationship between the predictor and the criterion?

A

Mediator

e.g., if education explains the link between SES and smoking, when you control the mediator (education), then the relationship of X & Y becomes almost zero

59
Q

What test involves the correlation between two or more IVs (Xs) and one DV (Y), where Y is always interval or ratio data and at least 1 X is also interval or ratio data

A

Multiple R (or multiple correlation)

60
Q

How do we get the coefficient of multiple determination?

A

squaring R (multiple R score)

61
Q

What’s the concept/equation prediction equation to make a prediction when we have 2 or more Xs and one criterion?

A

Multiple regression equation
Y = a + b1X1 + b2X2 +b3X3

62
Q

Multicollinearity is a problem associated with what multivariate test and what does it mean

A
  • associated with multiple regression
  • it happens when the predictors are highly correlated with one another
63
Q

What is the ideal outcome with correlations in a multi-regression:
1. between the predictors (Xs)
2. between each predictor & the criterion (Y)

A
  1. low correlation between predictors
  2. moderate to high/strong correlation between each predictor and the criterion
64
Q

Is multiple regression a compensatory or non-compensatory technique?

A

Compensatory technique

65
Q

Difference between the 2 subtypes of multiple regression, stepwise regression & hierarchical regression

A
  1. Stepwise regression: the computer decides which variable to put in first, second, third, etc., based on the strength of the correlation
  2. Hierarchical regression: the researcher controls the analysis, adding in the order based on the theory the researcher is using
66
Q

Canonical R is the relationship between how many Xs and how many Ys?

A

two or more IV’s (Xs) and two or more DVs (Ys)

67
Q

The discriminant function analysis is used with how many Xs and how many Ys, and what does the Y variable have to be?

A

two or more predictors (Xs)
one criterion (Y) - a nominal (categorical)

68
Q

What type of analysis is used to predict nominal/categorical criterion (Y) based on nominal/categorical predictors (Xs)?

e.g., type of graduate school (X1) & sex (X2) predicting the likelihood of passing or failing the EPPP

A

Loglinear or logit analysis

69
Q

Path analysis and structural equation modelling are ways, using correlational techniques, are ways to determine _______ relationships between variables

A

Causal relationships

70
Q

Factor analysis and cluster analysis are tests of ________

A

Structure

71
Q

What are tests of structure for?

A

when a researcher is interested in discovering which variables in the set best fit together or form coherent subsets

72
Q

What test is used to reduce a large number of variables into a smaller number of factors?

A

Factor analysis

73
Q

What does an Eigenvalue (characteristic root) tell you in a factor analysis? and if it’s less than 1.0 what does that mean?

A

Eigenvalue tells you the strength of the factor
less than 1.0 is considered NOT significant

74
Q

The factor loadings of a factor analysis are interpreted and to make the factors more distinct and interpretable, what is used?

A

Factor rotation

75
Q

What two types of rotations are used?

A

Orthogonal and oblique

76
Q

With orthogonal vs oblique rotations in factor analysis, which one relates in factors with correlation or no correlation

A

Orthogonal = no correlation
Oblique = correlationn

77
Q

Difference between principal component analysis vs principle factor analysis

A

Principal component analysis: the researcher has no prior hypotheses

Principle factor analysis: there’s an underlying theoretical basis before the analysis

78
Q

What analysis is involved in gathering data on various dependent variables and statistically looking for naturally occurring subgroups in the data?

A

Cluster Analysis

79
Q

What test has the main assumption that there’s independence of observations?

A

Chi-Square Test

80
Q

What type of tests have these three assumptions: data needs to be interval/ratio, homoscedasticity must be present, and the data must be normally distributed?

A

Parametric tests

81
Q

Is a Chi-Square test parametric or non-parametric?

A

non-parametric

82
Q

What helps researchers make inferences about causation & includes LISREL as a type of it?

A

Structural Equation Modelling

83
Q

What can test out many causal pathways involving multiple predictors and criterion variables?

A

Structural Equation Modelling

84
Q

Which one controls carryover effects when repeated measures are used vs which one controls for the effects for testing (Solomon four group vs the latin square)

A

Solomon four group = controls for the effects of testing

Latin-square: controls carryover effects when repeated measures are used

85
Q

A researcher would use LISREL, a form of structural equation modelling, in an attempt to:
1. test a causal model of relationships among variables
2. derives a causal model of relationships among variable

A

TEST a causal model of relationships among variables