Class Notes Flashcards

1
Q

if one level of IV is assigned via randomization

A

considered experimental – as consumer be keenly aware of which level is randomized – e.g., randomized on tx but not gender – 2 x 2 factorial design

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

any time you adminster the same measure to the same participants (to collect data on DV) at a later date (at any point – could be 5 minutes apart)

A

Repeated measures design

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

subsumed within repeated measures – generally speaking, typically more than 20 observations (e.g., clinical trials) no longer need 20 for statisical methods to hold (so need at least 10) – tend to occur in fixed intervals depends on what you’re collecting data on – logorythmic data collection - time period continues to get bigger (exponentially) between observations

A

Time-series design

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Time-series design

A

subsumed within repeated measures – generally speaking, typically more than 20 observations (e.g., clinical trials) no longer need 20 for statisical methods to hold (so need at least 10) – tend to occur in fixed intervals depends on what you’re collecting data on – logorythmic data collection - time period continues to get bigger (exponentially) between observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Difference between repeated measures and time series design

A

number of observations (collected data on DVs)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

In non-equivalent group design one should

A

control with pre-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Threats that concern ______ relate to statistical conclusion validity

A

integrity of treatment itself

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Threats that concern ______ relate to internal validity

A

making comparisons between tx groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does randomization do?

A

“Ensure” that participant groups are equal prior to treatment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does randomization not do?

A

Ensure anything that happens after treatment – possibility of history effects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Important issues to look for in time series designs

A
  • change in intercept (level)
  • slope
  • stability of effect (continuous or discontinuous effect)
  • delayed vs immediate effect (instantaneous vs delayed – when you see effects taking place)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Standard error is based on

A

sample of samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What type of research is meta-analysis?

A

Ex-post facto

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Assessment is defined as

A
  • Overarching, sampling behavior

* In contrast to research when we assess people

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Measurement is defined as

A
  • Establishing quantitative rules for assigning numbers to represent attributes of persons
    • Attributes of people, not to people
    • Distinction between observations and inferences
    • Must think: How representative is this of behaviors outside of this context?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Test, scales, and measures are defined as

A

Objective, quantitative measurement using standardized procedures; psychometric properties of scores essential

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are Rating protocols?

A

Taxonomies, classification and rating systems done by an observer (usually)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is Evaluation?

A

Assessing the congruency between what is expected and actually occurs (formal to informal, may be quantitative) Chen, 1990

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is Clinical assessment?

A

Less formal, typically not fully standardized or quantitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is a Scale?

A
  • often used interchangeably (not always) with measure, questionnaire or test
  • Some say questionnaire is less formal
  • Assumed to be assessing a single construct or domain
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is a construct?

A

Trait, domain, ability, latent variable, theta 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is theta?

A

Item Response Theory (IRT) uses this to talk about the construct itself – latent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What are the differing types of item responses?

A
  1. Dichotomous
  2. Polytomuos
  3. Graded responses
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What are dichotomous item responses?

A

Two levels (true or false, yes or no, etc.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What are polytomous item responses?

A

Three or more levels, often ordered but not always

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What are graded item responses?

A

More than 2 ordered response options

All graded responses are polytomous but not all polytomous items are graded items

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is Classical Test Theory (CTT)?

A
  • Total sums of squares partitioned into true score variance vs error score variance
  • Partitioning variance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is Modern Test Theory = Item Response Theory (IRT)?

A

Has to do with probability

  • What’s the probability that someone will respond in a certain way?
  • Not only assigning where the individual is on the construct
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What is Standard Error of Measurement (SEM or SEm) ?

A
  • Estimate of extent to which an observed score deviates from true score
  • Create confidence interval
  • Probability that an individual’s true score lies within a range
30
Q

What is reliability?

A
  • How consistently does a scale measure what it is designed to measure?
31
Q

What are types of reliability?

A
  1. Test-retest
  2. Parallel forms
  3. Split halves
  4. Internal consistency
  5. Inter-rater
32
Q

Parallel forms

A
  • Administer once in different forms and see how they correlate
33
Q

Split-halves

A
  • See how one half of test correlates to other half within each participant
  • But need twice as many items (think statistical power and N)
34
Q

Three types of validity

A
  1. Content
  2. Criterion
  3. Construct
35
Q

What is content validity?

A
  • Extent to which items are representative of or sampled from content domain being measured
  • Most important in tests in classes
36
Q

What are limitations of content validity?

A
  • Often non-statistical approach is used – ask panel of experts
  • Often relies upon face validity - does this look like it tests what it’s supposed to test
37
Q

What are two types of criterion validity?

A
  1. Predictive

2. Concurrent

38
Q

What is predictive validity?

A

Before and after – look at outcomes

39
Q

What is concurrent validity?

A

Measure all items at same time

40
Q

What are limitations of concurrent validity?

A
  • Often use 1-item scales
  • Lack validity and reliability
  • Extremely limited variance
41
Q

What is construct validity?

A
  • How well scores measure a specific trait or construct

* Requires a priori specification and operationalization of the construct

42
Q

Steps in developing CTT test construction

A
  1. Qualitative research
  2. Explicate theorizing
  3. Define constructs
  4. Generate item pool
  5. Content validity study
  6. Derivation sample
  7. Cross-validation study
  8. Subsquent studies testing evidence of construct validity
43
Q

What are limitations of CTT?

A
  1. Cannot change anything about the scale
  2. Standard error of measurement is presumed constant across levels of the construct, items, and scores
  3. Statistics are sample dependent
44
Q

What are main characteristics of IRT?

A
  1. Probablisitc
  2. Statistical mathematical theory
  3. Scaling items as well as people on latent trait
  4. Wide variety of IRT models (dichotomous–more common– and polytomous)
45
Q

What are basic assumptions of IRT?

A
  1. Well-known constructs and well-established tests
    a. know dimensionality
    b. know validity
    c. know what’s correct and incorrect
  2. extensive item banks and data banks
  3. large samples (>3000)
46
Q

In contrast to assumptions of IRT, what is typical of CPY instrumentation?

A
  1. New scales
  2. Constructs are not well known or defined
    a. confounds
    b. unknown dimensionality
    c. ordered rating scale data (likert)
  3. non-existent item pool; no data banks
  4. small sample sizes (in best cases n = ~300-400)
47
Q

What is measurement invariance?

A

Cross-cultural applications of tests, scales, and measures

48
Q

4 questions to ask regarding measurement invariance

A

To what extent…

  1. Can a construct be conceptualized equivalently across cultures?
  2. is the same construct being measured equivalently across cultures?
  3. can mean scores be compared equivalently across cultures?
  4. can measures of association (correlation) be compared equivalently across cultures?)
49
Q

Multiple regression

A
  • Find lines that best fit the data instead of planes (as above)
  • Linear composite of IVs to best explain DVs
  • Combination of weights on IVs constitutes linear composites
49
Q

Confirmatory Factor Analysis (CFA)

A
  • forces the data into this model then assess how well the model fits the data
  • Also assess how scales correlate with one another (Orthagonal or Oblique - allows correlation)
50
Q

Exploratory Factor Analysis (EFA)

A
  • Misapplication of statistical procedure for measurement development
  • Analogous to doing “atheoretical” research
  • No constraints on data
  • See how many factors come out (“sem-magical”)
  • interpret the factors that come out based on the data that they’re based on
  • Do a CFA because we have implicit theorizing
51
Q

Things to look for in articles relating to measurement

A
  1. Look for ceiling/floor effects (Determine possible range for scale or subscale. How many items, what’s the rating scale? If mean + or - 1 SD is highly skewed – No longer have adequate prediction of probability of error when you have skewed scores)
  2. Type 1 and Type 2 error rates are not protected
    * “I adapted this test”
    * Changed the rating scale, items, instructions, order, etc. – changed anything
    * Computing cronbach’s alpha is insufficient to protect against these threats
    * Different rating scales for same scale/sub-scale
52
Q

Analysis of variance and multiple regression

A

are identical!

53
Q

Multiple regression =

A

1 DV, always univariate

54
Q

Multivariate =

A

multiple IVs and multiple DVs

55
Q

Multiple regression in linear model

A

y = a + b1x1+b2x2+…bkXk + e

56
Q

Multi-collinearity

A

cannot interpet the overlap of explained variance between variables in multiple regression

57
Q

Part correlation

A

unique variance explained by one predictor in the model, controlling for other predictors in the model

58
Q

If summing all zero order correlations and the value exceeds 1.0…

A

by definition you have profound multicolliniarity (x1 and x2 are highly correlated)

59
Q

Moderation

A
  • Answers for whom or when does this relation apply?
  • Affects magnitude and/or strength of relation
  • Easier to think about as high low, but better to use continuous
60
Q

Interactions refer to moderation or mediation effects?

A

Moderator

61
Q

If interaction term is significant…

A

main effects are meaningless

62
Q

Mediation

A

*How or why a relation exists

63
Q

Full mediation

A

Mediator variable completely accounts for relation between IV and DV

64
Q

Partial mediation

A

Explains much of, but not all, of the relation between IV and DV (IV and DV path alone still exists but is nearer to non-significance)

65
Q

MAXMINICON

A
  1. Maximize experimental variance
  2. Minimize error variance
  3. Control extraneous variance
66
Q

MSH = SSH/dfH

A

Mean square variance (want to increase)

67
Q

MSE = SSE/dfe

A

Mean square error (want to decrease)

68
Q

Maximize experimental variance by

A
  1. Ensure maximum variability in Y due to X

2. Make treatments as different as possible, but realistic

69
Q

Control extraneous variance a priori by

A
  1. Homogenize on the confounding variable (restriction of range)
  2. Match participants on all relevant conditions
  3. Randomly assign participants to treatment conditions
70
Q

Control extraneous variance a priori or post hoc by

A
  1. Build a blocking variable into the design to control the confound
  2. Covary any confounding variable (analysis of covariance)
71
Q

Minimize error variance by

A
  1. Block on any variable that is related to the DV but not related to the IV
  2. Covary on any variable that is related to the DV but not related to the IV
  3. Maximize the reliability of the measures used ( rtt)
  4. Increase the sample size (error and statistical power are a function of N)
  5. Use repeated measures designs instead of between groups designs