Prelim 2 prep Flashcards

Question 1

Q

What are the differences between True Score Theory, Generalizability Theory, and Item Response Theory?

Question 2

Q

What is the standard error of measurement?

Question 3

Q

What are confidence intervals and what do they tell us?

Question 4

Q

When confidence intervals increase in terms of percentage (i.e., 90% vs 95%), what does that do
to the range of scores it comprises?

Question 5

Q

What does it mean if a test is valid?

Question 6

Q

What are the three main categories of validity?

Question 7

Q

What is: content validity, criterion-related validity, construct validity, ecological validity, external
validity, face validity?

Question 8

Q

What is the content validity ratio and how is it used to determine content validity of test items?

Answer

A

If more than half the panelists of experts say an item is essential, has content validity, CVR 0 is half, negative if fewer than half

Question 9

Q

Name the three characteristics of a criterion

Answer

A

Relevant, valid, uncontaminated

Question 10

Q

What does it mean for a criterion to be uncontaminated?

Answer

A

Independent- independent group of raters decides who is good and who isn’t, then correlate that with test scores

Question 11

Q

Define concurrent validity and predictive validity

Answer

A

Concurrent- degree to which a test score is related to some criterion measure obtained at the same time
Predictive- degree to which a test score predicts a criterion measure

Question 12

Q

What are false negatives, false positives, specificity, and sensitivity?

Answer

A

False negative- test predicts someone doesn’t possess a trait and they do
False positive- test says someone has a trait and they don’t
Specificity- perfect wouldn’t mistakenly identify as someone having a trait when they don’t
Sensitivity- perfect identify all people who have the trait

Question 13

Q

What is incremental validity and what would be proof of its existence?

Answer

A

Extent to which adding a second or third predictor gives more information about a criterion
proof??

Question 14

Q

What is construct validity?

Answer

A

Extent to which a test measures a construct we are examining

Question 15

Q

Name a describe the several ways in which you can find evidence for construct validity.

Answer

A

Homogenous
Evidence changes w age
Test scores change w experience
Distinct groups score differently
Convergent evidence between two tests measuring the same construct

Question 16

Q

What is the difference between convergent and concurrent validity?

Question 17

Q

What is a factor analysis and how does an exploratory factor analysis differ from a confirmatory
one?

Answer

A

?
Exploratory- estimating or extracting factors, deciding how many to retain, rotating to an interpretable orientation ????
Confirmatory- degree to which a hypothetical model fits the data

Question 18

Q

Name and define the different types of rating error that can occur

Answer

A

Leniency error- arises from tendency on part of rater to be lenient
Severity- opposite
Central tendency- rater doesn’t use extreme ends of scale
Halo effect- seeing people well no matter what

Question 19

Q

What is test utility?

Answer

A

Usefulness or practical value of testing to improve efficiency, use in a particular situation helps us make better decisions

Question 20

Q

What are some of the costs of administering a test, and what are some costs of NOT
administering one?

Answer

A

Administering:
- buying
- supply of blank test protocols
- computer program to score the test
- paying to score the test
- hiring people to administer the test
- costs of doing business

Not administering:
- loss of confidence as an ultimate cost of the company???
- missing a child abuser
- failing to diagnose when someone underreports on an interview???

Question 21

Q

Keep in mind the real-life example I discussed about how to think about the cost of testing when
doing evaluations.

Question 22

Q

What are the Taylor Russel tables used for and what three variables are considered when using
them to decide if giving a test is “worth it.”

Answer

A

COME BACK

Question 23

Q

Be able to name a few other tables (i.e., Naylor-Shine) and have a basic sense of how they work
(they could be multiple-choice option for instance)

Answer

A

COME BACK

Question 24

Q

Name some different ways cut scores are determined.

Answer

A

COME BACK

Question 25

Q

What’s the difference between a fixed and relative cut score?

Answer

A

Relative- actual score you need to meet a criteria changes
Fixed- always the same

Question 26

Q

What is pilot work and why is it used?

Answer

A

Preliminary research surrounding creation of prototype of test, experiment with test items

Question 27

Q

Name some ways scales are graded?

Answer

A

Age based
Grade based
Unidimensional v multidimensional??
Categorical v dimensional

Question 28

Q

What are some scaling methods – and remember, they can overlap – so a categorical scale can
be graded “summatively,” etc.

Answer

A

Rating
Summative
Paired comparisons
Sorting tasks
Categorical scaling
Guttman scale
????

Question 29

Q

What is the empirical vs analytical way of writing test items?

Answer

A

Analytical- write test questions you think will measure the qualities you want to measure
Empirical- find people with a problem, ask different types of questions, see how they respond

Question 30

Q

Why would we want to find seemingly arbitrary items for use of distinguishing one group for
another (in other words, non-face-valid ones)

Question 31

Q

Name some different ways items can be formatted.

Answer

A

Selected response
Constructed response
Computerized adaptive testing?

Question 32

Q

What is computerized adaptive testing, and how does item branching work?

Answer

A

?
Adds or deletes a branch depending on performance (meaning what)?

Question 33

Q

Name and describe a few different ways in which items can be scored

Answer

A

Cumulative
Class/category scoring
Ipsative scoring- comparing one score on one scale to another scale within the same test ?

Question 34

Q

What are the following: item-difficulty index, item endorsement index, item reliability index,
item discrimination index

Question 35

Q

How are item characteristic curves useful?

Question 36

Q

Extra paper

Answer

A

almost done