Unit 2 Flashcards

Question

STR & WK of Alt./Parallel R.

Answer 1

STR: -DEC cheating/memory effect b/c subjects get different items w/ different form *Practice effect possible b/c items on different forms are similar WK: -INC time, money, & effort to create new version of same test

Answer 2

Inter-corr. of items in same test -Tests w/ heterogeneous items usually have low INT Consistency Rel.

Answer 3

1 test given to 1 group on 1 occasion -Test is split in half, subtotal for each half are corr. -Best: Odd-even, randomly, matched items -Worst: 1st & 2nd half

Answer 4

Yields corr. btwn two half tests, NOT rel. of full test -Is longer w/ good quality -More rel. than shorter tests b/c it fully samples bhvr *Is pic rel. to someone's appearance VS many pics?

Answer 5

Finds rel. of a full test w/ a split-half corr. adj. UP

Answer 6

Many ways to do a split-half, how done effects outcome

Answer 7

M of all possible split-half corr. of a test -No correction needed

Answer 8

Items usually homogenous making scores easy to interpret -W/ LOW INT Consistency, there is ambiguity & is harder to interpret -Combine scores from many homogenous subtests to measure complex variables (ex. INT)

Answer 9

2 Raters/scorers observe & assign scores to 1 group & calculate corr. HIGH I-R rel. needs: -Good operational def. of bhvr measured -In depth training w/ feedback for rater -Occasional refresher training

Answer 10

Low I-R Rel. maybe from unstable characteristics -Difference in sampling can DEC alt./parallel forms R

Answer 11

-Unstable characteristics affect test-RT R. -Differences in item sampling of V.a & V.b affect Alt/ Forms Rel. -Heterogenous items effect INT Consistency Rel. -Restriction of Rng *Scores clustered *Too easy or hard tests

Answer 12

0.8-0.9+ is considered acceptable rel. -0.8 R = 80% true difference, 0.2 = 20% random error

Answer 13

Item characteristic curves (ICC), Relation of a personal trait w/ prob. of correctly scoring on a measure for said trait -EX: Verbal ability & prob. of passing vocab test -Look to notebook to understand how to read graph *A is easiest, D is hardest, B&C moderately hard

Answer 14

To what extent is an item different among people? -Certain items are different for those LOW on a trait -Some items made to discriminate those HIGH on a trait *A in IRT tests those low *D in IRT tests those high

Answer 15

e possible in a test -Confidence intervals -e is assumed random -Rel DEC SEM -Always a 68% chance score obt is ±1SEM of true score, 68% true is ±1SEM of obt score -SEM applied to score to interpret it *SEM = SDxSqR(1-r)

Answer 16

Error btwn 2 scores helps understand profile of results

Answer 17

How well a test measures what its intended to -How trustful is conclusion from test results? -Info accumulates overtime w/ clinical & rsch observations

Answer 18

R. DEC, Val. DEC -+ e R. INC, ≠ V. INC Cannot est. V w/out R.

Answer 19

No -V. is population specific -V needs to be documented for: *Certain pop. *Certain purpose *Certain setting -Name doesn't matter, evidence does

Answer 20

Each has some overlap -Content V. -Criterion V. -Construct V. Evidence should be gathered at multiple points

Answer 21

How relevant items are to laypersons -Client understands why Q is being asked *Type of socks you wear isn't relevant to job interview at pizza place -EX: Block design & Rorschach inkblot

Answer 22

How well a test samples what its trying to assess -Relation btwn sample & Qs to be asked -Hard to determine w/ poorly defined psych Vs -Est. w/ agreeance btwn 2 experts rating -Watch for construct underrep. & Construct Irrelevant Diff.

Answer 23

Test neglects to include key topics -EX: If Uber driver didn't have license

Answer 24

When test measures something irrelevant -EX: Math test Qs reading comprehension

Answer 25

How well a test measures IRL qualities/bhvr -Criterion: Standard for eval. obt scores -Has 2 subtypes: Concurrent & Predictive

Answer 26

F+: Test shows person has a quality they don't F-: Test shows person doesn't have a quality they do have

Answer 27

How well results reflect someone's standing on a current IRL dimension -EX: Dr.'s opinion & depression score

Answer 28

How well results predict someone's standing on a IRL dimension in the future -EX: GPA

Answer 29

Using a valid test

Answer 30

Corr. coefficient indicating STR of relationships btwn test scores & criterion measure -Rarely above 0.6 -EX: Corr. btwn depression & Dr.'s rate

Answer 31

Scorer for criterion also knows test scores -Artificial corr. elevation -EX: Prof. knows student's GRE score & assigns grades -Confirmation bias

Answer 32

Stat indicates degree of e for estimated scores -Confidence interval of e -High corr. btwn test & criterion, DEC SEE

Answer 33

% of "hits" / true + & true - AND % of "misses" false + & false - -Acceptable ratio dependent on nature of decision to be made

Answer 34

Unobservable, underlying, hypothesized trait of interest -A lot of psych subjects involve these

Answer 35

The extent to which a test adequately measures a theoretical construct/trait -EX: What does watermelon sugar high?

Answer 36

Seven: -Test homogeneity -Appropriate Developmental Changes -Theory-consistent Intervention Effects -Theory Consistent Group Difference -Convergence Evidence -Discriminant (Divergent) Evidence -Factor Analysis

Answer 37

INT consistency - How well a single trait is measured w/ test

Answer 38

Expected changes in scores w/ age -Principal of conservation -EX: Grade lvl & reading score

Answer 39

Changes in scores in pre/post-test following known effective intervention -EX: Known therapeutic intervention OR Bhvr changing w/ experience

Answer 40

STR corr. btwn scores on new & older est. test on similar construct -"og" log, hog BUT "int" hint, pint

Answer 41

Scores should theoretically be unrelated/have no corr. should show why diff. -EX: Dr. Skelly & Hawley bringing subs to the "all cookie" party

Answer 42

Data reduction tech. grouping items that have something in common -Cluster items (Factors) determined stat., interprets factors more subjective -EX: Crockpot separates meats & veggies

Answer 43

Accurately IDing patients w/ a particular disorder

Answer 44

Accurate IDing patients w/out the disorder OR having a different one -EX: Mini-mental exam screening for dementia in the elderly, cut-off set IDing those w/ disorder and disclude those w/out/have other

Answer 45

Side effects & unintended consequences of testing -EX: Going to therapy & having a reaction to being given the "batshit crazy" exam *Value judgement & SOC consequences of tests

Answer 46

Are actions of testing beneficial? -Giving reading tests but not having a way to help

Answer 47

Practical concern for use of test - Is it useful?

Unit 2 Flashcards

(71 cards)