Measurement Theory and Assessment 1 Flashcards

Question

Coefficient alpha (Cronbach's alpha)

Answer 1

Mean of all possible split-half coefficients, corrected by the Spearman-Brown formula Range: 0.00 to 1.00 Index of internal consistency of the items; tendency for items to correlate positively

Answer 2

Similar to Cronbach's formula, used for tests with only two answer options

Answer 3

A sample of tests is independently scored by two or more examiners; scores for the tests from each examiner are correlated (should have a strong, positive correlation). Used for subjective scoring tests

Answer 4

- either positive or negative - average measurement error is not 0 - can be due to test construction/an inconsistency in the assessed construct - serve as a measure of validity - how well is the test measuring what it is supposed to

Answer 5

- are random and unpredictable - are both positive and negative - average measurement error is 0 - are not related to the true score - are a measure of reliability - affects the consistency of scores

Answer 6

Most basic information provided by a psychological test e.g. how many questions were answered correctly

Answer 7

Sample of examinees, representative of the population for whom the test is intended

Answer 8

Results of an examinee are interpreted using the instrument's corresponding norms

Answer 9

Mean: average, good for normally distributed data Median: middle number/score, better than mean when distribution of data is skewed, used for percentiles Mode: most common score, shows the peak on a skewed distribution§

Answer 10

Percentage of people who scored below a specific raw score (e.g. score of 25 > 94th percentile, 94% of participants scored below 25)

Answer 11

Distance from the mean in standard deviation units, aka a z-score

Answer 12

Transformation of z-scores to avoid negative and decimated numbers M = 50, SD = 10 T = 10z + 50

Answer 13

Raw scores converted to a system using 1 to 9 M = 5, SD ≈ 2 Scores are ranked lowest to highest then put into numbers bij percentage: 1 > bottom 4% 2 > next 7% 3 > next 12% 4 > next 17% 5 > next 20% 6 > next 17% 7 > next 12% 8 > next 7% 9 > next 4%

Answer 14

Each member of the population (or subset thereof) has an equal chance of getting selected

Answer 15

Create strata (groups) from the population based on certain demographics then selecting the sample randomly (can be proportional)

Answer 16

Shows the relationship between test scores and the expected outcomes on a different, relevant task e.g. scores on a scholastic aptitude test and subsequent college grade point average

Answer 17

Compare the examinee's score to a predefined performance standard Used often for education purposes

Answer 18

68% CI > X ± 1*SD 90% CI

Answer 19

68% CI > X ± 1*SD 90% CI > X ± 1.65*SD 95% CI > X ± 1.96*SD 99% CI > X ± 3*SD X = score SD = standard deviation

Answer 20

Used to determine if the difference between pre- and post-treatment scores is valid (or due to the unreliability/validity of the test)

Answer 21

Goal: classify on a continuum Interpretation: control group Items should maximally discriminate

Answer 22

Goal: determine if criterion has been reached Interpretation: previously determined criterium Items should be relevant to criterium Especially used in education

Answer 23

Summary of distribution of characteristics in a representative sample Need to be up-to-date: - 15 years > outdated - 20 years > unusable

Answer 24

- used for selection, qualification, or prognosis - assessment of learning e.g. course exam

Answer 25

- strengths & weaknesses - aimed at instruction (compare with own scores or those of peers) - assessment for learning e.g. polls & feedback on a report

Answer 26

IQ increases by 3 points every 10 years

Answer 27

A test is valid to the extent that inferences made form it are appropriate, meaningful, and useful Types of validity: - content validity - criterion-related validity - construct validity

Answer 28

The degree to which the content of a test is representative of the sample of behaviour/construct the test is designed to assess - affected by a proper selection of items and thorough assessment of the construct - can be evaluated using an expert panel

Answer 29

Does the test look valid to test users, examiners, and examinees? - more a matter of social acceptability than a technical form of validity

Answer 30

Correlation between an examinee's test score and the behaviour/construct you want to predict - concurrent validity - predictive validity

Answer 31

Assess the behaviour at approximately the same time (usually the same day), using both the predictor and criterion tests

Answer 32

Assess the behaviour at separate time (usually a long period in-between), in order to predict future behaviour, predictor test first, then later criterion test

Answer 33

The extent to which the test/measure accurately assesses what it is supposed to, measured by correlating the test to another test - convergent validity - discriminant validity

Answer 34

Assess the relationship between the main test scores to those of a test which assesses the same construct - ideally > good, high correlation

Answer 35

Assess the relationship between the main test scores and test scores on another unrelated test (one which does not assess the same construct) - ideally > bad/no correlation

Answer 36

1. Defining the test 2. Selecting a scaling method 3. Constructing the items & analysis 4. Revising the test 6 Publishing the test If test is found to be inadequate after step 4, return to step 3

Answer 37

- Expert rankings - Likert scales - Guttman scales - Thurstone scales - Absolute scales - Empirical scales

Answer 38

Method for testing items Proportion of examinees who get the item correct in a tryout; identifies the items which should be altered or discarded from the test

Answer 39

Method for testing items Items should display internal consistency and good correlation to total test scores

Answer 40

Method for testing items Used to identify predictively useful test items; how well does each item contribute to the overall predictive validity

Answer 41

Method for testing items Graphical display of the relationship between the probability of a correct response and the examinee's position on the underlying trait being measured by the test

Answer 42

Method for testing items Statistical index of how efficiently an item discriminates between people who obtain high and low scores on the entire test

Answer 43

Method for revising a test Using the original regression equation in a new sample to determine whether the test still predicts the criterion well

Answer 44

Method for revising a test Often, a test predicts the relevant criterion less accurately with a new sample

Answer 45

Method for revising a test Receive feedback from the examinees in the try-out sample on the: - behaviour of examiners - testing conditions - clarity of exam instructions - convenience in using the answer sheet - perceived suitability of the test - perceived cultural fairness of the test - perceived sufficiency of time - perceived difficulty of the test - emotional response to the test - level of guessing - level/method of cheating by the examinee or others

Answer 46

Summarises the interrelationships among a large number variables in a concise and accurate manner as an aid in conceptualisation

Answer 47

- fluid intelligence/reasoning (Gf) - crystallised intelligence/knowledge (Gc) - domain-specific knowledge (Gkn) - visual-spatial abilities (Gv) - auditory processing (Ga) - broad retrieval (Gr) - cognitive processing speed (Gs) - decision/reaction time or speed (Gt)

Answer 48

- componential (analytic) intelligence > executive processes - experiential (creative) intelligence > dealing with novelty - contextual (practical) intelligence > adaptation

Answer 49

- problem-solving abilities - verbal abilities - global capacity vs specific mental functions - speed of response and thinking

Answer 50

- learning competence - social competence

Answer 51

- Galton: IQ as sensory keenness (speed) - Spearman: IQ as a global capacity (g) and specific factors (s) - Thurstone: IQ as 7 primary mental abilities - Luria: IQ as simultaneous vs successive processing - Guilford: IQ as the SOI model; added creativity; model consists of: operations, contents, and products

Answer 52

- hierarchical structure of intelligence - stratum 3: overall capacity (g) - stratum 2: broad cognitive abilities - stratum 1: narrow cognitive abilities

Answer 53

- critique on g > no underlying general factor exists - introduced multiple intelligences: people smart, music smart, etc. - found evidence in brain studies (localisation) - evolutionary plausible

Answer 54

- verbal comprehension index (VCI) - visual spatial index (VSI) - fluid reasoning index (FRI) - working memory index (WMI) - processing speed index (PSI)

Answer 55

Full-scale IQ: M = 100, SD = 15, 55 - 145 Indexes IQ: M = 100, SD = 15, 55 - 145 Individual subtests: M = 10, SD = 3, 1 - 19 FSIQ alpha: 0.96 (SEM = 3) FSIQ test-retest: 0.95

Measurement Theory and Assessment 1 Flashcards

(79 cards)