PSYC4022 Testing and Assessment Week Two Psychometrics Flashcards

1
Q

Test

A

“An objective & standardized procedure for measuring a psychological construct using a sample of behavior” (Guion, 1998)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Raw Score

A

Unmodified account of test performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Standardisation

A

involves administering a test to a representative sample for the purpose of establishing norms.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Norms

A

Show the distribution of results for the sample from a certain population. Raw scores from the sample can be transformed to standard scores to enable the development of norms.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Normative Sample

A

Is the group of people whose performance on a particular test is analysed for reference in evaluating the performance of individual testtakers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Criterion

A

is a standard on which a judgement or decision is based.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Criterion Referenced Evaluation

A

A method of evaluation and a way of deriving meaning from test scores by evaluating an individual’s score with reference to a set standard. For example, a certain level GPA to gain entry into Honours

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Distribution

A

a set of test scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Frequency Distribution

A

Where you tally the number of score occurances

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Grouped Frequency Distribution

A

Where intervals are counted

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Measures of Central Tendency

A

E.g. Mean, Medium, Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Measures of Variability/ Dispersion

A

Range, SD, Square Root of Variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How good is a score of 46.5?

A

You’d need to know;

  1. What was the test? How did developers define the concepts
  2. What type of distribution/ test norm referenced or criterion referenced?
  3. Norms for test (if norm referenced) Cut off score if criterion referenced.
  4. Was it a raw score or a scaled score
  5. Mean for population; Standard deviation
  6. Reliability - not a chance happening
  7. How it is scored? What it is out of, what scale?
  8. Content and Construct Validity
  9. Qualifications/ Experience of the tester
  10. What population was sampled
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

The Normal Curve is also called the…

A

Gaussian Curve

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

6 Types of Distribution Curves are…

A
  1. Normal
  2. Bi-Modal
  3. Positively Skewed
  4. Negatively Skewed
  5. J-shaped
  6. Rectangular
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What percentage of scores lie within 1, 2, 3, 4, 5 and 6 SDs of the mean?

A
  1. 34%
  2. 68%
  3. 82%
  4. 96%
  5. 98%
  6. 100%
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

A test is standardised when it has (3 Things);

A
  1. Supervised Administration
  2. Consistent conditions, instructions, wording and timing
  3. been administered to representative group from a target population (random, stratified or convenience)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Raw scores from the sample can be transformed into standard scores to enable the development of…

A

norms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

The Ravens has a standard norm and a….

A

managerial norm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Which accounting firm developed the managerial norm for the Ravens?

A

Price Waterhouse Coopers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Standard Scores

A

Raw scores that have been converted from one scale to another, the latter scale having some arbitrary set mean and SD.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Z-score ** EXAM

A

(mean set at zero, and SD set at one), which results from the conversion of a raw score into a number indicating how many SD units the raw score is below or above the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

T-score ** EXAM

A

(mean set at 50 and SD set at 10 (50 plus or minus ten scale)). A scale that ranges from 5 standard deviations below the mean to 5 standard deviations above the mean. A raw score that falls at 5 SDs below the mean would be 0, and one that falls at the mean would be 50, and 5 above would be 100.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

From Z Scores to T Scores Formula

A

= z (10) + 50

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

The Mean and SD of IQ

A

M = 100, SD = 15

26
Q

What is the formula to work out a Z score from an IQ?

A

Z = (X-M)/SD

27
Q

What is the formula for measurement error?

A

X = T + e

28
Q

There are a number of sources of error. Name 3

A
  1. Test construction
  2. Test administration
  3. Test scoring and Interpretation
29
Q

Psychometrics

A

Area of Psychology concerned with the quality of tests/scales and items designed to measure psychological constructs

30
Q

Validity

A

The degree to which evidence supports the interpretation of test scores for their intended purposes.

31
Q

Construct Validity

A

Adequacy of operational definition of variables. Does the test measure what is purports to measure?

32
Q

There are two criteria used to judge the quality of tests/scales

A
  1. Reliability

2. Validity

33
Q

Nomological Network

A

Network of research evidence surrounding and supporting the validity of a construct. It supports, doesn’t PROVE a construct.

34
Q

Convergent Validity

A

If a measure has convergent validity, it should correlate with questionnaires that measure the same and/ or related constructs.

35
Q

Discriminant/ Divergent Validity

A

If a measure has discriminant validity, it should not correlate with questionnaires that measure different constructs or unrelated constructs.

36
Q

Reliability and Validity are usually measured with…

A

correlation co-efficient

37
Q

Exploratory Factor Analysis

A

Exploratory Factor Analysis is used to identify the factor structure of a construct, generally during test development.

38
Q

There are 4 types of Validity. They are;

A
  1. Face Validity
  2. Content Validity
  3. Population Validity
  4. Criterion Validity
    4a. Concurrent
    4b. Predictive
39
Q

Face Validity

A

The face-appearance that a test measures what it purports to measure.

40
Q

Content Validity

A

The extent to which a test measure what it was designed to measure.

41
Q

Population Validity

A

…is a type of external validity which describes how well the sample used can be extrapolated to a population as a whole.

42
Q

Criterion Validity (also criterion-related validity).

a. Concurrent and
b. Predictive

A

A judgement regarding how adequately a score or index on a test or other tool of measurement can be used to infer an individual’s most probabaly standing on some measure of interest (or criterion). a. Concurrent - at the time of testing. b. Predictive - at some future point in time.

43
Q

Reliability

A

The consistency or stability of a measure of behaviour

44
Q

What is the relationship between standard error of measurement and reliability?

A

The larger the SEM, the smaller the reliability.

45
Q

What is the formula for SEM

A

= SD*(SQRT 1-r)

46
Q

There are 4 types of Reliability. What are they?

A
  1. Test-Retest Reliability (.7 or .8 is acceptable)
  2. Inter-rater Reliability
  3. Internal Consistency (coefficient alpha)
  4. Parallel Forms
47
Q

Standard Error of Measurement (SEM)

A

Provides an indication of the dispersion of the measurement errors when you are trying to estimate true scores from observed scores.

48
Q

Varimax Rotation and Structural Equation Modelling are two methods of….

A

Factor Analysis

49
Q

Can a test have good reliability but poor validity?

A

Yes, using foot measurement as an IQ test

50
Q

Can a test have good validity but poor reliability?

A

Generally not, but you could have a one-off

51
Q

Can a test have good validity but poor utility?

A

Yes, e.g. drug patches on juvenile offenders.

52
Q

Can a test have good utility but poor reliability?

A

Probably not, that would be difficult

53
Q

Can a test have good utility but poor validity?

A

Yes, like the Myers-Briggs has questionable validity but is useful for on-boarding

54
Q

There are 7 stages of a meta-analysis. What are they?

A
  1. Articulate a research question
  2. Identify a relevant population
  3. Capture all the available evidence (studies) (see Cochrane Library and PRISMA framework)
  4. Form meaningful measure of effect size
  5. Pool studies in an appropriate manner and formulate a balance-of-evidence conclusion
  6. Look for sources of variation in effect size
  7. Look for possible bias in available evidence
55
Q

Exercise 1 on Measuring Ourselves. What did we learn?

A
  1. Definitions of concepts is important.
  2. Require agreed method of measuring
  3. Require time to administer instruments
  4. Validity and Reliability are important
  5. Correct choice of instruments
  6. Rely on openness - willingness for self disclosure
  7. Need to consider ethics at all times
  8. Variety of sources of information
  9. Cultural Differences
  10. Can we really measure all concept - implications
56
Q

Sources of Error - Test Construction (2 things)

A
  1. To what extent do items adequately sample the construct being assessed?
  2. To what extent are items clearly worded? Any ambiguities?
57
Q

Sources of Error - Test Administration (5 things)

A
  1. Distractions in the Environment
  2. Emotional State of test taker
  3. Attitude/Disposition of test administrator
  4. Interpersonal Issues between test taker and administrator
  5. Malfunctions of Instruments
58
Q

Name 1 possible error with online testing

A
  1. Internet Speed is an unknown variable. In a timing test, you loose precision.
59
Q

The WAIS-IV was stratified on a US sample according to the following;

A
  1. Geographic region: 4 Regions by census reports including MW, NE, SW….
  2. Race - White, African Americans, Hispanics, Asians and other racial groups.
  3. Age - 13 age groups represented (not all of equal range or size)
  4. Sex - Under 65 Equal Numbers, ; over 65 proportional to population
  5. Educational Level - 5 Educational Levels based on number of years completed
60
Q

What are the benefits of assessment? (4 Things)

A
  1. Diagnosis
  2. Differentiate
  3. Severity
  4. Monitor/ Malingering