Test theory and psychometrics Flashcards

1
Q

What must you know about a test to use it?

A
  • Name of test
  • Type of distribution
  • Definition of construct
  • Standardisation or raw?
  • Norms / cut offs / population
  • Evidence used to support measures - validity
  • M and SD
  • Out of
  • Experience of Tester
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a distribution?

A

A set of test scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are raw scores?

A

unmodified scores of test performance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is frequency distribution?

A

a tally of the number of score occurrences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a grouped frequency distribution?

A

Scores that fall within intervals are counted (95-99 etc.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How can we describe the distribution of test scores?

A

Using:
measures of central tendency
measures of variability/ dispersion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the measures of central tendency?

A

mean, medium and mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the measures of variability/ dispersion?

A

range, standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does SD mean?

A

The average distance from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is used to develop norms?

A

Standard scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a normative sample?

A

A group of people’s scores who are used as reference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Norms are?

A

test performance data for reference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a criterion?

A

A standard on which a judgement or decision is based: e.g. GPA x to get into a course

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How many scores in a normal distribution fall above and below the mean?

A

50:50

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How many scores in a normal distribution occur between the mean and 1 SD?

A

34%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How many scores in a normal distribution occur between +-1 SD? e.g. -1 SD to +1 SD

A

68% (34% on each side)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How many scores in a normal distribution occur between +-2 SD? e.g. -2 SD to +2 SD

A

96% (48% on each side)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

How many scores in a normal distribution occur fall above 2SD?

A

2% (on each side)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

How many scores in a normal distribution occur fall between 1SD and 2SD?

A

14% (on each side)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What does 75th percentile mean?

A

higher than 75% of people (in the top 25%)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What are the limitations of using percentiles?

A

changes all of the time
may not give an accurate reading
if ordinal data it looses precision

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Why don’t we just give raw scores?

A

For broader meaning/ comparison

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is the simplest example of a standard score?

A

z score

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is the formula for calculating a z score?

A

z = x - mean / SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
What is the mean for a set of z scores?
0
26
What is the SD for a set of z scores?
1 SD
27
What does a z score tell us?
How many SD the raw score is below or above the mean
28
Other than a z score what is another type of standardised score?
t score
29
What is the mean of a t score?
50 (half way between 0 and 100)
30
What is the SD of a t score?
10
31
What does a t score range between?
5 SD above and below the mean | e.g. 0 - 100
32
IQ mean and SD?
``` M = 100 SD = 15 ```
33
What does FSIQ stand for?
Full scale IQ
34
Why, if reporting scores to parents is it better to use a t score than a z score?
No negatives (0-100 vs. -2 to 2)
35
How do you calculate a t score?
T = (Z x 10) + 50
36
How would you calculate a standardised score on the IQ test?
x = (z x SD) + Mean
37
What does a true score assume?
That there is an ideal score that captures a degree of a construct
38
What does test theory say about true scores?
They are not possible to obtain (measurement error)
39
Test theory states someones x score =
True score + error
40
What are some sources of error in test scores?
* Test construction (validity, sample) * Test Administration (conditions, followed manual?, professionalism, computer used?, emotional state, relationship) * Scoring and interpretation (subjective responses)
41
What could we assume if there was no error in test scores/
* We'd get the same score twice * X would be the true score * Two parallel tests would get the same score * Wouldn't need a range
42
What is psychometrics?
The area of psychology interested in the quality of tests
43
What makes a good test?
Validity and reliability
44
What is validity?
supporting evidence for interpretation of scores | e.g. construct validity: measures what is is operationalised to measure
45
Why is a measure of IQ iffy when it comes to validity?
Can you really use a number to represent how much of that construct (IQ) you have?
46
What is a nomological network?
a collection of research surrounding and supporting the validity of a construct
47
What are two types of construct validity?
Convergent: same as other tests measuring same construct Discriminant: different to unrelated constructs
48
How many factors in the Weschler?
4 cognitive domains
49
What types of factor analysis are there?
Exploratory | Confirmatory
50
What are 4 other types of validity (other than construct validity)?
face content population criterion: predictive and concurrent
51
What is predictive (criterion) validity?
Using a test to select the most adequate person (e.g. air force pilot that costs a lot to train - want to predict if they will stay)
52
What is varamax rotation?
a way to figure out factors that are completely unrelated to each other (but not that useful)
53
What type of analysis is usually used for confirmatory analysis?
SEM
54
How is reliability assessed?
Correlation
55
What are the 4 types of reliability?
* Internal consistency (alpha: how well they hang together) * Test-retest (acceptable = .8) * Inter-rater * Parallel forms
56
What is an example of something that could get good reliability but poor validity?
Using foot size to measure intelligence • always get the same score • not a good reflection of the construct IQ
57
What is an example of something that could get good validity but poor reliability?
XXXX | needs to be valid to be considered reliable
58
What is an example of something that could get good validity but poor utility?
Something that is too expensive
59
What is an example of something that could get good utility but poor validity?
Myer briggs - questionable validity but useful for team building
60
SEM stands for
standard error of measurement
61
What is used because of the standard error of the mean?
Confidence intervals: e.g. 95% of the time, the true value will fall between x and x q
62
How do you calculate confidence intervals?
CI = Score +-(1.96 x SEM)
63
The larger the SE of the Mean?
The lower the reliability of the test
64
How do you calculate the SEMeasurment?
SD sq root of 1-r
65
If SD was 12 and r was .88 what would the SEMeasurment?
4.16
66
What statistics do you use in a meta-analysis?
effect size and SE
67
Why is height so easy to report?
* Don't have to rely on self report * Can observe directly * Have agreed definitions * Have agreed ways of measuring
68
What are some different types of distributions?
``` normal bimodal positively skewed negatively skewed j-shaped rectangular ```