Midterm Flashcards

1
Q

Many of the tasks that David Wechsler used in his WAIS, WAIS-R, WAIS-III, and WAIS-IV were adapted from what sources?

A

The Army Alpha, Army Beta, Army Performance Scale Examination

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Updating the WAIS-IV’s theoretical foundations was achieved by considering which theoretical construct?

A

Phonological processing!

It was achieved by considering Fluid Reasoning, Working Memory, and Processing Speed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What was the major structural change implemented from the WAIS-III to the WAIS-IV?

A

The WAIS-IV eliminated the Verbal and Performance IQs, but the 4
indexes were retained

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Which Subtest in NOT new to the WAIS-IV?

A

Symbol Search

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Which Subtests ARE new to the WAIS-IV?

A

Figure Weights, Visual Puzzles, and Cancellation are all new

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Which WAIS-IV subtests offer process scores?

A

Digit Span, Block Design, and Letter-number Sequencing have process scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which index includes the subtest with the lowest loadings on the general (g) factor?

Note: General intelligence or general mental ability is denoted by g

A

Processing Speed (Coding, Symbol Search, and Cancellation)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the Flynn Effect?

A

The phenomenon that IQ test norms in the United States get out of date at the rate of about 3 points per decade!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

True or False? Analysis of ethnic differences on the WAIS-IV have shown that ethnicity accounts for more variance in IQ than socioeconomic status.

A

False!

Socioeconomic status and an array of other background, behavioral, and personal variables impact IQ far more than ethnicity alone

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Galton known for?

A

Galton developed the first comprehensive individual test of intelligence, composed of sensorimotor tasks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What was the FIRST Weschler test series?

A

Wechsler-Bellevue Intelligence Scale in 1939

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does the Verbal Comprehension Index (VCI) Measure?

A
Crystallized Intelligence (Gc) and represents ability to 
reason with previously learned information.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the four subtests that make up the Verbal Comprehension Index (VCI)?

A

Similarities
Vocabulary
Information

Comprehension (not a core subtest)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What subtests make up the Perceptual Reasoning Index (PRI)?

A

Block Design
Matrix Reasoning
Visual Puzzles

Figure Weights (not a core subtest) 
Picture Completion (not a core subtest)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does PRI measure?

A

Measures visual processing and fluid reasoning (Gv-Gf), and represents ability to analyze and synthesize visual stimuli and reason with it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does Working Memory Index (WMI) measure?

A

Measures short-term memory (Gsm), and represents ability to
comprehend and hold or transform information in immediate
awareness and then use it in a few seconds

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What subests make up Working Memory Index (WMI)?

A

Digit Span
Arithmetic

Letter-number Sequencing (not core)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What does Processing Speed Index (PSI) measure?

A

Measures Processing Speed (Gs) and represents ability to perform
simple tasks quickly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

When is the General Ability Index (GAI) used?

A

Good measure of global ability when FSIQ is not interpretable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

True or False? The examiner must memorize all directions in order to administer the test properly.

A

False! We can read the directions, but we still need to be very familiar with the test, procedures and prompts.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

When applying the reverse rule, you must…

A

Go back to administer items until two consecutive items are correct, including the item with which you initially started

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Which core subtests require the use of a stopwatch?

A

Block Design, Arithmetic, Symbol Search, Visual Puzzles, Coding

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is a Basal

A

Establishing a basal means that a test-taker has answered a set number of answers correctly so that they can continue with the rest of the test/subtest.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is a Ceiling

A

A ceiling is reached when a test-taker has answered an established number of questions incorrectly in a row.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

All of the following are principles of the intelligent testing philosophy:

A
  1. Subtests measure what the individual has learned
  2. Subtests are samples of behavior and are not exhaustive
  3. Test batteries are optimally useful when interpreted from a theoretical model
  4. IQ tests assess mental functioning under fixed experimental conditions
  5. Hypotheses generated from the test profile should be supported with data from multiple sources
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

If the FSIQ is not interpretable, then…

A

Determine if the GAI can be calculated and interpreted as a reliable and valid estimate of a person’s general intellectual ability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

An abnormally large discrepancy between PRI and VCI means it is…

A

Rare among the normal population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is Psychological Assessment

A

A comprehensive examination of psychological functioning that involves collecting, evaluating, integrating test results and collateral information, and reporting information about an individual

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What is Psychological Testing

A

An objective and standardized measure of a sample of behavior

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Pillars of Assessment

A

Norm referenced tests (this is what the IQ test is)

Interviews

Observations

Informal Assessment Procedures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What are some differences between Assessments and Testing?

A

o Complexity
▪ Testing- Unidimensional
▪ Assessment- multidimensional

o Duration
▪ Testing- Few minutes, hours
▪ Assessment- Hours to days

o Cost
▪ Testing- Inexpensive, but can be expensive
▪ Assessment- professional costs

Purpose
▪ Testing- Decision making
▪ Assessment- Referral Question, problem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Why is assessment important?

A

o Clinical versus statistical prediction controversy
o Clinical interviews are not really that reliable because we are all biased and human
o Statistical prediction consistently outperforms clinical judgment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

Psychological tests are used for…

A

…measuring characteristics of humans that pertain to overt (observable) and covert behavior. They are a set of items that are designed to measure characteristics of human beings that pertain to behavior

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What is Binet known for?

A

He made the first representative sample intelligence test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

Achievement tests

A

Measure previous learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

Aptitude tests

A

measure potential for acquiring a specific skill

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

Intelligence tests

A

measures an individual’s potential to solve problems, adapt to changing circumstances and profit from experience

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

Personality tests

A

measure typical behavior, including traits, temperaments and dispositions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

Statistics serve two important functions. What are they?

A

o Used for the purpose of description

o Use statistics to make inferences, which are logical deductions about events that cannot be observed directly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

Norm-referenced tests

A

Evaluate individuals relative to normative groups that depend on demographics such as age and culture

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

Norms

A

used to relate a score to a particular distribution for a subgroup of a population. Ex. Norms are used to describe where a child is (score) on some measure relative to other children of the same age (subgroup of the population)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

Criterion-referenced test

A

describes the specific types of skills, tasks, or knowledge of an individual relative to a well-defined mastery criterion.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

Descriptive Statistics

A

Provide a concise description of a collection of quantitative information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

Inferential Statistics

A

Used to make inferences from observations of a sample to a larger group (population)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

Measurement

A

Defined as the application of rules for assigning numbers to objects (rating scale for wine)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

Magnitude

A
Represents quantity (A scale of height, we can say John is taller 
than Fred)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

Equal Intervals

A

A scale has the property of equal intervals if the difference
between two points at any place on the scale has the same meaning
as the difference between two other points (A ruler, but not an IQ
test)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

Absolute 0

A

Obtained when nothing of the property being measured exists

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

Nominal Scales

A

▪ Not really scales
▪ For naming (such as naming teams)
▪ Can create frequency distributions, but not mathematical calculations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

Ordinal Scales

A

▪ For ranking
▪ Has the property of magnitude
▪ Can rank (1, 2, 3) from taller to shorter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

Interval Scales

A

▪ Property of equal intervals and Magnitude (temperature in degrees) ▪ But cannot reveal ratios
▪ Cannot have absolute zero

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

Ratio Scales

A

▪ Properties of magnitude, Equal Intervals, and Absolute 0

▪ Speed of travel

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

What are Frequency Distributions?

A

Displays scores on a variable or a measure to reflect how frequently each value was obtained

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

Positive Skew

A

The tail goes off toward the higher or positive side of the X axis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

Class Interval

A

The distance between each unit on the horizontal axis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

Percentile Ranks

A

Answers the question, “what percent of the scores fall below a particular score”

You actually exclude the score of interest (it is a measure of relative performance)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

Percentiles

A

The specific scores or points within a distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
58
Q

Mean

A

The average score or result of a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
59
Q

Standard Deviation

A

a measure of how dispersed the data is in relation to the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
60
Q

Variance

A

The average of the squared differences from the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
61
Q

Z-Score

A

the deviation of a score from the mean in standard

deviation units.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
62
Q

Expectancy Effects

A

Beliefs held by people administering and scoring tests might also get
translated into inaccurate test scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
63
Q

Data can sometimes be affected by what an experimenter expects to find. What is the name of this effect?

A

Rosenthal Effect!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
64
Q

The Point Scale Concept

A

In an age-scale format, the arrangement of items had nothing to do with their content and subjects didn’t receive points for each task completed. In a point scale, points are assigned to each item so the test yields a total overall score and a score for each content area

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
65
Q

Performance Scale Concept

A

Requires the subject to do something rather than just answer questions. This way, the test can directly compare an individua’s verbal and nonverbal intelligence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
66
Q

To determine scaled scores you need:

A

Examinee’s chronological age

Subtest raw scores on record form

Table A.1 from scoring manual

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
67
Q

Converting Raw Scores to Scaled:

A

Transfer total scores for each subtest onto pg. 1 of record form

Find scaled score equivalent for each subtest using Table A.1 for appropriate age

Record each scaled score in corresponding column

68
Q

What is the chronological age range for WAIS-IV participants?

A

16-90

69
Q

What do IQ tests assess?

A

IQ tests assess mental functioning under fixed experimental conditions

70
Q

________ generated from the test profile should be supported with data from multiple sources

A

Hypotheses

71
Q

What are some weaknesses of the WAIS-IV?

A

● Block Design is the only hands-on subtest
● Ambiguity in some scoring criteria
● Annoying instructions on one subtest
● Some questionable scoring rules

72
Q

Strengths of the WAIS-IV include:

A

o Generally strong reliability
o A much more user-friendly Record Form
o A high-quality standardization sample

73
Q

What are the three independent traditions T. R. Taylor (1994) identified that have been employed to study the nature of human intelligence?

A

Psychometric Approach
Information-processing approach
Cognitive tradition

74
Q

Psychometric Approach

A

Examines elemental structure of test. It is the OLDEST approach.

75
Q

Information-processing approach

A

examines the processes that underlie how we learn and solve problems

76
Q

Cognitive tradition

A

focuses on how humans adapt to real-world demands

77
Q

Binet defined intelligence as the capacity to:

A

○ to find and maintain a definite direction or purpose,
○ to make necessary adaptations—that is, strategy
adjustments—to achieve that purpose, and
○ to engage in self-criticism so that necessary adjustments in strategy can be made.

78
Q

Binet’s contributions

A

Age differentiation - estimated the mental ability of a child in terms of his or her completion of the tasks designed for the average child of a
particular age, regardless of the child’s actual or chronological age.

General mental ability - the total product of the various separate and distinct
elements of intelligence

79
Q

Charles Spearman’s contribution

A

Intelligence consists of one general factor (g) plus a large number of specific factors, hence “Spearman’s g”.

80
Q

Factor Analysis

A

A method for reducing a set of variables or
scores to a smaller number of hypothetical
variables called factors

81
Q

What are the two basic types of intelligence?

A

Fluid and Crystallized

82
Q

What is Fluid Intelligence?

A

Abilities that allow us to reason, think, and acquire new knowledge. These abilities allow us to learn and acquire information.

83
Q

What is Crystallized Intelligence?

A

Represents the knowledge and understanding that we have acquired from previous learning.

84
Q

1905 Binet-Simon Scale (first version)

A

■ purpose was restricted to identifying mentally disabled children in the Paris school system
■ 30 items presented in an increasing order of difficulty
■ Idiot described the most severe form of intellectual
impairment, imbecile moderate levels of impairment, and
moron the mildest level of impairment

85
Q

What changed from the 1905 to the 1908 Binet-Simon Scale (second version)

A

Items were grouped according to age level
rather than simply one set of items of increasing
difficulty

86
Q

What is mental age based on?

A

A subject’s mental age is based on his or her performance compared with the average performance of individuals in a specific
chronological age group.

87
Q

1916 Stanford-Binet Intelligence Scale (third version of Binet scales)

A

Introduced the Intelligence Quotient (IQ)

Strength: the increased sample size clearly marked an improvement over the meager 50 and 203 individuals of the 1905 and 1908 Binet-Simon versions.

Limitation: entire standardization sample consisted
exclusively of white, native-Californian children.

88
Q

The concept of g refers to the ____.

A

view that one general mental ability factor underlies all intelligent behavior.

89
Q

What major concept did Binet introduce in the 1908 Binet-Simon scale?

A

mental age

90
Q

The standardization sample of the 1916 Stanford-Binet scale was inadequate in that _____.

A

it was comprised exclusively of white children from California

91
Q

A central problem of the 1937 revision of the Stanford-Binet scale was that _____.

A

different age groups showed significant differences in the standard deviation of IQ scores

92
Q

_____ is a standard score with a mean of 100 and a standard deviation of 16 (later 15) that was first introduced in the 1960 revision of the Stanford-Binet.

A

The deviation IQ

93
Q

The knowledge you have acquired through your academic studies would best be described in terms of _____.

A

crystallized intelligence.

94
Q

The gf-gc theory of intelligence is

A

a hierarchical model on which only the later versions of the Stanford-Binet are based.

95
Q

Administration of the modern Stanford-Binet requires examiners to continue testing until the _____.

A

examinee’s ceiling is reached

96
Q

What is the oldest approach to investigating human intelligence?

A

Psychometric

97
Q

Binet believed that human intelligence was expressed through?

A

Judgment, attention, and reasoning

98
Q

To support the notion of g, Spearman developed a statistical technique called

A

Factor analysis

99
Q

Those abilities that allow us to learn and acquire information can be referred to as

A

Fluid intelligence

100
Q

As used in the Stanford Binet Scale the deviation IQ is a standard score with a

A

Mean of 100 SD of 16

101
Q

Item

A

An item is a specific stimuli to which a person responds overtly; this response can be scored or evaluated (e.g. classified, graded on a scale, or counted). In simple terms, items are the specific questions or problems that make up a test.

102
Q

Psychological Test

A

A psychological test or educational test is a set of items that are designed to measure characteristics of human beings that pertain to behavior.

103
Q

Psychological Testing

A

Refers to all the possible uses, applications, and underlying concepts of psychological and educational tests. The main use of these tests, though, is to evaluate individual differences or variations among individuals. Such tests measure individual differences in ability and personality and assume that differences shown on the test reflect actual differences among individuals.

104
Q

Overt behavior

A

an individual’s observable behavior

105
Q

Covert behavior

A

takes place within an individual and cannot be directly observed

106
Q

-To deal with distribution problems of scores, psychologists make use of ___, which relate raw scores on test items to some defined theoretical or empirical distribution.

A

scales

107
Q

Individual Tests

A

Tests that can be given to only one person at a time

108
Q

Test administrator

A

The person giving the test

109
Q

Group Test

A

Is a test that can be administered to more than one person at a time by a single examiner, such as when an instructor gives everyone in the class a test at the same time.

110
Q

Achievement

A

previous learning. A test that measures or evaluates how many words you can spell correctly is called a spelling achievement test.

111
Q

Aptitude

A

refers to the potential for learning or acquiring a specific skill. A spelling aptitude test measures how many words you might be able to spell given a certain amount of training, education, and experience

112
Q

Intelligence

A

Refers to a person’s general potential to solve problems, adapt to changing circumstances, think abstractly, and profit from experience.

113
Q

Human ability

A

In view of considerable overlap of achievement, aptitude, and intelligence tests, all three concepts are encompassed by the term human ability.

114
Q

Personality Tests

A

are related to the overt and covert dispositions of the individual.

115
Q

Structured personality tests

A

provide a statement, usually of the “self-report” variety, and requires the subject to choose between two or more alternative responses such as “true” or “false”.

116
Q

Projective personality tests

A

are unstructured and either the stimulus (test materials) or the required response-or both-are ambiguous.

117
Q

Reliability

A

the accuracy, dependability, consistency, or repeatability of test results. In more technical terms, reliability refers to the degree to which test scores are free of measurement errors.

118
Q

Validity

A

refers to the meaning and usefulness of test results. More specifically, validity refers to the degree to which a certain inference or interpretation based on a test is appropriate.

119
Q

Interview

A

a method of gathering information through verbal interaction, such as direct questions. Not only has the interview traditionally served as a major technique of gathering psychological information in general, but also data from interviews provide an important complement to test results.

120
Q

Issues of Psychological Testing

A

Many social and theoretical issues, such as the controversial topic of racial differences in ability, accompany testing.

121
Q

_____ are when two or more tests are used in conjunction.

A

Test batteries

122
Q

Charles Darwin’s theory

A

higher forms of life evolved partially because of differences among individual forms of life within a species. The most adaptive characteristics survive at the expense of those who are less fit and that the survivors pass their characteristics on to the next generation.

123
Q

Sir Francis Galton

A

concentrated on demonstrating that individual differences exist in human sensory and motor functioning, such as reaction time, visual acuity, and physical strength.

124
Q

J. E. Herbart

A

developed mathematical models of the mind,he eventually used these models as the basis for educational theories that strongly influenced 19th century educational practices.

125
Q

E. H. Weber

A

attempted to demonstrate the existence of a psychological threshold, the minimum stimulus necessary to activate a sensory system.

126
Q

G.T. Fechner

A

devised the law that the strength of a sensation grows as the logarithm of the stimulus intensity.

127
Q

Who founded the science of psychology?

A

Wilhelm Wundt!

128
Q

G. Whipple

A

provided the basis for immense changes in the field of testing by conducting a seminar at the Carnegie Institute.

129
Q

Alfred Binet and T. Simons

A

developed the first major general intelligence test (Binet-Simon Scale, 1905)

130
Q

A ___ sample is one that comprises individuals similar to those for whom the test is to be used.

A

representative

131
Q

How did World War I influence psychological testing?

A

The war created a demand for large-scale group testing because relatively few trained personnel could evaluate the huge influx of military recruits.

132
Q

Robert Yerkes

A

developed two structured group tests of human abilities: The Army Alpha and the Army Beta : Army Alpha required reading ability whereas Army Beta measured the intelligence of illiterate adults.

133
Q

Traits

A

relatively enduring dispositions (tendencies to act, think, or feel in a certain manner in any given circumstance) that distinguish one individual from another. (eg. optimistic and pessimistic)

134
Q

The first structured personality test was the ____________.

A

Woodworth Personal Data Sheet

135
Q

The MMPI (Minnesota Multiphasic Personality Inventory)

A

It used empirical methods to determine the meaning of a test response and helped to revolutionize structured personality tests. It began a new era for structured personality tests.

136
Q

Inferences

A

logical deductions about events that cannot be observed directly

137
Q

Descriptive statistics

A

methods used to provide a concise description of a collection of quantitative information

138
Q

Inferential statistics

A

methods used to make inferences from observations of a small group of people known as a sample to a larger group of people known as a population

139
Q

Quartiles

A

points that divide the frequency distribution into equal fourths. The first quartile is the 25th percentile; the second quartile is the median, or 50th, percentile; and the third quartile is the 75th percentile.

140
Q

Median

A

a value or quantity lying at the midpoint of a frequency distribution of observed values or quantities, such that there is an equal probability of falling above or below it. It is also known as the 50th percentile.

141
Q

Interquartile Range

A

the interval of scores bounded by the 25th and 75th percentiles.

142
Q

Deciles

A

similar to quartiles except that they use points that mark 10% rather than 25% intervals.

143
Q

Stanine System

A

converts any set of scores into a transformed scale, which ranges from 1 to 9 (standardized to have mean 5, SD 2)

144
Q

Norms

A

the performances by defined groups on particular tests

145
Q

Tracking

A

the tendency to stay at about the same level relative to one’s peers (e.g., height and weight)

146
Q

Norm-referenced test

A

compares each person with a norm (more focused on comparison)

147
Q

Criterion-referenced test

A

describes the specific types of skills, tasks, or knowledge, that the test-taker can demonstrate, such as mathematical skills (more individualized)

148
Q

Reliability

A

extent to which measurement gets consistent results over repeated measurements (with same conditions each time)

149
Q

Classical Test Theory

A

Distribution of random errors assumed to be the same for all people –> Standard error of measurement (SEM)

150
Q

Item Response Theory

A

computer used to focus range of item difficulty to assess individual’s ability level

151
Q

Sources of Error/What Affects Reliability

A

Testing environment

Test itself (unrepresentative of what’s being tested)

Examinee (physical or mental health)

Scoring of the test

152
Q

Testing Reliability Methods

A

Test-Retest Method
Parallel Forms Method
Split-Half Method

153
Q

Test-Retest Method

A

Only applies to measures of stable traits

“Carryover effect” = when examinee remembers answers from first session systematic carryover is not a concern = when everyone’s scores increase

Practice effects (sharpened skills from previously taking test)

Time interval must be considered carefully

Consider events that occurred between test taking sessions

154
Q

Parallel Forms Method

A

comparison of two equivalent forms of a test that measure same thing (each test uses different items)

155
Q

Split-Half Method

A

Divides test into halves that are scored separately which are then compared
Spearman-Brown (not always advisable to use)

156
Q

Content validity evidence

A

considers the adequacy of representation of the conceptual domain the test is designed to cover.

157
Q

Construct underrepresentation

A

describes the failure to capture important components of a construct. (e.g., a test of mathematical knowledge included algebra, but not geometry)

158
Q

Construct-irrelevant variance

A

occurs when scores are influenced by factors irrelevant to the construct. (e.g., a test of intelligence might be influenced by reading comprehension, test anxiety, or illness.)

159
Q

Criterion validity evidence

A

tells how well a test corresponds with a particular criterion. Evidence is provided by high correlations between a test and a well-defined criterion measure.

160
Q

Predictive validity evidence

A

the forecasting function of tests is actually a type or form of criterion validity evidence.

161
Q

Concurrent validity evidence

A

applies when the test and the criterion can be measured at the same time.

162
Q

Validity Coefficient

A

The relationship between a test and a criterion that is expressed by a coefficient. coefficient tells the extent to which the test is valid for making statements about the criterion.

163
Q

A construct is defined as something built by ______.

A

mental synthesis

164
Q

Convergent Evidence

A

When a measure correlates well with other tests believed to measure the same construct.

165
Q

Discriminant Evidence

A

one type of evidence a person needs in test validation is proof that the test measures something unique

166
Q

To demonstrate discriminant evidence for validity, a test should have ____ correlations with measures of unrelated constructs, or evidence for what the test does not measure

A

low