final Flashcards

Question 1

Q

face validity

Answer

A

does a test appear to measure what it was designed to measure; lay-person judgement

Question 2

Q

how do content & face validity differ?

Answer

A

content involves systematic and technical analysis

face is more superficial

Question 3

Q

criterion validity

Answer

A

the extent to which a measure agrees with a gold standard; whether it matches a measure of some attribute or outcome that is of primary interest (criterion)

Question 4

Q

types of studies: criterion validity

Answer

A

predictive vs concurrent

Question 5

Q

predictive studies

Answer

A

take the test today and test the criterion some time down the road

Question 6

Q

drawbacks to predictive validity studies

Answer

A

time, money, issues from time lag

Question 7

Q

concurrent studies

Answer

A

test and criterion done at the same time

Question 8

Q

when should you use predictive vs concurrent studies?

Answer

A

if goal is prediction -> predictive

if goal is to determine current status -> concurrent

Question 9

Q

criterion contamination

Answer

A

when criterion measures more dimensions than by those by test

do scores on the predictor influence criterion scores?

Question 10

Q

techniques for interpreting validity coefficients

Answer

A

(1) sig level; did not occur by chance (p value)

2) coefficient of determination (R^2

Question 11

Q

what if your validity coefficient is small?

Answer

A

if a test provides info that helps predict criterion performance better than any other existing predictor the test may be useful even if coefficient is relatively small

Question 12

Q

linear regression

Answer

A

a mathematical procedure that allows us to predict values on one variable if we know values on the other

Question 13

Q

standard error of estimate

Answer

A

a stat that reflects the average amount of error in our prediction and that allows us to make confidence statement

Question 14

Q

decision theory models

Answer

A

when tests are used for making decision such as personnel selection; factors others than the correlation between test and criterion are important

Question 15

Q

decision theory models: selection ratio

Answer

A

proportion of applicants needed to fill position

Question 16

Q

decision theory models: base rate

Answer

A

proportion of applicants who can be successful candidates

Question 17

Q

model sensitivity

Answer

A

metric that evaluates ability to predict true positives of each available category

A/A+C

A= true positive
C= false negative

Question 18

Q

model specificity

Answer

A

metric that evaluates ability to predict true negatives of each available category

D/B+D

B= false positive
D=true negative

Question 19

Q

evaluating validity coefficients

Answer

A

look for changes in the cause of relationships
what does the criterion mean?
review the subject pop of validation study
be sure sample size was adequate
never confuse criterion with predictor
check for restricted range on both predictor and criterion
review evidence for validity generalization
consider differential prediction

Question 20

Q

construct validity

Answer

A

extent to which evidence can be provided that test measures a theoretical construct

Question 21

Q

Campbell & Fiske’s types of validity evidence

Answer

A

convergent and discriminant

Question 22

Q

types of convergent evidence

Answer

A

(1) does test measure same thing as other tests used for same purpose
(2) does test correlate with specific variables that we can expect if it is doing its job

Question 23

Q

validation study

Answer

A

two or more constructs measured in two or more ways

Question 24

Q

what can validation studies tell us?

Answer

A

convergent and discriminate validity

homogenity and unidimensionality

Question 25

Q

evidence of validity based on response process

Answer

A

involves an analysis of the fit between the performance and actions the test takes actually engage in and the construct being assessed

e.g., interview, behavioural indicators (RT, eye gaze)

Question 26

Q

evidence based on consequences of testing

Answer

A

were the intended benefits of testing achieved?

Question 27

Q

ways of getting evidence of validity

Answer

A

(1) test content
(2) relations to other variables (criterion)
(3) internal structure
(4) response processes
(5) consequences of testing

Question 28

Q

factor analysis

Answer

A

any of several stat methods describing the interrelationships of a set of variables by stats deriving new variables, called factors, that are fewer in number than the original set of variables

Question 29

Q

types of factor analysis

Answer

A

exploratory and confirmatory

Question 30

Q

if alpha is lower than expected, there might be ______ and you might want to do _____

Answer

A

heterogeneity

factor analysis

Question 31

Q

steps in factor analysis

Answer

A

(1) extraction (how many groups?)

2) rotation (average correlation between items and factor itself

Question 32

Q

purposes of assessment in education

Answer

A

how well is a student learning?
assess whether class, grade, school, district, region is learning content
method to detect learning problems
method for identifying giftedness
determine if child is ready to move to next level
assess teacher effectiveness
determine readiness/placement in college, grad school, professional school
credential exams

Question 33

Q

achievement test

Answer

A

assess learned information; evaluate the effects of a KNOWN or controlled set of experiences

Question 34

Q

what type of validity procedures does achievement testing rely on?

Answer

A

heavily on content validation procedures

Question 35

Q

aptitude test

Answer

A

assess ability to learn something; evaluate the effects of UNKNOWN or uncontrolled experiences

Question 36

Q

what type of validity procedures does aptitude testing rely on?

Answer

A

heavily on predictive criterion validation procedures

Question 37

Q

goal of classroom testing

Answer

A

measure the extent to which students have learned the facts, concepts, procedures, and skills that have been taught

Question 38

Q

effective classroom tests

Answer

A

students who have learned more will obtain higher scores and students who have learned less will obtain lower scores. to be an effective test, a test must consist of effective items

Question 39

Q

types of classroom achievement tests

Answer

A

constructed and selected

Question 40

Q

Bloom’s taxonomy: levels of understanding

Answer

A

(1) knowledge
(2) comprehension
(3) application
(4) analysis
(5) synthesis/create (switched with 6?)
(6) evaluation

Question 41

Q

item difficulty index

Answer

A

right v wrong questions: percentage or proportion of test takers who correctly answer the item

Question 42

Q

item difficulty index: too hard

Question 43

Q

item difficulty index: too easy

Question 44

Q

item difficulty indeces are:

Answer

A

sample dependent and after the fact

Question 45

Q

on constructed response tests with two options, what is the optimal mean p value?

Answer

A

.50 (about half the class gets it right)

Question 46

Q

item discrimination: right and wrong Qs

Question 47

Q

item discrimination: good discriminatory

Answer

A

lower % of bottom quarter of class got it correct than top quarter of class

Question 48

Q

item discrimination: bad discriminator

Answer

A

bottom and top quarter of class did equally well on question

Question 49

Q

Examples of achievement tests

Answer

A

(1) Wechsler individual achievement test
(2) standford achievement test
(3) Iowa test of basic skills
(4) metropolitan achievement test

Question 50

Q

Wechsler individual achievement test (WAIT)

Answer

A

z-scores, percentile ranks, stanines

norms for grades and age

all ages (above 4)

45 min - 2 hours
- longer for adults than kids

gifted? learning difficulties?

high reliability

Question 51

Q

Stanford Achievement test

Answer

A

individual test

1923

K-12

math, writing expression, understanding of patterns, reading comprehension

high reliability
evidence for construct validity

Question 52

Q

Iowa test of basic skills

Answer

A

general achievement tests

K-8?

better for lower end of distributions?

shorter than others

Question 53

Q

metropolitan achievement test

Answer

A

classified as achievement test, but has some aptitude components

Question 54

Q

examples of diagnostic tests

Answer

A

(1) wide range achievement test 4 (the rat)
(2) peabody individual achievement test
(3) woodcock reading mastery test
(4) kaufman test of educational achievement
(5) canada quick individual achievement test
(6) canada french immersion achievement test (C-FIAT)

Question 55

Q

wide range achievement test 4 (the rat)

Answer

A

diagnostic test

basic academic skills

good for 5-98

individual admin

longer time frame for older people

Question 56

Q

readiness tests

Answer

A

intended to assess a child’s readiness to enter school or move forward

Question 57

Q

issues with readiness tests

Answer

A

(1) children change rapidly
(2) predictive ability is weak
(3) cultural/language biases

Question 58

Q

range rule

Answer

A

standard deviation should be around: (max response-min response)/4

Question 59

Q

examples of aptitude tests: cognitive ability

Answer

A

(1) otis-lennon school ability test
(2) cogAT
(3) SAT-I
(4) ACT
(5) GRE; GMAT; LSAT; MCAT

Question 60

Q

issues with grad school tests

Answer

A

don’t predict success and differentially predict for different groups

Question 61

Q

advantages and disadvantages of intelligence testing

Answer

A

advantages: helps identify/define problem
disadvantages: cultural bias, limited info

Question 62

Q

three research traditions

Answer

A

(1) psychometric, (2) information processing, (3) cognitive

Question 63

Q

binet: intelligence

Answer

A

tendency to take and maintain a definite direction, the capacity to make adaptations for the purpose of attaining a desired end and the power of auto-criticism

Question 64

Q

binet: principles of test construction

Answer

A

(1) age differentiation

(2) general mental ability

Answer 62

A

we should be able to distinguish between people (especially) children of different ages

IQ = MA/CA * 100

max mental age was 19.5 (problem)

Answer 63

A

start test based on chronological age, administrator moves to more challenging items as appropriate

Answer 64

A

intelligence test

appropriate for a broad range of 2 to 85+ years, providing one assessment for all ages (recommend waiting until school age)

provides comprehensive coverage of five factors of cog ability

(1) fluid reasoning
(2) knowledge
(3) quantitative processing
(4) visual-spatial processing
(5) working memory

assessed verbally and non-verbally

scores: full scale IQ, verbal IQ, nonverbal IQ, routing score (start point), individual scores for each scale (verbal and non-verbal)

Answer 65

A

non-verbal intelligence test

group or individually administered

standardized

reliability ranges in high .60’s to low .90s

Answer 66

A

aggregate or global capacity of the individual to act purposefully, to think rationally, and to deal effectively with his environment

wanted to focus more on adults (unlike Binet)

Answer 67

A

(1) general intelligence, (2) general, (3) specific, (4) influencing factors

Answer 68

A

comprehend, follow direction, respond verbally, understand english

Answer 69

A

concentration, memory, reasoning

Answer 70

A

interests, occupation, confidence, arithmetic skills/knowledge

Answer 71

A

point scale concept

inclusion of performance scale

Answer 72

A

IQ = attained or actual score/ expected mean score for age

doesn’t max out like binet

Answer 73

A

qualification level: C

completion time; 60-90 min for core subtests

ages 16-90

IQ mean = 100; SD = 15

full scale IQ, 4 indices, individual subtests (e.g., arithmetic) with intellective and non-intellective components

pattern analysis

strengths and weaknesses

normative sample = 2200 (US)

high reliability and good evidence of validity

Answer 74

A

one of the best known and most popular

can be administered to group or individuals

from 5 years of age to elderly

used throughout the world

respectable reliability coefficients: high .70 - .90

last revisions to the manual 1998 with impressive set of norms

has been tested with various cultural groups shown to historically score lower on binet and wechsler scales

Answer 75

A

one purpose of nonverbal and performance tests is to remove factors related to cultural influences that often disadvantage test takers’ performance

RPM (Raven’s progressive matrices) comes close to being culture fair

IPAT culture fair intelligence test

catell pencil and paper (fluid intelligence in children)

Answer 76

A

intelligence is not unitary; it is the ability to solve problems or to create products that are valued within one or more cultural settings

Answer 77

A

(1) linguistic
(2) logical-mathematical
(3) spatial intelligence
(4) bodily-kinesthetic
(5) musical intelligence
(6) interpersonal
(7) intrapersonal
(8) naturalistic

Answer 78

A

Peter Salovey (Yale); followed up by Goleman

Answer 79

A

(1) abstract
(2) concrete
(3) social

Answer 80

A

social intelligence

Answer 81

A

(1) being aware of one’s own emotions
(2) able to manage one’s own emotions
(3) sensitive to the emotions of others
(4) able to respond to and negotiate with other people emotionally
(5) use one’s own emotions to motivate oneself

Answer 82

A

regulate emotions and problem solve

Answer 83

A

conscientiousness, self-confidence, optimism, communication, leadership and initiative

Answer 84

A

dementia, alzheimers, concussion, brain injury, ALS, parkinson’s, stroke, epilepsy, brain tumour, infection

Answer 85

A

glasgow coma scale (GCS)

Answer 86

A

application of a set of standardized procedures designed to assess and quantify brain function as expressed in over beh

leads to additional inferences regarding the covert processes of the brain

Answer 87

A

neuropsych tests tend to be more highly specific in what they measure

Answer 88

A

all (or at least a sig majority) of a patient’s relevant cog skills or higher order info processing skills should be assessed
testing should sample the relative efficiency of the right and left hemispheres of the brain
testing should sample anterior and posterior regions of cortical function (posterior mostly receptive)
testing should determine the presence of specific deficits
should determine the acuteness versus the chronicity of any problems or weaknesses
testing should locate intact complex functional systems
testing should assess affect, personality, and behaviour
test results should be presented in ways that are useful in a school or work environment, to acute care or intensive rehabilitation facilities or to physicians

Answer 89

A

(1) fixed battery approach

(2) non-fixed

Answer 90

A

halstead-reitan neuropsych test battery

focuses on key behavioural correlates of brain function

Answer 91

A

use of a flexible combo of traditional psych and educational tests

e.g., boston process approach

can include qualitative stuff

Answer 92

A

sensory input
attention and concentration
learning and memory
language
spatial and manipulatory ability
executive functions (logic, concepts, reasoning, planning, flexibility)
motor output

Answer 93

A

finger tapping

grip strength

grooved pegboard

Answer 94

A

(1) focus execute, (2) sustain, (3) encode, (4) shift

Answer 95

A

get unique information
participants can elaborate
personal and meaningful experience
report and relationship building
rich info; detail

Answer 96

A

harder to anlayze
possible discomfort of participant
- not honest, not best performance
time and resources
introduction to bias
individualized/subjective
limited generalizability

Answer 97

A

(1) structured (highly)
(2) guided/semi-structured
(3) non-directive or unguided

Answer 98

A

demographic data
reason for referral
past medical history
present med condition
familial medical history
past psych history
past history with medical or psych professionals
current psych conditions

Answer 99

A

(1) confirmation bias
(2) self-fulfilling prophecy
(3) ethnocentrism

Answer 100

A

judgmental and evaluative statements, probing questions, false reassurance

Answer 101

A

attitude is warm and authentic, open-ended questions, measuring understanding

Answer 102

A

levels 1-5 ?

Answer 103

A

interview validity
interview reliability (length of session)

Answer 104

A

an individual’s unique constellation of psych traits that is relatively stable over time

Answer 105

A

distinguishable, relatively enduring ways in which one individual varies from another

Answer 106

A

a constellation of traits

continuum thinking is in contrast to this

Answer 107

A

(1) objective measures, (2) projective measures, (3) behaivoural assessment

Answer 108

A

purpose: to aid in diagnosis of psychopathology for adults 14 years and older

developed for abnormal personality

566 true/false items

originally criterion keyed items

Answer 109

A

way of developing items by how well they discriminate between different groups (e.g., psych pops vs non-psych pop)

Answer 110

A

“an integrated eval judgment of the degree to which empirical evidence and theoretical rationales support the adequacy and appropriateness of inferences and actions based on test scores of other modes of assessment”

the appropriateness or accuracy fo the interpretation of test scores

Answer 111

A

(1) construct (internal) underrepresentation, (2) construct-irrelevant variance (external), (3) examinee characteristics, (4) test admin and scoring, (5) instruction and coaching

Answer 112

A

not all aspects of construct are represented

Answer 113

A

reliability is necessary but not sufficient for validity

reliability restricts validity coefficients

\sqrt{rel}= max validity coeff