Assessment and Testing Flashcards
601
Appraisal can be defined as ________.
Pg. 367
the process of assessing or estimating attributes
_______ is a broad term which includes more than merely “testing clients.” It could include a survey, observations, assessments, performance, or even clinical interviews.
Pg. 367
Appraisal
A ______ is simply an instrument which measures a given sample of behavior.
Pg. 367
test
When we use the term ________ is merely connotes that a number or score has been assigned to the person’s attribute or performance.
Pg. 367
measure
________ is the study of psychological measurement and thus a helper who primarily administers and interprets tests often has the job title of psychometrician.
Pg. 368
psychometrics
An _______ counselor will always inform clients about the limitations of any test that they administer. Some evidence indicates that neophyte counselors are sometimes tempted to administer tests merely to boost their credibility.
Pg. 368
effective
602
A test can be defined as a systematic method of measuring a sample of behavior. Test format refers to the manner in which test items are presented. The format of an essay test is considered a ________ format.
Pg. 368
subjective
A _________ paradigm relies mainly on the scorer’s opinion. If the scorer knows the test taker’s attributes, the scorer’s “personal bias” can significantly impact the rating.
Pg. 368
subjective
In an “_________” test the rater’s judgement plays little or no part in the scoring process.
Pg. 368
objective
603
The NCE is an _______ test because the scoring procedure is specific.
Pg. 368
objective
604
A short answer test is a ________ test. The test taker can respond in any manner they choose to.
Pg. 369
free choice/response
Although testing is controversial, schools now employ psychoeducational tests ______ than at any time in history.
Pg. 369
more
605
The NCE and the CPCE would be examples of a _______ test.
Pg. 369
forced choice/recognition items
606
The _______ index indicates the percentage of individuals who answered each item correctly.
Pg. 370
difficulty
A 0.5 difficulty index (or difficulty value) would suggest that ______% of those tested answered the question correctly, while ______% did not.
Pg. 370
50%; 50%
Most theorists agree that a “_________” provides a wide range of items that even a poor performer will answer correctly.
Pg. 370
good measure
607
Short answer tests and projective measures utilize free response items. The NCE and the CPCE uses focused choice or so-called ________ items.
recognition
608
A true/false test has ________ recognition items.
Pg. 370
dichotomous
“________” simply means that you are presented with two opposing choices.
Pg. 370
dichotomy
When a test gives the person three or more forced choices than psychometricians call it a “__________”.
Pg. 370
multipoint item
609
A test format could be normative or ipsative. In the normative format _________.
Pg. 371
each item is independent of all other items
________ measures compare traits within the same individual; they do not compare a person to other persons who took the instrument.
Pg. 371
ipsative
610
A client who takes a normative test _________.
Pg. 371
can legitimately be compared to others who have taken the test
611
In an ipsative measure the person taking the test must compare items to one another. The result is that _________.
Pg. 372
you cannot legitimately compare two or more people who have taken an ipsative test
Since the ipsative measure does not reveal __________, comparing one person’s score to another is relatively meaningless. The person is measured in response to his or her own standard of behavior.
Pg. 372
absolute strengths
612
Tests are often classified as speed tests vs. power tests. A timed typing test used to hire secretaries would be ________.
Pg. 372
a speed test
A good timed _______ test is purposely set up to that nobody finishes it.
Pg. 372
speed
A “________” is designed to evaluate the level of mastery WITHOUT a time limit.
Pg. 372
power test
A ______ test is really a type of speed test, but a high percentage of the test takers complete it ad it is usually more difficult and has a time limit (e.g., NCE).
Pg. 372
timed
614
An achievement test measures maximum performance or present level of skill. Tests of this nature are also called attainment tests, while a personality test or interest inventory measures __________.
Pg. 373
typical performance
615
In a spiral test ________.
Pg. 374
the items get progressively more difficult
616
In a cyclical test _________.
Pg. 374
you have several sections which are spiral in nature
**in each section the questions would go from easy to more difficult.
617
A test battery is considered _________.
Pg. 374
a horizontal test
In a ________, several measures are used to produce results that could be more accurate than those derived from merely using a single source.
Pg. 374
test battery
A _______ test would have versions for various age brackets or levels of education (e.g., a math achievement test for preschoolers and a versions for middle school children. A _______ test measures various factors (e.g., math and science) during the same testing procedure.
Pg. 374
vertical; horizontal
618
In a counseling research study, two groups of subjects took a test with the same name. However, when they talked with each other they discovered that the questions were different. The researcher assured both groups that they were given the same test. How is this possible?
Pg. 375
The researcher gave parallel forms of the same test
When a test has two versions or forms that are interchangeable they are termed _________ of the same test.
Pg. 375
parallel or equivalent forms
619
The most critical factors in test selection are ________.
Pg. 375
validity and reliability
_______ refers to whether the test measures what it says it measures while ________ tells how consistent a test measures an attribute.
Pg. 375
validity; reliability
Experts nearly always consider ________ the number one factor in the constructions of a test. A test must measure what it purports to measure. ________ is the second most important concern.
Pg. 375
validity; reliability
A scale must measure body weight accurately if it is a ______ instrument. In order to be ______, it will need to give repeated readings which are nearly identical for the same person if the person keeps stepping on and off the scale.
Pg. 376
valid; reliable
There are _____ basic types of validity.
Pg. 376
five
______ validity (or rational or logical validity).
EX: An IQ test that did not sample the entire range of intelligence, just math, would have poor _______ validity.
Pg. 376
content; content
________ validity refers to a test’s ability to measure a theoretical construct like intelligence, self-esteem, artistic talent, mechanical ability, or managerial potential.
Pg. 376
construct
________ validity deals with how well the test compares to other instruments that are intended for the same purpose.
Pg. 376
concurrent
________ validity (aka empirical validity) reflects the test’s ability to predict future behavior according to established criteria.
Pg. 376
predictive
Concurrent validity and predictive validity are often lumped under the umbrella of “__________”, since concurrent validity and predictive validity are actually different types of criterion-related validity.
Pg. 376
criterion validity
________ validity simply tries to ascertain the social implications of using tests.
Pg. 376
consequential
622
A counselor peruses a testing catalog in search of a test which will repeatedly give consistent results. The counselor _________.
Pg. 377
is interested in reliability
True or False
A test can be reliable yet not valid.
Pg. 377
True
624
Construct validity refers to the extent that a test measures an abstract trait or psychological notion. An example would be _________.
Pg. 377
ego strength
Any trait you cannot “directly” measure or observe can be considered a ________.
Pg. 377
construct
In the real world physical measurements are ______ reliable than psychological ones.
Pg. 377
more
625
Face validity refers to the extent that a test ________.
Pg. 378
looks or appears to measure the intended attribute
_________ merely tells you whether the test looks like it measures the intended trait.
Pg. 378
face validity
________ validity has been used to describe the process by which a test is refined and becomes more valid as contradictory items are dropped. It also refers to a test’s ability to improve predictions when compared to existing measures that purport to facilitate selection in business or educational settings. When a test has this validity, it provides you with additional valid information that was not attainable via other procedures.
Pg. 378
incremental
_______ validity was popularized by industrial organizational psychologists who felt the procedure had merit, especially when utilized for smaller firms who did not hire large number of workers. In this validity, the helper or researcher looks for tests that have been shown to predict each job element or component.
Pg. 379
synthetic
627
A new IQ test which yielded results nearly identical to other standardized measures would be said to have _________.
Pg. 379
good concurrent validity
________ validity (aka concurrent or predictive validity) answers the question of how well your test stacks up against a well-established instrument that measures the same behavior, construct, or trait.
Pg. 379
criterion
The relationship or correlation of a test to an independent measure or trait is known as _______ validity. This validity is actually a method used to assess a test’s construct/criterion validity by correlating test scores with an outside source.
Pg. 379
convergent
_________ validity means the test will NOT reflect unrelated variables. If this validity is evident, a counselor who is genuinely qualified to sit for a state licensing exam should score higher than a student who flunked an introductory counseling course.
Pg. 380
discriminant
628
When a counselor tells a client that the GRE will predict her ability to handle graduate work, the counselor is referring to _________.
Pg. 380
predictive validity
629
A reliable test is ______ valid.
Pg. 380
not always
630
A valid test is ______ reliable.
Pg. 381
always
631
One method of testing reliability is to give the same test to the same group of people two times and then correlate the scores. This is called _________.
Pg. 381
test-retest reliability
________ refers to the ability of a test score to remain stable or fluctuate over time when the client takes the test again.
Pg. 381
stability
The ________ procedure is only valid for traits such as IQ which remain stable over time and are not altered by mood, memory, or practice effects.
Pg. 381
test-retest
632
One method of testing reliability is to give the same population alternate forms of the identical test. Each form will have the same psychometric/statistical properties as the original instrument. This is known as __________.
Pg. 381
equivalent or alternate forms reliability
633
A counselor doing research decided to split a standardized test in half by using the even items as one test and the odd items as a second test and then correlating them. The counselor __________.
Pg. 382
was testing reliability via the split-half correlation method
634
Which method of reliability testing would be useful with an essay test but not with a test of algebra problems?
Pg. 382
inter-rater/inter-observer
635
A reliability coefficient of 1.00 indicates _________.
Pg. 383
a perfect score which has no error
**this generally occurs only in physical measurement.
636
An excellent psychological or counseling test would have a reliability coefficient of _______.
Pg. 383
.90