Quiz's Flashcards
____ refers to the consistency of stability of test scores.
Reliability
_______ is usually considered the largest source of error in test scores.
Content Sampling
Factors that may affect reliability include
External and internal distractions
person grading the test
the time the test was administered
As a general rule ___ tests produce more reliable scores than _____ tests.
Longer tests, shorter tests.
____refers to the degree to which a test appears to measure what it is designed to measure
Face Validity
_____ refers to the appropriatness of accuracy of the interpretations of test scores.
Validity
In Classical Test Theory, the X represents_____ and the T represents ___
observed score, stable test taker characteristics (True Score).
When discriminant measures are used in validation studies, we expect ____.
Correlation of test sores with tests of a dissimilar construct.
As the reliability of a test score ________ the standard error of measurement ______.
Decreases, Increases.
A statistic called the _______ is used to describe the amount of prediction error resulting from the imperfect correlation between a test score and a criterion.
Standard error estimate.
General guidelines for writing test items include:
The avoidance of inadvertent cues to the answers.
When a test item has a discrimination index ______, it is considered to be acceptable by the chapter authors.
Greater than .30
When developing Maximum performance tests, it is best to arrange items.
From easiest to hardest.
A fill-in-the- blank question is a _____ item
Constructed- response
______ are reported as the most popular selected- response items.
Multiple Choice.
How many distracters is it recommended that one provide for multiple choice items?
3 to 5.
On a maximum performance test administered to 100 students, 60 students correctly answered item #4. The item difficulty index equals:
0.60
What is optimal Item Difficulty Index on a test consisting of only constructed response items?
0.50.
In order to determine the number of items to include on a test, one should consider:
Purpose of test
type of test
age of examinee
type of items.
Distracters are:
The incorrect responses on multiple choice items.
A ______ definition explains our construct at a theoretical level and may use many interpretive words.
Conceptual
As noted in the SNow and Hemel article, which of the following is an example that was listed as a caregiver report instrument of the child’s tempermant
Toddler Behavior Asessment Questionnaire (Carey Scales)
______ are designed by including extreme responses to traditional items that are seldom endorsed as present even in individuals with significant levels of psychopathology.
F-Scales
The use of general rules, principles, or abstract concepts to solve a problem not previously encountered involves objectives at the _____ Level.
Application.
What is always an undesirable attribute for a test?
Unnecessary length.
In which phase is it important to develop conceptual and operational definitions of the constructs you want to measure?
Test Conceptualization
According to Blooms Taxonomy, what is the simples level?
Knowledge.
______ is an attempt to mislead an examiner through inaccurate or incomplete responses or effort.
Response Bias
Proponents argue that children with hearing loss who recieve intensive early intervention have the following areas:
A better school performance
improved receptive language
less developmental delay.
What is the first task of a test developer
Identify the need in the field.