Midterm Flashcards
An individual test:
A) Involves a single examiner for two or more subjects.
B) Involves only tests of human ability.
C) Can be given to only one person at a time.
D) Involves more than one examiner for a single subject.
C
A group test:
A) Can be given to multiple people by one examiner.
B) Can be given to only three people at a time.
C) Involves a group of examiners for a single subject.
D) Involves only tests of human ability.
A
Previous learning can best be describe as:
A) achievement
B) aptitude
C) intelligence
D) ability
A
Are projective personality tests structured or unstructured?
Unstructured
The general potential to solve problems, adapt, and profit from experience is called…
Intelligence
A test that yields dependable and consistent results is…
Reliable
Which of the following is a group test of human ability for an adult that does NOT require the subject to be literate?
A) Army Alpha
B) Sequin Form Board
C) Army Beta
D) Wechsler-Bellevue Intelligence Scale
C
___ is always associated with a sampling process.
Error
Statistical procedures that summarize and describe a series of observations are called…
Descriptive statistics
Statistical procedures that allow one to make inferences about large groups by examining a smaller sample are called…
Inferential statistics
Scales of measurement differ from one another in terms of….
A) magnitude, absolute measurement, and equal intervals
B) magnitude, relative zero, and equal intervals
C) numbers, relative zero, and equal intervals
D) magnitude, absolute zero, and equal intervals
D
Which scale has the properties of magnitude, absolute zero, and equal intervals?
Ratio
A property of a scale that implies the complete absence of the measured attribute is called a(n)…
Absolute zero
Many people feel that the difference between an IQ of 100 and 105 is not the same as the difference between an IQ of 70 and 75. These people feel that IQ tests lack:
A) absolute zeroes
B) magnitudes
C) ratios
D) equal intervals
D
If a scale allows one to say whether a particular instance has more, less, or the same amount of an attribute as another instance, the scale is said to have…
Magnitude
Suppose you are in the 87th percentile on a test. This means:
A) you are among the top 13 students in the class.
B) 87% of the students got a score lower than yours.
C) you got 87% of the test items correct.
D) 87% of the students got a score higher than yours.
B
What do scatter diagrams do?
Help us visualize relationship between two variables
If the line that comes closest to all the points in a scatter diagram is perfectly straight, the correlation between two variables is…
Linear
The observation that Y decreases as X increases suggests:
A) positive correlation
B) no correlation
C) negative correlation
D) curvilinear correlation
C
Which of the following correlations represents the strongest relationship between two variables?
A) -.85
B) .01
C) .50
D) .80
A
When talking about errors in terms of psychological testing, we are referring to the fact that:
A) someone got an answer incorrect.
B) there is always some inaccuracy in the measurement.
C) the test was inappropriate for that particular group.
D) the score is too subjective to be accurate.
B
When creating a test, one generally uses a subset of items to represent a larger construct. This is known as:
A) a population parameter
B) a domain sampling
C) a sampling error
D) descriptive statistics
B
Classical test theory assumes that:
A) there are no errors in measurement.
B) each person has a true score.
C) observed score almost always reflect true ability.
D) errors of measurement are systematic.
B
Which type of validity is most important for educational tests?
Content
What is the relationship between reliability and validity?
A test can be reliable and lack validity, but a test cannot lack reliability and be valid.
The tendency for test takes to agree on most of the items is called a(n)…
Acquiescence response set
Distractors that are obviously incorrect…
A) lower the reliability of the test.
B) increase the reliability of the test.
C) have no impact on the reliability of the test.
D) reduce the likelihood of correct guessing.
A
True-false examinations use…
A dichotomous format
In multiple choice examinations, incorrect alternatives are called…
Distractors
What format do some personality tests use because it requires an absolute judgment?
Dichotomous
A test format that is typically used for attitude measurement is the…
Likert format
What testing method is popular for measuring self-rated pain?
Visual analogue scale
Which of the following is a disadvantage of true-false tests?
A) They are typically only useful with simple information.
B) They encourage memorization without understanding.
C) They are difficult to administer.
D) They encourage rapid responding.
B
Test takers who speak more than two languages should be given a test in…
The language with which the test taker is most comfortable.
Patterson and colleagues reported that in order to reduce administration errors, test administrators should…
Practice administration about 10 times.
Members of groups that are plagued by negative stereotypes tend to perform better academically when intelligence is described to them as…
Malleable and adaptable to environmental circumstances.
What has been proven to reduce the risk of stereotype threat?
Placing demographic questions at the END of the test, not the beginning.
Which of the following is a common theory regarding how stereotype threat weakens test performance?
A) Test takers rarely pay attention to the test administrator’s feelings and behaviors.
B) Test takers feel afraid to do well on the test.
C) Test takers take longer to read the test instructions to ensure they understand and rarely finish on time.
D) Test takers dedicate attention to themselves rather than the test.
D
The phenomenon in which a test taker exerts less effort to protect their self-worth when faced with the expectation that they may not perform well is referred to as…
Self-handicapping
Because situational variables can affect test scores, testing requires…
Standardized conditions
Social facilitation refers to…
The phenomenon that we tend to act like the models around us.
Good interview behavior is more a matter of ____ than ____.
A) social skills; experience
B) technique; attitude
C) attitude; skill
D) acquired abilities; personality style
C
One study found that the most important factor in a patient’s evaluations of the quality of an interview was…
Their perceptions of the interviewer’s feelings.
What is a test?
Measurement device used to quantify behavior and/or aid in the understanding and prediction of behavior
What are the three types of ability tests?
Achievement
Aptitude
Intelligence
What is reliability?
Accuracy, dependability, consistency of test results
What is validity?
Meaning and usefulness of test results
What are the four types of validity?
Construct
Content
Face
Criterion
What was Galton’s theory? And what 4 traits did he find?
He argued that some people have traits that make them more fit than others.
Sensory & motor function
Reaction time
Visual acuity
Physical strength
What’s the difference between Army Alpha and Army Beta?
Army Alpha –> recruits who could read
Army Beta –> recruits who could not read
What are test scores used for?
Understand behavior
Monitor change in behavior
Make comparisons between individuals
Predict future behavior
What is construct validity?
Measures construct of intended measure
What is content validity?
Upmost importance in educational testing (ex: quiz)!!!!
What is face validity?
Extent to which a measure APPEARS to have validity.
What is criterion validity?
Test correlates with similar strong tests
What is measurement?
Rules and procedures to transform quality of attribute into numbers
Do psychological tests have equal intervals?
No!
What is a nominal scale?
Scale that NAMES objects
(ex: 1 = male; 2 = female)
Is nominal scale an actual scale?
No! It is not a scale because it doesn’t measure anything.
What is an ordinal scale?
RANKS objects but difference between ranks has no meaning.
(ex: Comparing heights of students in class)
What is an interval scale?
Has magnitude and equal intervals but no absolute zero.
(ex: temperature)
What is a ratio scale?
Has magnitude, equal intervals, AND absolute zero
(ex: MPH)
How to measure correlation?
The bigger the number, the stronger its correlation is, regardless if the number is negative or positive.
(ex: -.85 has a bigger correlation than .70)
How do you determine a positive or negative skew?
The skew is the direction the tail goes. For example, positive skew starts at the top and goes down. Negative skew starts at the bottom and goes up.
Formula for percentile rank
Pr = B/N x 100
B = # of scales BELOW given score
N = TOTAL # of scores
What is a correlation coefficient?
Ranges from -1.00 to +1.00. The closer to -1/+1, the stronger the correlation is. 0 means it is the weakest.
What is regression?
Changes in one variable relate to changes in another variable
What is a regression line?
Line drawn that touches MOST of the data points
Will there always be a discrepancy between true ability and measurement ability?
YES!
What are the two assumptions of classical test theory?
- It assumes that each person does, in fact, have a true score.
- Error in measurement is random.
Six guidelines for writing test items (know at least 3 of these)
- Define clearly what you wish to measure
- Generate pool of items
- Avoid items that are exceptionally long
- Be aware of reading level of those taking the scale & the reading level of the items
- Avoid items that convey two or more ideas at the same time
- Consider using questions that mix positive and negative wording
What is dichotomous format?
Two choices for each question
(ex: T/F or Yes/No questions)
What is polytomous format?
More than two options for each question
(ex: multiple-choice test)
What is Likert format?
Degree of agreement; typically has 5 choices
What is category format?
“On a scale of one to ten”; typically has 10 choices
What are checklists format?
Provide a list of terms and the individual selects the most characteristics that fit themselves
What are Q-sorts?
Provide a list of adjectives that must be sorted into nine piles of increasing similarity to target person
What are the two basic methods/ways to analyze test items?
Difficulty (what % of people got an item correct) and discriminability (whether people who did well on particular item also did well on test)
Three factors that contribute to error in test administration
- Test administrator
- Test taker
- Test itself
What is the rule-of-thumb for training of test administrators?
Administrators should practice giving a minimum of 10 tests, so they are properly trained to give tests.
What are the two main objectives of intake interviews?
- Establishing rapport
- Gathering info
What are some good attitudes related to interview skills? (be able to list at least three)
Moods match
GEW (genuine, empathetic, warm)
Interviewer maintains control
Openness
What are some responses to avoid in an interview?
Why? questions
Hostile statements
False reassurance
Closed-ended questions (Yes/no questions)
What are some responses to keep the interacting flowing?
Verbatim playback (complete repeat of what the client said)
Paraphrasing
Transitional phrases
Summarizing & clarifying
What are three factors that affect interview validity?
Halo effect
General standoutishness
Misunderstanding cultural differences
What’s the problem with defining intelligence?
Can’t avoid strong positive correlation between socioeconomic status and IQ scores
Binet believed intelligence measured 3 abilities:
Judgment
Attention
Reasoning
Binet’s two major principles of test construction
- Age Differentiation
- General Mental Ability
1905 Binet-Simon Scale
-Had thirty questions of increasing difficulty
-Intellectual deficits were defined with unpleasant terms (idiot, imbecile, and moron)
1908 Binet-Simon Scale
-Introduced concept of mental age
-Used age scale format (items grouped according to age level)
1911 Binet-Simon Scale
3rd edition
1916 Stanford-Binet Scale
-1st edition of SB scale
-Concept of IQ was added
-Major problem: all participants were white
-Age range increased
1937 Stanford-Binet Scale
-Extended age range down to 2-year-old level
-Increased maximum possible mental age to 22 years & 10 months
-Performance items were added
-Problem: reliability coefficients were higher for older kids & lower for younger kids
1960 Stanford-Binet Scale
-Tried combing best forms of 1937 scale
-Tasks were selected based on best qualities from 1937 scale
-Used deviation IQ (mean of 100 & SD of 16)
1986 Stanford-Binet Scale
-“g” is made up of crystallized intelligence, fluid intelligence, and short-term memory
-Age scale format was eliminated
2003 Stanford-Binet Scale
-Most recent edition
-Added routing measures (verbal & nonverbal sections, preventing unnecessary items, and preventing having to start all the way over)
Psychometric properties of 2003 Stanford-Binet scale
-SD changes to 15 (from 16)
-Age range from 2-85+ years
-More diverse standardization
-Quite good reliability
What is the Five Factor Model of Intelligence? (2003)
- Fluid Reasoning
- Knowledge
- Quantitative Reasoning
- Visual/Spatial Reasoning
- Working Memory
Which test is the most reliable psychological test in the world?
IQ tests
What is the formula for Intelligence Quotient (IQ)?
IQ = MA/CA x 100
MA = mental age
CA = chronological age
Binet’s definition of intelligence
Ability to find & maintain direction (problem-solving)
Crystallized intelligence
Ability to pull out info already learned
Fluid intelligence
Ability to learn new information