Depa Reviewer (Mock Final Exam) Flashcards
Which of the following demonstrates systematic error in testing?
a) Random guessing by participants
b) A poorly calibrated scale consistently
overestimating weight
c) A participant misreading one test item
d) A scoring error on one question
b) A poorly calibrated scale consistently
overestimating weight
How often were examinations given in ancient China to evaluate work and promotion decisions?
a) Annually
b) Every three years
c) Every five years
d) Every ten years
b) Every three years
A diagnostic test has high sensitivity but low specificity. This means it is:
a) Good at identifying true positives but also generates false negatives.
b) Good at identifying true positives but also generates false positives.
c) Poor at identifying true positives but highly specific.
d) Accurate at diagnosing all cases without error.
b) Good at identifying true positives but also generates false positives.
A test consistently yields similar results over time but fails to measure what it intends to assess. What does this indicate?
a) High validity, low reliability
b) Low validity, high reliability
c) High reliability, high validity
d) Low reliability, low validity
b) Low validity, high reliability
Which of the following best demonstrates the concept of inter-rater reliability?
a) Two administrators scoring a test and achieving identical results
b) Administering the same test twice to the same group
c) Comparing scores from the first and second halves of a test
d) Measuring agreement between different
constructs in a test
a) Two administrators scoring a test and achieving identical results
A psychologist uses a test with a Likert scale ranging from 1 (strongly disagree) to 5 (strongly agree). This is an example of:
a) Nominal scaling
b) Ordinal scaling
c) Interval scaling
d) Ratio scaling
c) Interval scaling
A researcher finds that adding one more item to a scale improves its alpha coefficient significantly. This indicates the item has enhanced:
a) External validity
b) Internal consistency
c) Criterion validity
d) Test-retest reliability
b) Internal consistency
Which measure of variability shows the average amount each score differs from the mean?
a) Range
b) Variance
c) Standard Deviation
d) Interquartile Range
c) Standard Deviation
- When testing children, testing should begin:
a) Not longer than 5 to 10 minutes after the child arrives
b) When the test manual says it should begin
c) When he/she seems relaxed enough to give maximum effort
d) Almost immediately to prevent the child from developing fear of the tester
c) When he/she seems relaxed enough to give maximum effort
A standardized test uses a mean of 100 and a standard deviation of 15. What kind of scoring system is this?
a) Percentile rank
b) Z-scores
c) T-scores
d) Standard scores
d) Standard scores
Placement, screening, certification, and
selection are all examples of:
a) Diagnosis
b) Program evaluation
c) Classification
d) Research-based testing
c) Classification
A test developer correlates scores from the
odd-numbered and even-numbered items of a test. This checks:
a) Split-half reliability
b) Test-retest reliability
c) Internal consistency
d) Parallel forms reliability
a) Split-half reliability
Which civilization’s writings show early attempts
to categorize personality types?
a) Chinese
b) Greco-Roman
c) Indian
d) Mesopotamian
b) Greco-Roman
What is the focus of content validity?
a) Correlation with other measures
b) Predicting future outcomes
c) Representing the entire domain of a construct
d) Comparing to specific traits
c) Representing the entire domain of a construct
A psychometrician is best understood as:
a) An expert administrator of personality tests
b) A psychologist who has been trained from the scientist-practitioner model
c) A developer and evaluator of psychological tests
d) Any authorized user of assessment instruments
c) A developer and evaluator of psychological tests
A self-esteem test aligns with theories of self-worth and correlates with self-efficacy. This demonstrates:
a) Criterion-related validity
b) Predictive validity
c) Content validity
d) Construct validity
d) Construct validity
What does a negative skew in test scores
indicate?
a) Scores cluster around the mean
b) Most scores are below the mean
c) Most scores are above the mean
d) Scores are evenly distributed
c) Most scores are above the mean
Suppose a young girl answers correctly on 37 questions from a 50-item test but answers erroneously on 9 questions, leaving 2 questions blank. Suppose there are four alternatives per question. Using established principles of probability, what would be her corrected score?
a) 32
b) 34
c) 36
d) 37
b) 34
What is the primary difference between
descriptive and inferential statistics?
a) Descriptive statistics involve hypothesis testing, while inferential do not.
b) Inferential statistics summarize data, while descriptive statistics make predictions.
c) Descriptive statistics describe data, while
inferential statistics make predictions or inferences about a population.
d) Inferential statistics only use qualitative data, while descriptive use quantitative.
c) Descriptive statistics describe data, while
inferential statistics make predictions or inferences about a population.
What type of data is exemplified by the number of books a student owns?
a) Qualitative Data
b) Continuous Numbers
c) Discrete Numbers
d) Interval Data
c) Discrete Numbers
A researcher is examining the reliability of a test by splitting it into two halves and correlating the scores. What method is being used?
a) Test-retest reliability
b) Split-half reliability
c) Inter-rater reliability
d) Internal consistency reliability
b) Split-half reliability
An assessment tool that measures depression and anxiety with high correlation likely lacks:
a) Predictive validity
b) Convergent validity
c) Discriminant validity
d) Test-retest reliability
c) Discriminant validity
In ancient Egypt, counseling and psychotherapy was MOST probably carried out by:
a) Physicians
b) Pharaohs
c) Priests
d) Barbers
c) Priests
Which of the following is an essential step in the standardization of a test?
a) Use of identical stimuli with all examinees
b) Precise specification of oral instructions for subtests
c) Advice to the examiner as to how to handle queries from the examinee
d) All of the above
d) All of the above
A client refuses to take a psychological test due to privacy concerns. The psychologist should:
a) Administer the test anyway
b) Reassure the client of confidentiality and respect their decision
c) Terminate the session and refer the client to others
d) Offer an alternative test with explanation
b) Reassure the client of confidentiality and respect their decision
If a test consistently produces higher scores for one demographic group over another, this may indicate:
a) Poor standardization
b) Test fairness
c) Test bias
d) Construct validity
c) Test bias
In a(n) ___ test, the performance of each
examinee is interpreted in reference to a relevant standardization sample.
a) Individually-referenced
b) Group-referenced
c) Norm-referenced
d) Criterion-referenced
c) Norm-referenced
A test taker deliberately answers inaccurately to appear socially desirable. This is an example of:
a) Test bias
b) Response bias
c) Measurement error
d) Test invalidity
b) Response bias
Which scenario demonstrates concurrent
validity?
a) Comparing typing test scores with current typing speed
b) Using GRE scores for postgraduate school performance
c) Measuring anxiety and correlating it with stress levels
d) Designing a test that includes all relevant content areas
a) Comparing typing test scores with current typing speed
Uniformity of administration procedures is the definition of:
a) Criterion-referencing
b) Standardization
c) Norm-referencing
d) Reliability
b) Standardization
The purpose of norms is to
a) Establish an average performance
b) Indicate the prevalence of high and low scores
c) Determine deviations from expectation
d) All of the above
c) Determine deviations from expectation
In a psychological experiment, random
assignment helps ensure:
a) Construct validity
b) Test reliability
c) External validity
d) Internal validity
d) Internal validity
A culturally biased test likely fails to:
a) Measure what it intends to measure
b) Distinguish between individuals
c) Generalize across populations
d) Maintain test reliability
c) Generalize across populations
A test that measures a candidate’s ability to solve unfamiliar problems on the spot is likely assessing:
a) Emotional intelligence
b) Crystallized intelligence
c) Fluid intelligence
d) Social intelligence
c) Fluid intelligence
These tests are often used to predict success in an occupation, training course, or educational endeavor.
a) Intelligence
b) Personality
c) Aptitude
d) Achievement
c) Aptitude
To ensure fairness, a psychologist tests for
cultural bias in a test. This primarily enhances:
a) Predictive validity
b) Content validity
c) Test fairness
d) Criterion validity
c) Test fairness
What inspired the English East India Company in 1832 to adopt a system of testing for selecting employees?
a) Reports by American psychologists
b) Observations of the Chinese civil service testing system
c) Recommendations from French diplomats
d) Research on European education
b) Observations of the Chinese civil service testing system
What does a Z-score of 0 indicate?
a) The score is an outlier.
b) The score is equal to the mean.
c) The score is below the mean.
d) The score is above the mean.
b) The score is equal to the mean.
A researcher wants to compare the effectiveness of three different teaching methods on student performance. Which statistical test should they use if the dependent variable is measured on a ratio
scale?
a) Independent samples t-test
b) One-way ANOVA
c) Chi-square test
d) Mann-Whitney U test
b) One-way ANOVA
A correlation coefficient of -0.90 indicates:
a) A weak positive relationship
b) A weak negative relationship
c) A strong positive relationship
d) A strong negative relationship
d) A strong negative relationship
A psychologist wants to assess whether a new test for ADHD differentiates between diagnosed and non-diagnosed individuals. What type of validity is being evaluated?
a) Predictive validity
b) Concurrent validity
c) Discriminant validity
d) Content validity
c) Discriminant validity
By far the most common use of psychological tests is to:
a) Make decisions about persons
b) Diagnose mental and emotional disorders
c) Determine personality functioning
d) Evaluate learning disabilities
a) Make decisions about persons
A test developer collects data to create norms for a new test. What type of data collection is required?
a) Reliability testing
b) Standardization sample
c) Experimental group
d) Clinical trial
b) Standardization sample
A psychologist develops a test and validates it with several studies. What kind of validity is being established over time?
a) Concurrent validity
b) Face validity
c) Incremental validity
d) Ecological validity
c) Incremental validity
Presenting data could involve:
a) Drawing a bar graph to show monthly sales
b) Measuring the weight of individuals
c) Testing the effectiveness of a treatment
d) Categorizing survey responses
a) Drawing a bar graph to show monthly sales