Chapter 1: The Nature of Tests Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What is a test?

A

Anastasi (1988)
• “Objective”
• “Standardized”
Measure of a sample of behavior

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Three Elements of a Test

A
  1. Objectivity; decisions are not subjective to the examiner
  2. Standardization; procedures are uniform across all examiners
  3. Sample of behavior; from which we can draw a hypothesis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Domino & Domino propose three ways to consider tests

A
  1. As an experiment
    we are forced to focus on the standardization procedures, the elimination of confliction causes, experimental control, and generation of a hypothesis that can be tested
  2. As an interview
    allows you to interview many people without discrimination
  3. As tools
    best used in the hands of a trained professional and can be used for more than just psychology
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Testing versus Assessment

A

Assessment
• Documents (usually in measurable terms) and uses empirical data on the knowledge, skills, attitudes, and beliefs of a person
• Goal is to make improvements
• It is a procedure instead of a product

Test
• Examines a person’s knowledge of something to determine what that person knows or has learned
• Measures the level of knowledge or skill reached
• It is a product that measures a particular behavior or objective

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Purposes of tests

A
  1. Classification
  2. Self-understanding
  3. Program evaluation
  4. Scientific inquiry
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Tests in Decision Making

A

This may be the criteria by which our test is judged.
• High stakes versus low stakes decisions
• How predictive is our test?
• Many times we want to use our test to make an inference to something else

o Most agree that major decisions such as college acceptances, should not be decided based on one test such as the SAT but the test data is the only sources on a student’s application that is objective. Interviews, grades, and letters of recommendation are all subjective.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Ethical Standards (in your textbook): APA ethics code
Six general principles:

A
  1. Competence
  2. Integrity
  3. Professional and scientific responsibility
  4. Respect for people’s rights and dignity
  5. Concern for others’ welfare
  6. Social responsibility
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Ethical Principles of Psychologists and Code of Conduct (Newer Version)
Five general principles:

A

Principle A: Beneficence and Nonmaleficence
Principle B: Fidelity and Responsibility
Principle C: Integrity
Principle D: Justice
Principle E: Respect for People’s Rights and Dignity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Standards for educational and psychological tests: Three most important areas

A

• Informed consent
• Confidentiality
• Privacy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Variables that can influence a testing situation

A

If a test is to be considered an experiment than we must ensure that experimental procedures are followed. Such as controlling extraneous variables and enforcing time limits.
Greater control = less extraneous variables
A multiple-choice exam is less effected by other variables than an essay exam

• Method of administration
can be altered by changing or disregarding instructions, by giving answers in a certain way, or by not following procedures
• Situational variables
subject feeling frustrated, discouraged, fatigued, hungry, etc.
• Experimenter variables
The experimenter will have certain characteristics that may affect the test such as age, gender, race, more or less sympathetic, warm or cold, more or less authoritarian, better or worse at establishing relationships, etc. This won’t affect the subject’s performance but could affect the experimenters’ interpretations. This depends on many factors and in general there does not seem to be a large influence.
• Subject variables
how warm or cold a participant acts toward the experimenter can affect the results. Masling believed that situational and interpersonal influences effect testing. Sattler and Theye (1967) concluded that: 1. Departures from procedures are more likely to affect “specialized” groups than “normal” groups, 2. Children seem to be more susceptible to situational factors, 3. Rapport is a crucial variable and experimenter experience is not important, and 4. Racial differences could have an effect.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Categories of Tests

A
  1. Is the test commercially published?
  2. Distinguished by their administration
  3. Medium – materials used
  4. Item structure
  5. Area of assessment
  6. Test function
  7. Score interpretation
  8. Self-report versus observer
  9. Maximal versus typical performance
    10.Age range
  10. Type of setting
    12.Scales of measurement
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q
  1. Is the test commercially published?
A

Aka, a propriety test

§ Some tests are able for purchase through commercial companies, such as, the Stanford-Binet and Minnesota Multiphasic Personality Inventory

§ Once you have determined if a test is in fact commercially published, then you can consult with the Mental Measurements Yearbook (MMY) through a university. You may have to look through several versions to find the right test. Some tests are available through a computer service called the Bibliographic Retrieval Services.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q
  1. Distinguished by their administration
A

§ Group vs. individual tests
· Group tests are test such as the SATs that are administered to many people at once, they are preferred by researchers needing to test many people in a small amount of time
· Individual tests are tests such as the Stanford-Binet test of intelligence, they are preferred by clinicians who deal with one person at a time and want observational data in addition to the test scores

§ Speed tests vs Power tests
· Speed tests have a time limit that effects performance. The time limits are usually set up so that only 50% of the applicants can attempt every item.
· Power tests are designed to measure how well you can do and therefore do not have a time limit or give ample time as to not effect performance. Time limits are set so that 90% of applicants can attempt all items.

§ Secure test
· Most tests that are used in industry are secure tests meaning that the tests usefulness would be compromised if it was made public.

§ Invasive test
· Asking about someone’s sexuality is more invasive than a test on math skills
· A test completed by the subject is more invasive than an observer report

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q
  1. Medium – materials used
A

§ Paper-and-pencil

§ Performance

§ Physiological (polygraph)

§ Computer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q
  1. Item structure
A

§ Where on the question is on the continuum objective to subjective

§ Objective is easy to score but reveals little about the individual while subjective is difficult to quantify but can be revealing and highly informative to the individual

§ Is the question verbal (vocabulary and math) or require performance?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q
  1. Area of assessment
A

§ Intelligence test, personality questionnaires, tests of achievement, career-interest tests, reading tests, neuropsychological, etc.
§ The MMY has 16 categories
§ Five major categories in this book:
· (1.) Personality tests
· (2.) Tests of cognitive abilities
· (3.) Tests of attitudes, values, and interest
· (4.) Tests of psychopathology, used by clinicians and the study of mental illness
· (5.) Tests that assess normal and positive functioning, such as creativity, competence, and self-esteem

17
Q
  1. Test function
A

§ To diagnose; disorders, depression
§ To make predictions; who will succeed in college, who will commit suicide
§ Selection procedures; who is accepted to college graduate school
§ Placement purposes; what Spanish class level to go into
§ Screening purposes
§ Certification; driving test

18
Q
  1. Score interpretation
A

§ Norm-reference: referring to the norms of a particular score meaning; comparing scores that on individual obtains with the scores of a group who took the same test
§ Criterion-reference: comparing the score to a decision rule called a criterion; such as driving tests because you either pass or failed based on predetermined criterion
· Tests can have one or both interpretations
§ Normative or Ipsative: does it reflect the behavior of others or just the individual
· Normative: Variety of answers, measure of the group
o Example: ranking six ice cream flavors individually 1-6 (could add up anywhere between 6 and 36)
· Ipsative: scores on a scale must sum to a constant, measure of the individual
o Example: ranking six ice cream flavors from first to last (1+2+3+4+5+6 will always equal 21 no matter what order you put them in)
§ Psychometrically or Impressionistically
· Psychometrically: the responses are scored, and the scores are interpreted based on available norms and/or research data
· Impressionistically: the tester looks at the responses carefully based on his/her expertise and creates a psychological portrait of the client
o Can be combined

19
Q
  1. Maximal versus typical performance
A

· Maximal Performance: how well a person can do, aptitude tests and achievement tests that have correct answers

· Typical Performance: how well they usually do, personality tests, attitude scales, opinion questionnaires that have no correct answer

20
Q
  1. Scales of measurement
A

§ The NOIR System
· Nominal Scales: used labels instead of numeric properties
o Examples: number on a jersey (used to identify has no value), religion, eye color
· Ordinal Scales: ranking (we do not know how far number 1 is from number 2)
o Example: ranking your favorite states 1 to 10
· Interval Scales: distance between scores is based on equal units with no zero point
o Example: temperature, IQ tests
· Ratio Scales: equal rations between intervals and has a true zero
o Example: Kelvin scale of temperature, reaction time, number of items recalled