GIGA PRACTICE Flashcards

Question

Within-Group Norms: Criticisms (2)

Answer 1

(1) Only meaningful if the standardization (norm) sample is representative (2) Within-group comparisons encourage competition

Answer 2

Define content of domain narrowly and specifically. E.g. Driving skills, 8th grade math curriculum

Answer 3

(1) Can elements of performance be specifically defined? -> Hard to clearly define what “good” or “bad” performance looks like. -> Criterion-referenced norms require a clear standard (e.g., scoring 80% on a test to pass), but creating these standards can be challenging because it’s hard to decide what knowledge or skills are essential. (2) Focus on minimum standards -> e.g., “Did you pass?” -> Ignore how much better one person is compared to others. (3) Absence of relative knowledge -> You don’t know how someone performs compared to others.

Answer 4

Often interpreted inappropriately -> Overgeneralization, misinterpreting median…

Answer 5

2 z-scores

Answer 6

(1) Magnitude (2) Equal Intervals (3) Absolute 0

Answer 7

Same as standard scores (Z scores), except that the M=50 and SD=10.

Answer 8

Interval of scores bounded by the 25th and 75th percentiles. -> bounded by the range of scores that represents the middle 50% of the distribution.

Answer 9

Converts any set of scores into a transformed scale, which ranges from 1 to 9. M = 5, SD = 2

Answer 10

Selecting a higher percentage from a particular group than would be expected on the basis of the representation of that group in the applicant pool.

Answer 11

Developmental norms. Tendency to stay at about the same level relative to one’s peers.

Answer 12

Revolution in social science research. = Data sets with sizes beyond the ability of commonly used software tools to capture, curate, manage, and process the data within a tolerable elapsed time.

Answer 13

QUANTITATIVE description of the DIRECTION and STRENGTH of a straight-line relationship between 2 variables.

Answer 14

Non-linear relationships -> Non-linear relationships cannot be described, regardless of their strength.

Answer 15

(1) Each person has a true score that would be obtained if there were no errors in measurement. Observed test score (X) = True test score (T) + Error (E) (2) Measurement errors are random (3) Measurement error is normally distributed (4) Variance of OBSERVED scores = Variance of true scores + Error variance

Answer 16

The hypothetical or ideal measure of a person's attribute we aim to capture with a psychological test. => FREE FROM ERROR Expected score over an INFINITE number of independent administrations of the test

Answer 17

0; UNcorrelated; UNcorrelated

Answer 18

(1) EQUAL observed score MEANS -> Comes from the assumption that True scores would be the same (2) EQUAL ERROR VARIANCE (3) SAME CORRELATIONS with other tests

Answer 19

(1) Random (2) Cancels itself out (3) Lowers reliability of the test

Answer 20

Occurs when source of error always increases or decreases a true score -> DOESN'T LOWER RELIABILITY of a test since the test is RELIABLY INACCURATE by the same amount each time

Answer 21

(1) CONTENT Sampling Error (2) TIME Sampling Error (3) Other Sources of Error (e.g. observer differences)

Answer 22

Proportion of OBSERVED test scores accounted for by variability in TRUE scores.

Answer 23

Amount of uncertainty/error expected in an individual's observed test score. **=> Corresponds to the SD of the distribution of scores one would obtain by repeatedly testing a person. **

Answer 24

Predicts the effect of lengthening or shortening a test on reliability.

Answer 25

(1) Test-retest (2) Alternate (Parallel) Forms (3) Internal consistency (4) Interrater/Raters

Answer 26

time -> Higher when construct being measured is expected to be STABLE than when construct expected to CHANGE

Answer 27

Higher for "narrow" constructs Lower for "broader constructs -> Very high may indicate insufficient sampling in the domain E.g. Medium internal consistency is bad for a narrow construct (panic disorder), but not so bad for a broad construct (Neuroticism)

Answer 28

Split-half method

Answer 29

CRONBACH'S ALPHA = AVERAGE OF ALL POSSIBLE SPLIT-HALF RELIABILITIES Unaffected by how items are arranged in the test -> Most general method of finding estimates of reliability through internal consistency. (Kuder-Richardson also a possibility)

Answer 30

Interrater Agreement Proportion of the potential agreement following **CORRECTION FOR CHANCE**.

Answer 31

shorter, long-run true score

Answer 32

CARRYOVER EFFECTS: Occurs when the first testing session influences scores from the second session.

Answer 33

OVERESTIMATES -> This can happen because the participant REMEMBERS items or patterns from the first test, so their performance on the second test is less independent than it should be.

Answer 34

Parallel Forms Method

Answer 35

(1) The two halves may have different variances. (2) The split-half method also requires that each half be scored separately, possibly creating additional work.

Answer 36

Equivalent of alpha for dichotomous test (e.g. right/wrong)

Answer 37

(1) Time sampling: The same test given at different points in time may produce different scores, even if given to the same test takers. (2) Item sampling: The same construct or attribute may be assessed using a wide pool of items. (3) When different observers record the same behavior: Different judges observing the same event may record different numbers.

Answer 38

Parallel forms, Internal consistency

Answer 39

(1) Increase the # of Items (2) Throw out items that run down the reliability (by running a factor/discriminability analysis) (3) Estimate what the true correlation would have been (CORRECTION FOR ATTENUATION)

Answer 40

0-1. Kappa = 0 is considered poor -> means the agreement is basically by chance. Kappa = 1 represents perfect, complete agreement.

Answer 41

lower; small

Answer 42

Subtracting one test score from another -> two different attributes

Answer 43

Difference scores are unreliable because the random error from both scores is compounded and the true score is cancelled out.

Answer 44

- It comes in degrees and applies to a particular USE and a particular POPULATION - It is a process: An ongoing, dynamic effort to accumulate evidence for a sound scientific basis for proposed test score interpretations

Answer 45

Content, Criterion, Construct

Answer 46

Concurrent, Predictive

Answer 47

Convergent, Divergent

Answer 48

(1) Induce cooperation and positive motivation before and during test administration (2) Reduce dissatisfaction and feelings of injustice among low scorers (3) Convince policymakers, employers, and administrators to implement the test -> but sometimes a test with low face validity elicit more honest responses

Answer 49

Objective & Subjective criterion

Answer 50

Observable and Measurable E.g., Number of accidents, days of absence

Answer 51

Based on a person's judgement E.g., Supervisor ratings, peer ratings

Answer 52

This decreases the evidence of validity based on its content because it has underrepresented some important characteristics

Answer 53

If the criterion measures MORE dimensions than those measured by the test

Answer 54

Relationship between a test and a **criterion**. **Correlation** between test and criterion -> Tells the extent to which the test is valid for making statements about the criterion.

Answer 55

Correlation. So between -1 and 1

Answer 56

r=.60 | -> If higher than that → alternative test

Answer 57

(1) Range of Scores (diminishes the Test score & criterion score correlation) (2) Unreliability of Test Scores (3) Unreliability in Criterion

Answer 58

**Correction for attenuation** - validity coefficient if we had **perfect realibility** of test scores

Answer 59

Correction for attenuation - Correcting for unreliability in **test** (**predictor**) & **criterion**

Answer 60

(1) Gathering Theoretical evidence (2) Gather Psychometric evidence

Answer 61

(1) Establish nomological network - identifying all possible relationships (2) Based on this theoretical work → Propose experimental hypotheses -> If what we think is true, what would be the evidence to support this relationship

Answer 62

(1) **Constructs** (e.g. job satisfaction) (2) Their **observable manifestations** (e.g. smiles, productivity, positive feedback) (3) The **relations** within and between **constructs** and their **observable manifestations** (e.g. positive feedback related to productivity)

Answer 63

(1) **Content** validity (2) **Criterion** validity (3) **Reliability** of the test (4) Experimental interventions (5) **Convergent** evidence of validity (6) **Discriminant** evidence of validity

Answer 64

(1) No **construct underrepresentation**: Does the test sample adequately from the construct domain? (2) No **irrelevant construct representation**: Does the test properly exclude content that is unrelated to the construct?

Answer 65

E.g. test-retest/internal consistency not too low or too high given the construct

Answer 66

Extent to which two measures that are supposed to be related are actually correlated When a test scores correlate with: (1) **Other measures of the SAME construct**, or (2) Measures of **constructs to which the test should be related based on theory** (think nomologic net)

Answer 67

(1) Educational setting: content validity has been of greatest concern in educational testing (score on this test represent comprehension of subject) BUT **many factors can limit performance on test** (2) Unclear boundaries: **hard to separate types of validities** -> It’s often hard to separate "content coverage" (content validity) from whether the test actually measures the underlying concept (construct validity), leading to blurred boundaries. (3) Doesn't consider the relationship of contruct w **external** variables/constructs

Answer 68

CONTENT validity. Occurs when scores are **influenced by factors irrelevant to the construct**.

Answer 69

(1) **All validity coefficient don't have the same meaning** (2) The **conditions** of a validity study are **never exactly reproduced**. E.g. If you take the GRE to gain admission to graduate school, the conditions under which you take the test may not be exactly the same as those in the studies that established the validity of the GRE. (3) Criterion-related validity studies mean nothing UNLESS the **criterion** is **valid** and reliable. (4) Validity study might have been done on a **population** that **does not represent the group to which inferences will be made**. (5) Be sure the **sample size** was adequate (6) Never Confuse the **Criterion** with the **Predictor** (GRE & success in grad school example) (7) Check for **Restricted Range** on Both Predictor and Criterion: Correlation requires that there be variability in **both** the predictor and the criterion. (8) Review Evidence for **Validity Generalization** (may not be generalized to other similar **situations**) (9) Consider **Differential Prediction**: Predictive relationships may not be the same for all demographic groups.

Answer 70

Predictive relationships may NOT be the same for all demographic groups. -> The validity for men could differ in some circumstances from the validity for women. -> Under these circumstances, separate validity studies for different groups may be necessary.

Answer 71

Multitrait-Multimethod Matrix

Answer 72

**SYSTEMATIC error** Characteristics of method that will influence how responders will respond to questions important for our attributes

Answer 73

True score variance + Method variance + Random error

Answer 74

TRAIT, METHOD, IRRELEVANT

Answer 75

(1) High "trait variance" (2) Low "method variance" (3) Low "irrelevant variance"

Answer 76

Variance shared with theoretically unrelated measures

Answer 77

(1) Monomethod block (2) Monotrait-monomethod values (3) Heterotrait-monomethod triangle (4) Heteromethod block (5) Monotrait-heteromethod values (6) 2 Heterotrait-heteromethod triangles

Answer 78

MONOnotrait-MONOmethod values (in monomethod block). Tell how reliably each construct (A, B, C) can be measured with each method.

Answer 79

**Monotrait-heteromethod** values Tell how well a construct is measured using different methods. -> **CONVERGENT** validity coefficients

Answer 80

**Heterotrait**-heteromethod & **Heterotrait**-monomethod

Answer 81

(1) Administer SHORTER measures (2) Compare scores across: DIFF measures of the SAME constructs in DISTINCT groups

Answer 82

(1) **Adding/deleting items changes true score** (because the true score is **TEST-DEPENDENT,** so comparison not possible across diff test forms) (2) True score is interpretable ONLY in reference to **NORM** sample's distribution of scores: SAMPLE-DEPENDENT (3) Reliability of true score is function of the items used:** All items of EQUALLY reliable, measure SAME RANGE of scores, reliability CONSTANT across scores**

Answer 83

(1) True score defined on the **LATENT trait dimension** rather than observed score (2) Knowing **PROPERTIES OF ITEM **a person endorses tell us the **TRAIT LEVEL** the person possesses (3) Properties of an item do **NOT** change if we were to administer the item using different samples (4) True score of the person does **NOT** change regardless of which sets of items we administer.

Answer 84

Y = Probability of item endorsement ("yes") = HOW MUCH TRAIT LEVEL YOU POSSESS => **limited by 0 and 1** (proba) X = Theta (latent trait) - e.g. entire range of math level Theta is a CONTINUUM (from -infinity to +infinity)

Answer 85

Entire range of latent trait. => CONTINUUM (from -infinity to +infinity) => Negative values = LOW levels => Positive values = HIGH levels

Answer 86

Item DIFFICULTY & Item DISCRIMINATION

Answer 87

small; large

Answer 88

**b** The point in theta (X axis) where probability of endorsing an item is 50%. => To find it, start by checking 0.5 in the Y axis => Then you find what's the level of theta (X) that correspond to item difficulty

Answer 89

– 2 and + 2 (-/+ 2 = Arbitrary z-score)

Answer 90

Items are “EASIER”, more frequently endorsed (doesn't take much of the trait level to endorse); Items are more “DIFFICULT”, less frequently endorsed

Answer 91

Items more likely to be endorsed => When theta level is HIGHER than difficulty of the item

Answer 92

Items less likely to be endorsed => When level of underlying trait LOWER than item difficulty

Answer 93

= 50%; item difficulty

Answer 94

**a** Value of the slope at the STEEPEST point of the curve, i.e., b = 50%; -> Point in the curve where the increases in Y are the highest. To find it: find theta for difficulty -> this is the point where beta is the most elevated => The steeper the line, the closer it is to VERTICAL.

Answer 95

at which levels of data the item is most likely to differentiate best => Discriminates levels of theta

Answer 96

.5 and 1.5

Answer 97

item difficulty → Hard questions are more effective at measuring high levels of the trait.

Answer 98

MAXIMIZED; HOW MUCH INFO

Answer 99

We're talking about RELIABILITY (NOT VALIDITY) Bc it focuses on how precisely a test measures the latent trait ACROSS DIFF LEVELS OF THAT TRAIT. => *THE HIGHER THE CURVE, THE BETTER YOUR ASSESSMENT OF THE TRAIT (mountain)

Answer 100

CTT: 1 score of reliability for entire set of items IRT: 1 item = 1 reliability coefficient; Measurement error is NOT equal across the entire range of data

Answer 101

(1) IDENTIFY item characteristics (i.e., difficulty, discrimination) (2) CHOOSE items with higher discrimination covering the entire range of the latent continuum (3) INCREASE RELIABILITY with fewer items (3) COMPARE itemps across DIFF MEASURES of SAME CONSTRUCT + Compare group differences

Answer 102

Whether scales and items function differently across different discrete groups. -> Occurs when groups (such as defined by gender, ethnicity, age, or education) have different probabilities of endorsing a given item (controlling for overall score)

Answer 103

individuals from diff groups who have EQUAL levels of the UNDERLYING TRAIT, have diff probabilities of endorsing or agreeing with an item.

Answer 104

fair; examining group differences in responses while controlling for the trait level

Answer 105

(1) Written series of questions (2) Structured stimuli (i.e. questions) (3) Structured responses (i.e. response format)

Answer 106

(1) Presentation of stimuli is well controlled (2) **Scoring highly reliable** (3) Efficient to administer to large numbers (4) Inexpensive

Answer 107

Personality, e.g. MMPI

Answer 108

less reliable

Answer 109

(1) Likert Format (e.g. agree/disagree) (2) Category Format (e.g., rating pain on a scale of 1–10); Visual Analogue Scale, Rating scale (3) Checklists (select multiple items from a list that apply to them) (4) Q-Sorts (rank or categorize items into multiple predefined groups)

Answer 110

Number of points? How many options? More options = more variability At what point is it too much? Middle point? Acknowledges that pple might NOT have an opinion, but can be an easy way out Often 10, between 4 and 7 it's good.

Answer 111

Person is presented with 2 to 4 stimuli and asked to choose among them.

Answer 112

Forced distribution of items into categories E.g. E.g. Give person list of 100 characteristics. Group the characteristics according to how like the person the characteristics are:

Answer 113

(1) Comprehension: Attending to questions and instructions (2) Retrieval: Retrieval of relevant information (3) Judgment: Integration of retrieved information (4) Response: Mapping the judgment on the response category

Answer 114

RESPONSE SET: Tendency for people to respond to questions in a way that paints a certain picture of themselves instead of providing honest answers (1) Acquiescence = Tendency to agree, say true, say often. (2) Social desirability = Tendency to present self in a socially favorable manner (3) Random responding = Ignoring or paying insufficient attention to item content

Answer 115

Use reverse-score items

Answer 116

Impression management & Self deception

Answer 117

(1) Measure influence: assess discriminant validity (2) Marlowe-Crowne social desirability scale (3) Change response format (forced choice; Q-sort)

Answer 118

(1) **INSTRUCTED** response items: Ask for a specific answer (e.g. “choose strongly disagree") (2) **BOGUS** items: Ask about impossible or improbable scenarios (e.g. “I was born before 1920”) (3) **SELF-REPORT** items: Ask participants about their care and engagement DURING the survey (4) **RESPONSE TIME**: Computed after data collection but must be considered before starting

Answer 119

- Single idea per item stem - Write each item in a clear and direct manner - Avoid long items - Avoid double negatives - Reading level appropriate for intended test-takers - Avoid slang or colloquial language - Make all items independent - Ask someone else to review items to reduce ambiguity and inaccuracies - Make all responses similar in length and detail - Make sure the item has only one best answer - Avoid words such as “always” and “never” - Avoid overlapping responses

Answer 120

early 20th century

Answer 121

(1) Each item corresponds to a specific construct (2) Item has COMMON MEANING for all test-takers (3) A test-taker is able to accurately assess the requested information (4) A test-taker will honestly report requested information

Answer 122

Items selected on basis of relations to external criteria. (e.g., contrasted groups)

Answer 123

the verbal content of the item; groups who endorse the item. => Interpretation of scores is by 'cookbook“: **EMPIRICALLY KNOWN CORRELATES of high and low scores**

Answer 124

(1) Unintended group differences (2) Problem of generalization (3) Item overlap

Answer 125

1950s; prominent from the 1960s to the present.

Answer 126

(1) A person possesses some degree of a construct (e.g., sociability) (2) Nontest behaviours can be identified which are referents (indicator) for the construct (3) Test responses are referents (indicator) for the construct.

Answer 127

Evaluates adequacy of test by how well test fits in with theoretical (nomologic) net for the construct

Answer 128

(1) Define construct: Consider the literature for definition + theoretical relations with other constructs (2) Gather/Write items (3) Evaluate content validity: expert judgment to know if items are relevant (4) Pre-testing of items: Administer initial pool to a small sample and conduct cognitive interviews (5) Item reduction: Consider endorsement rate for items. (6) Factor analysis: Determine the optimal number of factors underlying item response patterns (7) Scale evaluation: test of dimensionality, reliability and validity

Answer 129

Behavior, Affect, Cognition

Answer 130

Most important traits represented by single words. Origin of NEO-PR relies on lexical tradition.

Answer 131

If an idea is important for pple, they'll have a word that will express this concept. -> The more important the concept, the more word exist for this concept

Answer 132

- 1978: Included only 3 factors - N, E, O (no scales for A and C) and 18 facets - 1985: A and C added: First NEO-PR - 1992 manual: Facet scales available for all factors + Included the short version (NEO-FFI) + Rational Scale Construction (supported by factor analysis)

Answer 133

Traits: .86-.92 Facets: .56-.81 -> Cuz fewer items to measure each of the facets (normal range)

Answer 134

High but bit weaker as time interval extends. - 3 month → .75-.83 - 6-year N,E,O → .68-.83 - 3-year A,C → .63 & .79

Answer 135

Self-spouse agreement (2 forms of NEO-PR: Self and Other rated) N,E,O,A,C → .60, .73, .65, .62, .63: Moderate to large convergent validity

Answer 136

Sometimes scales are NOT independent (C & A)!

Answer 137

NEO-PI-**3**. Published in 2005. - 240 items, description of behaviors rated on **5-point scale** (strongly disagree to strongly agree) - Age range: **14-99 **(norms for adolescents)

Answer 138

T-scores (M=50, SD=10)

Answer 139

(1) Mostly research on basic personality (2) Limited usefulness in clinical or other applied settings

Answer 140

(1) **Aquiescence**: Tentency to agree with statements -> Half reverse keyed items (2) **Social desirability**: Tendency to portray self in a socially desirable way - Construct validity problem

Answer 141

If more than 150 items are 'agree or strongly agee' profile must be interpreted with caution.

Answer 142

MMPI-3 (2020)

Answer 143

Empirical approach Choose 2 groups. - Administer item pool to large group: psychiatric & normative - Select a diagnostic group - Compare endorsement for each item of selected group to normative group

Answer 144

Scale 0: Social introversion Scale 1: Hypochondriasis Scale 2: Depression Scale 3: Hysteria Scale 4: Psychopathic Deviate Scale 5: Masculinity-Femininity Scale 6: Paranoia Scale 7: Psychasthenia Scale 8: Schizophrenia Scale 9: Hypomania

Answer 145

(1) L(Lie) Scale: Endorse too few items which express common frailties - SOCIAL DESIRABILITY BIAS (denying common human weaknesses) (2) F Scale (Infrequency scale): Endorse items which few people endorse (3) K Scale (Defensiveness Scale): Denial of more subtle, personal, or psychological difficulties that may be less obvious but still significant. -> More about defensiveness—the person may be hiding or minimizing psychological problems or discomfort. -> defensively hiding emotional or personal struggles. (4) new-FB (Frequent back): infrequent responding in 2nd half of the test (5) new-VRIN: Assesses random responding (if they don't answer in similar way to similar questions, random responding) (6) new-TRIN: Acquiescence bias (pairs of items with opposite content => should have different responses)

Answer 146

- Before: 70+ = may have clinical significance; Today: **65** - Before: look at any elevation in any of the scale (1 most elevated scale); now: Interpret scores in **multiple subscales**

Answer 147

(1) Re-standardization: More appropriate normative sample (2) Updated item content (567 items) (3) Same clinical scales but **5 and 0 not psychopathology** (4) 3 new validity scales (FB, VRIN, TRIN)

Answer 148

Content scales (measuring particular constructs); Rational test construction (Based on judgment of what items seem to be measuring) => E.g. anxiety, alcoholism scale, Obsessiveness, Family problems, Negative treatment indicators

Answer 149

- Perceptions of inkblots - Telling stories about pictures - Completing sentence stems

Answer 150

- Responses to ambiguous stimuli are determined by personality characteristics - Reveal characteristics beneath the surface (bypass defences, unaffected by social desirability/context) - Provide broad coverage of personality characteristics

Answer 151

1st phase = **Free association phase** - Presents 10 carts one by one; what might this be? 2nd phase =** Inquiry phase** - Examiner examines responses

Answer 152

- Informal: **Interpretation of content**, e.g. odd uses of words; thematic patterns - examiner searches for anything that stands out - Formal scoring: 5 dimensions

Answer 153

(1) **Location**: part of the inkblot the individual focuses on when giving their response (e.g., a whole blot, a specific detail) (2) **Determinant**: specific feature or characteristic of the inkblot that influenced the person's response (3) **Form quality**: how well the respondent’s perception matches the actual shape or structure of the inkblot (the more closely the response fits the inkblot's form, the better the form quality). (4) **Content**: What the person sees in the inkblot (5) **Frequency** of occurrence

Answer 154

- Form - Colour - Texture - Movement

Answer 155

- **Perceptual Thinking** Index (disturbed thinking and perceptions) - **Depression** Index - **Coping Deficit** Index (interpersonal and/or emotional deficits) - **Suicide constellation **(risk) - **Hypervigilance** index -** Obsessive style index** (obsessive info processing)

Answer 156

For determinants: 88-97%

Answer 157

Depends on studies: - Meyer & Archer: 1 month= .50-.77 - Exner: 1y=.74-.91; 3y = .70-.87

Answer 158

Meta analyses: - Rorchach: .27-.30 - MMPI: .23-.28 - WAIS: .32-.36

Answer 159

- Psychotherapy outcome - Differentiate psychotic and non-psychotic patients

Answer 160

- Serious problem of norms - Absence of a standardized method of administration - Limits on **validity** evidence - Time intensive - Does the test give useful information?

Answer 161

(1) Respondents interpret stimuli in accord with their personality and life experiences (2) Respondents identify with the “hero” of the story

Answer 162

(1) Achievement (2) Power (3) Affiliation

Answer 163

(1) Informal interpretation (themes, patterns, sequences) (2) Formal scoring using manual

Answer 164

No good normative sample -> Implication: Cannot interpret individual's score

Answer 165

- Standardized questions - Analysis of job domain - Well-defined rating scales - Mechanical combination of ratings

Answer 166

- Joint interviews: Participant is interviewed by one clinician, others observe and make independent ratings - Test-retest design: Interrater agreement (Kappa stat - 50-70): Participant interviewed at 2 diff times by 2 diff interviewers Note: Fair to good reliability for many disorders

Answer 167

Best estimate diagnosis = “LEAD” standard:

Answer 168

- Content: Close correspondence between SCID questions and DSM criteria - Criterion: Meh - LEAD criterion may be possible but not comprehensively studied - Construct: Problems with **discriminant** validity; **high co-occurrence of diagnoses** Conclusion: Excellent content validity, but limited other forms of validity

Answer 169

1. A priori beliefs about occurrence 2. Collection of confirmatory evidence ONLY 3. Failure to test alternative hypotheses 4. Ignore discrepant evidence

Answer 170

- Information introduced by the interviewer that has NOT been mentioned by the interviewee - Few open-ended questions - Leading/misleading questions - Repeating questions - Emotional tone of the interview - Selective reinforcement - Brives, threats, rewards - Aggrandizement of interviewer status - Visualization procedures/pretending (what could it be?)

Answer 171

- Main invitation (“Tell me everything that happened from the beginning to the end) - Follow-up invitations (“Tell me more about that.”; “Then what happened?”) - Follow up and cued invitations (“Earlier you mentioned ___. Tell me everything about that”)

Answer 172

- Introduction: Establish ground rules for truthfulness and control transfer. - Build rapport with the interviewee. - Conduct a practice interview for memory training. Transition to substantive phase: - Investigate incidents using open-ended prompts and separating incidents. - Ask focused questions about undisclosed information followed by open-ended prompts. - Discuss disclosure information (initial disclosures and who else knows). - Conclude the interview, **inviting additional information** or questions. - End with a** neutral topic** for closure.

Answer 173

1. **Directive** questions (“**Wh**” questions about previously mentioned details) - ”When did it happen?” or “What color was his car?” 2. **Option-posing** questions (yes/no questions referencing new issues) - “Did he touch any part of your body when he was talking to you?” => Suggestive utterances are strongly discouraged: “At that time he was laying on top of you, wasn’t he?”