Study Cards Flashcards

Question 1

Q

What is the APGAR test? What does it measure?

Answer

A

Evaluates health of baby based on appearance, pulse, grimace, activity, respiration

Question 2

Q

Who is Alfred Binet? Why is he important?

Answer

A

French psychologist. Introduced the idea of intelligence testing

Question 3

Q

What is an operational definition?

Answer

A

The exact way a construct is measured, and what qualifies something as being in/out of a given category

Question 4

Q

What is an operational measure?

Answer

A

The exact way in which something is tested, and how it should always be tested (think procedure)

Question 5

Q

What is a normative group?

Answer

A

Aka reference group. The sample of the population used to attain a base/average score

Question 6

Q

What is a normal distribution? What is it used for?

Answer

A

A distribution that, when mapped out, forms a bell curve. Depicting the mean, median, and mode as equal.
Used as the assumption of the layout of datasets in a group

Question 7

Q

What are deviations?

Answer

A

The difference between the observed values and the mean

Question 8

Q

What was the first version of the Binet-Simon intelligence test? What did results show?

Answer

A

A group of children were asked to perform a series of tasks to asses the knowledge they have acquired

Question 9

Q

What were Binet’s original concerns with his intelligence test?

Answer

A

That it would be misused, and that children who were behind would be labeled “idiots” and unteachable.

Question 10

Q

What are some of Binet’s contributions to the natural and social sciences

Answer

A

The development of scales of measurement
The formal operationalization of constructs
The development of non-verbal intelligence tests
The proposal that intelligence is both acquired and innate
The operationalization of terms and concepts
The development of mental age
The idea and use of normative groups
Established the dominance of psychology in the field of testing

Question 11

Q

Who is Francis Galton? What did he contribute to psychology?

Answer

A

He was a psychologist who had a fascination with data collection and variability. He started the development of large scale data collection

Question 12

Q

What is the law of error? Is it 100% true?

Answer

A

In any group or set of measurements, the outliers tend to cancel each other out, forming a normal distribution. It is not always true, but used as an assumption of truth

Question 13

Q

What are distributions of error (deviations) and how do you calculate them?

Answer

A

A deviation shows how far, on a scale from -3 to +3, scores are away from the mean.
Observed score - mean = deviation

Question 14

Q

What are the first 3 principles of psychometrics?

Answer

A

Defining and operationalizing is central to understanding if a claim is justifiable - always ask how a construct is measured and defined
Variability exists everywhere - this is the essence of the law of error
There is always a normative group - ask who is the sample and who created the sample

Question 15

Q

How does Anne Anastasi define a psychological test? Define the different aspects

Answer

A

An objective and standardized measure of a sample of behaviour
Objective: free of bias, clearly defined, little to no interpretation
Standardized: everyone gets the same test and is measured the same way
Sample of behaviour: This should be how they would act regularly, but the sample may not be representative

Question 16

Q

How does Lee Cronbach define psychological tests? How does it compare to the aspects of Anastasi’s definition?

Answer

A

Psychological tests are a systematic procedure for comparing the behaviour of two people.
Systematic vs standardized and objective: Cronbach recognized that tests cannot be 100% objective

Question 17

Q

What is psychometrics according to Thurstone? (2 parts)

Answer

A

A construction of instruments and procedures for measurement
The development and refinement of theoretical approaches to measurement

Question 18

Q

What is a construct? And how do they relate to the definition of psychometrics?

Answer

A

A construct is any idea or concept we’d like to measure
A. Constructing tests to measure these constructs
B. The methods and approaches must be refined when measuring these constructs

Question 19

Q

What are the 4th and 5th principles of psychometrics?

Answer

A

Most (if not all) test questions, in any format, are imperfect indicators of the construct being measured
Assigning numbers to data imposes a relationship among indicators that may not be justifiable

Question 20

Q

What does it mean to measure something? What are the 4 main scales of measurement?

Answer

A

The assigning of numbers to individual scores in a systematic way, according to one or another rule or convention
1. Ratio
2. Interval
3. Nominal
4. Ordinal

Question 21

Q

Explain the 4 main scales of measurement

Answer

A

Ratio: Equal intervals with a true zero
Interval: Equal intervals with NO true zero
Nominal: a categorical for, of organizing data
Ordinal: Determined rank or order, numbers have no value, intervals may be unequal

Question 22

Q

What is the 5th principle of psychometrics?

Answer

A

The leap of faith principle. By assigning numbers to data, you impose a relationship among indicators that might not be justifiable

Question 23

Q

What does a distribution measure in psychometrics?

Answer

A

The performance of the entire test

Question 24

Q

What are the 3 factors that ALWAYS affect variability?

Answer

A

Systematic effect, systematic bias, random effect

Question 25

Q

What is systematic effect?

Answer

A

It is the primary cause of the score. How much of the construct you have

Question 26

Q

What is systematic bias? Give an example

Answer

A

An affect that effects a subgroup. EX: a delayed train effects commuters

Question 27

Q

What is random effect? Give an example

Answer

A

Random factors that affect the score of an individual, but have no relationship to the construct. EX: poor sleep

Question 28

Q

What is the difference between a formal and an operational definition?

Answer

A

A formal definition defines the construct for what it is while an operational definition defines how it is measured

Question 29

Q

How does Plato’s allegory of the cave help us understand constructs?

Answer

A

It captures the challenges we face when measuring constructs that cannot be directly seen. The shadows and they symptoms are observed and interpretations must be made

Question 30

Q

What is Novak’s classical test theory?

Answer

A

A persons true score is different from their observed score (due to error)

Question 31

Q

How do you calculate true score (Novak)?

Answer

A

T= X +/- E

Question 32

Q

How does Galton’s law of error play in classical test theory?

Answer

A

Error is just as likely to be positive as negative

Question 33

Q

What is item response theory?

Answer

A

An attempt to directly estimate an individual’s ‘true score’ by examining how individuals respond to questions - as a function of their ability

Question 34

Q

What does an item response graph show?

Answer

A

Shows the minimum required ability to get an answer correct

Question 35

Q

What is scientific model testing?

Answer

A

The evaluation of different approaches to find which one best explains the data in that case

Question 36

Q

How does Ockham’s razor fit with scientific model testing?

Answer

A

When there are two theories that explain the data equally well, the most simple explanation is most often better

Question 37

Q

What is the definition of criterion validity?

Answer

A

Criterion validity is the correlating of scores with some external criterion that is relevant to the purpose of the test

Question 38

Q

What is scale validation?

Answer

A

The methods used to test validity

Question 39

Q

What are the features of scale validation according to Rulon?

Answer

A

A test cannot be labeled as valid or invalid without respect to a given purpose
Assessments of validity must include an assessment of the content of the instrument and its relation to the purpose
Different forms of validity evidence are required for different types of instruments
Some measures are obviously valid (face validity) and require no further study

Question 40

Q

What are the 4 main domains of validity?

Answer

A

Content validity
Structural validity
External validity
Item validity

Question 41

Q

What are the 3 types of content validity?

Answer

A

1.Domain representativeness
2. Domain relevance
3. Face validity

Question 42

Q

What is content validity?

Answer

A

The content represented by the construct
The degree to which a test measures all aspects of a criterion

Question 43

Q

What is domain representativeness?

Answer

A

The extent to which the questions/tasks/etc. measure the entire domain

Question 44

Q

What is domain relevance?

Answer

A

The extent to which the questions are relevant to assessing the construct

Question 45

Q

What is inclusionary criteria?

Answer

A

The signs and symptoms that MUST be present to have the construct

Question 46

Q

What is exclusionary criteria?

Answer

A

The signs and symptoms that CANNOT be present for the criteria

Question 47

Q

What type of validity includes inclusionary and exclusionary criteria? What is the interaction?

Answer

A

Domain relevance, these criteria are considered more important or more relevant

Question 48

Q

What is face validity?

Answer

A

Whether the test APPEARS to measure a given construct

Question 49

Q

What is structural validity?

Answer

A

The components that a test measures

Question 50

Q

What are the 2 components of structural validity?

Answer

A

Dimensionality
Order

Question 51

Q

What is dimensionality?

Answer

A

The number of factors the questions can be attributed to (pieces of the cake)

Question 52

Q

What is order?

Answer

A

The number of tiers that are needed to explain how the different factors are interrelated (layers of the cake)

Question 53

Q

What are the 4 factors of external validity

Answer

A

Criterion validity
Convergent and divergent validity
Predictive validity
Incremental validity

Question 54

Q

What is external validity?

Answer

A

The manner to which test scores are related to other constructs

Question 55

Q

What is criterion validity?

Answer

A

The extent to which test scores on questionnaire are related to some other outcome or condition

Question 56

Q

What is convergent validity?

Answer

A

The degree to which it a measure is correlated with other measures

Question 57

Q

What is divergent validity?

Answer

A

The degree to which a measure does not correlate with other measures

Question 58

Q

Explain the relationship chart of convergent and divergent validity?

Answer

A

Should converge: r>0.70 - good convergent validity, r<0.30 - poor convergent validity
Should diverge: r>0.70 - poor divergent validity, r<0.30 - good divergent validity
Anything in between is mild, and depends the theory.

Question 59

Q

What does a multi-trait multi-method matrix show?

Answer

A

It shows the correlates of different traits and how well they converge to measure the same construct

Question 60

Q

How do you read the multi-trait multi-matrix table?

Answer

A

The traits are listed down the side and along the top, grouped by test (method), and shows the correlation coefficient in the cross section of each individual trait

Question 61

Q

What are the factors of predictive validity? Define them

Answer

A

Concurrent (predicts a criterion measured at the same time) and prospective (predicts a criterion observed in the future) validity

Question 62

Q

What is incremental validity?

Answer

A

The degree to which a new (additional) measure adds the prediction of a criterion - beyond what can be predicted by some other measure

Question 63

Q

What are closed format tests?

Answer

A

Tests that have preset answers that cannot be changed or elaborated

Question 64

Q

What does it mean to have a dichotomous response?

Answer

A

The answer can only be yes or no

Answer 65

A

A range of replies (typically from strongly agree to strongly disagree) in which a person rates how much they agree with a statement

Answer 66

A

The subject must rank each statement (example: most important - least)

Answer 67

A

The questions do not have predetermined responses, allowing for elaboration

Answer 68

A

Questions that allow the participants to come up with their own responses

Answer 69

A

When the respondents rate their level of a construct on a continuous scale

Answer 70

A

They are statements that help specify what each number refers to in the real world
1. Rarely or never -

Answer 71

A

The variability within a group - differences in individual scores

Answer 72

A

Variability across distributions - differences between groups

Answer 73

A

How ability and probability of correctness correlate

Answer 74

A

Mean: the average
Mean = the sum of the population scores / the number of scores
μ = ΣN/N

Answer 75

A

Stand. Dev = the square root of the sum of scores - mean squared / number of scores
σ = √ (x-μ)^2 / N

Answer 76

A

The differences in scores
Variance - sum of (scores-mean) squared / total number of scores
σ2 = Σ(x-μ)^2 / N OR σ2 = σ^2

Answer 77

A

A line through a scatter plot that minimizes discrepancy between observed and predicted scores
Measures the degree of mis-fit between scores

Answer 78

A

An estimated score for future tests
Regression = y intercept + slope * X
Y= aX+b OR Y= b0 + b1*X

Answer 79

A

The magnitude of differences between groups
Effect size - mean of group 1 - mean of group 2 / standard deviation
D = (x̄1 - x̄2) / s

Answer 80

A

Of the people who actually have the condition, how many were designated to have it
Sensitivity - A / (A+C)

Answer 81

A

Of the people who don’t actually have the condition, how many were designated not to have it
Specificity = D / (B+D)

Answer 82

A

Of the positive results, how many actually have the condition
PPV = A / (A+B)

Answer 83

A

Of the negative results, how many really don’t have the condition
NPV = D / (C+D)

Answer 84

A

The guaranteed rate of prevalence in a population

Answer 85

A

A test completed by someone who reports their own experiences

Answer 86

A

The beck depression index is a self report test that measures depression
A unidimensional test, the use of cutoff scores indicates a discrete condition, any combination of items can be used to designate the presence of depression

Answer 87

A

A test completed on behalf of someone else

Answer 88

A

Tests that measure SUBCONSCIOUS impulses, emotions, difficulties, etc

Answer 89

A

Tests that use standardized measures that allow little to no interpretation
Created to account for the limits of projective tests

Answer 90

A

Projective testin which the patient interprets inkblots

Answer 91

A

A test designed to measure individual aptitudes, attitudes, preferences, etc

Answer 92

A

The Meyers Briggs is a self report measure of psychological preferences in how people see the world and make decisions
Measures innate aptitudes that are either mental or physical

Answer 93

A

Tests in which the questions and structure are predetermined, no changes or follow up can be made

Answer 94

A

Tests in which the procedure and questions are predetermined but the doctor is able to add in and take out questions up to their discretion

Answer 95

A

The structured clinical interview for DSM is a semi structured test that helps clinicians assess the presence or absence of psychiatric symptoms to render formal diagnoses
It is semi structured, allowing for follow up and the adding/removing of questions

Answer 96

A

The way in which questions are asked and how tests are presented changes the amount of information that comes out of a test

Answer 97

A

How a doctor interprets the information to make conclusions that can result in changes between scores

Answer 98

A

Tests designed to asses personality characteristics

Answer 99

A

A test that measures the degree of OCEAN
- openness, conscientiousness, extraversion, agreeableness, neuroticism
Uses a likert scale for questions, multidimensional- assesses each personality characteristic based on multiple smaller factors

Answer 100

A

Openness, Conscientiousness, Extraversion, Agreeableness, Neuroticism

Answer 101

A

Minnesota Multiphastic Personality Index. Dsigned to address existing concerns on existing self-report measures, that assesses psychopathology and personality in a clinical setting, prioritizing criterion validity over face validity

Answer 102

A

A measure of how behaviour and personality traits correlate

Answer 103

A

The Behavioural Acts Inventory. Designed to measure actions and behaviours to identify the correlates with personality

Answer 104

A

Tests designed to measure quantitative personality characteristics, comparing them to patterns of normality

Answer 105

A

Intelligence tests for adults (WAIS) and children (WISC) which evaluates intelligence and cognitive ability

Answer 106

A

Tests that measure developed skills or knowledge

Answer 107

A

Graduate Record Examination that measures the acquired knowledge of students
Evaluates verbal reasoning, quantitative reasoning, analytical writing, critical thinking, and knowledge

Answer 108

A

When it produces the same score continuously over time

Answer 109

A

How close our observed score approaches the true score

Answer 110

A

An estimate of true score

Answer 111

A

(E)rror*(x)observed=estimate of True

Answer 112

A

If error is uncorrelated with test scores, then error from two different tests is also uncorrelated, meaning errors from one test will be uncorrelated with the True Score of another test

Answer 113

A

Test-retest, Inter-rater, Parallel forms, Split half, Internal consistency

Answer 114

A

The ability for a test to produce consistent scores from one time to another

Answer 115

A

The degree to which different observers give consistent estimates of the same construct

Answer 116

A

The consistency of two separate but similar tests

Answer 117

A

The consistency between two halves of the same test

Answer 118

A

The consistency of the results across items of a test

Answer 119

A

By comparing two different groups of items

Answer 120

A

Within a single test - one part vs another part
Across multiple test - test 1 vs test 2

Answer 121

A

Cronbach’s alpha (a) and Cohen’s Kappa (k)

Answer 122

A

(Observed agreement - chance agreement)/(1-chance agreement)

Answer 123

A

[probability of ‘yes’ from DR.a/probability of ‘yes’ from DR.b] X [probability of ‘no’ from DR.a / probability of ‘no’ from DR.b]

Answer 124

A

(‘Yes’ from both + ‘No’ from both) / N

Answer 125

A

The analysis of how each individual item on a test performs

Answer 126

A

That T = the average score on a test if taken repeatedly, that error is random and independent

Answer 127

A

0.9 > a > 0.8

Answer 128

A

0.8 > a > 0.7

Answer 129

A

0.7 > a > 0.6

Answer 130

A

0.6 > a > 0.5

Answer 131

A

The analysis of how each individual item performs and the correlation of individual items with the total score

Answer 132

A

To determine which items are the best measurement of a construct

Answer 133

A

An assessment of total score - the cumulative degree of agreement for a construct

Answer 134

A

Each individual score is averaged (across a ‘group’) for an item total. Each average item total is added and averaged for a total score. This average item agreement is plotted with the total score to find r (total, item)

Answer 135

A

When a items are more highly correlated with one factor than the others

Answer 136

A

The probability of choosing an option correlated with the level of a construct required to choose a given option

Answer 137

A

The amount of knowledge you need to get an answer right.

Answer 138

A

Discriminability, difficulty, precision

Answer 139

A

The slope. The point at which changes are easily observed

Answer 140

A

Better: steep slopes
Worse: flattened regions

Answer 141

A

How much of the construct is needed before you choose that option (answer the question correctly)

Answer 142

A

Using the 0.5 threshold. The point on the x-axis at which the curve is at 0.5

Answer 143

A

More: when the slope is very shallow for a while, or it begins further down the x-axis
Less: when the slope begins early on the d-axis and/or is very steep right away

Answer 144

A

An estimate of your level of ability

Answer 145

A

Using the area under the curve. The space between -2 to 2 (95%)

Answer 146

A

Based on the option picked, we can infer with 95% certainty that their severity level falls within the 95% of the area under the curve

Answer 147

A

Is it flat? Sharp?
Where is the peak (most common area)
Does one curve override another?
Is a curve high for too long?

Answer 148

A

The examination of the degree to which individual items are related to one or more underlying dimensions of variation (factors)

Answer 149

A

Variable reduction
Structural analysis

Answer 150

A

To reduce the redundancy in tests and see if the same construct can be better explained by a short form test

Answer 151

A

A visual representation of the relation of items to the factor(s) on a test

Answer 152

A

The red and blue squares of the NEO PI-R

Answer 153

A

Strong blue squares
Using eigenvalues

Answer 154

A

Numbers that show the proportion of variance that each factor contributes

Answer 155

A

Any above 1

Answer 156

A

The ones under 1 or where the curve goes flat, the smallest correlation, if an item correlates to multiple factors,

Answer 157

A

By comparing two measures - an existing and a new -to a gold standard

Answer 158

A

Graphically, through models

Answer 159

A

Just the gold standard, measure 1, or measure 2
The single overlap: GS-M1, GS-M2, M1-M2
The total overlap

Answer 160

A

The ability to create a predicted score on the gold standard, based on observations on the other two+ measures

Answer 161

A

If adding this scale to the calculation of predicted score on the GS closes the gap between the predicted and observed score, there is incremental validity

Answer 162

A

The more tests you add, the closer you SHOULD be to the observed score on the GS

Answer 163

A

SSE= ΣN(y-ŷ)^2
Sum of squares of error = sum of (observed -predicted scores) squared

Answer 164

A

Benchmark - the existing tests vs the GS
(Existing test + new test) vs GS- does adding your test contribute anything

Answer 165

A

Data points are the observed scores on the GS
Each measure has its line of best fit
The space between a point and the line shows the discrepancy between observed and predicted scores

Answer 166

A

Compare it to a poor benchmark
If the benchmark does a poor job when compared to the GS, it will make your scale look better

Answer 167

A

Both measures have incremental utility, one is not better than the other - retain both
One measure has more incremental utility than the other - keep the better measure
The measures do not contribute uniquely - choose one
The measures have completely unique proportions of variation - retain both

Answer 168

A

The CESD accounts for variance in the HRSD above and beyond the variance accounted for by the BDI

Answer 169

A

The examining of the structure of questionnaires and decision of what model best fits the data

Answer 170

A

Structural equation models

Answer 171

A

The imposition of a model on the data to evaluate fit

Answer 172

A

The factors of a construct that cannot be directly observed, they are inferred using related questions

Answer 173

A

The questions

Answer 174

A

Values that show how the latent variables relate to each other, and how the questions relate to the variables

Answer 175

A

Latent variables - circles - factors
Factor loadings - top r score - correlations
Error - bottom r scores

Answer 176

A

Explanatory model in which EVERYTHING is related
The benchmark

Answer 177

A

A model in which none of the variables are correlated

Answer 178

A

Saturated: r = 1
Null: r = 0
Other: 1>r>0

Answer 179

A

Models can be uni-factoral and multi-factorial

Answer 180

A

Only one latent variable (circle)

Answer 181

A

Multiple latent variables (circles)

Answer 182

A

A model within another

Answer 183

A

By comparing the discrepancy between predicted and observed values to find which pattern of correlations is actually close to what has been observed

Answer 184

A

Creating a test that does not account for the behaviours of the target population
Not having enough items
Not using a test how it was intended

Answer 185

A

Teenscreen - used to screen teens for those at risk of suicide, but the at risk ones typically don’t show up

Answer 186

A

Responses might be wrong, there is nothing else to verify

Answer 187

A

Using the WISC to identify children that are gifted