static constructs (not changing) possible history effects (practice effects) may improve

tests you compare are shorter the test you will use odd/even split (or any way to split them in half.) Cronbach's alpha - all possible split test reliabilities. nothing less than .70 RULE OF THUMB

Does it seem to measure the construct? Ex: Smith & Sons Welding Job Application 1. Do you have any pets? (wrong) 2. How many years have you been welding? (correct)

Chapter 5 - Test 3 Flashcards by Sara Macias

Validity

Truth (Accuracy of measurements, assumptions, …)

How well did you know this?

Not at all

Perfectly

Reliability

Consistency (Accuracy of measurements, assumptions, …)

How well did you know this?

Not at all

Perfectly

You can have with no, but you cannot have without .

reliability, validity, validity, reliability

How well did you know this?

Not at all

Perfectly

Reliability Practices

Psychological Tests (may already be available)
Test-retest
Alternate form/Parallel Form
Internal Consistency (Inter-item)
a. split half (odd/even)
b. item/total
Inter-rater/Inter-judge/Inter-observer

How well did you know this?

Not at all

Perfectly

Test-Retest

static constructs (not changing)
possible history effects (practice effects)
may improve

How well did you know this?

Not at all

Perfectly

Alternate Form/Parallel Form

At the same time:
* make two tests (with the same mean and standard deviation)
* Hard to make “Equivalent” tests

How well did you know this?

Not at all

Perfectly

Internal Consistency

each item can be considered a mini test, so…(use correlation matrix for measuring each item = average inter-item correlation

How well did you know this?

Not at all

Perfectly

Split-half

tests you compare are shorter the test you will use
odd/even split (or any way to split them in half.)
Cronbach’s alpha - all possible split test reliabilities.
nothing less than .70 RULE OF THUMB

How well did you know this?

Not at all

Perfectly

Cronbach’s alpha

All possible split test reliabilities

How well did you know this?

Not at all

Perfectly

Inter-judge/Observer/rater

*Controls consistency
provides a measure of the degree to which two judges concur in their respective sorting of N items into K mutually exclusive categories.
Cohen’s Kappa

How well did you know this?

Not at all

Perfectly

External Validity

ability to generalize

How well did you know this?

Not at all

Perfectly

Internal Validity

ability to determine cause/effect

How well did you know this?

Not at all

Perfectly

Construct Validity

How well can you measure the construct you are interested in
* true score vs. measurement error (real value and fuzziness - errors)
* Observed score

How well did you know this?

Not at all

Perfectly

Types of Constructs

Face validity
Divergent Validity (Discriminant)
Convergent Validity
Content Validity
Predictive Validity (criterion validity)
Concurrent Validity (criterion validity)
Logical Argument

How well did you know this?

Not at all

Perfectly

Face Validity

Does it seem to measure the construct?

Ex: Smith & Sons Welding Job Application

Do you have any pets? (wrong)
How many years have you been welding? (correct)

How well did you know this?

Not at all

Perfectly

Divergent (Discriminant) Validity

Study These Flashcards

Does it not correlate highly with a similar but different construct?
(tell apart one construct from another)

Ex: IQ Test ——–>Level of education

Convergent Validity

Study These Flashcards

Does it correlate with another measure of the same construct?

Ex: Stanford-Binet IQ Test—> WAIS IQ Test (comparison to another test)

Content Validity

Study These Flashcards

Do we sample from the universe of elements that compose the construct?

Ex: American History Test
(Not asking all the questions on only one subject)

Predictive Validity

Study These Flashcards

Does it predict a future construct?

Ex: SAT Scores ——–> College Success

Concurrent Validity

Study These Flashcards

Does it measure a current construct?

Ex: SAT Scores ——–> High School GPA

Logical Argument

Study These Flashcards

Not empirical based

Ex: Does purchasing behavior translate into customer preference?

Coke sales are People like
higher than Pepsi —–> Coke better

External Validity

Study These Flashcards

ability to generalize

Internal Validity

Study These Flashcards

ability to determine cause and effect

construct validity

Study These Flashcards

how well you can measure the construct you are interested in

construct

true score vs measurement error observed score

Types of constructs

* Face validity * Discriminant validity (divergent) * convergent validity * content validity * predictive validity > criterion validity * concurrent validity > criterion validity * logical argument

Face Validity

Does it seem to measure our construct? Example: Smith & Sons Welding Job Application 1. Do you have any pets? 2. How many years have you been welding? > correct ****DEPENDS WHEN NEEDED*****

Divergent Validity (DIscriminant)

Does it correlate with another measure of the same construct? (comparison to another test) Example: Stanford Binet IQ test >>>>>WAIS IQ Test

Content Validity

Do we sample from the universe of elements that compose the construct? Example: School Exams (varying questions not focusing on one subject only.

Predictive Validity

Does it measure a future construct? Example: SAT Scores >>>>>>College Success

Concurrent Validity

Does it measure a current construct? Example: SAT Scores >>>>>>High School GPA

Logical Argument

Example: Does purchasing behavior translate into customer preference? Coke sales are People like higher than Pepsi Coke better ****NOT EMPIRICAL BASED*****

Chapter 5 - Test 3 Flashcards

(32 cards)