Research Design, Statistics, Tests, and Measurements Flashcards

0
Q

Hermann Ebbinghaus

A

Showed that higher mental processes could be studied empirically using experimental methodology By studying memory using nonsense syllables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

Wilhelm Wundt

A

Founded the first psychology laboratory in 1879 Believed that methodology couldn’t be used to study higher mental processes like memory, thinking, language Also believed that there could be no thought without mental image

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Oswald Kulpe

A

Found that there could be imageless thought

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

James McKeen Cattell

A

Studied under Wundt; introduced mental testing to the U.S.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Binet & Simon

A

Collaborated to publish the Stanford-Binet test, first intelligence test. Also introduced the the concept of mental age (based on intellect)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

William Stern

A

Developed IQ: equation to compare mental age to actual age as a measure of intelligence/aptitude

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Terman

A

Revised the Binet-Simon test for use in the U.S. Became known as the Stanford-Binet Intelligence Test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Operational Definition

A

States how the researcher defines the variables so that they are measurable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

True experiments, quasi-experiments, and correlational studies

A

True: random assignment, manipulate IV Quasi: no random assignment, no sufficient control on variables Correlational: do not manipulate the IV

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Naturalistic observation

A

Researcher doesn’t intervene; measure natural behavior

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Representative sample

A

Sample is a miniature version of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Random sample

A

Every population member has an equal chance of being selected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Stratified random sample

A

Relevant subgroups of the population are randomly sampled in proportion to size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Three common research designs

A

Between-subjects design, matched-subjects design, within-subjects design

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Between-subjects design

A

Each subject exposed to one level of each IV P’s randomly assigned to groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Matched-subjects design

A

Split subjects into groups while controlling for a given variable. Ex. Take the two students with the two top IQs and randomly assign each into the two groups. Then the next two highest IQs, and so on.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Within-subjects design

A

Also called repeated-measures; each subject is exposed to all conditions, removing individual difference as a confound

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Control group design

A

A control and experimental group; one receives treatment and one does not

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Nonequivalent group design

A

Doesn’t use random assignment; ex. Using one class for one teaching method and another class for another teaching method

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Demand characteristics

A

Any cues given to subjects suggesting what the researcher expects of them; May influence he subjects’ behavior and skew results Ex. Placebo effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Hawthorne effect

A

Tendency to behave differently when they know they’re being observed. Using a control group that is also observed can control for the Hawthorne effect.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

External validity

A

Stronger external validity –> more generalizable to the general population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Descriptive v. Inferential statistics

A

Descriptive: organize, describe, quantify, and summarize data Inferential: generalize; estimate population characteristics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Frequency distribution

A

Characteristic is graphed along the X axis and frequency along the line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Measures of central tendency
Mode, median, mean
25
Measures of variability
Range, standard deviation (average distance from mean), variance (Standard deviation squared)
26
Normal Distribution
27
z-score
Tells how many standard deviations a particular score is from the mean. Subtract the mean of the distribution from your score and divide by standard dev Negative z-score --\> falls below mean Positive z-score --\> fall above mean
28
Converting every score to z-scores
Mean is always 0, standard deviation is always 1
29
T-scores
T-score distribution has a mean of 50 and standard deviation of 10. Commonly used in test score interpretation
30
Skewed Distributions
Scores are not centered around mean/median. Mean/median are not the same Skewness is interpreted by tail
31
Correlation Coefficient
Descriptive statistic used to measure how related two variables are Direction and degree Can't show cause/effect Visualized on scatter plot, draw line of best-fit, coefficient is the slope of the line.
32
Factor analysis
Attempts to account for the interrelationships amoung variables by seeing how they hang together Ex. Variables A, B, C are highly correlated to one another but not to D, E, F AND D, E, F are highly correlated but not to A, B, C
33
Null hypothesis
Sample mean is the same as the population mean
34
If the observed difference is STATISTICALLY SIGNIFICANT...
Reject the null
35
Alpha level
Criterion of significance selected by the researchers
36
Errors in Testing Significance
37
Significance Testing Process
1. Formulate null and alt hyp. based on research hyp. 2. Decide on an alpha level (0.05) 3. Collect data 4. Perform significance test on data to obtain sig. level. Significance level tells us how likely our difference is due to chance. Large level --\> more likey due to chance. Smaller level --\> more likely caused by hypothesized effect. 5. Compare obtained level of sig. to alpha. If sig. is less than alpha --\> statistically sig. If level is greater than alpha --\> stat. insignificant. 6. Significant --\> reject null. Insignificant --\> accept null.
38
Types of Significance Tests
t-test, ANOVA, and chi-square
39
t-test
used to compare the means of two groups
40
ANOVA
Used to estimate how much group means differ from one another by comparing the between-group variance to the within-group variance using the F ratio. F ratio= Between-Group Variance Estimate OVER Within-Group Variance Estimate Factorial design: each level of IV occurs with each level of the other IVs Interaction: when effects of one IV are not consistent with all levels of he other IVs
41
Chi-square tests
For categorical rather than numerical data When summarizing cat. data --\> frequencies or proportions
42
Meta-analysis
Statistical procedure used to make conclusions on the basis of data from different studies
43
Norm-referenced Testing v. Domain-referenced Testing
Norm-referenced: assesses an ind.'s performance in terms of how they perform in comparison to others Problem: is specific on the population tested, which can (and often does) change Domain-Referenced: (criterion-referenced testing) concerned with what the ind. knows about a specific content domain; does not compare to peers, concerned only with ind. mastery
44
Reliability
Consistency with which a test measures whatever it is that it is intended to measure High reliability --\> test produces dependable, reproducible, and consistent measures
45
Standard Error of Measurement (SEM)
Index of how much, on average, we expect a person's score to vary from the score the person is really capable of (by ability) Desirable SEM = 0 but not possible in reality Speaks to the test's reliaiblity
46
Three methods used to establish the reliability of a test
test-retest, alternate-form, split-half reliability
47
test-retest method
consistent when administered to same people more than once
48
alternate-form method
consistency when examinees are given two different forms of the test at two different times
49
split-half reliability
test-takers have only one test, but it is divided into two equal halves Strong correlation coefficient between pairs? --\> high reliability
50
Validity and the 6 subtypes
Extent to which a test actually measures what it is supposed to measure (concerned with accuracy rather than consistency) Content, face, criterion concurrent, predictive, construct convergent, discriminate
51
Content Validity
Test's coverage of the particular think it seeks to measure ex. If a test is supposed to measure knowledge about a particular topic, test items should include questions about that topic.
52
Face validity
Does the test APPEAR to measure what it is supposed to measure
53
Criterion Validity
How well does the test predict an individual's performance on an established test designed to measure the same thing Cross validation: involves testing the criterion validity on a second sample
54
Construct Validity
Refers to how well performance on a test fits into the theoretical framework related to what you want to measure Ex. If a relationship between sociability and intelligence is well supported by research, then your test measuring sociability should also show high scores among people with high intelligence.
55
Convergent Validity
Shows construct validity
56
Discriminant validity
performance on the test is not correlated with other variables that it should not be related to Ex. test-taking experience
57
A test can show perfect reliability and very little...
Validity. But the oposite is not possible.
58
Four Types of Measurement Scales
NOMINAL (categorical) ORDINAL (rank) INTERVAL (actual number, like number correct on a test) RATIO (includes a true zero point that indicates absence of quality measured, like income)
59
Ability Tests
Aptitude: predict what an ind. can accomplish through training; predict future performance (like IQ) Achievement: assess what one knows/can do *now *
60
Wechsler tests
Two broad subscales: verbal and performance Verbal: based on information, vocabulary Performance: Tests of manipulative skill, eye-hand coordination, and speed Three major tests of intelligence: Wechsler Preschool and Primary Scale of Intelligence (WPPSI; preschool), Wechsler Intelligence Scale for Children (WISC; age 5-16), and Wechsler Adult Intelligence Scale (WAIS; 16+) Have all been revised: WPPSI-R, WISC-R, and WAIS-R WAIS-III is currently used for adult IQ testing
61
Personality Inventory
Self-rating device (consisting of 100-500 statements)
62
Minnesota Multiphasic Personality Inventory, MMPI
550 statements to which subjects resond T, F, or "cannot say" Yields ten clinical scales Used to aid in the assessment of various clinical disorders
63
Validity Summary Image
64
empirical criterion-keying approach
used by Hathaway and McKinley to develop the MMPI Tested thousands of questions and retained those that differentiated between patient and nonpatient populations
65
MMPI-2
Added content scales; formed using derived theoretical concerns To form each content scale, test authors selected items that "ought" to be related to self-esteem (by theory)
66
California Psychological Inventory (CPI)
personality inventory, based on MMPI Developed to be used in normal populations, age 13 and up (esp. high school and college students Consists of 20 scales, including 3 validity scales ~460 T/F items, measures personality traits (like sociability)
67
Projective Tests
Stimuli are ambiguous, and responses are not limited Test taker is asked to interpret ambiguous stimuli Scoring is subjective
68
Rorschach
Inkblot test 10 cards presented to subject in specific order with specific instructions; clinician then interprets results based on what they saw and the spontaneous remarks the person made
69
Thematic Apperception Test (TAT)
Morgan and Murray 20 pictures depicting scenes with ambiguous meanings Subject is asked to tell what is happening, describe what led to the event, and give an ending No standardized scoring
70
Blacky pictures
projective test devised for children 12 cartoonlike pictures that feature a small dog (Blacky) Developed depicting Blacky in situations designed to correspond to particular stage of psychosexual developement Subject is asked to tell stories about the pictures shown
71
Rotter Incomplete Sentences Blank
Sentence completion task; projective technique used by clinicians and researchers
72
Barnum effect
Tendency people have to accept and approve of the interpretation of personality you give to them Form of pseudovalidation
73
Interest Testing
Assesses an ind's interest in different lines of work Strong-Campbell Interest Inventory
74
Strong-Campbell Interest Inventory
Developed using an empirical criterion-keying approach Test takers are given list of interests; asked to indicate whether they like/dislike the interest listed A second section asks test-takers to indicate preference for one of two paired items Divides interests into 6 types: realistic, investigative, artistic, social, enterprising, and conventional (also called RIASEC)
75