Exam 2 Flashcards
for a study to be an experiment, it has to have…
- at least one manipulated variable
- at least one measured variable
control variable
a variable that an experimenter holds constant on purpose (besides the independent variables)
- not really variables because they do not vary
why experiments support causal claims through the 3 criteria
- establish covariance: changes in the independent variables are related to the changes in the dependent variable
- establish temporal precedence: The causal variable should come before the outcome variable
- establish internal validity: There are no other likely explanations for the relationship observed
placebo (control) groups
a group that is exposed to an inert treatment; comparison group may not need to be a control group (i.e., no treatment)
confounds
an unmeasured variable that influences both the supposed cause and the supposed effect
design confounds
an accidental second variable varies systematically along with the intended independent variable
selection confounds (selection effects)
when the kinds of participants in one level of the independent variable are systematically different from those in the other
- avoid selection effects with random assignment
- avoiding selection effects with matched groups
types of experimental design
- independent-groups designs (between-sujects)
- within-subjects designs
- pottest-only designs
- pretest/posttest designs
independent-groups designs (between-subjects)
Separate groups of participants are placed into different levels of the independent variable
- Ex. an experiment exploring how different amounts of sleep affect people’s reaction times → level 1: 3 hours of sleep, level 2: 8 hours of sleep
within-subjects designs
- Each person is presented with all levels of the independent variable
- One set of participants are tested more than once, and their scores are compared
- Ex. an experiment exploring how different amounts of sleep affect people’s reaction times → all groups do day 1 with 3 hours of sleep and day 2 with 8 hours of sleep
posttest-only designs
Participants are randomly assigned to independent variable groups and are tested on the dependent variable once
pretest/posttest designs
Participants are randomly assigned to at least two difference groups and are tested on the key dependent variable twice— once before and once after exposure to the independent variable
problems using pretest/posttest design
threat to internal validity
- Taking the pretest affects how participants do the posttest → testing threat
- Participants may get tired from a long study with a pretest and posttest
repeated-measures design
participants are measured on a dependent variable more than once, after exposure to each level of the independent variable
concurrent-measures design
participants are exposed to all the levels of an independent variable at roughly the same time, and a single attitudinal or behavioral preference is the dependent variable
advantages of within-groups design
- Participants in your groups are equivalent because they are the same participants and serve as their own controls
- Gives researchers more power to notice difference between conditions because there is less extraneous error in the measurement
- Requires fewer participants
disadvantages of within-groups design
- potential for order effects
- carryover effects
- might not be practical or possible
- experiencing all levels of the IV changes the way participants act –> demand characteristics
demand characteristics
subtle cues or aspects of an experiment that might unintentionally signal to participants what the study is about, leading them to change their behavior to fit that perceived expectation
order effects
When the sequence in which stimuli are presented to participants influences their responses; order of conditions can affect the results
when being exposed to one condition affects how participants respond to other conditions
carryover effects
- practice effect –> participants perform better during later treatment conditions because they’ve had time to practice and improve
- Fatigue effect → participants perform worse during later treatment conditions because they’re tired or fatigued
avoiding order effects
- full counterbalancing
- partial counterbalancing
solution to order effects
counterbalancing
interrogating causal claims with construct validity
How well were the variables measured and manipulated?
- dependent variables: check face validity, interrater reliability, and convergent vailidty
- independent variables: How well were they manipulated?
- manipulation check
interrogating causal claims with external validity
To whom or what can the causal claim generalize?
- generalizing to other people
- generalizing to other situations
interrogating causal claims with statistical validity
How well do the data support the causal claim?
- accuracy of the conclusions drawn from a study’s statistical analysis
- Is the difference statistically significant?: P-value < .05 usually considered statistically significant
- How large is the effect?: correlation coefficient (r), Cohen’s d
- confidence interval
interrogating causal claims with internal validity
Are there alternative explanations for the outcome?
- Were there any design confounds?
- If an independent-groups design was used, did they control for selection effects using random assignment or matching?
- If a within-groups design was used, did they control for order effects by counterbalancing?
manipulation check
extra dependent variable that researchers can include to convince them that their experimental manipulation worked
ex. A study comparing the effect of a serious lecture vs. a funny lecture on the memory of lecture information; Manipulation check - how funny was the lecture?
correlation coefficient (r)
indicates the strength of a linear association between two variables (association claim)
Cohen’s d
standardized effect size for measuring the difference between two group means
(group A mean - group B mean) / pooled standard deviation
- small (d = 0.2)
- medium (d = 0.5)
- large (d = 0.8)
confidence interval
a range of values, bounded above and below the statistics’s mean, that likely would contain an unknown population parameter
- when a study has a small sample and more variability, CI will be relatively wide (less precise)
- when a study has a larger sample and less variability, CI will be narrower (more precise)
six potential internal validity threats in one-group, pretest/posttest designs
- maturation
- history
- regression
- attrition
- testing
- instrumentation
maturation threat to internal validity
A change in behavior that emerges more or less spontaneously over time
- Spontaneous remission is a specific type of maturation
- E.g., people adapt to changed environments; children get better at walking, talking, reading, etc.; plants grow taller
preventing maturation threats
Include an appropriate comparison group
history threats to internal validity
- Something specific has happened between the pretest and posttest (not just time has passed)
- “Historical” or external factor that systematically affects most members of the treatment group at the same time as the treatment itself
- E.g., the rowdy boys started a swimming course and the exercise tired most of them out
preventing history threats
include a comparison group
regression threats to internal validity
- regression to the mean: the tendency of results that are extreme by chance on first measurement to move closer to the average when measured a second time
- Occurs only when a group is measured twice, and
- Only when the group has an extreme score at pretest
- E.g., the 40 depressed women might have scores exceptionally high on the depression pretest due to random effects, such as recent illness, family or relationship problems
preventing regression threats
include a comparison group
attrition threats to internal validity
- A reduction in participant numbers that occurs when people drop out before the end of the study
- Problem for internal validity when attrition is systematic— only a certain kind of participant drops out
preventing attrition threats
Remove the dropped-out participants’ scores from the pretest average too
testing threats to internal validity
A change in the participants as a result of taking a test (dependent measure) more than once
preventing testing threats
- No pretest
- Two different forms— one for pretest and one for posttest
- Include a comparison group
instrumentation threats to internal validity
- Occur when a measure instrument changes over time
OR - When a researcher uses different forms for the pretest and posttest, but the two forms are not sufficiently equivalent
- E.g., people judging the rowdy campers’ behavior became more tolerant of loud voices and rough-and-tumble play
preventing instrumentation threats
- Use a posttest-only design
- Ensure that the pretest and posttest measures are equivalent
- Counterbalance the versions of the test
three potential internal validity threats in any study
- observer bias
- demand characteristics
- placebo effects
observer bias
when researchers’ expectation influence their interpretation of the results
placebo effects
eople’s behaviors or symptoms respond not just to the treatment, but also to their belief in what the treatment can do to alter their situation