Readings Flashcards

1
Q

Difference between quasi-experimental and experimental design

A

Although the independent variable is manipulated, participants are not randomly assigned to conditions or orders of conditions. Because the independent variable is manipulated before the dependent variable is measured, quasi-experimental research eliminates the directionality problem (temporal precedence). But because participants are not randomly assigned— making it likely that there are other differences between conditions—quasi-experimental research does not eliminate the problem of confounding variables. In terms of internal validity, therefore, quasi-experiments are generally somewhere between correlational studies and true experiments.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Non-equivalent Groups Design

A

between-subjects experiments are randomly assigned to conditions, the resulting groups are likely to be quite similar. In fact, researchers consider them to be equivalent. When participants are not randomly assigned to conditions, however, the resulting groups are likely to be dissimilar in some ways. For this reason, researchers consider them to be non-equivalent. A non-equivalent groups design, then, is a between-subjects design in which participants have not been randomly assigned to conditions.

when people are not randomly assigned to conditions, they are not equivalent and can have important differences between them.

For example, in a teaching non-equivalent group study confounding variables can include teachers’ styles, and even the classroom environments, might be very different and might cause different levels of achievement or motivation among the students.

Researchers do their best to make the groups as similar as possible. for example, sampling from the same school, choosing two classes with similar academic scores, classes with same-sex teachers and same teaching styles. These steps increase the internal validity of the study because it would eliminate some of the most important confounding variables. But without true random assignment of the students to conditions, there remains the possibility of other important confounding variables that the researcher was not able to control.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Pretest-Posttest Design

A

scores are measured before and after intervention but since the order of conditions is not counterbalanced it is typically not possible for a participant to be tested in the treatment condition first and then in an “untreated” control condition.

If the average posttest score is better than the average pretest score, then it makes sense to conclude that the treatment might be responsible for the improvement. Unfortunately, one often cannot conclude this with a high degree of certainty because there may be other explanations for why the posttest scores are better. One category of alternative explanations goes under the name of history. Other things might have happened between the pretest and the posttest. Perhaps an anti-drug program aired on television and many of the students watched it, or perhaps a celebrity died of a drug overdose and many of the students heard about it. Another category of alternative explanations goes under the name of maturation. Participants might have changed between the pretest and the posttest in ways that they were going to anyway because they are growing and learning. If it were a yearlong program, participants might become less impulsive or better reasoners and this might be responsible for the change.
Another alternative explanation for a change in the dependent variable in a pretest-posttest design is regression to the mean. This refers to the statistical fact that an individual who scores extremely on a variable on one occasion will tend to score less extremely on the next occasion. For example, a cricketer with a long-term batting average of 15 who scores 150 in an innings will almost certainly score lower in her next innings. Her score will “regress” toward her mean (average) score of 15. Regression to the mean can be a problem when participants are selected for further study because of their extreme scores. Imagine, for example, that only students who scored especially low on a test of fractions are given a special training program and then retested. Regression to the mean all but guarantees that their scores will be higher even if the training program has no effect. A closely related concept—and an extremely important one in psychological research—is spontaneous remission. This is the tendency for many medical and psychological problems to improve over time without any form of treatment. The common cold is a good example. If one were to measure symptom severity in 100 common cold sufferers today, give them a bowl of chicken soup every day, and then measure their symptom severity again in a week, they would probably be much improved. This does not mean that the chicken soup was responsible for the improvement, however, because they would have been much improved without any treatment at all. The same is true of many psychological problems. A group of severely depressed people today is likely to be less depressed on average in 6 months. In reviewing the results of several studies of treatments for depression, researchers Michael Posternak and Ivan Miller found that participants in wait-list control conditions improved an average of 10 to 15% before they received any treatment at all (Posternak & Miller, 2001)[2]. Thus one must generally be very cautious about inferring causality from pretest-posttest designs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Interrupted Time Series Design

A

A variant of the pretest-posttest design is the interrupted time-series design. A time series is a set of measurements taken at intervals over a period of time. For example, a manufacturing company might measure its workers’ productivity each week for a year. In an interrupted time series-design, a time series like this one is “interrupted” by a treatment. In one classic example, the treatment was the reduction of the work shifts in a factory from 10 hours to 8 hours (Cook & Campbell, 1979)[1]. Because productivity increased rather quickly after the shortening of the work shifts, and because it remained elevated for many months afterwards, the researcher concluded that the shortening of the shifts caused the increase in productivity. Notice that the interrupted time-series design is like a pretest- posttest design in that it includes measurements of the dependent variable both before and after the treatment. It is unlike the pretest- posttest design, however, in that it includes multiple pretest and posttest measurements.

This figure also illustrates an advantage of the interrupted time-series design over a simpler pretest-posttest design. If there had been only one measurement of absences before the treatment at Week 7 and one afterwards at Week 8, then it would have looked as though the treatment were responsible for the reduction. The multiple measurements both before and after the treatment suggest that the reduction between Weeks 7 and 8 is nothing more than normal week-to-week variation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Combination Designs

A

A type of quasi-experimental design that is generally better than either the non-equivalent groups design or the pretest-posttest design is one that combines elements of both. There is a treatment group that is given a pretest, receives a treatment, and then is given a posttest. But at the same time there is a control group that is given a pretest, does not receive the treatment, and then is given a posttest. The question, then, is not simply whether participants who receive the treatment improve but whether they improve more than participants who do not receive the treatment.
Imagine, for example, that students in one school are given a pretest on their attitudes toward drugs, then are exposed to an anti- drug program, and finally are given a posttest. Students in a similar school are given the pretest, not exposed to an anti-drug program, and finally are given a posttest. Again, if students in the treatment condition become more negative toward drugs, this change in attitude could be an effect of the treatment, but it could also be a matter of history or maturation. If it really is an effect of the treatment, then students in the treatment condition should become more negative than students in the control condition. But if it is a matter of history (e.g., news of a celebrity drug overdose) or maturation (e.g., improved reasoning), then students in the two conditions would be likely to show similar amounts of change. This type of design does not completely eliminate the possibility of confounding variables, however. Something could occur at one of the schools but not the other (e.g., a student drug overdose), so students at the first school would be affected by it while students at the other school would not.

Finally, if participants in this kind of design are randomly assigned to conditions, it becomes a true experiment rather than a quasi experiment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Key Takeaways

A

Quasi-experimental research involves the manipulation of an independent variable without the random assignment of participants to conditions or orders of conditions. Among the important types are non-equivalent groups designs, pretest- posttest, and interrupted time-series designs.
Quasi-experimental research eliminates the directionality problem because it involves the manipulation of the independent variable. It does not eliminate the problem of confounding variables, however, because it does not involve random assignment to conditions. For these reasons, quasi-experimental research is generally higher in internal validity than correlational studies but lower than true experiments.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

ABAB Reversal Design

A

Positive reinforcement of good behaviour and ignoring negative behaviour (differential reinforcement) is a good ABAB reversal design which reduces disruptive behaviour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

single subject design

what is it?
what is it not?

A

Focuses on individuals, with a small n

Single-subject research is a type of quantitative research that involves studying in detail the behaviour of each of a small number of participants. Note that the term single-subject does not mean that only one participant is studied; it is more typical for there to be somewhere between two and 10 participants. (This is why single-subject research designs are sometimes called small-n designs, where n is the statistical symbol for the sample size.)

What is it not?
Before continuing, it is important to distinguish single-subject research from two other approaches, both of which involve studying in detail a small number of participants. One is qualitative research, which focuses on understanding people’s subjective experience by collecting relatively unstructured data (e.g., detailed interviews) and analysing those data using narrative rather than quantitative techniques. Single-subject research, in contrast, focuses on understanding objective behaviour through experimental manipulation and control, collecting highly structured data, and analysing those data quantitatively.
It is also important to distinguish single-subject research from case studies. A case study is a detailed description of an individual, which can include both qualitative and quantitative analyses. (Case studies that include only qualitative analyses can be considered a type of qualitative research.) i.e., little albert study or Sigmund Freuds work. Case studies can be useful for suggesting new research questions and for illustrating general principles. They can also help researchers understand rare phenomena, such as the effects of damage to a specific part of the human brain. As a general rule, however, case studies cannot substitute for carefully designed group or single-subject research studies. One reason is that case studies usually do not allow researchers to determine whether specific events are causally related, or even related at all. For example, if a patient is described in a case study as having been sexually abused as a child and then as having developed an eating disorder as a teenager, there is no way to determine whether these two events had anything to do with each other. A second reason is that an individual case can always be unusual in some way and therefore be unrepresentative of people more generally. Thus case studies have serious problems with both internal and external validity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Assumptions of Single-Subject Research

A

Again, single-subject research involves studying a small number of participants and focusing intensively on the behaviour of each one.

There are several important assumptions underlying single-subject research:

  1. It is important to focus intensively on the
    behaviour of individual participants:
    • Group research can hide individual
    differences and generate results that do
    not represent the behaviour of any
    individual
    • Single-subject designs can identify
    individual differences
  2. Sometimes it is the behaviour of a
    particular individual that is primarily of
    interest:
    • For example, changing the behaviour of
    a disruptive student.
    • Using single-subject design provides
    more direct and effective information
    than group-means analysis when we
    are interested in a specific behaviour in
    a participant.
3. It is important to discover causal 
    relationships through the manipulation of 
    an independent variable, the careful 
    measurement of a dependent variable, 
    and the control of extraneous variables.
      •	Single-subject designs are experimental 
        designs with good internal validity.
      •	Dependant variable is measures 
        multiple times (baseline-intervention- 
        baseline; effect is introduced and 
        removed to provide strong evidence 
        that behaviour change is due to IV and 
        not extraneous variables).
  1. It is important to study strong effects that
    have social importance.
    • Minimizing or removing harmful
    behaviours (i.e., treatments)
    • Social validity (real-world applications)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Who Uses Single-Subject Research?

A

• Herman Ebbinghaus’s research on memory
and Ivan Pavlov’s research on classical
conditioning are other early examples.
• B. F. Skinner clarified many of the
assumptions underlying single-subject
research and refined many of its
techniques.
• This work was carried out primarily using
non-human subjects—mostly rats and
pigeons. This approach, which Skinner
called the experimental analysis of
behaviour—remains an important subfield
of psychology and continues to rely almost
exclusively on single-subject research.
• Many researchers were interested in using
this approach to conduct applied research
primarily with humans—a subfield now
called applied behaviour analysis (Baer,
Wolf, & Risley, 1968)[5]. Applied behaviour
analysis plays an especially important role
in contemporary research on
developmental disabilities, education,
organisational behaviour, and health,
among many other areas.
• Most contemporary single-subject research
is conducted from the behavioural
perspective, it can in principle be used to
address questions framed in terms of any
theoretical perspective.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Key Take-Aways

Single Subjects

A

 Single-subject research—which involves
testing a small number of participants and
focusing intensively on the behaviour of
each individual—is an important alternative
to group research in psychology.
 Single-subject studies must be
distinguished from case studies, in which
an individual case is described in detail.
Case studies can be useful for generating
new research questions, for studying rare
phenomena, and for illustrating general
principles. However, they cannot substitute
for carefully controlled experimental or
correlational studies because they are low
in internal and external validity.
 Single-subject research has been around
since the beginning of the field of
psychology. Today it is most strongly
associated with the behavioural theoretical
perspective, but it can in principle be used
to study behaviour from any perspective.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

General Features of Single-Subject Designs

A

Common Features in Single Subject Designs:
1. First, the dependent variable (represented
on the y-axis of the graph) is measured
repeatedly over time (represented by the
x-axis) at regular intervals.

  1. Second, the study is divided into distinct
    phases, and the participant is tested under
    one condition per phase. The conditions
    are often designated by capital letters: A,
    B, C, and so on. Thus Figure 10.2
    represents a design in which the
    participant was tested first in one
    condition (A), then tested in another
    condition (B), and finally retested in the
    original condition (A). (This is called a
    reversal design and will be discussed in
    more detail shortly.)
  2. Another important aspect of single-subject
    research is that the change from one
    condition to the next does not usually
    occur after a fixed amount of time or
    number of observations. Instead, it
    depends on the participant’s behaviour.
    Specifically, the researcher waits until the
    participant’s behaviour in one condition
    becomes fairly consistent from
    observation to observation before
    changing conditions. This is sometimes
    referred to as the steady state strategy
    (Sidman, 1960)[1]. The idea is that when
    the dependent variable has reached a
    steady state, then any change across
    conditions will be relatively easy to detect.
    Recall that we encountered this same
    principle when discussing experimental
    research more generally. The effect of an
    independent variable is easier to detect
    when the “noise” in the data is minimised.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Reversal Designs

A

The most basic single-subject research design is the reversal design, also called the ABA design. During the first phase, A, a baseline is established for the dependent variable. This is the level of responding before any treatment is introduced, and therefore the baseline phase is a kind of control condition. When steady state responding is reached, phase B begins as the researcher introduces the treatment. There may be a period of adjustment to the treatment during which the behaviour of interest becomes more variable and begins to increase or decrease. Again, the researcher waits until that dependent variable reaches a steady state so that it is clear whether and how much it has changed. Finally, the researcher removes the treatment and again waits until the dependent variable reaches a steady state. This basic reversal design can also be extended with the reintroduction of the treatment (ABAB), another return to baseline (ABABA), and so on.
Why is the reversal—the removal of the treatment—considered to be necessary in this type of design? Why use an ABA design, for example, rather than a simpler AB design? Notice that an AB design is essentially an interrupted time-series design applied to an individual participant. Recall that one problem with that design is that if the dependent variable changes after the treatment is introduced, it is not always clear that the treatment was responsible for the change. It is possible that something else changed at around the same time and that this extraneous variable is responsible for the change in the dependent variable. But if the dependent variable changes with the introduction of the treatment and then changes back with the removal of the treatment (assuming that the treatment does not create a permanent effect), it is much clearer that the treatment (and removal of the treatment) is the cause. In other words, the reversal greatly increases the internal validity of the study.
There are close relatives of the basic reversal design that allow for the evaluation of more than one treatment. In a multiple- treatment reversal design, a baseline phase is followed by separate phases in which different treatments are introduced. For example, a researcher might establish a baseline of studying behaviour for a disruptive student (A), then introduce a treatment involving positive attention from the teacher (B), and then switch to a treatment involving mild punishment for not studying (C). The participant could then be returned to a baseline phase before reintroducing each treatment—perhaps in the reverse order as a way of controlling for carry-over effects. This particular multiple-treatment reversal design could also be referred to as an ABCACB design.
In an alternating treatments design, two or more treatments are alternated relatively quickly on a regular schedule. For example, positive attention for studying could be used one day and mild punishment for not studying the next, and so on. Or one treatment could be implemented in the morning and another in the afternoon. The alternating treatments design can be a quick and effective way of comparing treatments, but only when the treatments are fast acting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Multiple-Baseline Designs

A

There are two potential problems with the reversal design—both of which have to do with the removal of the treatment. One is that if a treatment is working, it may be unethical to remove it. For example, if a treatment seemed to reduce the incidence of self-injury in child with a developmental disability, it would be unethical to remove that treatment just to show that the incidence of self-injury increases. The second problem is that the dependent variable may not return to baseline when the treatment is removed. For example, when positive attention for studying is removed, a student might continue to study at an increased rate. This could mean that the positive attention had a lasting effect on the student’s studying, which of course would be good. But it could also mean that the positive attention was not really the cause of the increased studying in the first place. Perhaps something else happened at about the same time as the treatment—for example, the student’s parents might have started rewarding him for good grades.
One solution to these problems is to use a multiple-baseline design, which is represented in Figure 10.4. In one version of the design, a baseline is established for each of several participants, and the treatment is then introduced for each one. In essence, each participant is tested in an AB design. The key to this design is that the treatment is introduced at a different time for each participant. The idea is that if the dependent variable changes when the treatment is introduced for one participant, it might be a coincidence. But if the dependent variable changes when the treatment is introduced for multiple participants—especially when the treatment is introduced at different times for the different participants—then it is extremely unlikely to be a coincidence.
Notice that if the researchers had only studied one school or if they had introduced the treatment at the same time at all three schools, then it would be unclear whether the reduction in aggressive behaviours was due to the bullying program or something else that happened at about the same time it was introduced (e.g., a holiday, a television program, a change in the weather). But with their multiple-baseline design, this kind of coincidence would have to happen three separate times—a very unlikely occurrence—to explain their results.
In another version of the multiple-baseline design, multiple baselines are established for the same participant but for different dependent variables, and the treatment is introduced at a different time for each dependent variable. Imagine, for example, a study on the effect of setting clear goals on the productivity of an office worker who has two primary tasks: making sales calls and writing reports. Baselines for both tasks could be established. For example, the researcher could measure the number of sales calls made and reports written by the worker each week for several weeks. Then the goal-setting treatment could be introduced for one of these tasks, and at a later time the same treatment could be introduced for the other task. The logic is the same as before. If productivity increases on one task after the treatment is introduced, it is unclear whether the treatment caused the increase. But if productivity increases on both tasks after the treatment is introduced—especially when the treatment is introduced at two different times—then it seems much clearer that the treatment was responsible.
In yet a third version of the multiple-baseline design, multiple baselines are established for the same participant but in different settings. For example, a baseline might be established for the amount of time a child spends reading during his free time at school and during his free time at home. Then a treatment such as positive attention might be introduced first at school and later at home. Again, if the dependent variable changes after the treatment is introduced in each setting, then this gives the researcher confidence that the treatment is, in fact, responsible for the change.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Data Analysis in Single-Subject Research

A

In addition to its focus on individual participants, single-subject research differs from group research in the way the data are typically analysed. As we have seen throughout the book, group research involves combining data across participants. Group data are described using statistics such as means, standard deviations, Pearson’s r, and so on to detect general patterns. Finally, inferential statistics are used to help decide whether the result for the sample is likely to generalise to the population. Single-subject research, by contrast, relies heavily on a very different approach called visual inspection. This means plotting individual participants’ data as shown throughout this chapter, looking carefully at those data, and making judgements about whether and to what extent the independent variable had an effect on the dependent variable. Inferential statistics are typically not used.
In visually inspecting their data, single-subject researchers take several factors into account. One of them is changes in the level of the dependent variable from condition to condition. If the dependent variable is much higher or much lower in one condition than another, this suggests that the treatment had an effect. A second factor is trend, which refers to gradual increases or decreases in the dependent variable across observations. If the dependent variable begins increasing or decreasing with a change in conditions, then again this suggests that the treatment had an effect. It can be especially telling when a trend changes directions—for example, when an unwanted behaviour is increasing during baseline but then begins to decrease with the introduction of the treatment. A third factor is latency, which is the time it takes for the dependent variable to begin changing after a change in conditions. In general, if a change in the dependent variable begins shortly after a change in conditions, this suggests that the treatment was responsible.

The results of single-subject research can also be analysed using statistical procedures—and this is becoming more common. There are many different approaches, and single-subject researchers continue to debate which are the most useful. One approach parallels what is typically done in group research. The mean and standard deviation of each participant’s responses under each condition are computed and compared, and inferential statistical tests such as the t test or analysis of variance are applied (Fisch, 2001)[3]. (Note that averaging across participants is less common.) Another approach is to compute the percentage of non-overlapping data (PND) for each participant (Scruggs & Mastropieri, 2001)[4]. This is the percentage of responses in the treatment condition that are more extreme than the most extreme response in a relevant control condition. In the study of Hall and his colleagues, for example, all measures of Robbie’s study time in the first treatment condition were greater than the highest measure in the first baseline, for a PND of 100%. The greater the PND, the stronger the treatment effect. Still, formal statistical approaches to data analysis in single-subject research are generally considered a supplement to visual inspection, not a replacement for it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Key takeaways

A

 Single-subject research designs typically
involve measuring the dependent variable
repeatedly over time and changing
conditions (e.g., from baseline to
treatment) when the dependent variable
has reached a steady state. This approach
allows the researcher to see whether
changes in the independent variable are
causing changes in the dependent
variable.
 In a reversal design, the participant is
tested in a baseline condition, then tested
in a treatment condition, and then returned
to baseline. If the dependent variable
changes with the introduction of the
treatment and then changes back with the
return to baseline, this provides strong
evidence of a treatment effect.
 In a multiple-baseline design, baselines are
established for different participants,
different dependent variables, or different
settings—and the treatment is introduced
at a different time on each baseline. If the
introduction of the treatment is followed by
a change in the dependent variable on
each baseline, this provides strong
evidence of a treatment effect.
 Single-subject researchers typically
analyse their data by graphing them and
making judgements about whether the
independent variable is affecting the
dependent variable based on level, trend,
and latency.

17
Q

The Single-Subject Versus Group “Debate”

A

Single-subject research is similar to group research—especially experimental group research—in many ways. They are both quantitative approaches that try to establish causal relationships by manipulating an independent variable, measuring a dependent variable, and controlling extraneous variables. But there are important differences between these approaches too, and these differences sometimes lead to disagreements. It is worth addressing the most common points of disagreement between single-subject researchers and group researchers and how these disagreements can be resolved. As we will see, single-subject research and group research are probably best conceptualised as complementary approaches.

18
Q

single subject/group data analysis

A

Data Analysis
Some advocates of group research worry that visual inspection is inadequate for deciding whether and to what extent a treatment has affected a dependent variable:

  1. One specific concern is that visual
    inspection is not sensitive enough to
    detect weak effects.
  2. A second is that visual inspection can be
    unreliable, with different researchers
    reaching different conclusions about the
    same set of data
  3. A third is that the results of visual
    inspection—an overall judgment of
    whether or not a treatment was effective.
    —cannot be clearly and efficiently
    summarised or compared across studies
    (unlike the measures of relationship
    strength typically used in group research)
    Single-Subject Designs argue their use of
    the steady state strategy, combined with
    their focus on strong and consistent
    effects, minimises most of them. If the
    effect of a treatment is difficult to detect
    by visual inspection because the effect is
    weak or the data are noisy, then single-
    subject researchers look for ways to
    increase the strength of the effect or
    reduce the noise in the data by controlling
    extraneous variables (e.g., by
    administering the treatment more
    consistently). If the effect is still difficult to
    detect, then they are likely to consider it
    neither strong enough nor consistent
    enough to be of further interest. Many
    single-subject researchers also point out
    that statistical analysis is becoming
    increasingly common and that many of
    them are using this as a supplement to
    visual inspection—especially for the
    purpose of comparing results across
    studies (Scruggs & Mastropieri, 2001).
    Some advocates of single-subject
    research worry about the way that group
    researchers analyse their data:
  4. Specifically, they point out that focusing on
    group averages can be highly misleading.
    Again, imagine that a treatment has a
    strong positive effect on half the people
    exposed to it and an equally strong
    negative effect on the other half. In a
    traditional between-subjects experiment,
    the positive effect on half the participants
    in the treatment condition would be
    statistically cancelled out by the negative
    effect on the other half. The average for
    the treatment group would then be the
    same as the average for the control group,
    making it seem as though the treatment
    had no effect when in fact it had a strong
    effect on every single participant!
  5. But again, group researchers share this
    concern. Although they do focus on group
    statistics, they also emphasise the
    importance of examining distributions of
    individual scores. For example, if some
    participants were positively affected by a
    treatment and others negatively affected
    by it, this would produce a bimodal
    distribution of scores and could be
    detected by looking at a histogram of the
    data. The use of within-subjects designs is
    another strategy that allows group
    researchers to observe effects at the
    individual level and even to specify what
    percentage of individuals exhibit strong,
    medium, weak, and even negative effects.
19
Q

External Validity

A

The second issue about which single-subject and group researchers sometimes disagree has to do with external validity—the ability to generalise the results of a study beyond the people and specific situation actually studied. In particular, advocates of group research point out the difficulty in knowing whether results for just a few participants are likely to generalise to others in the population. Imagine, for example, that in a single-subject study, a treatment has been shown to reduce self-injury for each of two children with developmental disabilities. Even if the effect is strong for these two children, how can one know whether this treatment is likely to work for other children with developmental disabilities?

Again, single-subject researchers share this concern. In response, they note that the strong and consistent effects they are typically interested in—even when observed in small samples—are likely to generalise to others in the population. Single-subject researchers also note that they place a strong emphasis on replicating their research results. When they observe an effect with a small sample of participants, they typically try to replicate it with another small sample—perhaps with a slightly different type of participant or under slightly different conditions. Each time they observe similar results, they rightfully become more confident in the generality of those results. Single-subject researchers can also point to the fact that the principles of classical and operant conditioning—most of which were discovered using the single-subject approach—have been successfully generalised across an incredibly wide range of species and situations.

Single-subject researchers have concerns of their own about the external validity of group research. One extremely important point they make is that studying large groups of participants does not entirely solve the problem of generalising to other individuals. Imagine, for example, a treatment that has been shown to have a small positive effect on average in a large group study. It is likely that although many participants exhibited a small positive effect, others exhibited a large positive effect, and still others exhibited a small negative effect. When it comes to applying this treatment to another large group, we can be fairly sure that it will have a small effect on average. But when it comes to applying this treatment to another individual, we cannot be sure whether it will have a small, a large, or even a negative effect.

Another point that single-subject researchers make is that group researchers also face a similar problem when they study a single situation and then generalise their results to other situations. For example, researchers who conduct a study on the effect of cell phone use on drivers on a closed oval track probably want to apply their results to drivers in many other real-world driving situations. But notice that this requires generalising from a single situation to a population of situations. Thus the ability to generalise is based on much more than just the sheer number of participants one has studied. It requires a careful consideration of the similarity of the participants and situations studied to the population of participants and situations that one wants to generalise to (Shadish, Cook, & Campbell, 2002)[3].

20
Q

Single-Subject and Group Research as Complementary Methods

A

As with quantitative and qualitative research, it is probably best to conceptualise single-subject research and group research as complementary methods that have different strengths and weaknesses and that are appropriate for answering different kinds of research questions:
• Single-subject research is particularly good
for testing the effectiveness of treatments
on individuals when the focus is on strong,
consistent, and biologically or socially
important effects.
o It is also especially useful when the
behaviour of particular individuals is of
interest.
o Clinicians who work with only one
individual at a time may find that it is their
only option for doing systematic
quantitative research.
• Group research, on the other hand, is ideal
for testing the effectiveness of treatments
at the group level.
o Among the advantages of this approach is
that it allows researchers to detect weak
effects, which can be of interest for many
reasons. For example, finding a weak
treatment effect might lead to refinements
of the treatment that eventually produce a
larger and more meaningful effect.
o Group research is also good for studying
interactions between treatments and
participant characteristics. For example, if
a treatment is effective for those who are
high in motivation to change and
ineffective for those who are low in
motivation to change, then a group design
can detect this much more efficiently than
a single-subject design. Group research is
also necessary to answer questions that
cannot be addressed using the single-
subject approach, including questions
about independent variables that cannot
be manipulated (e.g., number of siblings,
extraversion, culture).
• Different research traditions:
o Researchers in the experimental analysis
of behaviour and applied behaviour
analysis learn to conceptualise their
research questions in ways that are
amenable to the single-subject approach.
o Researchers in most other areas of
psychology learn to conceptualise their
research questions in ways that are
amenable to the group approach.
o At the same time, there are many topics in
psychology in which research from the two
traditions have informed each other and
been successfully integrated. One
example is research suggesting that both
animals and humans have an innate
“number sense”—an awareness of how
many objects or events of a particular
type have they have experienced without
actually having to count them.
o Single-subject research with rats and birds
and group research with human infants
have shown strikingly similar abilities in
those populations to discriminate small
numbers of objects and events. This
number sense—which probably evolved
long before humans did—may even be the
foundation of humans’ advanced
mathematical abilities.

21
Q

Key Takeaways

A

 Differences between single-subject
research and group research sometimes
lead to disagreements between single-
subject and group researchers. These
disagreements centre on the issues of data
analysis and external validity (especially
generalisation to other people).
 Single-subject research and group
research are probably best seen as
complementary methods, with different
strengths and weaknesses, that are
appropriate for answering different kinds of
research questions.

22
Q

Multiple Dependant Variables
Measuring Different Constructs
Measuring the Same Construct

A

Measuring Different Constructs
§ When an experiment includes multiple
dependent variables, there is again a
possibility of carryover effects. For example,
it is possible that measuring participants’
moods before measuring their perceived
health could affect their perceived health or
that measuring their perceived health
before their moods could affect their
moods. So the order in which multiple
dependent variables are measured
becomes an issue.
§ Solutions:
o measure them in the same order for all
participants—usually with the most
important one first so that it cannot be
affected by measuring the others.
o Counterbalance, or systematically vary, the
order in which the dependent variables are
measured.

Measuring the Same Construct
§ Multiple dependent variables is to
operationally define and measure the same
construct, or closely related ones, in
different ways (i.e., converging operations).
§ The dependent variable, stress, is a
construct that can be operationally defined
in different ways. For this reason, the
researcher might have participants
complete the paper-and-pencil Perceived
Stress Scale and measure their levels of
the stress hormone cortisol. This is an
example of the use of converging
operations. If the researcher finds that the
different measures are affected by exercise
in the same way, then he or she can be
confident in the conclusion that exercise
affects the more general construct of
stress.
§ When multiple dependent variables are
different measures of the same construct—
especially if they are measured on the
same scale —researchers have the option
of combining them into a single measure of
that construct.
§ When researchers combine dependent
variables in this way, they are treating them
collectively as a multiple-response
measure of a single construct. The
advantage of this is that multiple-response
measures are generally more reliable than
single-response measures.
§ However, it is important to make sure the
individual dependent variables are strongly
correlated with each other by computing
an internal consistency measure such as
Cronbach’s α. If they are not correlated
with each other, then it does not make
sense to combine them into a measure of a
single construct. If they have poor internal
consistency, then they should be treated as
separate dependent variables.

23
Q

Manipulation Checks

A

§ When the independent variable is a
construct that can only be manipulated
indirectly—such as emotions and other
internal states—an additional measure of
that independent variable is often included
as a manipulation check. This is done to
confirm that the independent variable was,
in fact, successfully manipulated.
§ Manipulation checks are usually done at
the end of the procedure to be sure that
the effect of the manipulation lasted
throughout the entire procedure and to
avoid calling unnecessary attention to the
manipulation.
§ Manipulation checks become especially
important when the manipulation of the
independent variable turns out to have no
effect on the dependent variable. Imagine,
for example, that you exposed participants
to happy or sad movie music—intending to
put them in happy or sad moods—but you
found that this had no effect on the
number of happy or sad childhood events
they recalled. This could be because being
in a happy or sad mood has no effect on
memories for childhood events. But it
could also be that the music was
ineffective at putting participants in happy
or sad moods. A manipulation check—in
this case, a measure of participants’
moods— would help resolve this
uncertainty. If it showed that you had
successfully manipulated participants’
moods, then it would appear that there is
indeed no effect of mood on memory for
childhood events. But if it showed that you
did not successfully manipulate
participants’ moods, then it would appear
that you need a more effective
manipulation to answer your research
question.

24
Q

Key Takeaways

A

§ Researchers in psychology often include
multiple dependent variables in their
studies. The primary reason is that this
easily allows them to answer more
research questions with minimal additional
effort.
§ When an independent variable is a
construct that is manipulated indirectly, it is
a good idea to include a manipulation
check. This is a measure of the
independent variable typically given at the
end of the procedure to confirm that it was
successfully manipulated.
§ Multiple measures of the same construct
can be analysed separately or combined
to produce a single multiple-item measure
of that construct. The latter approach
requires that the measures taken together
have good internal consistency.

25
Q

Multiple IVs

Factorial Designs

A

Factorial Designs
§ It is also common for them to include multiple
independent variables.
§ It allows us to answer more research
questions. Schnall and colleagues were able
to conduct one study that addressed both
questions. But including multiple
independent variables also allows the
researcher to answer questions about
whether the effect of one independent
variable depends on the level of another.
This is referred to as an interaction between
the independent variables. Interactions are
often among the most interesting results in
psychological research.
§ In a factorial design, each level of one
independent variable (which can also be
called a factor) is combined with each level
of the others to produce all possible
combinations. Each combination, is a
condition in the experiment.
§ 3 × 2 factorial design, and there would be six
distinct conditions. Notice that the number of
possible conditions is the product of the
numbers of levels. A 2 × 2 factorial design
has four conditions, a 3 × 2 factorial design
has six conditions, a 4 × 5 factorial design
would have 20 conditions.
§ In principle, factorial designs can include any
number of independent variables with any
number of levels.
§ In practice, it is unusual for there to be more
than three independent variables with more
than two or three levels each.
§ This is for at least two reasons:
o First, the number of conditions can quickly
become unmanageable. For example,
adding a fourth independent variable with
three levels (e.g., therapist experience: low
vs. medium vs. high) to the current example
would make it a 2 × 2 × 2 × 3 factorial design
with 24 distinct conditions.
o Second, the number of participants required
to populate all of these conditions (while
maintaining a reasonable ability to detect a
real underlying effect) can render the design
infeasible.

Assigning Participants to Conditions
§ In a factorial experiment, the decision to take
the between-subjects or within-subjects
approach must be made separately for each
independent variable.
§ In a between-subjects factorial design, all of
the independent variables are manipulated
between different subjects.
§ In a within-subjects factorial design, all of the
independent variables are manipulated
within subjects.
§ Advantages/Disadvantages: the between-
subjects design is conceptually simpler,
avoids carryover effects, and minimises the
time and effort of each participant. The
within-subjects design is more efficient for
the researcher and controls extraneous
participant variables.
§ Mixed factorial design; it is also possible to
manipulate one independent variable
between subjects and another within
subjects.
§ Regardless of whether the design is between
subjects, within subjects, or mixed, as with all
experiments, the actual assignment of
participants to conditions or orders of
conditions is typically done randomly.

26
Q

Non-manipulated Independent Variables

A

§ In many factorial designs, one of the
independent variables is a non-manipulated
independent variable. The researcher
measures it but does not manipulate it.
§ First, non-manipulated independent
variables are usually participant variables
(private body consciousness,
hypochondriasis, self-esteem, and so on),
and as such they are by definition between-
subjects factors.
§ Second, such studies are generally
considered to be experiments as long as at
least one independent variable is
manipulated, regardless of how many non-
manipulated independent variables are
included.
§ Third, it is important to remember that
causal conclusions can only be drawn
about the manipulated independent
variable.

27
Q

Key Takeaways

A

§ Researchers often include multiple
independent variables in their
experiments. The most common approach
is the factorial design, in which each level
of one independent variable is combined
with each level of the others to create all
possible conditions.
§ In a factorial design, the main effect of an
independent variable is its overall effect
averaged across all other independent
variables. There is one main effect for each
independent variable.
§ There is an interaction between two
independent variables when the effect of
one depends on the level of the other.
Some of the most interesting research
questions and results in psychology are
specifically about interactions.

28
Q

Key Takeaways

A

§ Researchers often use complex
correlational research to explore
relationships among several variables in the
same study.
§ Complex correlational research can be
used to explore possible causal
relationships among variables using
techniques such as multiple regression.
Such designs can show patterns of
relationships that are consistent with some
causal interpretations and inconsistent with
others, but they cannot unambiguously
establish that one variable causes another.

29
Q

Correlational Studies With Factorial Designs

A

§ Factorial designs can also include only non-manipulated independent variables, in which case they are no longer experiments but correlational studies.
§ Consider a hypothetical study in which a researcher measures both the moods and the self-esteem of several participants—categorising them as having either a positive or negative mood and as being either high or low in self-esteem—along with their willingness to have unprotected sexual intercourse. This can be conceptualised as a 2 × 2 factorial design with mood (positive vs. negative) and self-esteem (high vs. low) as between-subjects factors. Willingness to have unprotected sex is the dependent variable. This design can be represented in a factorial design table and the results in a bar graph of the sort we have already seen. The researcher would consider the main effect of mood, the main effect of self-esteem, and the interaction between these two independent variables.
§ One must be cautious about inferring causality from correlational studies because of the directionality and third-variable problems.

30
Q

Assessing Relationships Among Multiple Variables

A

§ Most complex correlational research, however, does not fit neatly into a factorial design. Instead, it involves measuring several variables—often both categorical and quantitative—and then assessing the statistical relationships among them.
§ This approach is often used to assess the validity of new psychological measures.
§ When researchers study relationships among a large number of conceptually similar variables, they often use a complex statistical technique called factor analysis. In essence, factor analysis organises the variables into a smaller number of clusters, such that they are strongly correlated within each cluster but weakly correlated between clusters. Each cluster is then interpreted as multiple measures of the same underlying construct.These underlying constructs are also called “factors.”
§ For example, when people perform a wide variety of mental tasks, factor analysis typically organises them into two main factors—one that researchers interpret as mathematical intelligence (arithmetic, quantitative estimation, spatial reasoning, and so on) and another that they interpret as verbal intelligence (grammar, reading comprehension, vocabulary, and so on). The Big Five personality factors have been identified through factor analyses of people’s scores on a large number of more specific traits. For example, measures of warmth, gregariousness, activity level, and positive emotions tend to be highly correlated with each other and are interpreted as representing the construct of extraversion. As a final example, researchers Peter Rentfrow and Samuel Gosling asked more than 1,700 university students to rate how much they liked 14 different popular genres of music (Rentfrow & Gosling, 2008)[4]. They then submitted these 14 variables to a factor analysis, which identified four distinct factors. The researchers called them Reflective and Complex (blues, jazz, classical, and folk), Intense and Rebellious (rock, alternative, and heavy metal), Upbeat and Conventional (country, soundtrack, religious, pop), and Energetic and Rhythmic (rap/hip-hop, soul/funk, and electronica).
§ First, factors are not categories. Factor analysis does not tell us that people are either extraverted or conscientious or that they like either “reflective and complex” music or “intense and rebellious” music. Instead, factors are constructs that operate independently of each other. So people who are high in extraversion might be high or low in conscientiousness, and people who like reflective and complex music might or might not also like intense and rebellious music.
§ Second, factor analysis reveals only the underlying structure of the variables. It is up to researchers to interpret and label the factors and to explain the origin of that particular factor structure. For example, one reason that extraversion and the other Big Five operate as separate factors is that they appear to be correlated with different biological variables

31
Q

Exploring Causal Relationships

A

§ Another important use of complex correlational research is to explore possible causal relationships among variables. This might seem surprising given that “correlation does not imply causation”. It is true that correlational research cannot unambiguously establish that one variable causes another. Complex correlational research, however, can often be used to rule out other plausible interpretations.
§ The primary way of doing this is through the statistical control of potential third variables. Instead of controlling these variables by random assignment or by holding them constant as in an experiment, the researcher measures them and includes them in the statistical analysis.
§ Researchers dealt with these potential third variables, however, by measuring them and including them in their statistical analyses. They found that neither religiosity nor ethnicity was correlated with generosity and were therefore able to rule them out as third variables. This does not prove that SES causes greater generosity because there could still be other third variables that the researchers did not measure. But by ruling out some of the most plausible third variables, the researchers made a stronger case for SES as the cause of the greater generosity.
§ Many studies of this type use a statistical technique called multiple regression. This involves measuring several independent variables (X1, X2, X3,…Xi, where i is the total number of independent variables), all of which are possible causes of a single dependent variable (Y). The result of a multiple regression analysis is an equation that expresses the dependent variable as an additive combination of the independent variables. regression weights that indicate how large a contribution an independent variable makes, on average, to the dependent variable. Specifically, they indicate how much the dependent variable changes for each one-unit change in the independent variable.
§ The advantage of multiple regression is that it can show whether an independent variable makes a contribution to a dependent variable over and above the contributions made by other independent variables. As a hypothetical example, imagine that a researcher wants to know how the independent variables of income and health relate to the dependent variable of happiness. This is tricky because income and health are themselves related to each other. Thus if people with greater incomes tend to be happier, then perhaps this is only because they tend to be healthier. Likewise, if people who are healthier tend to be happier, perhaps this is only because they tend to make more money. But a multiple regression analysis including both income and happiness as independent variables would show whether each one makes a contribution to happiness when the other is taken into account.

32
Q

four validities

A

Four Big Validities
o Internal Validity:
o The purpose of an experiment, however, is to show that two variables are statistically related and to do so in a way that supports the conclusion that the independent variable caused any observed differences in the dependent variable. The logic is based on this assumption: If the researcher creates two or more highly similar conditions and then manipulates the independent variable to produce just one difference between them, then any later difference between the conditions must have been caused by the independent variable.
o An empirical study is said to be high in internal validity if the way it was conducted supports the conclusion that the independent variable caused any observed differences in the dependent variable. Thus experiments are high in internal validity because the way they are conducted—with the manipulation of the independent variable and the control of extraneous variables—provides strong support for causal conclusions.
o External Validity:
o The need for experiments to manipulate the independent variable and control extraneous variables means that experiments are often conducted under conditions that seem artificial.
o An empirical study is high in external validity if the way it was conducted supports generalising the results to people and situations beyond those actually studied. As a general rule, studies are higher in external validity when the participants and the situation studied are similar to those that the researchers want to generalise to and participants encounter everyday, often described as mundane realism.
o High psychological realism where the same mental process is used in both the laboratory and in the real world.
o Construct Validity:
o The quality of the experiment’s manipulations, or the construct validity.
o In evaluating this design, we would say that the construct validity was very high because the experiment’s manipulations very clearly speak to the research question (i.e., operationalisation).
o Statistical Validity:
o A common critique of experiments is that a study did not have enough participants. The main reason for this criticism is that it is difficult to generalise about a population from a small sample.
o Sample size reflects statistical validity and not external validity!
o The statistical validity speaks to whether the statistics conducted in the study support the conclusions that are made.
o Proper statistical analysis should be conducted on the data to determine whether the difference or relationship that was predicted was found. The number of conditions and the number of total participants will determine the overall size of the effect. With this information, a power analysis can be conducted to ascertain whether you are likely to find a real difference.
o Four validities: they have to be prioritised because we cannot be high in all areas. Most experiments are high in internal and construct validity but sacrifice external validity.

33
Q

Key take aways

A

Key Take-Aways
o Studies are high in internal validity to the extent that the way they are conducted supports the conclusion that the independent variable caused any observed differences in the dependent variable. Experiments are generally high in internal validity because of the manipulation of the independent variable and control of extraneous variables.
o Studies are high in external validity to the extent that the result can be generalised to people and situations beyond those actually studied. Although experiments can seem “artificial”—and low in external validity—it is important to consider whether the psychological processes under study are likely to operate in other people and situations.