Research Design and Statistics Flashcards

1
Q

[internal validity]
A valid causal inference requires satisfaction
of three criteria: (a) statistical association, (b) temporal precedence, and (c) ________

A

nonspuriousness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Spurious causes are ….

A

threats to internal validity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

External validity is the extent to which the causal association can be ___________
to or across variations in study instances

A

generalized

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

An example of ___________, researchers sometimes use inadequate labels to describe study instances (e.g., label a treatment “progressive relaxation” when the treatment has many additional therapeutic components).

A

Construct Validity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Randomized experiments are considered the gold standard for assessing ________.

A

causality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

In _________ trials, an intervention’s effects are examined under real-world conditions.
Such trials often take place outside of academic settings (e.g., community
mental health centers).

A

effectiveness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

In ______ trials, an intervention’s effects are examined under ideal circumstances,
particularly with respect to treatment implementation.

A

efficacy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

In _________ analyses, researchers
analyze outcome data from participants as a function of their original group assignment,
regardless of their level of exposure to treatment. The analysis is intended
to provide a conservative (and real-world) estimate of the treatment effect because
it is based on cases exposed to varying levels of treatment.

A

intent-to-treat

missing data is core problem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

single-case experiments are often designed to

increase _________

A

internal validity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

The _______ design is a single-case design that alternates the baseline (A) phase
(intervention absent) with an intervention (B) phase (intervention present). The
outcome of interest is assessed on multiple occasions within each phase.

A

ABAB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

In _________ designs, replication of an effect is sought over multiple baselines,
which can reflect different behaviors, settings, and/or children (just to name
a few).

A

multiple baseline

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Although inferential statistical procedures can be used to analyze data from single case
experiments, it is more common for clinicians to rely on ____________ of the data. Visual inspection is often supplemented with descriptive statistics.

A

visual inspection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

clinicians can examine _______ by comparing the averagenfrequency of the outcome across different phases of the experiment (e.g., during
the A vs. B phases).

A

mean changes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q
Clinicians can also examine \_\_\_\_\_\_\_\_ in which they compare the last data point in an immediately prior phase to the first data point in
an immediately subsequent phase. If the latency of response is hypothesized to
be immediate (e.g., the behavior will reduce dramatically as soon as the intervention
is implemented), one might predict dramatic level changes between adjacent
(baseline-intervention) phases.
A

level shifts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Clinicians can also examine _______ (or functional
form) changes by examining the rate of behavior change in different phases. For
example, the behavior might increase in a fairly linear (i.e., constant) manner during
the initial A phase and become fairly stable during the initial B phase

A

slope

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Quasi-experimental studies are experiments that lack _____ of units
to conditions.

A

random assignment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

_____________ (also called passive observational studies) are conducted
when the researcher is not actively manipulating anything (like exposure to an
intervention).

A

Correlational studies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

________ designs compare a group of participants who possess a certain characteristic
(e.g., diagnosis of attention deficit hyperactivity disorder [ADHD]) with
a group of participants who do not possess the characteristic.

A

Case–control

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q
In \_\_\_\_\_\_\_\_\_\_, an intact group (i.e., cohort) is followed over time to examine the
emergence of—and/or change—in some outcome of interest. These designs are classified
as longitudinal (also known as prospective) because individuals are assessed
on at least two occasions.
A

cohort designs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

If the multiple cohorts also differ in their age or some other salient developmental marker at the study’s inception, the study is called a __________
design. Such designs allow for the study of a longer developmental period over fewer
years of data collection because several developmental cohorts (e.g., toddlers, preschoolers,
and school-aged children) are embedded in the study.

A

cross-sequential

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

______ is a threat to validity when naturally occurring changes are mistaken
for an intervention effect—when symptoms remit because of the passage of time
rather than the effects of an intervention.

A

Maturation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

____ is a threat to validity when some event (or constellation of events) occurs
during the study and impacts the results in a manner mistaken for an intervention
(e.g., pt exercising and it helps depression vs. treatment).

A

History

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

____—also known as regression to the mean—occurs when
extreme scores tend to revert back to the mean on a subsequent evaluation.
Statistical
regression is more plausible in single-group studies in which extreme
performers (e.g., severely depressed individuals) comprise the study sample.

A

Statistical regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

______ is a threat to validity when the pattern of participant drop-out impacts
the results in a way interpreted as an intervention effect.

A

Attrition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

____ is a threat to validity when exposing individuals to the pretest changes
them in ways that might be mistaken for an intervention effect.

A

Testing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

______ is a threat to validity when the measurement tool changes and
impacts the results in a manner mistaken for an intervention effect.

A

Instrumentation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

____ occurs in multiple-group studies when systematic differences among
intervention groups can be mistaken for an intervention effect.

A

Selection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

As such, the reliability of a measure is viewed as
the ratio of true score variance to ______ (True score variance is consistent
with consistency or dependability, concepts that are often invoked in discussions
of reliability.)

A

total variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Kuder–Richardson Formula 20—often

abbreviated KR-20—which can be used when the items are

A

dichotomous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

______—also known as internal structure—is the extent to which the
structure of the measure is consistent with the theorized factor structure of the construct.

A

Structural validity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

____ matrix can be used to evaluate convergent and discriminant validity.

A

multitrait–multimethod

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

In ____, three
primary decisions involve (a) choosing a method of factor extraction, (b) choosing
a method of factor rotation, and (c) deciding on the number of factors to retain.

A

EFA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

In _______ rotations, the factors are
assumed to be uncorrelated. In oblique rotations, the factors are assumed to be
correlated—often the case in psychology.

A

orthogonal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Several _______ can help with this
last decision—including the chi-square test, root mean square error of approximation
(RMSEA), and standardized root mean square residual (SRMR). All fit indices
quantify (albeit in slightly different ways) how well the model-implied covariance
matrix reproduces the estimated population covariance matrix of the analysis
variables.

A

fit indices [CFA]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

One of the advantages of conducting
statistical analyses on_______ (as in the case of structural equation
modeling, described below), is the gain in statistical power that results when
measurement error is removed from the constructs of interest.

A

latent variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

The ___ is used when (a) data are ordinal, or (b) when data are interval or ratio, but the
distribution is highly skewed (the median is less affected by skewness)

A

median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

In_______, the mean and median are identical.
In symmetrical
distributions with a single mode, the mean, median, and mode are identical

A

symmetrical distributions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

The interquartile range captures the middle 50% of the distribution and is
computed by subtracting the 25th percentile (first quartile) from the _____th percentile
(third quartile)

A

75

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

The standard deviation (SD) captures the average distance of scores from the

A

mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

z = (x − M) / _______ .

A

SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

Properties of the Normal z Distribution When a z-score conversion is used,
the resulting z distribution has the following properties: (a) the mean is ___; (b) the
standard deviation is 1; and (c) each z score represents the position of the score in
relation to the mean, in standard deviation units. In other words, a z score of −1.27
denotes a score exactly 1.27 standard deviations below the mean.

A

0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

percentiles can be computed by conversion of T to z: (T – 50)/______.

A

10

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

__________: (a) make
more distributional assumptions (e.g., that the distribution is normal), (b) assume
data are measured on an interval or ratio scale, (c) are conducted on actual data
(as opposed to on ranks derived from data), and (d) allow researchers to test more
specific hypotheses about the populations from which they are drawn.

A

parametric statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

the null hypothesis specifies

that the two ________ are equal (mt = mc).

A

population means

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

When the sample data
would occur relatively infrequently assuming the null hypothesis (e.g., the data
would occur less than 5% of the time if the null hypothesis were true

A

p < .05 (sig level; alpha)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

results are not declared statistically significant even though the null hypothesis
is false

A

Type II error (Beta)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

Results were excepted even though the null was true

A

Type I error (alpha)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

_______ is the probability of correctly rejecting a false null hypothesis (i.e., finding an effect when one exists in the population)

A

Statistical power (1 -Beta)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

What helps increase power?

A
Increase N;
Increase alpha;
Directional hypotheses;
Large effects;
More reliable measures
50
Q

When the 95% confidence interval does not contain zero the results are:

A

significant

51
Q
An example of \_\_\_\_\_\_\_\_\_: (a) no longer meeting diagnostic
criteria and (b) scoring two standard errors below a pretest score on one of the
primary study outcomes.
A

Clinical significance (vs statistical)

52
Q

two-tailed or nondirectional means:

A

(leaving open the possibility that the sample mean will be larger or smaller than
the population mean of 100).

53
Q

A Single-sample Z test is used when

A

The population SD is known (e.g., intelligence tests).

54
Q

A single-sample T test is used when

A

The population SD is unknown.

55
Q

A relevant effect size is C______, which is a standardized mean difference and is
computed as the ratio of the difference between the two sample means to the
pooled standard deviation.

A

Cohen’s d (.2 small, .5 med, .8 large).

56
Q

omega squared, ω2—reflect the proportion of variance
in the outcome that is explained by the factors. Interpretive guidelines for omega
squared are small = 0.01, medium = 0.06, and large = 0.15.

A

Effect size used in ANOVA

57
Q

In models for which
there are three or more levels of a factor (e.g., low, medium, and high levels of
stress), the test of the factor’s main effect is an _______ statistical test.

A

omnibus

58
Q

omnibus tests are typically followed
by a series of additional tests (e.g., comparing each pairs of means). These
more focused contrasts are often referred to as _____.

A

post hoc tests

59
Q

Conducting multiple
statistical tests raises the familywise type I error rate associated with the full set of
analyses, so we use ______.

A

Corrections (e.g., Bonferoni, Tukey)

60
Q

A ____________ can be used
to contrast two or more treatment groups (e.g., CBT, interpersonal therapy [IPT],
and control).

A

one-way between-subjects ANOVA

61
Q

A ___________ can be used to
examine a single cohort’s symptom levels over two or more assessments (e.g., pretest,
posttest, and follow-up measures for individuals exposed to a single intervention).

A

one-way within-subjects ANOVA

62
Q

What test design is this an example of:
one might be interested in examining whether ADHD diagnosis (present or absent)
and testing environment (quiet or noisy room) have an impact on performance.

A

two-way between subjects ANOVA

63
Q

What test design is this an example of:
exposing a group of children with ADHD to two levels of a psychostimulant drug dose
factor (e.g., 5 and 10 mg) crossed with two levels of a testing-environment
factor (e.g., quiet and noisy rooms). In other words, all children would be observed
under all four study conditions (e.g., 10 mg, quiet room) and performance would
be the dependent variable

A

two-way WITHIN subjects ANOVA

64
Q
\_\_\_\_\_\_\_\_\_ are used in two primary manners: (a) to increase statistical power
in randomized experiments (when the covariate is uncorrelated with intervention
conditions, but correlated with the dependent variable); and (b) to control for possible
confounding influences (i.e., controlling for variables associated with both
intervention conditions and the dependent variable) in nonrandomized designs.
A

ANCOVAs

65
Q

the multivariate ANOVA model (i.e.,

MANOVA model) allows for _______ to be analyzed in a single model.

A

multiple dependent variables

66
Q

In MANOVA, the actual analysis is performed on an optimized _________ (one that maximizes between group
differences while minimizing within-group differences). A number of test
statistics are generated by MANOVA (Pillais’s trace, Wilk’s lambda, Hotelling’s
trace, and Roy’s largest root).

A

linear combination of the multiple dependent variables

67
Q

Effect size is estimated by computing the square of the correlation (i.e., the coefficient
of determination), which is the proportion of shared variance between the
two variables. Common interpretative guidelines for r2 are as follows: small = .01,
medium = .09, and large = .25.

A

Correlation effect size (r2)

68
Q

Spearman’s
rank correlation coefficient and Kendall’s tau coefficient are both nonparametric
tests that are used when responses on the two variables are ________.

A

rank ordered

69
Q

unstandardized linear regression coefficient (b) meaning):

A

for every one-unit increase in x, Y changes by b units

70
Q

(Hierarchical regression analysis is sometimes confused with stepwise regression analysis,
which is an _______ approach to predictor entry used more often in exploratory
analyses.)

A

atheoretical

71
Q

Models that

do not include interaction effects are referred to as _______.

A

additive effects models

72
Q

The Mann–Whitney Test (also called the Mann–Whitney–Wilcoxon Test) is a nonparametric
alternative to the _________

A

independent samples t-test

73
Q

The Kruskal–

Wallis Test is a nonparametric alternative to the ______.

A

between-group ANOVA

74
Q

The Wilcoxon Signed Ranks Test is a nonparametric

alternative to the __________.

A

paired samples t-test.

75
Q

Testing for moderation is the same as testing for a _________
between a predictor and a moderator. For example, a researcher might test
whether an intervention effect is moderated by participant sex to see whether the
effects of the intervention on the outcome are stronger for men or women

A

statistical interaction

76
Q

A _______ is the mechanism through which a distal predictor operates in influencing an
outcome.

A

mediator

77
Q

________ centers on understanding participants’ lived experiences and
emphasizes subjective experience (e.g., understanding personal knowledge, motivations,
and perspectives).

A

Phenomenology

78
Q

The ultimate goal of ______ is to develop a theory (“grounded” in data)
about a concept of interest. This approach is used when current theory is lacking,
nonexistent, or incomplete.

A

grounded theory

79
Q

qualitative research data are often analyzed
using __________, a process of identifying and analyzing patterns or themes
within data. This can occur either deductively (i.e., starting from a particular theory
or hypothesis) or inductively (i.e., as in grounded theory above)

A

thematic analysis

80
Q

1) triangulation (the use
of multiple, varied sources of data, methods, and researchers in order to corroborate
results), 2) audits (the use of an external consultant to complete an independent
analysis), and 3)member checking (having participants in the study review and provide
feedback on the credibility of findings).

A

Checking reliability and validity in qualitative research

81
Q

The first step in ________ is to identify and engage stakeholders (e.g., administrators,
staff, clients, etc.). Next, a needs assessment might be conducted to assess
the relative priority of the needs, or “problems,” of a specific population in order
to determine where resources should be allocated

A

Program Evaluation

82
Q

Formative evaluations provide information to make needed changes early on, whereas _____ determine a program’s success once delivered.

A

summative evaluations

83
Q

CBA examines the balance of resources/
costs spent on a program compared to the benefits to answer the question “have
resources been well spent on this program?” This results in a benefit/cost ratio, the
worth of a program’s outcomes divided by the program’s costs, which can then be
compared to alternative programs. CBA is controversial in part because it assigns
monetary values to the benefits arising from a program.

A

Cost-benefit Analysis

84
Q

_______ sampling is when researchers collect data from individuals with specific
characteristics.

A

Purposive

85
Q

_______ sampling, a type of purposive sampling, involves participants
inviting others to participate in the study.

A

Snowball

86
Q

______ sampling uses
incentives to overcome possible biases that result from snowball and other chain-referral
sampling methods.

A

Respondent-driven

87
Q

In other words, unlike _______ variables which provide a causal explanation for the relationship between variables, moderator variables affect the strength of the relationship.

A

mediator

88
Q

_________ is useful for studying behaviors that occur infrequently, have a long duration, or leave a permanent record or other product (e.g., a completed worksheet or test).

A

Event sampling

89
Q

_______ is an alternative to behavioral sampling and is used when a goal of the study is to observe a behavior in a number of settings. It helps increase the generalizability of a study’s findings.

A

Situational sampling

90
Q

the independent variable (experimental variance);
• systematic error (error due to extraneous variables); and
• random error (error due to random fluctuations in subjects, experimental conditions, methods of measurement, etc.).

A

three factors that can cause variability in the study’s dependent variable.

91
Q

when a researcher includes an extraneous variable as an independent variable in a study, the extraneous variable is also known as a ________ variable

A

moderator

92
Q

________ is a particularly useful method in quasi-experimental research in which subjects cannot be randomly assigned to treatment groups.

A

Statistical control (such as using covariates)

93
Q

A study has internal validity when it allows an investigator to determine if there is a ____ relationship between independent and dependent variables.

A

causal

94
Q

Fatigue, boredom, hunger, and physical and cognitive development are
potential ______ effects that can limit a study’s Internal validity

A

maturational;
The best way to control maturation is to include more than one group in the study and randomly assign subjects to groups.

95
Q

History is controlled by including more than one group in the study and _________ subjects to groups.

A

randomly assigning

96
Q

The threat of _______ can be controlled by administering the DV measure only once as a posttest, by designing the measure in a way that minimizes memory and practice effects, or by including at least two groups in the study with all groups completing the pre- and posttests so that any difference between groups on the posttest

A

Testing

97
Q

_______ is controlled by including more than one group in the study and ensuring that all groups are subject to the same instrumentation effects, by using the same measuring devices and procedures with all subjects, and by making sure that measuring devices and procedures do not change during the course of the study.

A

Instrumentation

98
Q

The _________ threat is avoided by not including only extreme scorers in the study or by including more than one group and ensuring that all groups consist of subjects who are similarly extreme.

A

statistical regression

99
Q

______ is difficult to control, but pretesting can help determine if dropouts and non-dropouts differ with regard to their initial status on the DV.

A

Attrition

100
Q

Population validity is generalization to other people and ecological validity is generalization to other ______.

A

settings

101
Q

When a study’s results have been contaminated by _______, they cannot be generalized to people who have not been pretested. Pretest · sensitization is controlled by not administering a pretest or by using the Solomon four-group design, which allows an investigator to measure the impact of pretesting on both the external and internal validity of a research study.

A

pretest sensitization

102
Q

An interaction between _________ is often a problem when subjects are volunteers because volunteers tend to be more motivated than non-volunteers and, consequently, might be more responsive to the IV. In this situation, the study’s results
apply to volunteers but can’t be generalized to other people. The best way to eliminate this threat is to ensure that the sample is representative of the population of interest.

A

selection and treatment

103
Q

Research participants may respond to an independent variable in a particular way simply because they know their behavior is being observed, and this is known as _____

A

reactivity

104
Q

The behavior of subjects can also be altered by __________, which are cues in the experimental setting that inform subjects of the purpose of the study or suggest what behaviors are expected of them.

A

demand characteristics

105
Q

______can be controlled by using deception, unobtrusive (nonreactive) measures, or a single- or double-blind technique. When using a single-blind technique, subjects do not know which treatment group they have been assigned to; in a double-blind study, neither the subjects nor the experimenter know which group subjects have been assigned to.

A

Reactivity

106
Q

When a study involves exposing each subject to two or more levels of an independent variable (i.e., when the study utilizes a within-subjects design), the effects of one level of the independent variable can be affected by previous exposure to another level.

A

(Order Effects, Carryover Effects)

107
Q

The _______ designs are considered inappropriate when withdrawal of a treatment during the course of a research study would be unethical (e.g., when the treatment has successfully eliminated a self-injurious behavior)

A

reversal (ABAB)

108
Q

Because of its insensitivity to “outliers,·· the _____ is a useful measure of central tendency when a distribution contains one or a few extreme scores.

A

median

109
Q

One advantage of the _____ is that, of the three measures of central tendency, it is least susceptible to sampling fluctuations.

A

mean

110
Q

Regardless of the shape of the distribution of individual scores in the population, as the sample size increases, the sampling distribution of the mean approaches a normal distribution.
• The mean of the sampling distribution of the mean is equal to the population mean.
• The standard deviation of the sampling distribution of the mean is equal to the population standard deviation divided by the square root of the sample size:

A

Central Limit theorem

111
Q

The ________ is the foundation of inferential statistics. It is the sampling distribution that enables a researcher to make inferences about the relationship between variables in the population based on obtained sample data.

A

sampling distribution

112
Q

A _____ error is more likely when alpha is low, the sample size is small, and the independent variable is not administered in sufficient intensity.

A

Type II

113
Q

One tailed tests, when appropriate, and parametric tests vs non are helpful to increase _____.

A

Power

114
Q

The most effective way to maximize the robustness of a parametric test is to have an equal number of _______.

A

subjects in each group

115
Q

Numerator and denominator of F statistic are:

A

Mean Square Between / Mean Square Within

116
Q

ANCOVA and randomized block ANOVA serve to decrease ___________ variability in order to create a stronger test.

A

within group

117
Q

Put another way, the ___________ indicates the proportion of variability in Y that is
explained by, or accounted for by the variability in X. For example, if the correlation
coefficient for sales success and product knowledge is .60, then 36% (.60 squared= .36) of variability in sales success is accounted for by product knowledge.

A

squared correlation coefficient

118
Q

If the subscript contains two different letters or numbers (e.g., “xy”), it represents the correlation between two different variables. When the subscript contains the same letters or numbers (e.g., ‘‘xx”), it is a _________.

A

reliability coefficient

119
Q

Canonical correlation is an extension of multiple regression that is used when two or more continuous predictors are to be used to predict status on _______.

A

two or more continuous criteria

120
Q

________ analysis is also known as discriminant analysis and is the appropriate technique when two or more continuous predictors will be used to predict or estimate a person’s status on a single discrete (nominal) criterion.

A

Discriminant function; examines “hit rate” or number of correct classifications