Glossary Portney & Watkins Flashcards

1
Q

regression line

A

The straight line that is drawn on a scatter plot for bivariate data from the regression equation, summarizing the relationship between variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

sampling error

A

The difference between an observed statistic from a sample and the population parameter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

analysis of covariance (ANCOVA)

A

Statistical procedure used to compare two or more treatment groups while controlling for the effect of one or more confounding variables (called covariates).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

positive predictive value

A

Estimate of the likelihood that a person who tests positive actually has the disease.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

curvilinear relationship

A

The relationship between two variables that does not follow a linear proportional relationship.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

phi coefficient

A

A nonparametric correlation statistic for estimating the relationship between two dichotomous variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Type II error

A

An incorrect decision to accept the null hypothesis, concluding that no relationship exists when in fact it does.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

z-score

A

The number of standard deviations that a given value is above or below the mean of the distribution; also called a standardized score.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

stratification

A

The grouping of individuals in a population into homogeneous groups on some characteristic prior to sampling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

sXhat

A

standard error of the mean

The standard deviation of ta distribution of sample means; an estimate of the population standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

audit trail

A

Comprehensive process of documenting interpretation of qualitative data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

epidemiology

A

Study of the distribution of disease in relation to person, place and time, and measures of risk associated with exposures to disease.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

nondirectional hypothesis

A

A research hypothesis (or alternative hypothesis) that does not indicate the expected direction of the relationship between variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

target population

A

The larger population to which results of a study will be generalized.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

grand mean

A

The mean of all scores across groups in an analysis of variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

longitudinal study

A

A study designed to collect data over time, usually for the purpose of describing developmental changes in a particular group.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Newman-Keuls (NK) multiple comparison test

A

Amultiple comparison procedure, used following a significant analysis of variance. Also called the Student-Newman-Keuls (SNK) test.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

nominal scale

A

Level of measurement for classification variables; assignment of “values” based on mutually exclusive and exhaustive categories with no inherent rank order.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Q-sort

A

An analytic technique used to characterize attitudes, opinions, or judgments of individuals through a process of comparative rank ordering.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

treatment threshold

A

In clinical decision making, the point at which a decision is reached to treat the patient without first performing a diagnostic test.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Wilcoxon signed-ranks test (T)

A

A nonparametric statistical procedure, comparing two correlated samples (repeated measures); analogous to the paired t-test.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

z distribution

A

The standardized normal distribution, with a mean of 0 and a standard deviation of 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Bayes’ theorem

A

The calculation of the probability of one event based on the probability of another event; used to estimate posterior (posttest) probabilities based on prior (pretest) probabilities of a diagnostic outcome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

degrees of freedom (df)

A

Statistical concept indicating the number of values within a distribution ntat are free to vary, given restrictions on the data set; usualy n-1. For analysis of variance, dfe = error degrees of freedom; dfb = between groupus degrees of freedom; dft = total degrees of freedom

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
σ
Standard deviation of a population σ2 is the population variance
9
variance
s2 a measure of variability in a distribution, equal to the square of the standard deviation
10
factor analysis
An exploratory multivariate statistical technique used to examine the structure within a large set of variables and to determine the underlying dimensions that exist within that set of variables.
10
s
standard deviation (s) A descriptive statistic reflecting the variability or dispersion of scores around the mean.
10
standard error of the mean
The standard deviation of a distribution of sample means; an estimate of the population standard deviation.
10
systematic review
Review of a clearly formulated question that uses systematic and explicit methods to identify, select and critically appraise relevant research.
11
convergent validity
An approach in construct validation, assessing the degree to which two different instruments or methods are able to measure the same construct.
11
eigen value
A measure of the proportion of the total variance accounted for by a factor in a factor analysis.
12
experimental event rate (EER)
The number of subjects in the experimental or treatment group who develop the outcome of interest.
13
split middle line
In single-case research, a line used to separate data points within one phase into two equal halves, reflecting the trend of the data within that phase. (See celeration line.)
13
backward selection
A process used in stepwise multiple regression that enters all independent variables into the equation, and then removes nonsignificant variables in successive steps, until all remaining variables are significant.
14
recall bias
The possible inaccuracy of recalling medical history or previous exposures; of particular concern in retrospective studies.
15
scale of measurement
same as level of measurement
16
purposive sample
A nonprobability sample in which subjects are specifically selected by the researcher on the basis of subjective judgment that they will be the most representative.
16
single-blind study
An experiment in which either the investigator or the subject is kept ignorant of group assignment, but not both.
17
statistical conclusion validity
The validity of conclusions drawn from statistical analyses, based on the proper application of statistical tests and principles.
18
known groups method
A technique for construct validation, in which validity is determined by the degree to which an instrument can demonstrate different scores for groups known to vary on the variable being measured.
18
ordinal scale
Level of measurement in which scores are ranks.
18
box plot
Also called box and whisker plot. A graphic display of a distribution, showing the median, 25th and 75th percentiles, and highest and lowest scores.
20
Guttman scale
same as cumulative scale?
20
publication bias
Tendency for researchers and editors to treat positive experimental results (finding an effect) differently from negative or inconclusive results (finding no effect), often with a preference for publication of positive findings.
22
bimodal distribution
A distribution having two modes.
23
reactive measurement
A measurement that distorts the variable being measured, either by the subject’s awareness of being measured or by influence of the measurement process.
24
posttest-only design
An experimental design in which only one measurement is taken following treatment.
25
line of best fit
The regression line, representing the relationship between two variables, usually plotted on a scatter diagram.
25
random selection
same as random sampling. Probability method of selecting subjects for a sample, where every subject in the population has an equal chance of being chosen.
25
testing effect
The effect that occurs when a test itself is responsible for observed changes in the measured variable.
26
true positive
A test result that is positive for those who do have the disease or condition of interest.
27
single-factor design
An experimental design involving one independent variable.
27
sum of squares
SS A measure of variability in a set of data, equal to the sum of squared deviation scores for a distribution: Σ(X-Xhat)2 The numerator in the formula for variance
28
frequency distribution
A list of values that occur in a distribution, with a count of the number of times each value occurs.
29
factor
1. A variable. 2. A set of interrelated variables in a factor analysis.
29
adjusted means
Means that have been adjusted based on the value of a covariate in an analysis of covariance.
30
standard error of the mean
sXhat The standard deviation of ta distribution of sample means; an estimate of the population standard deviation
32
internal validity
The degree to which the relationship between the independent and dependent variables is free from the effects of extraneous factors.
32
partial correlation
A statistical technique for establishing the correlation between two variables, with the effect of a third variable removed; also called a first-order correlation.
33
prospective study
A study designed to collect data following development of the research question.
33
sensitivity analysis
A procedure in decision making to determine how decisions change as values are systematically varied.
33
two-way design
An experimental or quasi-experimental study that involves two independent variables.
34
paired t-test
A parametric test for comparing two means for correlated samples or repeated measures; also called a correlated t-test.
34
specificity
A measure of validity of a screening procedure, based on the probability that someone who does not have a disease will test negative.
35
informed consent
An ethical principle that requires obtaining the consent of the individual to participate in a study based on full prior disclosure of risks and benefits.
37
*d*f
degrees of freedom. Statistical concept indicating the number of values within a distribution ntat are free to vary, given restrictions on the data set; usualy n-1. For analysis of variance, *df*e = error degrees of freedom; *df*b = between groupus degrees of freedom; *df*t = total degrees of freedom
38
r
Pearson product-moment coefficient of correlation (r) A parametric statistical technique for determining the relationship between two variables.
39
facets
In generalizability theory, specific conditions under which reliability of a measurement can be generalized
40
one-tailed test
A statistical test based on a directional alternative hypothesis, in which critical values are obtained for only one tail of a distribution.
40
between subjects design
An experimental design that is based on comparison between independent groups.
41
MDD
minimal detectable difference (MDD) That amount of change in a variable that must be achieved to reflect a true difference; the smallest amount of change that passes the threshold of error. Also called minimal detectable change (MDC).
41
alpha coefficient
same as cronbach's alpha
42
consecutive sampling
A form of nonprobability sampling, where subjects are recruited as they become available.
42
SnNout
When a test has high sensitivity, a negative test rules out the diagnosis.
43
incidence
The proportion of people who develop a given disease or condition within a specified time period.
43
skewed distribution
A distribution of scores that is asymmetrical, with more scores to one extreme.
45
agreement
same as percent agreement
46
false positive
A test result that is positive in a person who does not have the disease or condition of interest.
47
interrupted time-series design
A design involving a series of measurements over time, interrupted by one or more treatment occasions.
49
cross-sectional study
A study based on observations of different age or developmental groups at one point in time, providing the basis for inferring trends over time.
49
SEE
Standard error of the estimate: In regression analysis, an estimate of prediction accuracy; a measure of the spread of scores around the regression line.
50
on-protocol analysis
Analysis of data in an experiment based only on subjects who completed the study according to assigned groups. Also called completer analysis or on-treatment analysis.
52
double-blind study
An experiment in which both the investigator and the subject are kept ignorant of group assignment.
52
standardized score
see z-score
54
risk factor
A characteristic or exposure that potentially increases the likelihood of having a disease or condition.
54
snowball sampling
A nonprobability sampling method in which subjects are successively recruited by referrals from other subjects.
55
instrument all measure the same trait
Typically measured using Cronbach’s alpha.
56
logrank test
A statistical procedure for comparing two survival curves when censored observations are present.
57
triangulation
The use of multiple methods to document phenomena.
58
naturalistic inquiry
Qualitative observation and interaction with subjects in their own natural environment.
58
stepwise regression
An approach to multiple regression that involves a sequential process of selecting variables for inclusion in the prediction equation.
58
alternate forms reliability
Reliability of two equivalent forms of a measuring instrument.
59
prevalence
The number of existing cases of a disease or condition at a given point in time, expressed as a proportion of the total population at risk.
59
case-control study
A design in analytic epidemiology in which the investigator selects subjects on the basis of their having or not having a particular disease and then determines their previous exposure.
60
hazard function
The probability that a subject will achieve a specific outcome in a certain time interval.
60
SS
sum of squares A measure of variability in a set of data, equal to the sum of squared deviation scores for a distribution: Σ(X-Xhat)2 The numerator in the formula for variance
61
historical controls
Subjects from previous research studies that serve as controls for experimental subjects in a subsequent study.
61
ROC curve
Receiver operating characteristic curve
62
standard error of the estimate (SEE)
In regression analysis, an estimate of prediction accuracy; a measure of the spread of scores around the regression line.
63
likelihood ratio
In diagnostic testing, the ratio indicating the usefulness of the test for ruling in or ruling out a condition. (See negative likelihood ratio and positive likelihood ratio.)
63
*df*e
For analysis of variance, = error degrees of freedom
64
natural history
Longitudinal study of a disease or disorder, demonstrating the typical progress of the condition.
64
single-subject design
An experimental design based on time-series data from one or more subjects, with data compared across baseline and intervention phases. Also called single-case designs.
65
weighted kappa
An estimate of percentage agreement, corrected for chance, based on weights reflecting levels of seriousness of disagreements.
66
goodness of fit test
Use of chi square to determine if an observed distribution of categorical variables fits a given theoretical distribution.
66
quartile (Q)
Three quartiles divide a distribution of ranked data into four equal groups, each containing 25 percent of the scores.
66
sign test
A nonparametric statistical procedure for comparing two correlated samples, based on comparison of positive or negative outcomes; analogous to the paired t-test.
67
true negative
A test result that is negative for those who do not have the disease or condition of interest.
68
factorial design
An experimental design involving two or more independent variables, allowing for the interpretation of main effects and interaction effects.
68
histogram
A bar graph of a frequency distribution.
68
residual
Y - Yhat In regression analysis, the difference between the value of the dependent variable predicted by the regression equation and the actual value
69
ratio scale
The highest level of measurement, in which there are equal intervals between score units and a true zero point.
70
change score
Difference between two measurements taken at different times, typically between pretest and posttest or followup. Also called a gain score.
71
coefficient of determination (r2)
Coefficient representing the amount of variance in one variable (Y) that can be explained (accounted for) by a second variable (X).
72
generalizability
1. The quality of research that justifies inference of outcomes to groups or situations other than those directly involved in the investigation. 2. The concept of reliability theory in which measurement error is viewed as multidimensional and must be interpreted under specific measurement conditions.
72
repeated measure (repeated factor)
An independent variable for which subjects act as their own control; that is, all subjects are exposed to all levels of the variable. Also called a within subjects factor.
73
Pearson product-moment coefficient of correlation (r)
A parametric statistical technique for determining the relationship between two variables.
73
transformation
Mathematical conversion of a distribution to a different scale by a constant (such as square root or log) to change the shape or variance characteristics of the distribution.
73
two-way analysis of variance
An analysis of variance with two independent variables.
73
s2
variance a measure of variability in a distribution, equal to the square of the standard deviation
74
attribute variable
An independent variable with levels that cannot be manipulated or assigned by the researcher, but that represent subject characteristics (such as age and sex).
75
MCID
minimal clinically important difference (MCID) The smallest difference in a measured variable that signifies an important rather than trivial difference in the patient’s condition. The smallest difference a patient or clinician would perceive as beneficial, and that would result in a change in the management of the patient. Also called minimal clinically important change (MCIC) or minimally important change (MIC).
76
phi coefficient
rphi A nonparametric correlation statistic for estimating the relationship a dichotomy and a continuous variable on the interval or ratio scale
77
norm referencing
Interpretation of a score based on its value relative to a standard or “normal” score.
77
point biserial correlation
ppb A correlation statistic for estimating the relationship between a dichotomy and a continuous variable on the interval or ratio scale
78
continuous variable
A quantitative variable that can theoretically take on values along a continuum.
78
evidence-based practice
The application of clinical decision making for patient management based on research evidence, clinical expertise, patient values and preferences and clinical circumstances.
80
interquartile range
The difference between the first and third quartiles in a distribution, often expressed graphically in a boxplot.
81
limits of agreement
Index of reliability between alternate forms of an instrument.
82
LR -
negative likelihood ratio A ratio that indicates how much the odds of a disease are decreased if a diagnostic test is netagive Equals specificity/(1-sensitivity)
83
semantic differential
A technique used to measure attitudes by asking respondents to rate concepts on a 7-point scale which represents a continuum across two extremes.
84
common cause variation
Fluctuation in response resulting in random and expected variation in performance.
84
one-way design
An experimental or quasi-experimental design that involves one independent variable.
85
sampling distribution
A theoretical frequency distribution of a statistic, based on the value of the statistic over an infinite number of samples.
86
inductive reasoning
The logical process of developing generalizations based on specific observations or facts.
87
methodological research
Research designed to develop or refine procedures or instruments for measuring variables, generally focusing on reliability and validity.
88
Delphi survey
Survey method whereby decisions on items are based on consensus of a panel.
88
PV -
negative predictive value In diagnostic testing, the proportion of subjects who are correctly identified as not having the condition of interest
89
randomized controlled trial (RCT)
An experimental study in which a clinical treatment is compared with a control condition, where subjects are randomly assigned to groups. Also called a randomized clinical trial.
91
counterbalancing
Systematic alternation of the order of treatment conditions, to avoid order effects in a repeated measures design.
91
variable
A characteristic that can be manipulated or observed and that can take on different values, either quantitatively or qualitatively.
93
logistic regression
Multiple regression procedure where the dependent variable is a dichotomous outcome; predicts odds associated with presence or absence of the dependent variable based on the independent variables.
94
SRM
standardized response mean
95
Tukey’s honestly significant difference (HSD)
A multiple comparison test for comparing multiple means following a significant analysis of variance.
96
regression analysis
In a regression equation, the weight (b) assigned to the independent variable; the slope of the regression line.
96
rphi
phi coefficient A nonparametric correlation statistic for estimating the relationship a dichotomy and a continuous variable on the interval or ratio scale
98
dummy variable (coding)
In regression procedures, the assignment of codes to a nominal variable, reflecting the presence or absence of certain traits.
99
Postitive predictive value
PV + Estimate of the likelihood that a person who tests positive actually has the disease
101
negative predictive value
PV - In diagnostic testing, the proportion of subjects who are correctly identified as not having the condition of interest
102
level
1. The “value” or classification of an independent variable. 2. In single-case research, the magnitude of the target behavior; changes in level are associated with differences in magnitude between the end of one phase and the beginning of the following phase.
103
r2
coefficient of determination: Coefficient representing the amount of variance in one variable (Y) that can be explained (accounted for) by a second variable (X).
105
point estimate
A single sample statistic that serves as an estimate of a population parameter.
106
standard error of measurement (SEM)
A reliability measure of response stability, estimating the standard error in a set of repeated scores.
107
autocorrelation
Correlation of consecutive data points in a time series design.
108
primary source
Reference source that represents the original document by the original author.
109
vital statistics
Mortality and morbidity rates.
110
confounding variable
A variable that is more likely to be present in one group of subjects than another, and that is related to the outcome of interest, thereby potentially “confounding” interpretation of the outcome.
110
efficacy
Benefit of an intervention as tested under controlled experimental conditions, usually with a control group in a randomized controlled trial.
111
independent factor
An independent variable in which the levels represent independent groups of subjects.
113
stopping rule
In a sequential clinical trial, the threshold for stopping a study based on crossing a boundary that indicates a difference or no difference between treatments.
114
critical value
The value of a test statistic that must be exceeded for the null hypothesis to be rejected; the value of a statistic that separates the critical region; the value that defines a statistically significant result at the set alpha level.
115
proportional hazards model
see Cox's regression
116
sigma, uppercase
Read as: “the sum of.”
118
effect size
A statistical expression of the magnitude of the difference between two treatments or the magnitude of a relationship between two variables, based on the proportional relationship of the difference to the variance.
119
A-B-A Design
A single-case withdrawal design in which a second baseline phase is introduced
121
Boolean logic
In literature searches, the terms AND, NOT and OR used to expand or narrow search terms.
123
intrarater reliability
The degree to which one rater can obtain the same rating on multiple occasions of measuring the same variable.
124
crossover design
A repeated measures design used to control order effects when comparing two treatments, where half of the sample receives treatment A first followed by treatment B, and the other half receives treatment B first followed by treatment A.
125
content analysis
A procedure for analyzing and coding narrative data in a systematic way.
126
maturation effect
A threat to internal validity, in which changes occur in the dependent variable as a result of the passing of time.
127
ethnography
An approach to qualitative research in which the experiences of a specific cultural group are studied.
127
two-tailed test
A statistical test based on a nondirectional alternative hypothesis, in which critical values represent both positive and negative tails of a distribution.
128
gold standard
A measurement that defines the true value of a variable. In criterion-related validity, an instrument that is considered a valid measure and that can be used as the standard for assessing validity of other instruments. In diagnostic testing, a procedure that accurately identifies the true disease condition (negative or positive) of the subject.
129
open-ended question
A question on a survey (interview or questionnaire) that does not restrict the respondent to specific choices, but allows for a free response.
130
time-series design
A quasi-experimental design in which performance changes are assessed over time, prior to and following the administration of treatment.
130
chi square test (x2)
A nonparametric test applied to nominal data, comparing observed frequencies within categories to frequencies expected by chance.
131
reference standard
A value used as a standard against which to judge a criterion; may or may not be a gold standard. Used to judge criterion-related validity or diagnostic accuracy.
133
mixed design
A design that incorporates independent variables that are independent (between-subjects) and repeated (within-subjects) factors. Also called a split-plot design.
134
A-B Design
A single-case design with two phases: A represents the baseline phase, and B represents the intervention phase
134
PV +
positive predictive value Estimate of the likelihood that a person who tests positive actually has the disease
136
trend analysis
Part of an analysis of variance, used to assess trend within data taken over ordered intervals; can express data as linear, quadratic, cubic, and so on, reflecting the number of changes in direction in the data over time.
137
predictive value of a diagnostic test
Also called posterior probability. Depends on the pretest probability, and the test’s sensitivity and specificity.
139
random sampling
Probability method of selecting subjects for a sample, where every subject in the population has an equal chance of being chosen.
139
systematic sample
A probability sampling method where subjects are chosen from lists of population members using specified intervals, such as every 10th person.
141
q
Studentized range statistic, used in multiple comparison tests.
142
sampling bias
Bias that occurs when individuals who are selected for a sample overrepresent or underrepresent the underlying population characteristics
142
a
Alpha level: Level of statistical significance, or risk of Type I eror; maximum probablitlty level that can be achieved in a statistical test to reject the null hypothes.s Symobls a1 and a2 are used to denote level of significance for one- and two-tailed tests.
143
discriminant analysis
A multivariate statistical technique used to determine if a set of variables can predict group membership.
144
mode
A measure of central tendency representing the most commonly occurring score.
146
case-control study
A design in analytic epidemiology in which the investigator selects subjects on the basis of their having or not having a particular disease and then determines their previous exposure.
147
research hypothesis
A statement of the researcher’s expectations about the relationship between variables under study.
147
binomial variable is the same as
dichotomy
149
intercorrelations
A set of bivariate correlations for several variables within a sample.
151
unpaired t-test
A parametric test for comparing two means for independent samples; also called an independent t-test.
152
Gaussian distribution
same as normal distribution?
153
statistical hypothesis
sames as null hypothesis
154
predictive validity
A form of measurement validity in which an instrument is used to predict some future performance.
156
univariate analysis
Statistical procedures for analyzing one dependent variable.
157
celeration line
In single-case research, a line that divides the data points within a phase into two equal halves, indicating the trend of the data within that phase.
158
number needed to harm (NNH)
The number of patients that need to be treated to observe one adverse outcome.
160
Rasch analysis
Transformation of items on an ordinal scale to an interval scale, demonstrating the unidimensional nature of a scale.
161
experimenter effects (experimenter bias)
Biases that are present in research data because of behaviors, expectations, or attitudes of those collecting the data.
162
forward selection
A process used in stepwise multiple regression that enters variables one at a time into the equation based on the strength of their association with the outcome variable, until all statistically significant variables are included.
162
X2r
Freidman two-way analysis of variance by ranks: A nonparametric statistical procedure for repeated measures, comparing more than two treatment conditions of one independant variable; analogous to the one-way repeated measures analysis of variance.
163
extraneous variable
A variable that confounds the relationship between the independent and dependent variables.
165
posterior probability
same as posttest probability The probability of a condition existing after performing a diagnostic test; predictive value of a diagnostic test. Also called posterior probability. Depends on the pretest probability, and the test's sensitivity and specificity
165
pretest probability
The probability that a condition exists prior to performing a diagnostic test. Equal to the prevalence of the condition in a specified group of subjects. Also called prior probability.
166
centroid
A point determined from the intersection of two means of two dependent variables (X, Y), used in multivariate analysis.
167
interaction effect
The combined effect of two or more independent variables on a dependent
169
meta-analysis
Use of statistical techniques in a systematic review to integrate the results of included studies to determine overall outcome, usually based on effect size.
171
exempt review
Exemption from review of a proposal by an Institutional Review Board for projects that do not involve direct contact with subjects, presenting no risk.
172
Wilcoxon rank-sum test
see Mann Whitney U test
173
cohort study
An observational study design in which a specific group is followed over time. Subjects are classified according to whether they do or do not have a particular risk factor or exposure and followed to determine disease outcomes
174
SEM
standard error of measurement (SEM) : A reliability measure of response stability, estimating the standard error in a set of repeated scores.
176
paradigm
A set of assumptions, concepts or values that constitute a way of viewing reality within an intellectual community.
177
nonequivalent control group
A control group (or comparison group) that was not created by random assignment.
177
quadratic trend
A nonlinear trend, with one turn in direction.
178
power
(1-beta) The ability of a statistical test to find a significant difference that really does exist; the probability that a test will lead to rejection of the null hypothesis.
178
alpha level (a)
level of statistical significance, or risk of Type I eror; maximum probablitlty level that can be achieved in a statistical test to reject the null hypothes.s Symobls a1 and a2 are used to denote level of significance for one- and two-tailed tests.
180
measurement error
The difference between an observed value for a measurement and the theoretical true score; may be the result of systematic or random effects.
182
phenomenology
An approach to qualitative research involving the study of complex human experience as it is actually lived.
183
absolute risk increase (ARI)
The increase in risk associated with an intervention as compared to risk without the intervention (or control condition); the absolute difference between the control event rate (CER) and the experimental event rate (EER)
184
block
Level of an attribute variable in which subjects are homogeneous on a particular characteristic.
185
completer analysis
Analysis of data in a clinical trial only for those subjects who complete the study.
186
standardized residual
In a chi square test, the contribution of each cell to the overall statistic.
188
preference
In sequential clinical trials, the expression of which treatment is considered better within a sequential pair.
189
item-to-total correlation
Correlation of individual items in a scale with the total scale score; an indication of internal consistency.
190
Least squares method
A method of fitting a regression line to a set of bivariate data so as to minimize the sum of the squared vertical deviations of Y values around that line.
191
relative risk reduction (RRR)
The reduction in risk associated with an intervention relative to the risk without the intervention (control); the absolute difference between the experimental event rate and the control event rate divided by the control event rate.
192
prior probability
same as pretest probability
193
crude rate
A rate for a population that is not adjusted for any subset of the population.
193
mean (X)
A measure of central tendency, computed by summing the values of several observations and dividing by the number of observations.
195
secondary source
Reference source that represents a review or report of another’s work.
196
hypothesis
A statement of the expected relationship between variables.
196
µ
mu mean of a popluation
197
criterion-referencing
Interpretation of a score based on its actual value.
199
withdrawal design
In single-case research, a design that involves withdrawal of the intervention.
200
parameter
A measured characteristic of a population.
202
within-subjects design
A research design that incorporates only repeated measures.
204
dichotomy (dichotomous variable)
A nominal variable having only two categories, such as yes/no and male/female; a binomial variable.
205
split-plot design
(See mixed design.)
206
cumulative scale
A scale designed so that agreement with higher-level responses assumes agreement with all lower-level responses.
208
independent variable
The variable that is presumed to cause, explain or influence a dependent variable; a variable that is manipulated or controlled by the researcher, who sets its “values” or levels.
209
explained variance
Between-groups variance; that portion of the total variance in a data set that can be attributed to the differences between groups or treatment conditions.
209
multivariate analysis of variance (MANOVA)
An advanced multivariate procedure that provides a global test of significance for multiple dependent variables using an analysis of variance.
211
split-half reliability
A reliability measure of internal consistency based on dividing the items on an instrument into two halves and correlating the results.
212
order effects
The sequential effect of one subject being exposed to several treatments in the same order; potentially manifested as carryover or practice effects.
214
parametric statistics
Statistical procedures for estimating population parameters and for testing hypotheses based on population parameters, with assumptions about the distribution of variables, and for use with interval or ratio measures.
215
intention-to-treat
Principle whereby data are analyzed according to group assignments, regardless of how subjects actually completed the study.
216
statistic
A measured characteristic of a sample.
218
Scheffé’s multiple comparison test
A multiple comparison procedure for comparing means following a significant analysis of variance. Considered the most conservative of the multiple comparison methods.
219
odds ratio (OR)
Estimate of relative risk in a case-control study.
220
Spearman-Brown formula
The statistical procedure used to analyze split-half reliability; also called the Spearman-Brown prophecy formula.
221
ARIMA (autoregressive integrated moving average)
Statistical technique for analysis of data from time-series studies.
223
retrospective study
A study that analyzes observations that were collected in the past.
224
difference score (d)
The difference between two scores taken on the same individual
225
Accessible population
The actual population of subjects available to be chosen for a study. This group is usually a nonrandom subset of the target population.
227
rpb
point biserial correlation A correlation statistic for estimating the relationship between a dichotomy and a continuous variable on the interval or ratio scale
229
homogeneity of variance
An underlying assumption in parametric statistics that variances of samples are not significantly different.
230
standard deviation (s)
A descriptive statistic reflecting the variability or dispersion of scores around the mean.
232
positive liklihood ratio
LR + A ratio that indicates ow much the odds of a disease are increased if a diagnostic test is positive. Equals sensitivity/(1-specificity)
233
translational research
Clinical investigation with human subjects in which knowledge obtained from basic research is translated into diagnostic or therapeutic interventions that can be applied to treatment or prevention.
235
event rate
The proportion of subjects in a group in whom a specific event or outcome is observed. (See control event rate [CER] and experimental event rate [EER].)
236
grounded theory
An approach to collecting and analyzing data in qualitative research, with the goal of developing theories to explain observations and experience.
237
sensitivity
A measure of validity of a screening procedure, based on the probability that someone with a disease will test positive.
238
interval scale
Level of measurement in which values have equal intervals, but no true zero point.
239
history effect
A threat to internal validity, referring to the occurrence of extraneous events prior to a posttest that can affect the dependent variable.
240
exploratory research
Research that has as its purpose the exploration of data to determine relationships among variables.
241
contingency table
A two-dimensional table displaying frequencies or counts, with rows (R) and columns (C) representing categories of nominal or ordinal variables; also referred to as cross-tabulation.
242
blinding
Techniques to reduce experimental bias by keeping the subjects and/or investigators ignorant of group assignments and research hypotheses.
243
absolute risk reduction (ARI)
The reduction in risk associated with an intervention as compared to the risk without the intervention (or the control condition); the absolute difference between the experimental event rate (EER) and the control event rate (CER).
244
correlation
The tendency for variation in one variable to be related to variation in a second variable; those statistical procedures used to assess the degree of covariation between two variables.
245
predictive value (PV)
(see negative predictive value and positive predictive value)
246
secondary analysis
An approach to research involving the use of data that were collected for another purpose, usually for the purpose of testing new hypotheses.
247
reliability
The degree of consistency with which an instrument or rater measures a variable.
248
scatter plot
A graphic representation of the relationship between two variables.
249
canonical correlation
A multivariate correlation procedure, whereby two sets of variables are correlated
250
constant comparative method
Inductive process in qualitative research that calls for continual testing of a theory as data are examined.
251
difference score (d)
The difference between two scores taken on the same individual.
253
Bonferroni t
A post hoc method to compare means following an analysis of variance; based on planned comparisons; also called the Dunn multiple comparison procedure.
254
population
The entire set of individuals or units to which data will be generalized.
255
validity
1. The degree to which an instrument measures what it is intended to measure. 2. The degree to which a research design allows for reasonable interpretations from the data, based on controls (internal validity), appropriate definitions (construct validity), appropriate analysis procedures (statistical conclusion validity), and generalizability (external validity).
255
zero-order correlation
A bivariate correlation.
257
*X*2
chi-square test: A nonparametric test applied to nominal data, comparing observed frequencies within categories to frequencies expected by chance.
259
pretest-posttest design
An experimental design involving a pretest prior to intervention and a posttest following intervention.
260
intraclass correlation coefficient (ICC)
A reliability coefficient based on an analysis of variance; a generalizability coefficient.
261
within-groups variance
same as error variance
262
Cox’s proportional hazards regression
A regression procedure used when the outcome has not yet occurred (a censored variable). Used in survival analysis.
262
alphanumeric data
In data processing, the entry of values that contain symbols or letters.
264
control event rate (CER)
The number of subjects in the control group who develop the outcome of interest.
265
floor effect
A measurement limitation of an instrument whereby the scale cannot determine decreased performance beyond a certain level.
267
level of measurement
The precision of a scale based on how a characteristic is measured; nominal, ordinal, interval and ratio levels.
269
linear regression
The process of determining a regression equation to predict values of Y based on a linear relationship with values of X.
270
construct validity
1. A type of measurement validity; the degree to which a theoretical construct is measured by an instrument. 2. Design validity related to operational definitions of independent and dependent variables.
271
rs
Spearman's rank correlation coefficient a nonparametric correlation prodeure for ordinal data. Also called Spreaman's rho
272
statistical regression
same as regression towards the mean
273
treatment-received analysis
Analysis of subject data in an experiment according to the treatment subjects actually did receive, regardless of their original group assignment.
274
serial dependency
Correlation in a set of data collected over time, in which one observation can be predicted based on previous observations.
275
test-retest reliability
The degree to which an instrument is stable, based on repeated administrations of the test to the same individuals over a specified time interval.
277
area probability sample
A form of cluster sampling in which geographic areas serve as the units of analysis.
278
concurrent validity
A type of measurement validity; a form of criterion-related validity; the degree to which the outcomes of one test correlate with outcomes on a criterion test, when both tests are given at relatively the same time.
280
historical research
Research that seeks to examine relationships and facts based on documentation of past events.
280
*df*t
For analysis of variance, = total degrees of freedom
281
median
A measure of central tendency representing the 50th percentile in a ranked distribution of scores; that is, that point at which 50 percent of the scores fall below and 50 percent fall above.
282
face validity
The assumption of validity of a measuring instrument based on its appearance as a reasonable measure of a given variable.
283
quasi-experimental research
Comparative research approach in which subjects cannot be randomly assigned to groups or control groups are not used.
286
bivariate statistics
Statistics involving the analysis of two variables for the purpose of determining the relationship between them, for example, correlation.
287
main effect
The separate effect of one independent variable in a multifactor design.
288
Likert scale
A summative scale based on responses to a set of statements for which respondents are asked to rate their degree of agreement or disagreement.
288
negative likelihood ratio
NR - ## Footnote A ratio that indicates how much the odds of a disease are decreased if a diagnostic test is netagive Equals specificity/(1-sensitivity)
290
cut-off score
Score used as the demarcation of a positive or negative test outcome.
290
X-bar
mean of a sample
292
canonical correlation
A multivariate correlation procedure, whereby two sets of variables are correlated.
293
selection bias
A threat to internal validity in which bias is introduced by initial differences between groups, when these differences are not random.
295
minimal detectable difference (MDD)
That amount of change in a variable that must be achieved to reflect a true difference; the smallest amount of change that passes the threshold of error. Also called minimal detectable change (MDC).
296
stem-and-leaf plot
A graphic display for numerical data in a frequency distribution showing each value in the distribution.
297
operational definition
Definition of a variable based on how it will be used in a particular study; how a dependent variable will be measured, how an independent variable will be manipulated.
298
internal consistency
A form of reliability, assessing the degree to which a set of items in an instrument all measure the same trait. Typically measured using Cronbach's alpha
299
multiple comparison test
A test of differences between individual means following analysis of variance, used to control for Type I error.
300
Kaplan-Meier Estimate
A common method of determining survival time which generates a step function, changing the survival estimate each time a patient dies (or reaches the terminal event).
302
multistage sampling
same as cluster sampling?
303
clinical prediction rule
A statistical tool that quantifies the relative contribution of examination and history findings to determine a diagnosis, prognosis, or likely response to intervention.
304
cluster analysis
A form of probability sampling in which large subgroups (clusters) are randomly selected first, and then smaller units from these clusters are successively chosen; also called multistage sampling.
305
special cause variation
Fluctuation in response caused by known factors that results in nonrandom and unexpected performance.
306
multiple baseline design
In single-case research, a design for collecting data for more than one subject, behavior, or treatment condition. Baseline phases are staggered to provide control.
308
Cronbach’s alpha
Reliability index of internal consistency, on a scale of 0.00 to 1.00.
309
expedited review
Accelerated review of a proposal by an Institutional Review Board, based on minimal risk.
311
normative research
A descriptive research approach designed to determine normal values for specific variables within a population.
313
correlational research
A descriptive research approach that explores the relationship among variables without active manipulation of variables by the researcher.
314
polynomial regression
Regression procedure for nonlinear data.
316
inferential statistics
That branch of statistics concerned with testing hypotheses and using sample data to make generalizations concerning populations.
317
analysis of variance (ANOVA)
Statistical procedure appropriate for comparison of three or more treatment groups or conditions, or the simultaneous manipulation of two or more independent variables; based on the F statistic.
318
minimal clinically important difference (MCID)
The smallest difference in a measured variable that signifies an important rather than trivial difference in the patient’s condition. The smallest difference a patient or clinician would perceive as beneficial, and that would result in a change in the management of the patient. Also called minimal clinically important change (MCIC) or minimally important change (MIC).
320
Beta
Probability of making a Type II error.
321
convenience sampling
A nonprobability sampling procedure, involving selection of the most available subjects for a study.
322
survival analysis
Multivariate analysis to estimate survival time, or time to a defined outcome, based on probabilities that an individual will achieve the outcome.
323
refereed journal
Journals that utilize a peer review process to evaluate manuscript submissions as a basis for choosing which ones will be published.
324
McNemar test
A nonparametric statistical test for nominal level measures, for correlated samples; a form of the chi square test.
326
significance level
same as alpha level
327
directional hypothesis
A variable that can only be measured in separate units and that cannot be measured in intervals of less than 1.
328
Two standard deviation band method
A method of data analysis in single-subject research.
329
relative risk (RR)
Estimate of the magnitude of the association between an exposure and disease, indicating the likelihood that the exposed group will develop the disease relative to those who are not exposed.
331
effectiveness
Benefits of an intervention as tested under “real world” conditions, often using quasi-experimental methods.
332
nonparametric statistics
A set of statistical procedures that are not based on assumptions about population parameters, or the shape of the underlying population distribution; most often used when data are measured on the nominal or ordinal scales.
333
nonprobability sample
A sample that was not selected using random selection.
334
X-Xhat
The distance of a single data point from the mean of the distribution. The summ of the deviation scores for a given distribution will awlays equal zero.
336
between-groups variance
That portion of the total variance in a set of scores that is attributed to the difference between groups.
337
number needed to treat (NNT)
The number of patients that need to be treated to prevent one adverse outcome or achieve one successful outcome; the reciprocal of absolute risk reduction (ARR).
338
d
Difference Score: The difference between two scores taken on the same individual
339
nested design
A multifactor design in which one variable is not crossed with other variables.
340
criterion-related validity
A type of measurement validity; the degree to which the outcomes of one test correlate with outcomes on a criterion test; can be assessed as concurrent validity or predictive validity.
341
systematic error
A form of measurement error, where error is constant across trials.
342
pooled variance estimate
Estimate of population variance based on the weighted average of sample variances; used in the unpaired t-test when group variances are not significantly different (under conditions of homogeneity of variance).
344
Latin square
A matrix of columns and rows used to assign sequences of treatments to control for order effects.
345
confounding
The contaminating effect of extraneous variables on interpretation of the relationship between independent and dependent variables.
345
normal distribution (curve)
A symmetrical bell-shaped theoretical distribution that has defined properties; also called a Gaussian distribution.
346
method error (ME)
A form of reliability testing for assessing response stability based on the discrepancy between two sets of repeated scores.
347
ceiling effect
A measurement limitation of an instrument whereby the scale cannot determine increased performance beyond a certain level.
348
gain score
same as change score?
349
interrater reliability
The degree to which two or more raters can obtain the same ratings for a given variable.
350
false negative
A test result that is negative in a person who has the disease or condition of interest.
352
responsiveness
The ability of a test to demonstrate change.
354
error variance
That portion of the total variance in a data set that cannot be attributed to treatment effects, but that is due to differences between subjects.
355
washout period
In a crossover design, that period of time between administration of the two treatments, allowing effects of the experimental treatment to dissipate.
357
sequential clinical trial
Experimental research design that allows consecutive entrance to a clinical trial and continuous analysis of data, permitting stopping of the trial when data are sufficient to show a significant effect.
358
participant observation
A method of data collection in qualitative research in which the researcher becomes a participant in the group that is being observed.
360
random assignment
Assignment of subjects to groups using probability methods, where every subject has an equal chance of being assigned to each group.
361
discriminant validity
An approach in construct validation assessing the degree to which an instrument yields different results when measuring two different constructs; that is, the ability to discriminate between the constructs.
362
percent agreement
A reliability test for categorical variables, estimating the ability of researchers to agree on category ratings.
363
between subjects factor
An independent variable for which levels are applied to independent groups.
365
Yates’ correction for continuity
In the chi square test, a correction factor applied when expected frequencies are too small, effectively reducing the chi square statistic.
366
risk-benefit ratio
An ethical principle that is an element of informed consent, in which the risks of a research study to the participant are evaluated in relation to the potential benefits of the study’s outcomes.
367
sigma
Standard deviation of a population.
369
cross-tabulation
same as contingency table. A two-dimensional table displaying frequencies or counts, with rows (R) and columns (C) representing categories of nominal or ordinal variables; also referred to as cross-tabulation.
370
Hawthorne effect
The effect of subjects’ knowledge that they are part of a study on their performance.
371
confidence interval (CI)
The range of values within which a population parameter is estimated to fall, with a specific level of confidence.
372
observational study
A study that does not involve an intervention or manipulation of an independent variable.
373
rho
Correlation coefficient for a population.
374
probability sample
A sample chosen using randomized methods.
375
multivariate analysis
A set of statistical procedures designed to analyze the relationship among three or more variables; includes techniques such as multiple regression, discriminant analysis, factor analysis and multivariate analysis of variance.
377
sp2
Pooled variance estimate Estimate of population variance based on the weighted average of sample variances; used in the unpaired t-test when group variances are not significantly different (under conditions of homogeneity of variance)
378
expected frequencies
In a contingency table, the frequencies that would be expected if the null hypothesis is true; frequencies that are expected just by chance.
379
coefficient of variation (CV)
A measure of relative variation; based on the standard deviation divided by the mean, expressed as a percentage.
381
SpPin
When a test has high specificity, a positive test rules in the diagnosis.
382
post hoc comparisons
Multiple comparison tests that follow an analysis of variance.
383
within-subjects factor
same as a repeated measure
384
covariate
An extraneous variable that is statistically controlled in an analysis of covariance, so that the relationship between the independent and dependent variables is analyzed with the effect of the extraneous factor removed.
386
Deviation score (X-Xhat)
The distance of a single data point from the mean of the distribution. The summ of the deviation scores for a given distribution will awlays equal zero.
388
Σ
read as "the sum of"
389
active variable
An independent variable with levels that can be manipulated and assigned by the researcher.
390
R squared
Multiple correlation coefficient squared; represents the proportion of variance in Y explained by several independent variables in a multiple regression equation.
392
attrition (experimental mortality)
A threat to internal validity, referring to the differential loss of participants during the course of data collection, potentially introducing bias by changing the composition of the sample.
393
MANOVA
multivariate analysis of variance (MANOVA) An advanced multivariate procedure that provides a global test of significance for multiple dependent variables using an analysis of variance.
394
beta weight
In a multiple regression equation, the standardized weight for each independent variable.
395
1-beta
power. The ability of a statistical test to find a significant difference that really does exist; the probability that a test will lead to rejection of the null hypothesis.
396
regression toward the mean
A statistical phenomenon in which scores on a pretest are likely to move toward the group mean on a posttest because of inherent positive or negative measurement error; also called statistical regression.
397
residual
In regression analysis, the difference between the value of the dependent variable predicted by the regression equation and the actual value.
398
standardized response mean (SRM)
One approach to evaluating effect size with change scores. Calculated as the difference between pretest and posttest scores, divided by the standard deviation of the change scores.
399
Spearman’s rank correlation coefficient
A nonparametric correlation procedure for ordinal data. Also called Spearman’s rho.
400
Freidman two-way analysis of variance by ranks (X2r)
A nonparametric statistical procedure for repeated measures, comparing more than two treatment conditions of one independant variable; analogous to the one-way repeated measures analysis of variance.
401
randomized block design
An experimental design in which one independent variable is an attribute variable, creating homogeneous blocks of subjects who are then randomly assigned to levels of the other independent variable.
403
instrumentation effect
A threat to internal validity in which bias is introduced by an unreliable or inaccurate measurement system.
404
Type I error
An incorrect decision to reject the null hypothesis, concluding that a relationship exists when in fact it does not.
405
* df*b
For analysis of variance, = between groupus degrees of freedom
406
LR +
positive liklihood ratio A ratio that indicates ow much the odds of a disease are increased if a diagnostic test is positive. Equals sensitivity/(1-specificity)
407
content validity
A type of measurement validity; the degree to which the items in an instrument adequately reflect the content domain being measured.
408
external validity
The degree to which results of a study can be generalized to persons or settings outside the experimental situation.
409
Y - Yhat
residual In regression analysis, the difference between the value of the dependent variable predicted by the regression equation and the actual value
410
developmental research
A descriptive research approach designed to document how certain groups change over time on specific variables.
411
Levene’s test
A test of the equality of variances, used with the independent t test and the analysis of variance.
412
one-way analysis of variance
An analysis of variance with one independent variable.
413
cumulative incidence (CI)
The number of new cases of a disease during a specified time period divided by the total number of people at risk; the proportion of new cases of a disease in a population.
414
variance
A measure of variablity in a distribution, equal to the square of the standard deviation.
415
central tendency
Descriptive statistics that represent “averages” or scores that are representative of a distribution; includes mean, median, and mode.
416
t-test
A parametric test for comparing two means; also called Student’s t-test. (See paired t-test and unpaired t-test.)
417
Kruskal-Wallis one-way analysis of variance by ranks (H)
A nonparametric statistical procedure for comparing more than two independent groups representing levels of one independent variable; analogous to the one-way analysis of variance.
418
statistical significance
The term indicating that the results of an analysis are unlikely to be the result of chance at a specified probability level; rejection of the null hypothesis.
419
standarized residual
in a chi square test, the contribution of each scell to the overall statistic
420
trend
1. The shape of a distribution of scores taken over time, reflecting the distribution’s linearity or lack of linearity. 2. In single-case research, the direction of change in the target behavior within a phase or across phases.
421
dependent variable
A response variable that is assumed to depend on or be caused by another (independent) variable.
422
Pooled variance estimate
sp2 Estimate of population variance based on the weighted average of sample variances; used in the unpaired t-test when group variances are not significantly different (under conditions of homogeneity of variance)
423
censored observation
An observation whose value is unknown because the subject has not been in the study long enough for the outcome to have occurred; used to estimate survival curves.
424
percentile
The percentage of a distribution that is below a specified value. Data are divided person-years. The total number of years that a set of subjects have participated in a study, typically used when subjects begin and end their participation at different times.
425
Fisher’s exact test
A nonparametric procedure applied to nominal data in a 2 X 2 contingency table, comparing observed frequencies within categories to frequencies expected by chance. Used when samples are too small to use the chi-square test.
426
treatment arm
Another term for each independent group in a clinical trial.
427
deductive reasoning
The logical process of developing specific hypotheses based on general principles.
428
slope
1. In regression analysis, the rate of change in values of Y for one unit of change in X. 2. In single-case research, the rate of change in the magnitude of the target behavior over time.
429
statistical process control (SPC)
A method of charting production outcomes over time to identify and monitor variances; can be used as a method of analysis for single-subject designs.
430
point biserial correlation
A correlation statistic for estimating the relationship between a dichotomy and a continuous variable on the interval or ratio scale.
431
multicolinearity
The correlation between independent variables in a multiple regression equation, causing them to provide redundant information.