Definitions Flashcards

Question

What is a contingency table?

Answer 1

Table showing the frequencies of observations for two categorical variables such that sub-categories of one variable (exposure) are indicated in rows and sub-categories of the other variable (outcome) are indicated in columns

Answer 2

A numerical variable which can potentially take an infinite number of distinct values

Answer 3

Measure of association that indicates the degree to which variable change together; can be pearsons (parametric) or spearman (non parametric)

Answer 4

Judgement made as to the quality of published articles e.g. regarding whether the appropriate study design and statistical methods have been chosen

Answer 5

Study that examines the association between exposure and outcome at a particular point in time

Answer 6

Also known as an unadjusted association. Estimated association between exposure and outcome, before possible confounding variables are taken into account

Answer 7

Also known as an unadjusted association. Estimated association between exposure and outcome, before possible confounding variables are taken into account

Answer 8

Study of populations, especially with reference to size, density, morality, fertility, growth, age distribution and the interaction of these with social and economic factors

Answer 9

The lower portion of a fraction used to calculate a rate or ratio

Answer 10

A study concerned with describing a variable in terms of time, place or person

Answer 11

Form of measurement bias that may occur when the outcome assessor is not blinded

Answer 12

A test performed to aid diagnosis of an outcome (usually a disease), often compared with a gold standard in reliability studies

Answer 13

A numerical value representing counts, which cannot take on any intermediate values

Answer 14

Pattern of association observed between exposure (does) and outcome (response) including linear trend and threshold effects

Answer 15

Bias that may occur because an association observed between variables on an aggregate level does not necessarily represent the association that exists at an individual level

Answer 16

Study in which the unit of analysis is populations or groups of people, rather than individuals

Answer 17

The criteria that must be met by subjects eligible for inclusion in a study

Answer 18

Study of the distribution and determinants of health-related conditions or events in specified populations and the application of this study to the control of health problems

Answer 19

Approval that must be sought from a local or regional ethics committee before a randomised controlled trial can be undertaken

Answer 20

The conscientious, explicit and judicious use of current best evidence in making decisions about the care of individual patients

Answer 21

A variable whose influence on the outcome variable is of interest

Answer 22

Statistical test used to compare two unpaired dichotomous variables in small datasets

Answer 23

A statistical test used to compare distributions between three or more groups, when variables are matched and not normally distributed

Answer 24

A back transformation (antilog/exponential) of a mean value which has been calculated on logged data

Answer 25

Measurement method widely accepted as being the best available, often used in reliability studies

Answer 26

Measurement method widely accepted as being the best available, often used in reliability studies

Answer 27

Simple guide to assessment of the evidence provided by different study designs randomised controlled trials (highest level) -> cohort -> case-control -> cross-sectional -> ecological -> descriptive; although quality of evidence also depends on quality of the study design and execution

Answer 28

Graphical representation of the frequency distribution of a continuous variable with areas of the bars representing the frequencies within each grouping interval

Answer 29

Outcome status for a defined sunset of the populations i ascertained at baseline and then linked to pre-existing historical data on exposure usually from routine records, so that the cohort's experience of outcome risk can be reconstructed

Answer 30

Idea expressed in such a way that it can be tested and refuted

Answer 31

Statistical methods used to determine how likely observed differences in data are due to chance rather than real differences

Answer 32

Rate of occurrence of new cases of an outcome, which is dependent on the number of new cases, total number in the population, and the time interval of interest

Answer 33

Drawing conclusions abut some unknown aspect of a population, based on statistics derived from a random sample from that population

Answer 34

Consent five by the subject or responsible person fro participation in a study - usually randomised controlled trial

Answer 35

Participants in a randomised controlled trial analysed according to their treatment group allocation, regardless of whether they completed the trial

Answer 36

An interaction between an exposure and confounder exists if the association between an outcome and exposure varies across the categories of the confounding variable

Answer 37

Point where a linear regression line crosses the y-axis i.e. value of the outcome when the exposure is zero

Answer 38

A measure of variability = the spread of data around the median, and is the distance between the lower quartile (25th gentile) value and at the upper quartile (75th centile) of a distribution

Answer 39

A study where an investigator tests whether modifying or changing something ('intervening') alters the outcome, usually a randomised controlled trial or experimental study

Answer 40

Form of measurement bias where an interviewer inquires more deeply about exposure in those with the outcome compared to those without

Answer 41

Measures the agreement between two or more examiners or methods when the variables are both categorial; if one or more of the variables are ordinal a modified version called the weighted kappa should be used

Answer 42

A statistical test used to compare distributions between three or more groups when variables are unmatched and not normally distributed

Answer 43

Gives an indication of the agreement between two examiners or methods when the variables are unmatched and not normally distributed

Answer 44

Regression method appropriate for continuous outcomes that are approximately normally distributed, which produces regression coefficients

Answer 45

Equation derived from fitting a linear regression model, usually denoted y = a + bx

Answer 46

A diagrammatic presentation (usually overlaid on a scatter plot) of a linear regression equation

Answer 47

A type of dose-response effect wherby there is a systematic increase in the risk of outcome with increasing or decreasing level of exposure

Answer 48

Conversion of data to their natural log values, with the aim of achieving an approximate normal distribution

Answer 49

Regression method appropriate for binary outcomes where the resulting regression coefficients can be exponentiated to produce odds ratios, multinomial and ordinal versions also exist

Answer 50

Bias due to subjects being lost over a follow up period, where the loss may be associated with the exposure and/or outcome

Answer 51

Statistical test used to compare distributions between two groups, when variables are unpaired and not normally distributed

Answer 52

Data that are not independent, often repeated measurements on the same person, or measurements from people who are related such as siblings or twins

Answer 53

Statistical test used to compare two paired dichotomous variables

Answer 54

Measure of the central tendency calculated by summing all values and dividing by the total number of observations

Answer 55

Measure of the central tendency calculated by summing all values and dividing by the total number of observations

Answer 56

Bias in how the exposure and/or outcome is measured or classified that results in different quality of information collected between those with and without the outcome; includes detection, interviewer & recall bias

Answer 57

Measure of central tendency, which is calculated as middle value when all the values are arranged in order, useful for summarising data that are not normally distributed

Answer 58

Use of observational studies to obtain an estimate of the causal effect of a modifiable exposure on an outcome, through identification of a candidate gene which is related to exposure

Answer 59

Statistical technique for combining estimates of exposure-outcome associations from more than one study, weighting according to size of the study

Answer 60

Measure of central tendency which is calculated as the most frequently occurring of all values, but is rarely used in epidemiology

Answer 61

regression modelling when there is more than one exposure, or there is one exposure plus a number of confounders

Answer 62

Unordered categorical variable i.e. categories have no order to them

Answer 63

Set of tests based on ranking observations in order of magnitude and testing these rankings rather than the actual values if the observations, suitable for data which are not normally distributed

Answer 64

``` Statistically = data follows a normal distribution; Clinically = the likely values for an individual ```

Answer 65

Continuous symmetrical frequency distribution where both tails extend to infinity and the shape is determined by the mean and standard deviation

Answer 66

Hypothesis that there is no association between outcome and exposure

Answer 67

The upper portion of a fraction used to calculate a rate or ratio

Answer 68

Values are numbers as opposed to categories

Answer 69

Study that does not involve any form of intervention, where the investigatory just observes and records exposure and outcome information

Answer 70

Number of people with thoutcome divided by the number of healthy people

Answer 71

Ratio of odds of outcome amongst exposed subjects to the odds of outcome amongst unexposed subjects

Answer 72

Number of people with the outcome divided by the number of healthy people

Answer 73

Ratio of odds of outcome amongst exposed subjects to the odds of outcome amongst unexposed subjects

Answer 74

An ordered categorical variable i.e. categories can take values that are ranked according to an ordered classification

Answer 75

(response variable/dependent variable/y-variable/cas-control status/disease group) = variable whose association with an exposure is of interest

Answer 76

(response variable/dependent variable/y-variable/cas-control status/disease group) = variable whose association with an exposure is of interest

Answer 77

Special case of matched date when there are only two groups

Answer 78

Numerical quantity measuring some aspect of a population e.g. the mean

Answer 79

Assume the data has an underlying distribution

Answer 80

Bias arising due to an unequal provision of care between the treatment and control group in a randomised controlled trial, which may occur if subjects and assessors are not blinded

Answer 81

Analysis restricted to those who completed a randomised controlled trial according to protocol, which defeats the main purpose of random allocation and may invalidate the results; it is preferable to use intention to treat analysis

Answer 82

Sum of the number of years that each individual has been under observation, sometimes used as a denominator for calculating incidence rates

Answer 83

Graph used to respresent a categorical variable; frequencies within each group of observations are represented by the areas of segments in a circular diagram

Answer 84

Inert medication or procedure i.e. a drug having no pharmacological effect, but intended to give patients the perception that they are receiving treatment for their complaint

Answer 85

Phenomenon whereby a patients symptoms can be alleviated by an otherwise ineffective treatment, as they expect the treatment to work

Answer 86

Group of patients in a randomised controlled trial who receive no treatment other than standard care i.e.e placebo

Answer 87

Regression method appropriate for outcomes that are count variables; the resulting regression coefficients are usually exponentiated to produce rate ratios

Answer 88

Ability of a study to demonstrate an association between variables if one exists i.e. the probability of observing evidence against the null hypothesis if it is false

Answer 89

Amount of variation in the sample statistic, with greater variation indicating less precision

Answer 90

Total number of individuals who have an outcome at a particular time divided by the total population at risk at that time i.e. proportion with outcome at a particular time

Answer 91

Number of occurrences of an event divided by total number of observations

Answer 92

Healthy individuals are recruited (though some may already have the outcome at baseline), exposure status recorded, then subjects are followed up to see whether those who were exposed develop the outcome at a different rate to those who were not exposed

Answer 93

Tendency for studies that find associations between variables to get published, and the that do not find associations not to get published

Answer 94

Probability that the difference between exposure groups would be at least as big as that observed if the null hypothesis of no difference is true i.e. if the difference has arisen due to chance

Answer 95

Proportion of variance in one variable that is explained by the variation in another, given by the square of the correlation coefficient (R)

Answer 96

Sample drawn from a population such that all members of the population have an equal chance of being chosen

Answer 97

(/random allocation) In randomised controlled trials, allocation of subjects in either the intervention or control group, by chance aloe, to ensure that groups are similar with respect to the distribution of confounding factors

Answer 98

Study in which subjects are randomly allocated to either a treatment or control group, followed up, then outcomes measured and compared

Answer 99

Measured variability which is the difference between the largest and smallest values in a distribution

Answer 100

measure of the frequency of occurrence of an event e.g. incidence rate

Answer 101

Quantification of the association between an exposure and discrete/count outcome, calculated using poisson regression models

Answer 102

Form of measurement bias occurring in retrospective studies, whereby recall of information is different in the with the outcome compared to those without

Answer 103

Measure of variability which indicates the amount of variation between individual observations in a sample and hence likely values for an individual in the population/can inform as to whether a patient is "clinically" normal.

Answer 104

Finds the best mathematical model to describe the outcome (y) with respect to the exposure (x)

Answer 105

(/slope) = Estimate of the change in outcome (y) for a unit change in exposure (x); in a linear regression this is the gradient of the linear regression line, denoted by b in the linear regression equation (y= a + bx)

Answer 106

Compares the measurements made by two or more examiners or methods with respect to the agreement between them

Answer 107

Alternative explanation for an exposure-outcome association, whereby the outcome causes the exposure rather than the other way around

Answer 108

Number of new cases with he outcome in a particular time period divided by the number of people who did not have the outcome at the outset i.e. proportion of new cases in a time period

Answer 109

Difference in risk between exposed and non-exposed groups

Answer 110

Risk of developing the outcome in the exposed group compared to risk of developing the outcome in the unexposed group

Answer 111

Mathematical process of deciding how many subjects should be included in a study, to be determined at the outset

Answer 112

Estimate of a population parameter, based on a sample drawn from that popultion

Answer 113

Process of selecting a number of subjects from all the subjects in the target population

Answer 114

Distribution of sample statistics if repeated samples were drawn from the same population

Answer 115

Graph used to present two continuous variables, whereby each point represents the exposure and outcome values for an individual

Answer 116

Randoms ample of individuals that have been selected from the target population

Answer 117

Systematic difference in the characteristics of the subjects selected randomly to take part in a study and those who are not

Answer 118

Proportion of sample with the outcome, who are correctly classified by a diagnostic test that has been compared to a gold standard (e.g. in reliability study)

Answer 119

Statistical test used to compare distributions between two groups, when variables are paired but not normally distributed

Answer 120

An asymmetrical frequency distribution, which is either positively skewed (long tail to the right) or negatively skewed (long tail to the left)

Answer 121

Proportion of sample without the outcome, who are correctly classified by a diagnostic test that has been compared to a gold standard (reliability study)

Answer 122

Measure of variability, indicating how widely dispersed the individual observations are in a distribution

Answer 123

Measurement of the precision of the sample mean as an estimate of the population mean standard deviation of the sampling distribution of a sample statistic

Answer 124

Special case of the normal distribution where the mean is zero and the standard deviation is one

Answer 125

A p-value less than a specified level, usually 5%, suggesting that the null hypothesis can be rejected; although statistical results have traditionallyy been interpreted in this way it is now considered preferable to avoid the the term statistical significance!

Answer 126

Science of collecting, summarising, analysing (e.g. estimating the strength of association between two variables) and interpreting data

Answer 127

Analysis undertaken separately in each of a number of subgroups

Answer 128

Subgroup of subjects from the selected sample that actually agree to take part in the study

Answer 129

Subdivision of the sample into groups

Answer 130

Statistical modelling of the time to an event which does not assume that rates are constant over time

Answer 131

Review of a clearly formulated question that uses systematic methods to identify, select and critically appraise relevant research

Answer 132

Collection of individuals for which it is of interest to draw inferences or be able to generalise too, often defined in terms of geographical location

Answer 133

Referring to time

Answer 134

Quantity calculated from the data which is used to assess the strength of evidence against the null hypothesis

Answer 135

Type of dose-response effect, whereby the risk of the outcome is only increased in subjects whose exposure is above or below a certain level

Answer 136

A statistical test used to compare means between two groups, when variables are approximately normally distributed; can be unpaired or paired test

Answer 137

Measure of variability, the variance is the square of the standard deviation

Answer 138

The same as the Mann-Whitney U test for two unpaired groups & sign test for two paired groups

Answer 139

Statistical test used to compare means between two groups, when variables are approximately normally distributed, and sample is not too small

Definitions Flashcards

(163 cards)