Statistics - definitions Flashcards

1
Q

Power of a study

A

Probability of correctly rejecting the null hypothesis (that there is no difference between treatment arm and placebo) when it is false (i.e. not making a type II error)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Standard error

A

Used in the calculation of confidence interval – many sample means

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Type I error + not affected by + increases with..

A

This is where the null hypothesis (that there is no difference between remdesivir and placebo) is incorrectly rejected (false positive result).
Termed alpha
Not affected by sample size as significance level determined in advance. The chance increases if the number of end points is increase

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Type II error
determined by

A

The failure to reject the null hypothesis (that there is no difference between remdesivir and placebo) when it is false (also known as a false negative result).
Termed beta
Determined by sample size and alpha (type I error)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

P Value

A

Probaility of obtaining a result by chance at least as extreme as te one that has actually observed, assuming that the null hypothesis is true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Properties of Normal distribution

A

Symmetry = Mean= median = Mode

68.3% of values lie within 1 SD of the mean
95.4% of values lie within 2 SD of the mean
99.7% of values lie within 3 SD of the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

95% confidence interval

A

IF a repeat sample of 100 observations are taken from the same group 95 of them would be expected to lie in that range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Standard deviation

A

IS a measure of how much dispersion exists from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Parametric Data tests

A

Students t-test – paired or unpaired
Pearsons product-moment coefficient – correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Non-parametric tests

A

Mann-Whitney U test
Wilcoxon signed-rank test
Chi squared tests
Spearman, kendall rank- correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Endemic

A

Persistent, usual, or expected level of disease in a given population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Pandemic

A

Involves epidemics affecting a large number of people across many countries, continents or regions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Validity

A

Refers to the extent to which something measures what it claims to measure. First major distinction is internal and external validity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Reliability

A

Is the extent to which an experiment, test, or any measuring procedure yields the same result on repeat trials

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Internal validity

A

Is the confidence that we can place in the cause and effect relationship in a study. It is the confidence that we have that the change in the independent variable caused the observed change in the dependent variable (rather than due to poor control of extraneous variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Internal validity
example

A

A researcher has suggested a causal link between a new designer drug and psychosis. During peer review of the research, this link is challenged by the fact that the temporal relationship between to the two has not been proven. What form of validity has been brought into question?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

External Validity

A

IS the degree to which the conclusions in a study would hold for other persons in other places and at other times i.e. its ability to generalize

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

external validity example

A

A PhD student conducted a small study which demonstrated that a new form of psychological therapy successfully alleviated symptoms of depression. They therefore made a claim in a review article that the new therapy is successful in treating depression. They were criticised for making this claim as the therapy had not been widely tested and so the results may not be generalisable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Content validity

A

Content validity refers to the extent to which a test or measure assesses the full content of a subject or area.

20
Q

content validity example

A

For example if a test is designed to help diagnose depression, it would have poor content validity if it only asked about psychological symptoms and neglected biological ones

21
Q

Face validity

A

Face validity refers to the general impression of a test. A test has face validity if it appears to test what it is meant to

22
Q

Criterion validity

A

Criterion validity concerns the comparison of tests. You may wish to compare a new test to see if it works as well as an old, accepted method. The correlation coefficient is used to test such comparisons

23
Q

Criterion validity (concurrent)

A

In concurrent validation, the predictor and criterion data are collected at or about the same time.

24
Q

Criterion validity (concurrent) example

A

An example could be testing a new, shorter test of intellectual functioning against a standard measure

25
Q

Criterion validity (predictive)

A

In Predictive validation, the predictor scores are collected first and criterion data are collected at some later/future point. Here you want to know if the test predicts future outcomes.

26
Q

Criterion validity (predictive) example

A

An example might be evaluating a new assessment method to select medical students. The test could be compared against the students performance at the end of year one to see if there is a correlation

27
Q

Construct validity

A

The extent to which a test measures the construct it aims to

28
Q

Construct validity (convergent)

A

A test has convergent validity if it has a high correlation with another test that measures the same construct

29
Q

Construct validity (divergent)

A

A test’s divergent validity is demonstrated through a low correlation with a test that measures a different construct

30
Q

Nominal data types

A

Observed values can be put into set categories which have no particular order or hierarchy. You can count but not order or measure nominal data (for example birthplace)

31
Q

Ordinal data types

A

Observed values can be put into set categories which themselves can be ordered (for example NYHA classification of heart failure symptoms)

32
Q

Discrete data types

A

Observed values are confined to a certain values, usually a finite number of whole numbers (for example the number of asthma exacerbations in a year)

33
Q

Continuos data types

A

Data can take any value with certain range (for example weight)

34
Q

Binomial data types

A

Data may take one of two values (for example gender)

35
Q

Interval data types

A

A measurement where the difference between two values is meaningful, such that equal differences between values correspond to real differences between the quantities that the scale measures (for example temperature)

36
Q

A Rate

A

is a quantity measured with respect to another measured quantity (e.g. 60 miles an hour).

37
Q

Attributable risk

A

is the rate in the exposed group minus the rate in the unexposed group. For example the attributable risk for lung cancer in smokers is the rate of lung cancer in smokers minus the rate of cancer in non smokers. Essentially it tells you what proportion of deaths in the exposed group were due to the exposure.

38
Q

Relative risk

A

is the risk of an event relative to exposure. It is also known as the risk ratio.

39
Q

Population attributable risk

A

can be described as the reduction in incidence that would be observed if the population were entirely unexposed. For instance how would the incidence of lung cancer change if everyone stopped smoking? It can be calculated by multiplying the attributable risk by the prevalence of exposure in the population.

40
Q

Attributable proportion

A

is the proportion of the disease that would be eliminated in a population if its disease rate were reduced to that of the unexposed group.

41
Q

False-negative rate

A

Would be the proportion of patients with a disease not picking up by testing

42
Q

False-positive rate

A

proportion of patients with a positive test who do not have the condition being tested for

43
Q

Hazard ratio

A

The hazard ratio (HR) is similar to relative risk but is used when risk is not constant to time. It is typically used when analysing survival over time

44
Q

Precision

A

The precision quantifies a tests ability to produce the same measurements with repeated tests.

45
Q

Confounding

A

Refers to a variable which correlates with other variables within a study leading to spurious results

46
Q

Confounding

A

Refers to a variable which correlates with other variables within a study leading to spurious results
Confounding occurs when there is a non random distribution of risk factors in the populations. Age, sex and social class are common causes of confounding.

47
Q

Confounding can be controlled by (design stage versus analysis stage)

A

In the design stage = randomisation

In the analysis stage of an experiment, confounding can be controlled for by stratification.