Critical numbers🔢 Flashcards
Categorical variables can be…
Binary, ordinal and nominal
Numeric variables can be…
Discrete or continuous
What is binary data?
Only two categories e.g positive/negative
What is ordinal data?
Categories with a natural order e.g stage of cancer
What is nominal data?
Categories with no natural or universally agreed order e.g blood group
What is discrete/count data?
Observations can only take certain numerical values e.g number of children
What is continuous data?
Observations can take any value within a range e.g age/height
What happens where continuous variables are categorised
The variable type switches from continuous to ordinal e.g age in years into age categories
Frequency definition
How often an event occurs in a population group at risk
What is the term for the number of existing cases in a population at a defined timepoint?
Prevalence
What is the term for the number of new cases in a population over a defined period?
Incidence
What is prevalence dependent on?
The incidence and duration of the event
True or false: the term risk can be used to quantify both desirable and undesirable outcomes
True
How do you calculate the proportion?
The number experiencing the event divided by the total (scale 0 to 1) e.g three type I diabetics in a sample of 1000 participants = 3/1000 = 0.003
How to calculate percentage from proportion?
Often x100 and reported as a percentage (scale 0 to 100%) e.g 0.003 x 100 = 0.3% of the sample had type I diabetes
How to convert the proportion to the number per quantity of people?
Multiply by the number of participants e.g 0.003 x 1000 = 3 cases per 1000 participants
How to calculate rates from number per quantity of people?
Divide it by the length of time e.g 10 deaths per 1000 people per year
How to calculate odds?
The number or proportion with an event divided by those without the event. e.g The odds of having type I diabetes in the previous example were
3/997 = 0.003 (using the actual participant counts)
0.003/0.997 = 0.003 (using the proportions)
What is the term for the difference in proportions between groups (subtraction)?
Risk difference
What is the term for the risk in one group divided by the risk in another group (division)?
Risk ratio AKA relative risk
What is the term for odds in one group divided by odds in the other (division)?
Odds ratio
What will the risk difference be when there is no difference?
Zero
What will the odds and risk ratios be if there is no difference?
1
What do risk/odds ratios >1 indicate?
A higher risk/odds in the group of interest
What do risk/odds ratios <1 indicate?
A lower risk/odds in the group of interest
How to calculate numbers needed to treat and what does this mean?
1/risk difference = x
This means x patients at risk would need to be treated with aspirin to prevent 1 additional case of hypertension.
How to calculate risk reduction?
Calculate the risk ratio then take away 1 from it and multiply by 100 to get the % reduction in risk
Why do we need both risk and odds?
Need it for probability
Odds = Probability/(1 − risk)
Odds have symmetry so Y is the inverse of the odds for the outcome not-y
Defining features of the ecological study design
Population-level data
Advantages of ecological study design
· Can look at trends between regions or over time
· Data collection fairly easy (tends to be routinely-collected)
· Fast
· Inexpensive
· Few ethical issues
Disadvantages of ecological study design
· Cannot determine individual-level associations (ecological fallacy)
· Cannot demonstrate cause and effect
· Lack of control over variables collected
Defining features of cross sectional study design
Outcome and exposure status measured simultaneously
Advantages of cross-sectional study design
· Can look at associations – hypothesis-generating
· Data collection fairly easy
· Can study multiple exposures and outcomes
· Fast
· Inexpensive
Disadvantages of cross sectional study design
· Cannot demonstrate cause and effect
· Prone to bias and confounding
· Not useful for rare exposures or outcomes
Defining features of case-control study design
Participants selected on the basis of outcome
Advantages of case control study design
· Can look at association between outcome and prior exposures
· Fast
· Inexpensive
· Good for studying rare outcomes
· Can study multiple exposures
Disadvantages of case-control study design
· Cannot determine incidence/risk of outcome
· Limited control over data quality – poor historic records or recall bias
· Retrospective nature limits ability to determine causality
· Not useful for rare exposures
Defining features of cohort study design
Participants selected on the basis of exposure
Advantages of cohort study design
· Can look at incident cases and associations with exposure
· Good for studying rare exposures
· Can study multiple outcomes
· Control over data collected
· Exposure determined before outcome occurs so can demonstrate temporality for potential cause and effect
Disadvantages of cohort study design
· Mostly prospective which can be time consuming
· Risk of loss to follow-up
· Expensive
· Not useful for rare outcomes
Defining features of randomised control trials
Participants randomly allocated to interventions then followed up to compare outcomes
Advantages of randomised control trials
· Can study intervention effects on outcome(s)
· Random allocation means confounding factors should be evenly distributed
· Control over variables collected
· Comparator group means ability to account for placebo/temporal effects
· Less prone to bias, particularly where blinding and objective outcome assessment used
· Gold standard for establishing causality
Disadvantages for randomised control trial
· Time consuming
· Expensive
· Require expertise to run
· Can only be used where ethics and participant willingness permit randomisation to intervention
· Overly strict eligibility criteria may render sample not fully representative of population
What does PICO stand for?
Population
Intervention
Comparator
Outcome
What is strength of association?
The stronger an association, the more likely it is to be causal
What is consistency?
Association shown across different studies in different locations, populations, using different methods, etc