Critical numbers - statistics Flashcards
What is the target population and sample population?
We can’t collect info from everyone so we take a sub set from the whole population this is known as the sample population.
What is sampling bias?
What is recall bias?
Social desirability bias?
Information bias?
Sampling bias = individuals in the study are more/less likely to be included than others
Recall bias = individual can not remember specifics of a question
Social-desirability bias = individuals tell us incorrect information because they feel a societal pressure
Information bias = measurement bias
What is a background/confounding factor?
Something that is responsible for the outcome and related to the exposure.
Screen use and poor vision…. Cofounder = lack of natural light.
Types of study design:
Experimental vs observational
Retrospective vs prospective
Individual vs population
Experimental = researcher changes something/ has intervened
Observational = researcher just collected data
Retrospective = look back to see if exposure caused outcome
Prospective = collect information to see if current exposure leads to outcome
Individual = info collected on an individual - usual study design
Population/ecological = whole populations looked at
Types of study:
Case control
Look at individuals with outcome and matched individuals without and look to see who had exposure and the outcome.
Good for investigating rare disease
Cross sectional study:
Look at what is happening now (snapshot of time)
Who currently has exposure and the outcome
Difficult to establish order of events
Cohort study:
Collect information on a sample, some have exposure some do not, no one has outcome yet. Then follow up and see if those with exposure leads to more outcomes.
Time consuming, expensive
Randomised control trial:
Have multiple groups(also known as arms)
Give a different exposure to each group
Compare the outcomes between the groups
Steps to avoid bias: Blinding - single and double Randomisation - flipping a coin Placebos Matching - identical with only difference is the exposure
Gold standard, but expensive and not always suitable exposures
Crossover trial:
Extension of a RCT where everyone in the study has all different exposures. Therefore you can compare their effects to themselves.
Randomised which treatment/exposure they receive first
Not always suitable as may be carry over effects
What is a variable?
A quantitive measure of something that varies.
What is a categoric variable and what are the subtypes?
Categoric variables fit into a particular category.
Binary = 2 categories - yes or no
Ordinal > 2 with a natural ordering e.g. low medium and high
Nominal > 2 with no ordering e.g hair colour, ethnicity
What is a numeric variable and what are the subtypes?
A variable that is a measured on a scale.
Can be discrete = where this a distinct number of values e.g age in years
Continuous = can take nay value within its limits e.g. weight
What is descriptive statistics?
Collection of statistical measures used to describe the data sample we have.
Definitions of:
Proportion
Probability
Odds
Rate
Portion = total number with outcome/total number
Probability = proportion x 100
Odds = number with outcome/number without
Rate = number of times something happens per a quantified e.g x per 100 people
What is the risk difference?
Risk ratio?
Odds ratio?
Risk difference = subtraction of one proportion from the other
“Risk with X …% higher than with Y”
Risk ratio = Group A/Group B proportion or percentage
The focus on the top compared to on bottom
If greater than 1 risk in group A larger than B if 1 then its the same and if less than 1 its smaller.
1.85 shows a increase risk of 85%
0.80 shows a decreased risk of 20%
Odds ratio = Group A odds/ Group B odds
Odds increased or decreased by X
Remember a score of 2 is only 100% increase in odds
Odds ratio and risk ratio can cause what?
Can cause unnecessary panic, 200% increase may sound larger but actual risk could still be very very small.
What does standard deviation show?
Shows the spread of dat about the mean
Sigma is the symbol for
Standard deviation
Variance =
SD squared
Mean =
Sum of numbers/total number
Median =
middle number of data set
If invested 2 numbers take the average
In a perfect symmetric distribution the mean and median are…
Equal
When the distribution is not symmetric you are said to have a…
Skewed distribution
It can be right or left skewed depending on the position of the outlier.
The outlier will skew it in that direction
What are the three measures of spread we have learnt about?
Range
SD
Interquartile range