3. Medical Statistics intro Flashcards
what is a STATISTIC
a numerical summary of a SAMPLE
what is a PARAMETER
a numerical summary of the POPULATION
what is CATEGORICAL data
QUALITATIVE
what is NUMERICAL data
QUANTITATIVE
CATAGORICAL data can be split into:
NOMINAL and ORDINAL
what is NOMINAL vs ORDINAL data with examples
(categorical)
- Nominal: categories are mutually exclusive and UNORDERED
eg. sex, blood group, ethnicity, survival after 10 years
- Ordinal: categories are mutually exclusive and ORDERED
eg. disease stage, education level, heart murmur grade
NUMERICAL data can be split into…
DISCRETE and CONTINUOUS
what is DISCRETE vs CONTINUOUS data (numerical)
- Discrete: take only INTEGER VALUES (COUNT 0,1,2..)
eg. NUMBER OF pregnancies, number of asthma exacerbations
- Continuous: take ANY VALUE in a given interval
eg. weight, blood pressure, cholesterol levels, age
PROS and CONS of CONVERTING NUMERICAL to CATEGORICAL
(eg systolic bp (mmHg) —> hypertensive (>140), normotensive (<140)
PROS:
- EASIER to DESCRIBE POPULATION by the % of people AFFECTED
- EASIER to make TREATMENT DECISIONS if population is GROUPED
CONS:
- LOSE INFORMATION
- how to DECIDE CUT OFF? what is abnormal?
how to DESCRIBE the DISTRIBUTION of CATEGORICAL variables (what to look at)
- the category with the LARGEST FREQUENCY (MODAL CATEGORY)
- how FREQUENTLY each category was OBSERVED (%)
how to DESCRIBE the DISTRIBUTION of NUMERICAL variables (what to look at)
- SHAPE (do observations cluster in certain intervals?)
- CENTRE (where does a typical observation fall?)
- VARIABILITY (how tightly are the observations clustering around a centre)
DESCRIBING CATEGORICAL DATA:
PROPORTION vs PERCENTAGE (how to calculate)
PROPORTION : the NUMBER OF OBSERVATIONS in that category DIVIDED by the TOTAL NUMBER of OBSERVATIONS
PERCENTAGE = PROPORTION X 100
DESCRIBING CATEGORICAL DATA:
PROPORTIONS and PERCENTAGES are also called … and serve as a way to..
RELATIVE FREQUENCIES
serve as a way to SUMMARIZE the DISTRIBUTION of a CATEGORICAL variable NUMERICALLY
DESCRIBING CATEGORICAL DATA:
what is the ABSOLUTE CHANGE (and how to calculate)
describes the ACTUAL INCREASE or DECREASE from a REFERENCE VALUE to a NEW VALUE
ABSOLUTE CHANGE = NEW VALUE - REFERENCE VALUE
DESCRIBING CATEGORICAL DATA:
what is the RELATIVE CHANGE (and how to calculate)
describes the size of the ABSOLUTE CHANGE in COMPARISON to the REFERENCE VALUE
expressed as %
RELATIVE CHANGE = NEW VALUE - REFERENCE VALUE /
REFERENCE VALUE
X100
DESCRIBING CATEGORICAL DATA:
Percentages are also commonly used to compare 2 numbers. there is REFERENCE VALUE and COMPARED VALUE (compared to reference)
how do you calculate ABSOLUTE DIFFERENCE
ABSOLUTE DIFFERENCE
= COMPARED VALUE - REFERENCE VALUE
DESCRIBING CATEGORICAL DATA:
Percentages are also commonly used to compare 2 numbers. there is REFERENCE VALUE and COMPARED VALUE (compared to reference)
how do you calculate RELATIVE DIFFERENCE (%)
RELATIVE DIFFERENCE
= COMPARED VALUE - REFERENCE VALUE /
REFERENCE VALUE
X 100
DESCRIBING CATEGORICAL DATA:
ABSOLUTE vs RELATIVE
ABSOLUTE = difference/change
RELATIVE = Percentage change
eg weight loss 200 kg —> 180 kg
absolute weight loss = 20 kg
relative weight loss = 10% (20/200 x 100)
DESCRIBING CATEGORICAL DATA:
if a value is 20% MORE than the reference value, it is ….% OF the reference value
120% OF the reference (100 + P)
DESCRIBING CATEGORICAL DATA:
is a value is 20% LESS than the reference value, it is …% OF the reference value
80% OF the reference (100 - P)
DESCRIBING NUMERICAL DATA:
what type of graph visualises the DISTRIBUTION of a QUANTITATIVE variable
HISTOGRAM
DESCRIBING NUMERICAL DATA:
three questions to ask:
- does the distribution have a SINGLE MOUND / PEAK (MODE)
- what is the SHAPE of the distribution
- do the data CLUSTER together, or is there a GAP such that one or more observations noticeably differ from the rest