Session 2 Flashcards
What are the types of Variable?
Variables can be divided into discrete (or categorical) and continuous.
Categorical or discrete variables:
Nominal categories into which individuals are classified, these have no numerical relationship e.g. sex (male or female)
ordinal ranking of categories e.g. mild, moderate or severe Interval (categorical) distance between measures on a scale that has meaning e.g. one, two or three people in a household
Continuous variables:
Interval (continuous) distance between measures on a scale has meaning e.g. temperature. However, the ratio between measurements does not have meaning i.e. 10°C is not twice as hot as 5°C.
Ratio the distance and ratio between measurements are defined e.g. weight (1kg is twice the weight of 500g)
How can you express continuous vs categorical data?
Histogram: continuous
Bar chart: discrete
Define Mode?
Mode the value (or group of values) which occur most often i.e. the highest peak of a histogram.
What is the median?
Median the middle value (so arrange in order of size, (n+1)/2th observation if odd or n/2th, if even.
Define mean
Mean (average) arithmetic average of observations, mean = (x1 + x2+…+xn)/n
what does a symmetrical distribution tell you?
Symmetrical distribution: mean = median
What does a skewed distribution tell you?
Skewed distribution:
right: mean > median
left mean < median
What is Interquartile range (IQR)?
Interquartile range (IQR) to calculate the IQR, order all the values, Q1 is the value that 25% of observations fall below, Q3 is the value that 75% of observations fall below, IQR = Q3 – Q1
median is the (n + 1) ÷ 2 th value.
Lower quartile is the (n + 1) ÷ 4 th value.
Upper quartile is the 3 (n + 1) ÷ 4 th value.
What is standard deviation?
SD: measures the spread in data around the mean
What is Variance?
What are the methods of presenting and summarizing data?
Define incidence
Incidence the number of new cases per population at risk in a given time period
Define prevalence
Prevalence the number of all (new + old) cases per population at a given time
How do you calculate incidence
Incidence = Number of new cases in a defined population over a given time
Number in defined at-risk population over same
period of time
expressed as a %
How do you calculate incidence density and when is it used for?
- if people are followed up for different amounts of time
Number of new cases of a disease
Total person-time at risk of developing a disease
expressed in person-years