Week 9 Kuracloud: Measuring and Summarising Data Flashcards
Statistics
=
(Kirkwood & Sterne. Essential Medical Statistics, 2nd ed., 2010)
= “the science of collecting, summarising, presenting and interpreting data, and of using them to estimate the magnitude of associations and test hypotheses”
(Kirkwood & Sterne. Essential Medical Statistics, 2nd ed., 2010)
Descriptive Statistics
= describes features of data sample
“summarising, presenting and interpreting data”
Inferential Statistics
= infer findings of sample to target population
“estimate the magnitude of associations and test hypotheses”
Data
=
= “a set of values of subjects with respect to qualitative or quantitative variables”
Raw Data
=
= observations
Data set
=
= collection of information regarding a group of people or other items
Variables
=, 2
= characteristics that you can measure or observe and may take any one of a specified set of values
- Numerical (quantitative) (or interval/ratio data)
- Categorical (qualitative)
Categorical Variables
2,1
- ordered/ordinal = rank in categories in an order
- unordered/nominal = place observations in named, unordered groups
- dichotomous/binary
Numerical Variables
2
- continuous = on a continuos scale, can take any value in range
- discrete = finite options, usually countable
Derived variable
=,
= new variable created from existing variable
variable measured as numerical –> categorical
Spreadsheets of datasets
3
- Columns: each represents 1 variable (first usually identifier)
- Rows: each represents data for 1 person (record)
- Cells: value of 1 variable for 1 person = observation
Outcome variable
=, (3)
= focus of attention, we try to explain its variation
(dependant variable/response variable/y-variable)
Exposure Variable
=, (3)
= influences variation of outcome variable
(independant variable/predictor variable/x-variable)
Operationalising Variables
=,
= deciding which category designates individual as having an outcome/exposed
dictates interpretation of results
Nominal (unordered categorical) variable measurement
2
- frequencies (no. observations in each category)
- proportions (relative frequencies)
Ordinal (ordered categorical) measurement
2
- frequencies
- proportions
- sometimes means and medians