Lecture 7- Terminology Flashcards
What does the type of variable used effect?
The tools we use to analyze it
What are the three main types of variables?
- Continuous: expressed on a continuous scale in which every value is possible
- Discrete: can be put in a one to one correspondence with counting numbers (whole)
- Categorical: restricted to one of a set of distinct categories e.g. heads or tails
What is binary categorical data?
This is the simplest data type that can arise from a categorical variable
There is only two categories to choose from
e.g. you are either a smoker or non-smoker, athlete or not athlete
Why is binary categorical data sometimes called 0-1 data?
Because the two categories are given arbitrary numbers for representation e.g. 1 represents the patients with the outcome and 0 represents those without the outcome
Are there cases where we need more than two categories of categorical data? What are the further divisions that occur at this level?
Yes, often e.g. blood types
The term nominal is used if there is not relevant ordering of the data
Ordinal is used if there is an ordering/ rank
In what case where we are using a number scale is data not numerical?
For categorical ordinal categories we often use number as a scale/ ranking. However, this doesn’t mean the data is numerical e.g. pain levels. If it was numerical we could say someone who rated double someone else’s score experienced twice the pain which we can’t do.
How is discrete numerical data different from categorical data which uses a number scale?
Because numbers only take on discrete (whole) data values spaces between the numbers is always consistent so we can make claims like 2x as many (comparisons)
Continuous data arises from…..
taking a measurement (e.g. height), there is a full scale of numbers.
True or false continuous data is more often positive than negative…
T
What is an example of making continuous data discrete? (coarsening)
When we say our age in full years
Does continuous numerical data give rise to only 1 pattern of distribution?
No, because any measurement is possible often there is a lot of variation in the pattern that can be seen from a graph
What is an example of when multimodal data would arise?
Measuring height- would get a mode for each gender
What is a ratio?
Fraction given by one quantity over another. Both quantities have the same units
What is a fraction?
Fraction of one quantity when compared to the whole
What are rates?
Ratios for quantities of different units