3h- Presentation of data Flashcards
What are the 3 kinds of data that discrete and continuous variation give rise to?
- qualitative
- quantitative
- ranked
What are discrete variables?
Limited number of possible types of characteristics e.g. eye colour
What are continuous variables?
Range of values e.g. height
What is quantitative data?
Data which can be measured objectively, usually with a numerical value, e.g. time, height. Easy to plot in graphical form and can be analysed statistically
What is qualitative data?
Data which is subjective and descriptive, it cannot necessarily be measured and is difficult to analyse statistically e.g. colour change, patient symptoms
What is ranked data?
Refers to the data transformation in which numerical values are replaced by their rank when the data is sorted from lowest to highest e.g. observing animals and assigning dominance rank
What does the type of variable being measured have consequences for?
Any graphical design or statistical tests which may be used
What are the 3 main calculations that can be conducted on data?
- mean
- median
- mode
What is the mean and how do you find it?
The mean is the average number. It is the sum of all the numbers divided by the number of measurements taken
What is the median?
The median is the middle value when the data is placed in sequence
What is the mode?
The mode is the most frequent value in the data set
What can box plots be used to show?
Variation within and between data sets
What 4 things should box plots include?
- median
- lower quartile
- upper quartile
- inter-quartile range
What do error bars show?
They indicate the variability of data around the mean
Why are error bars added to graphs?
They can show a direct measure of variation (using standard deviation) or probabilities (using confidence intervals)
What does a smaller error bar mean?
That the data is less variable
What is a confidence interval?
The statistical estimate of the range of values within which a certain percentage of the total population would be found
What does a 95% confidence interval show?
That the range of values would include 95% of the whole population being studied
When graphing data, when does correlation exist?
If there is a relationship between two variables
When does a positive correlation exist?
When an increase in one variable is accompanied by an increase in the other variable
When does a negative variable exist?
When an increase in one variable is accompanied by a decrease in the other variable
What is the strength of correlation proportional to?
The spread of values from the line of best fit
When can the correlation be described as strong?
When the values align closely to the line of best fit
When can the correlation be described as weak?
When the values do not align closely to the line of best fit
When, and only when, can causation be shown?
When all confounding variables are adequately controlled and thus is is difficult to demonstrate causation in observational studies
When does causation exist?
If the changes in the values of the independent variable are known to cause changes to the value of the dependent variable