Data: Descriptive Statistics Flashcards
What are the various measures of central tendency
- Mode
- Median
- Mean
How do you decide which measure of central tendency to use
Based on the levels of measurement
which measure of central tendency you’d use if the data was nominal
Mode
which measure of central tendency you’d use if the data was ordinal
- the median
- then the mode if calculating the median isn’t possible
which measure of central tendency you’d use if the data was interval/ratio
- preferably the mean
- however, if the data is skewed…
- you need to calculate the median
- then the mode if calculating the median isn’t possible
What is the strength of using the mean
- takes all the scores into account so it is the most sensitive
What is the weaknesses of using the mean
- impacted by extreme values (i.e. outliers) - it will be skewed (artificially inflated or deflated)
- not useful when a decimal point is not an option for the data (e.g. 2.4 cola bottles)
What is the strength of using the mode
- Not impacted by extreme values (i.e. outliers) — it will not be skewed (artificially inflated or deflated)
- useful for nominal data as it is the only method
What is the weakness of using the mode
- Doesn’t take all the scores into account so it is not as sensitive
- there may be several or none
What is the strength of using the median
- not impacted by extreme values (i.e. outliers) - it will not be skewed (artificially inflated or deflated)
What is the weaknesses of using the median
- Doesn’t take all the scores into account so it is not as sensitive
EXAMPLE:
In an unethical experiment 3 groups of 8 lab rats were given a maze to complete and times were recorded in seconds.
Group 1) Rats given brain lesions - 35, 27, 26, 27, 28, 79, 27, 30
Group 2) Rats with tails cut off - 15, 10, 18, 22, 8, 49, 16, 22
Group 3) Rats with eyes damaged - 33, 33, 32, 28, 67, 45, 24, 29
Which measure of central tendency should be used + why?
- the data was ratio, however there were outliers in the groups
- therefore calculating the median is the most appropriate measure of central tendency
- because we can’t calculate the mean because there are outliers
Describe the characteristics of a Bar chart
used for nominal (category/not continuous data):
- frequency = Y-axis
- categories = X-axis
- gaps between each bar represents the lack of continuity
For experiments: IV = X-axis, DV = Y-axis
Describe the characteristics of a Histogram
Used for continuous data:
- frequency = Y-axis (it must start at 0)
- continuous data = X-axis
- no gap between each bar, represents continuity
Describe the characteristics of a Line graph
- frequency = Y-axis (must start at 0)
- used for continuous data = displayed on X-axis
- each dot is connected by a line