Descriptive Statistics Flashcards
Familiarity with basic descriptive stats
What type of variable are rating scales?
Ordinal - gaps between numbers are not meaningful e.g. gap between 1 and 2 may be very different from the gap between 2 and 3, may take a lot more to move between one pair than the other
How can rating scales such as Likert scales be used?
Numbers can be aggregated across many questions and resulting numbers treated as INTERVAL data - cannot say one person is “twice as…” using this data, but can say a person is “three points higher than…” for example
What is a key difference between interval and ratio data?
Ratio data has an absolute zero e.g. temperature in Celsius is interval data while weight is ratio data
What are the 2 main types of descriptive statistics used?
Measures of central tendency - describes central position of a frequency distribution; MEAN, MEDIAN or MODE
Measures of spread - Describes how spread out scores are; RANGE, QUARTILES, ABSOLUTE DEVIATION, VARIANCE, SD
Is it possible for the same data to be treated as both ordinal and ratio?
Yes - data can be transformed from one level to another to allow different types of testing but this can only happen “downwards” e.g. going from ratio down to ordinal. In simple terms, we can make detailed data more simple but we cannot do the opposite
What are 3 possible advantages of simplifying data?
Easier reading/understanding of data
Decisions/actions based on data become clearer e.g. pass or fail
Sweeping generalisations - classifying people into comparable groups
What are inferential statistics useful for?
When it isn’t practical to measure every member of a population - use a representative sample and make generalisations
Methods are an ESTIMATION of population parameters as sampling errors mean that a given sample will never be fully representative
What is Ordinal data
Variation along a continuum, difference between numbers NOT meaningful
Can only say bigger or smaller than, no direct ratio comparisons e.g. twice as big as
What is Interval data
Variation along a continuum where the difference between numbers is meaningful/equal/fixed
No true zero though so ratios between numbers are not meaningful
Compare and contrast descriptive and inferential statistics
Descriptive - measurements certain but cannot be generalised
Inferential - measurements can be generalised but purely estimation
When converting interval to ordinal data, how do you treat data which is the same e.g. if three participants have the same score?
Rather than saying, for example, that these three people are “equal second”, we would take the median of the second, third and fourth rank and that would be the designated rank
e.g. if three people score the second highest score, the median would be 3 and there would be no Rank 2 in this particular set of ordinal data
Why can numbers used as measures be misleading?
e. g. TEMPERATURE - a temperature scale will appear as an interval scale, but the EXPERIENCE of heat change is arguably better considered as ordinal as an increase of 3 degrees in a room at 14 will likely be more noticeable than the same change at 30 degrees i.e. the gaps between the numbers are not meaningful as it takes more to get between one pair of numbers than another
e. g. “PLASTIC INTERVALS”/QUASI-INTERVAL SCALES - attitude scales, for example, have meaningless intervals along the scale but misleading familiarity of the 0-10 number scale
What is a good rule of thumb for how to treat different measurement scales?
Using a published, standardised scale –> treat data as interval
Using an unstandardized invented scale –> safer to treat as ordinal
This means using different descriptive stats to summarise the data
How can we change interval data into nominal?
Median split method
What is meant by a quasi-interval scale?
Numerically appearing intervals on the scale do not measure equal amounts of construct
In practice, how do we use a truly interval scale?
e.g. when using a tape measure and someone is 175cm tall, we can only truthfully assert that they are 174.5-175.5cm tall (exact limits determined by tape measure or by convenience)
When can we create frequency histograms?
When data is ordinal/interval/ratio NOT nominal i.e. the data needs to be able to be meaningfully ordered
(For nominal we would use bar charts, leaving spaces between the bars)
How and why do we group data for a frequency histogram?
When we have a large range of scores and individual data points are time consuming and uninformative
1) Decide group size - usually between 5 and 10, we are aiming for a size that enables us to have less than 10 groups in total
2) Start the interval with a multiple of 5 or 10 - we can use our group size possibilities here e.g. if we have scores ranging from 55-99 we can try an interval of 5 and divide the difference between scores i.e. 45 by 5 –> gives us 9 groups