PAS 1 Flashcards
What do line graphs imply?
that it is reasonable to interpolate between any observed readings
What should you look at first when inspecting and plotting data, before calculating statistics?
RAW DATA
- look at the actual numbers
- check for obvious mistakes and impossible observations
- graph and results and look at them
Summary statistics
statistics that summarise your data
The disadvantage and advantage of using summary statistics
the exception to the disadvantage
You lose information by doing this, but you gain a greater understanding of the overall picture/trends in the data.
the exception to the disadvantage - dot plots retain all the data in its original form
Measures of Average/central tendency
Mean- common average
Median- middle value
Mode - occurs the most frequently (2 = bimodal)
Measures of Spread
Range- smallest to largest value
Standard Deviation- measure of spread of data around the mean
Variance- average of the squared difference from the mean, or the square of the standard deviation
Interquartile Range- covers middle two groups
The disadvantage and advantage of using interquartile range
it is not useful with small numbers of observations
help rule out extreme values or outliers in the data
the disadvantage of the mean and range
outliers/extreme values can affect the mean and range
Types of Data
Numerical (quantitative) Data
Categorical (qualitative) Data (e.g. male/female)
Continuous (numerical) data
Discrete (numerical) data
can take on any real-number value within some range
can only take specific value
Nominal, ordinal and dichotomous data
- NOMINAL - no natural order to the categories e.g. hair colour or degree of course
- ORDINAL- there is a natural order to the categories e.g. months of the year, satisfaction scale from 1-10
- DICHOTOMOUS - there are only 2 options e.g. yes/no vote, leave/remain vote
the disadvantage of the standard variation
large outliers will create a higher dispersion
What are the means of variance
variation - an average of how much each number in a group of numbers differs from the group mean.
mean deviation - a measure of how closely grouped your data values are.
standard deviation - the measure of the average distance our observations are from the mean.
how do you calculate mean deviation
- calculate the mean
- calculate the deviation of each value from the mean
- drop the negative sign
- find the mean of the deviations
how do you calculate standard deviation
- calculate the mean
- calculate the deviation of each value from the mean
- square each deviation
- sum of the squared deviation
- divide by n-1
- square root