Statistics Flashcards
What is random sampling?
Usually using computer generated random number tables. Every item has an equal chance of being selected
What is stratified sampling?
The area under study is divided up into homogenous units and each unit is randomly or systematically sampled
What is systematic sampling?
Items are picked at a regular interval e.g. every 30 metre
Outline a bar graph:
Easy to understand
Histograms are bars based on frequencies
Outline a line graph:
Used for continuous data
Can plot several lines on one graph
Can have different scales on each axis
Outline a pie chart:
The segments represent the share of the total value
Visual but can be difficult to read
How do you work out the mean?
Calculated by adding up all the values in a datset and dividing the total sum by the number of values in the data set.
What is the median?
This refers to the central value in the ranked data set.
What is the mode?
This is the most frequently occurring number in the data set.
What is the range?
Highest value take away the lowest value.
What happens when the median, mode and mean are the same value?
We get a normal distribution, however, most data sets are SKEWED with differences in the 3 measures.
Outline how to work out interquartile range:
1) Put numbers in order.
2) Find out the median
3) Find the median of the data set before and after the median number (upper and low quartile)
4) Take upper away from lower quartile to get the interquartile range.
What is standard deviation?
The standard deviation is the most useful measure of the dispersion of a set of data from its mean.
A low SD indicates that the data is clustered around the mean whereas a high value indicates dispersion.
Outline how to calculate standard deviation:
1) Difference of each value from the mean is worked out.
2) Each of these values is squared (to remove negative values)
3) All the squared values are added together
4) This summed value is then divided by the number of values in the data set.
5) The square root of this value is found.
What is the formula for standard deviation?
SD = ∑ ( x - x )²
_______
n
n = number of values in the data set x = each value in data set Second x (should have a line over) = mean of all values in data set