Statistics Flashcards
Pearson advantages
The result being between 1 and -1 it is easy to decide wether the correlation is positive or negative
Tells us the strength of a relationship
Uses exactly values and therefore is very accurate.
Pearson negatives
Highly affected by outliers
Difficult and time consuming to calculate
Assumes a linear relationship between variables.
Cannot explain trends
Spearmen positives
The result being between 1 and -1 it is easy to decide wether the correlation is positive or negative
Tells us the strength of a relationship
Is not affected by outliers as uses ranked data
Much easier to calculate than Pearson.
Spearmen negatives
The result does not explain the relationship
Less accurate as uses ranked data.
Linear regression positives
Very simple to understand
Linear regression negatives
Affected by outliers
Scatter graph needs to first be created to see if data is normally distributed
Can take very long to calculate as standard deviation and Pearson need to first be calculated.
Standard Deviation positives
Can always be calculated
Very accurate as takes into consideration all values
Standard deviation negatives
Difficult to compare to different sets with different means
Assumes normal distribution
Is affected by outliers.
Nearest neighbour advantages
It can confirm a theory or hypothesis
It allows comparisons to be made between different areas or see how an area changes over time
Easy to understand and interpret.
Nearest neighbour disadvantages
It does not take into consideration the size of regions
Cannot be used in irregular shaped areas where a geographic feature separates the nearest neighbour
It can be difficult to identify settlement boundaries.
Chi squared advantages
Tests associations between variables
Useful when data is grouped into classes
Can be compared to a significance table
Useful when measuring the difference between observed and expected data.
Chi squared disadvantages
Cannot use percentages
The number of observations must be more than 20
Complicated to calculate
The test becomes invalid if the expected result is less than 5.
Mean advantages
Easy to calculate and understand
Uses all the data to find the answer
Useful for ratio and interval data
Mean disadvantages
It is affected by outliers
Medien advantages
It is not affected by outliers
Very easy to calculate
Good with ordinal data (ordered).