Unit 4: descriptive statistics Flashcards
what are the 4 types of data sets
nominal
ordinal
interval
ratio
how do you know if a statistic is nominal?
has names, such as:
in land use - commercial, residential, agricultural etc
rock types - sedimentary, metamorphic
or just straight up names like locations
how do you know if a statistic is ordinal?
data can be placed in descending or ascending order such as settlement type: city, town, village, swamp
how do you know if a statistic is interval?
refers to real numbers with no true zero - only asked about temp
how do you know if a statistic is ratio?
numbers with no real zero - rainfall in mm etc
most number data will be in this category
how do you calculate the mean?
the average - all the numbers added together then divide by the values
how do you calculate the median?
middle value - all the numbers in a line and find the middle
how do you calculate the mode?
the most common number - can be two of them
how do you calculate the range?
difference between the max and min number - for example if the biggest number was 90 and the min was 10 then the range is 80
What is the methodology for spearmen’s rank correlation coefficient by hand
Not reliable as <10 pairs is too little and >30 is too much
1) construct table and write variables (x2)
2) draw scatter graph for this data and establish null hypothesis
3) rank data from lowest to highest (x2)
4) calc diff between numbers and put into colum
5) calc *2
6) choose between tied and non tied formula
7) interpret results
What is the methodology for persons product moment correlation coefficient
1) create scatter graph and null hypothesis for data
2) test strength with formula
Further technique used to test significance of relationship
3) work out degrees of freedom
What is the purpose of SRCC and PPMC
What are the positives and negatives for PPMC
Assumes linear relationship between variable even when not existing
Tedious to calc
Effected by extreme values - skews data
High degree of correlation doesn’t mean close to casual relationship between variables - can confuse viewer
What is the purpose of central tendency
Central tendency tells you the middle point in a set of data, done through mode, median and mean - used on already collected data
What are the pros and cons of central tendency