Lets have it lad Flashcards
Add together
=sum
what symbol in edit bar means auto sum
Ʃ
Multiply
*
Divide
/
Subtract
-
Mean
=Average()
Median for even numbers
Middle two, add together divide by two
Median
Sort and filter icon, smallest to largest to put them in order
Mode
=mode()
Range
=Max() and =min()
Interquatile range
Put in numeric order
Split into 4 quarters
Subtract first value of 2nd quarter from last value of 3rd
Standard deviation
=STDEV()
Variance tpye
=VAR()
What is variance
STDEv without square root
How do you calculate stdev
Calculate mean, x with line above means mean
Difference of each x value and mean then square
Take sum of all square
Then divide by number of parts minus 1
Square root
Square data
data^2
What are the two types of statistical analysis
Descriptive stats
Inferential stats
What is descriptive stats?
Statistical analysis to summarise main points/characteristics of data
What is inferential stats
Statistical analysis to infer something about a whole form sample
Nominal data
PLace in categroies, labelled- options
Ordinal data
Preferences shown and then presences ranked, scale 1-5
Interval data
Any values that have a consistent interval, how hot
Ratio
Has a defined 0 point, distance travelled
What is the mean very sensitive to?
extreme values
What is the median compared to the mean?
Much less sensitive, more robust
For nominal data what should be used, descriptive stats
Mode
What should you use for quantitative data?
Mean or median
When should you use median over mode
When. there are extreme values because you do not want a distorted average
When should you use mean over median
When there are no extreme values
What are the 4 measures of spread
Range
Standard dev - most common
Interquartile range
Variance
What is the standard deviation used for?
Measure the variability in data
What does a high standard dev mean?
high change
wHat does a low standard dev mean>
low change
Example of standard dev?
LAI values of rainfall
What is variance
Calculates variation which is not in the same units as the data (squared units)- less common
What are histograms?
Graphic depiction of the shape of the distribution of data- most common
What is the issue w to many intervals in a histogram?
Too complex
What is the issue with too few intervals in a histogram?
Detail is lost
What is the ideal number of intervals in a histogram?
Ideal=10 to 20
4 measures to describe the similaries/differences of frequency distributions
Central tendency
Spread
Skewness
Kurtosis
Skewness
Measure of asymmetry in distribution
Skewness, low numbers
Positively skewed distribution
Skewness High numbers
Negatively skewed distribution
No skew
0
Sew =
Mean-median
+ve value skew
+ve skew
-ve value skew
- e skew
Kurtosis
Measure of how flat or peaked a distribution is
Kurtosis, Platykurtic
Relatively flat distribution w no obvious peak
Kurtosis, Leptokurtic
strongly pronounced peak in data
Culmative frequency graphs, steep slope
Intervals with many data points
Culmative frequency graphs, shallow slope
intervals with few data points
What are calmative frequency graphs used for )example)
grain size statistics
What is used to raph 2 variables
Scatter plots
Nominal data os
-Categroical data
What is nominal data frequnctly expressed as?
Pie charts, good at showing proportion
Cna histograms still be used for nominal data?
-Yes, intervals=categories and frequencies = number of x in each category
How is nominal data expressed?
Histograms and pie charts
What do frequency distribution histograms show?
Visually describe distribution and indetify skewness and kurtosis
How do you decide size of intervals on histogram
Total (highest value) divided by the number of intervals you want
How do you calculate the number of individuals in each class interval
=fequencey(data cells, class interval cells) highlight range of cells to put the answer into first
How do you produce freuencey distribution histogram?
insert tab-charts-coumn
Skew function
=skew()
What is a normal skew distribution?
0
Why are sampling and inferential stats important
- Rare to be able to sample a whole population
- Use characteristics of sample t infer
What is random sampling
Selecting individuals with no bias
What is systematic smapling
Individuals selected in a regular way
What is spatial sampling
individuals are selected at regular spatial intervals
Criteria for truly random samples
- Every individual has an equal chance of inclusion throughout the procedure
- Selection of any individual should not affect the chance of selection for another
Positive/negatives of systematic sampling
+allows fair/even coverage of range of individuals
-Not fair and equal chance of being chosen, can produce bunching of sampled individuals
Two key assumptions which underpin most inferential stats
- random sampling
- population has a known distribution
What is a parameter
number that describes data from a population
What is a statsistic
A number that describes data from a sample
How to use random generator on excel
=RaNDBETWEEN(1,200), pulldown for cells below, rapsate answers and click paste special
Where to find look up feature
Lecture week 4
What is a hypothesis?
Proposed explanation for narrow phenomena, based on a range of things e.g. background scientific knowledge, preliminary investigators, logic, etc.
What is a theory
structure conceived by human imagination to explain how/why patterns occur in observed data
- often broader and can integrate many hypotheses
- new oe very well tested
- can be used to generate hypotheses
What must hypotheses be to be a science?
testable
Why can descriptive statistics be used to make hypotheses?
We can make hypotheses based on observed patterns
Hypotheses can be formalised for what?
statisical testing
Null hypotheses, symbol
H0