1. Introduction - Variability and Sampling Populations Flashcards
Why do we need statistics?
It describes a situation to us and gives us facts about a given topic.
Displaying data
From complex to simple.
Example:
List of data with various complex numbers, easier to understand in graph.
Find a way to represent his data that has meaning to us.
Histogram
different to a Bar chart.
On the x-axis (independent variable) - like a scaler - number at the front is larger and the number at the back is smaller. Align things in order - order the data that might not be ordered into something meaningful.
Y-axis (dependent variable) - increase of repeated numbers from x-axis. The more something occurs on the x-value, the lager the y-value is.
Independent Variable
Something the scientist controls and can be varied as I see fit.
Dependent Variable
Depends on independent Variable. Depends on the value obtained from the independent variable.
Bell-shaped Curve
Very important!!
A way of describing data as being normally distributed.
Significance
Ordered the data to look for something normal which would give us significant values.
Applied a statistical test which would give us a result of value and significance.
Ho to decide to trust your Data?
Correct labelling of axis.
Correct display of error bars.
Error Bar
The error is something I can’t account for.
Big error bar - Data is highly variable and cant be confident in it. Wouldn’t want to present due to lack of trust.
Small error bars - trustworthy and something presentable
Correlations
Looking for relationships between data.
Example:
Amount of x-axis correlating with the happiness on y-axis? -> power point sweets slide
Linear relationship - data displayed in a straight line
Positive correlation
value increases further along on x-axis.
Negative correlation
value decreases across the x-axis.
Descriptive Statistics
- ) measure of central tendency
- ) measure of variability
Brief, descriptive co-efficient which summarise a given data set. This can be a representation of the entire or a sample of a population.
They are broken down into messures of central tendency and measures of variability (spread).
1.) this is a summary statistic, representing the centre point or typical value of a data set.
The three most common measure of central tendency are the mean, median and the mode. Each calculates the location of the central point using a different method.
2.) the most common are the range, variance, and standard deviation.