Statistics + Probability (topic 4) Flashcards
Mean
Average: add up all values, divide by the number of terms.
Median
Middle value in an ordered data set. **Need to be in order first.
Mode
Most common number in the set.
Percentile
X percent of the data is below this.
Quartile
Q1 = first quartile = 25th percentile
Q2 = second quartile = 50th percentile
Q3 = third percentile = 75th percentile
Discrete data
Exact numbers (usually from counting)
Interquartile range
Measure of dispersion (spread) of the data
Continuous data
Any value in a certain range (can be decimal)
Reliable data
Repeatable data
Missing data can affect reliability
Bias: you have results favouring one outcome over another. **We try to minimize bias.
Sampling techniques
- Simple random
- Convenience
- Systematic
- Quota
- Stratified
Simple random sampling technique
Equal chance of choosing. Choose out of a hat, number generator, etc.
Ex. Poll students from school - # assigned to students, choose with a random # generator.
Convenience sampling technique
Choose easiest people to sample: ask your friends etc. Problems? May not be representative of population
Systematic sampling technique
Choose random starting point, use fixed interval.
Ex. Make a list of all students in a class, choose every 3rd student.
Quota sampling technique
Sample sizing to who you’re polling
Ex. 55% girls, 45% boys in school, so sample should have those same %.
Stratified sampling technique
Split into strata (smaller groups)
Ex. Choose half dp1, half dp2 students