week 2 - descriptive statistics part 2 Flashcards
What is the difference between sampling errors and non-sampling errors?
Sampling errors result from the fact that only a fraction of the population is being observed. They become less important when the sample size increases.
Non- sampling errors arise when the sample is not representative of the whole population and the do not neccesarily decrease with the sample size. Eg not including people in a study with no permanent home address because their existence is not recorded.
Name some different types of sampling.
Non-random sampling, simple random sampling (SRS) and Stratified random sampling.
What is non-random sampling?
The sample is chosen not by random. For example participants opt to be part of a study.
What is SRS (simple random sampling)
SRS means everyone in a population has equal chance of being involved in a study. For example everyone in a population is provided with a number and a group of numbers is then generated randomly to decide which participants are the be involved from the population.
How does stratified random sampling differ from SRS?
In stratified random sampling the population is divided into groups based on particular characteristics (eg sex, occupation etc) and then random sample is selected from each of these groups
Calculate the mean and median for the following numbers..
83, 90, 80, 105, 85, 74, 88.
Mean = 86.4 Medium = 85
Why is the mean most commonly used to describe the distribution of data?
Because it is the most amendable to analysis
What are the most commonly used measures of locations (also known as measure of central tendency or average).
Mean, Mode and Median.
When would you not use the mean to describe the measure of location and what might you use instead?
When the distribution is skewed the mean can be misleading so you would use the median instead
A positively skewed distribution is skewed to the ….and a negatively skewed distribution is skewed to the ….
Positive = right. Negative = left.
What is the range and what are its disadvantages?
The difference between the largest and the smallest values. The disadvantaged are that it wastes informations, extreme values (outliers) may make it unreliable, and it often increases as the sample increases.
What are these two symbols and what is the difference between them? µ and
‘x-bar’ and ‘mu’. X-bar represents the mean of a sample population where ‘mu’ represents the mean of a total population.
What is the below formula representing?
The formula for sample standard deviation.
What is the formula (algebraic) of population standard deviation?
How to do calculate the sample standard deviation for the sample variance?
The sample standard deviation is the square root of the sample variance.