Lecture 4 - Probability, Sampling and Distributions Flashcards
Define probability theory
The branch of mathematics concerned with the study of random phenomena, i.e. chance.
Using the Gaussian equation, we can predict the value of y for any value of x from just the…
Mean and standard deviation
A positive skew moves the data peak of a normal distribution to the…
Left, and vice versa for a negative skew
Pearson’s coefficient of skew uses… to…
The difference between the mean and median… Measure the skew in terms of both magnitude and direction (positive or negative)
When data is positively skewed, i.e. the tail is on the right of the mean, mean>…
Median>mode
When data is negatively skewed i.e. tail is on the left of the mean, mean<…
Median<mode
What percentage of the sample are within 1 s.d. of the mean?
68%
What percentage of the sample is within 2 s.d.s of the mean?
About 95%
What percentage of the sample is within 3 s.d.s of the mean?
About 99.7%
Parametric tests assume that the mean and standard deviation…
Accurately represent the population distribution
Data can be transformed by…
Performing a mathematical operation (s) on all the values recorded
Data transformation is useful for…
+ reducing the impact of outliers/skew
+ standardisation, e.g. z scores are a consistent, universal unit
+ to remove non-linear effects
+ theoretical - using different measures to better understand the data
+ making the data normally distributed so that parametric tests can be used.
The z score is calculated by…
Taking the mean from each score and dividing the result by the standard deviation
The z score tells us…
How many standard deviations we are above or below the mean (0)
Sampling error is the difference…
Between the mean of each sample and the true mean of the population (and other sample means)