Statistics Flashcards
What is a Sample Statistic?
Is a quantity that describes some characteristic of a sample with respect to a specific variable.
Pro and Con of Median
Pro: Is insensitive to extreme scores in the data set
Con: Doesn’t reflect the shape of the scores – i.e. doesn’t care how far away extreme scores are
Pro and Con of Mode
Pro: Easy to calculate from a histogram and easy to understand – the most common value.
Con: Data set might have more than 1 mode or no mode at all
2 Types of Non-Normally Distributed Data
Skewed
Bimodal
Define Conditional Probability
Probability of an event given that something else is known/assumed, i.e. when given/assuming some other additional information.
What does the z-score measure?
z measures how far away your sample is from the population mean in multiples of the standard deviation (how many standard deviations away is your sample from the mean)
Define sampling Error.
The error associated with examining statistics calculated from a sample rather than the population
Occurs because in our sample we do not have all the members of the population
Pop. Parameters and sample statistics differ due to sampling error.
How does sample size effect magnitude of sample error?
BIGGER SAMPLE = BIG SAMPLING ERROR LESS LIKELY
SMALLER SAMPLE = BIG SAMPLING ERROR MORE LIKELY
Define Sampling Distribution.
A distribution of a sample statistic (e.g. mean, s.d., median, etc…) obtained by repeatedly sampling from a population.
Tells us important information about how a statistic changes from sample to sample
Define Sampling Distribution of the Mean (SDM)
The sampling distribution of the mean describes a distribution ofsample means derived from samples of size N from a parent population.
The standard deviation of this distribution is commonly referred to as the standard error of the mean or standard error for short.
Define the Central Limit Theorem.
The sampling distribution of the mean approaches a normal distribution, as the sample size increases.
As a sample size increases, the sample mean and standard deviation will be closer in value to the population mean μ and standard deviation σ.
A sufficiently large sample can predict the parameters of a population such as the mean and standard deviation.
What is a Confidence interval?
A confidence interval (CI) describes an interval (i.e. a range) of values for our population parameter, together with a specified level of confidence that the parameter is in that range
It is simply a way to measure how well your sample represents the population you are studying.
What is Type 1 Error
When we reject H0 when it is in fact true. (H1 is False)
Type 1 errors will occur naturally (i.e. just due to random sampling error) with probability p = a (i.e. 0.05)
What is Type 2 Error?
We Fail to reject H0 when it is in fact incorrect. (H1 is True)
Why does Type 1 Error Occur?
Type I errors occur because even if your p-value is small there is still a (small) chance that your data was unusually extreme (and so you rejected the NULL) just due to sampling error.