Exam Review Units 1-6 Flashcards
What’re individuals?
individuals are objects described by a set of data
What’re variables?
variables are any characteristic of an individual
What’re categorical variables?
categorical variables are qualitative
What’er quantitative variables?
quantitative variables are numerical
What’s an outlier?
An outlier is an individual observation that falls outside the overall pattern of the data.
What’s nonresistance?
nonresistance is when extreme observations impact the value.
Examples of nonresistance?
mean and standard deviation
How can you display distributions with graphs?
displaying distributions with graphs:
- dotplots
- histograms
- stemplots
- time plots
- bar graphs
How to describe distributions with numbers? (important)
describing distributions with numbers:
- measures of center: mean, median, and mode
- measures of spread: range, interquartile range, standard deviation
- measuring position: quartiles, percentiles
What’s percentile?
percentile is the percent of the distribution that is at or to the left of the observation.
What does the normal distribution look like? (4) (important)
normal distribution:
- symmetric
- one peak
- bell shaped
- described by its mean and standard deviation.
What does the 68-95-99.7 rule apply to?
68-95-99.7 rule applies to NORMAL DISTRIBUTIONS.
What’s the 68-95-99.7 rule?
68-95-99.7 rule:
- 68% of the observations fall within 1 standard deviation of the mean
- 95% of the observations fall within 2 standard deviations of the mean
- 99.7% of the observations fall within 3 standard deviations of the mean
How to determine if data is normal?
- for histograms, stemplots, and box plots: look for a bell shape
- for normal probability plots: look for a straight line.
What’s a parameter?
a parameter is a number that describes the population.
Do we know the parameters?
NO! WE do not
what’s a statistic?
a statistic is a number that we can find using the sample data without using unknown parameters.
What do we use a statistic for?
we use a statistic to estimate the unknown parameter
what’s the sampling distribution?
the sampling distribution is the distribution of values taken by a statistic in all possible samples of the samite size from a population.
When is a statistic unbiased?
a statistic is unbiased if the mean of its sampling distribution is equal to the true value of the parameter you’re estimating.
What’s the variability of a statistic?
the variability of a statistic is the SPREAD of its SAMPLING DISTRIBUTION.
What determines the spread of a sampling distribution?
These determine the spread of a sampling distribution:
- sampling design
- size of the sample
How does sample size affect the spread?
as the sample size increases, the spread decreases and becomes smaller.
When does the margin of error of a confidence interval get smaller? (3)
the margin of error of a confidence interval gets smaller when:
- the confidence “C” level decreases
- the population standard deviation decreases
- the sample size “n” increases.
Why do you do a test of hypothesis?
you do a test of hypothesis to assess the evidence provided by data against a null hypothesis H0 in favor of an alternative hypothesis Ha.
For hypothesis tests of significance, what’s H0?
H0 is the null hypothesis
For hypothesis tests of significance, what’s Ha?
Ha is the alternative hypothesis
What’s a test of significance based on?
a test of significance is based on a TEST STATISTIC.
What’s the p-value?
the p-value is the probability that the test statistic will take a value at least as extreme as the observed value if the NULL H0 is TRUE.
What do small p-values tell you?
small p-values tell you indicate strong evidence against the null hypothesis H0.
What happens if the p-value is equal to or less than the alpha “a”?
if the p-value is equal to or less than the alpha “a,” the data are STATISTICALLY SIGNIFICANT at the alpha “a” significance level
When’s the data STATISTICALLY SIGNIFICANT at the alpha “a” significance level?
the data is STATISTICALLY SIGNIFICANT at the alpha “a” significance level when the p-value is less than or equal to alpha “a.”
What’s a type 1 error? (important)
a type 1 error is when we reject the null hypothesis H0 when it is TRUE.
What’s a type 2 error? (important)
a type 2 error is when we accept the null hypothesis H0 when in fact the alternative hypothesis Ha is true.
What’s the power of a significance test?
The power of a significance test measures its ability to detect an alternative hypothesis
What’s the power against a specific alternative?
the power against a specific alternative is the probability that the test will reject the null H0 when the alternative is true.
In a fixed level alpha “a” test, what’s the level alpha “a”?
In a fixed level alpha “a” test, the level alpha “a” is the probability of a TYPE ONE (1) error
In a fixed level alpha “a” test, what’s the power against a specific alternative?
In a fixed level alpha “a” test, the power against a specific alternative is 1 - beta “b”
- beta is the probability of a Type TWO (2) error
In a fixed level alpha “a” test, what’s the level beta “b”?
In a fixed level beta “b” test, the level beta “b” is the probability of a TYPE TWO (2) error
What’s the standard error of a statistic?
The standard error of a statistic is the standard deviation of a statistic you estimate from the data.
When do you use a T Distribution?
you use a T distribution when you don’t know the population’s standard deviation σ.