mtehts Flashcards
Critical Region
A critical region is the range of values of the test statistic that would lead us to reject 𝐻0. The threshold value between the critical region and acceptance region is the critical value.
How will a distribution change when the intervals change?
We don’t know the distribution of the test statistic within the interval.
Type I Error
Concluding that there is evidence for rejecting the null hypothesis, when in reality the null hypothesis is true.
Probability is the probability of rejecting the null hypothesis. (Significance level/s)
Type II Error
Concluding that there is evidence to not reject the null hypothesis, when in reality the null hypothesis is false.
Probability of accepting the null hypothesis with the true probability (Probability of the acceptance region w/ true probability)
Sampling Types
Systematic Sampling
Selecting sampling units from a population at a fixed periodic interval.
- Bias-free
- Requires a sampling frame
- May lead to inaccuracies if population size is unknown
Simple Random Sampling
Selecting sampling units randomly, each unit has an equal chance of being selected.
- Bias-free
- Requires a sampling frame
- Is unspecific; can be used on any dataset with no other steps or special knowledge required.
Opportunity Sampling
Selecting sampling units based on what is accessible and convenient at the time and place the investigation is taking place.
- Very simple to carry out, inexpensive
- Subject to selection bias will make results not accurate
Sampling Frame
Indexed list of all sampling units in a population that is being tested.
Significance Level
The threshold value between the acceptance and critical regions (p-value of test statistic less than or equal to the significance level leads to the conclusion that there is evidence for rejecting the null hypothesis)
Probability Distributions & Conditions for each
Binomial:
Each event is independent and has constant probability.
Each event has only two outcomes, success or failure.
There is a fixed number of trials.
Poisson:
There is a constant average rate that the events occur at.
Events don’t occur simultaneously.
Each event is independent.
Discrete Uniform:
Each outcome has equal probability. (e.g dice roll or choosing numbered cards)
Skewness
Positive skew:
Bulk of test statistics are nearer 0, distribution tails away from 0.
(mean > median > mode)
Negative skew:
Bulk of test statistics are further from 0, distribution tails towards 0.
(mode > median > mean)
Correlation Terms
Strong positive
Weak positive
Strong negative
Weak negative
Zero correlation
Measures of Central Tendency
Mean
Median
Mode
Measures of Variability
Range
Interquartile Range
Standard Deviation
Variance
Standard Deviation Equations
For discrete test statistics:
σ² = Σ( (x-x̅)² ) / N
For frequency distributions:
σ² = ( (Σfx²) / (Σf) ) - x̅²
P-Value
Probability of obtaining a test result based on the test’s binomial model under the assumption of the null hypothesis being true.
Interpreting Regression Line Equation
Gradient: As X increases by 1, Y increases by m 𝐨𝐧 𝐚𝐯𝐞𝐫𝐚𝐠𝐞.
Intercept: When X is 0, Y is C