Hypothesis Testing: Types of Hypothesis Tests Flashcards
T-Test
- tests for a Student’s t-distribution
- In a normally distributed population where standard deviation is unknown and sample size is comparatively small
- Pared t-test compare two samples
One Sample T Hypothesis Test (Student’s T Test)
- Used to resolve hypothesis tests around comparing process means.
- The underlying chart makes use of the T distribution
- Student’s T distribution leverages the T distribution and is used for finding confidence intervals for the population mean when the same size is less than 30 and the population standard deviation is unknown
-
What is a T Distribution?
- The T distribution and T tests are used to determine the likelihood of various conditions occurring for small sample sizes (< = 30)
-
What is a T Distribution?
What is the T Distribution Table?
- The t distribution table values are critical values of the t distribution.
- The column header are the t distribution probabilities (alpha)
- The row names are the degrees of freedom (df)
- Student t tables gives the probability that the absolute t value with a given degree of freedom lies above the tabulate value
Student’s T Distribution Example
Student’s T distribution can be used for finding confidence intervals for the population mean when the sample size is less than 30 and the population standard deviation is unknown.
Confidence of a mean 1 Example
- At a soda bottling factor, the normal filling specification (goal) is 16 fluid ounces. A sample of 20 bottles is tested with the following results: x = 16.13 ox and s = 0.24 fl oz. What interval would allow you to say, with 95% confidence, that the interval contains the actual mean of the filling process?
What is a One Sample T Hypothesis Test?
- The One Sample T Hypothesis Test (Student’s T Test) allows to compare the (small) population mean to some hypothesized value or one sample mean if they are significantly different.
- For example, if we know the average weight of chickens in a farm is 3lbs, and compare the average weight of sample black hens to the population mean
When would you use a One Sample T Hypothesis Test?
- One sample t test is a type of parametric test because the assumption is samples are randomly distributed
- It tests whether the sample mean is significantly different than a population mean when the standard deviation of the population is unknown
- T test is used when the population standard deviation is unknown and the sample size is below 30, otherwise use Z-test (for known variance)
Assumptions of One Sample T Hypothesis Test
- Data is continuous and quantitative at the scale level (ratio or interval data)
- The sample should be randomly selected from the population
- Samples are independent to each other
- Data should follow normal probability distribution
- Assumes it don’t have extreme outliers in the dependent variable
One Sample T Hypothesis Test Formula
- Where
- x̅ is observed sample mean
- μ0 is population mean
- s is sample standard deviation
- n is the number of the observations in the sample
Steps to Calculate One Sample T Hypothesis Test
- State the claim for the test and determine the null hypothesis and alternative hypothesis
- Determine the level of significance
- Calculate the degrees of freedom (df = N - 1)
- Find the critical value
- Calculate the test statistics
- Make a decision, the null hypothesis will be rejected if the test statistic is in the rejection region
- Finally, interpret the decision in the context of the original claim
Chi Square Distribution
Best Method to test a population variance against a known or assumed value of the population variance.
Continuous distribution with degrees of freedom
Uses goodness of fit of a distribution of data, whether data series are independent, and for estimating confidences surrounding variance and standard deviation for a random variable from a normal distribution
ANOVA Analysis of Variation
- Parametric statistical technique used to compare the data sets
- ANOVA is best applied where more than 2 populations or samples are meant to be compared
- It is used to text statistical significance of the relationship between a dependent variable (Y) and a single or multiple independent variables (X’s)
Types of ANOVA
-
One-Way
- Measures single factor from multiple scores
- Uses only one technician / one measurer
-
Two-Way (without replicates)
- Measures 2 factors
- Uses only one technician (unless technicians are one of the factors)
-
Two-Way (with replicates)
- Measures 2 factors, but has multiple repetitions of each combination
- Uses only one technician (unless the technicians are one of the factors)
ANOVA Sum of Squares Correction Factor
- Grand total of all runs (G) = ΣX
- N= Total number of runs
- Correction factor (CF)= (ΣX)2 /N = (G)2 /N
Terms used in ANOVA
- Degrees of Freedom (df): The number of independent conclusions that can be drawn from the data
- SSFactor: It measures the variation of each group mean to the overall mean across all groups
- SSError: It measures the variation of each observation within each factor level to the mean of the level
- Main effect: A main effect is the effect where the performance of one variable considered in isolation by neglecting other variables in the study
- Interaction: An interaction effect occurs where the effect of one variable is different across levels of one or more other variables
- Mean Square Error (MSE): The mean square error (MSE) is divide the sum of squares of the residual error by the degrees of freedom
- F-test Statistic: The null hypothesis that the category means are equal in the population is tested by F statistic based on the ratio of mean square related to X and mean square related to error
- P-Value: It is the smallest level of significance that would lead to rejection of the null hypothesis (H0). If α = 0.05 and the p-value ≤ 0.05, then reject the null hypothesis, similarly if p-value > 0.05, then fail to reject the null hypothesis.
Analysis of Variance (ANOVA) has three types:
- One-Way Analysis
- Two-Way Analysis
- K-Way Analysis: L-Way ANOVA can be two-way ANOVA or three-way ANOVA or multiple ANOVA