Sampling Flashcards
Model
representation of a system, in some form other than that of the object itself
Mean and standard Dev.
Sample mean= sum of all over population
The sample mean will always be in the sample but can be outside the population mean. Use just N we will be underestimating
st dev= (observed - mean)^2/ N
parameter
Sample
= /n - 1
statistic
n−1 is used in the denominator of the sample standard deviation formula to provide an unbiased estimate of the population variance and to account for the loss of a degree of freedom when estimating the sample mean.
Variance is squared bc don’t want things above the mean to cancel out below the mean and then sq rooted to revert to the og units
Distribution
Divide the range of data into intervals
Label the x-axis according to chosen intervals
Find the frequency of occurrence in each interval
Label y-axis according to the frequency of occurrence
Plot!
Goodness of Fit
Compare the histogram of observed data to the shape of the candidate distribution
Chi-squared etc.
Find the distribution with the best fit
Chi Squared
Chi squared =
χ² = Σ [(O - E)² / E]
-
Formulate Hypotheses:
- Null Hypothesis (H0): There is no significant relationship between gender and preferred smartphone brand.
- Calculate Expected Frequencies: calculate the expected frequencies for each cell of the contingency table.
- Compute Chi-Squared Statistic: Use the formula to calculate the chi-squared statistic.
- Determine Degrees of Freedom: (r - 1)(c - 1), where r is the number of rows and c is the number of columns in the contingency table
- Find Critical Value or P-value: Look up the critical value for chi-squared distribution with 2 degrees of freedom at your chosen significance level (usually 0.05) or calculate the p-value using statistical software.
- Make a Decision: If the calculated chi-squared value is greater than the critical value or the p-value is less than 0.05, reject the null hypothesis.
Standard Error
σ / √n
Confidence interval +1.96 σ / √n
Hypothesis Testing
Assume initial hypothesis is true
Calculate a plausible range of values
If sample is not in range, reject the hypothesis
T-Stat Hypothesis Testing
How many standard errors the value is from the null hypothesis for the mean
test stat - mean / standard error
> 1.96 then reject
Comparison of Samples
H0: mean - mean = 0 (test stat)
Approach: – measure the difference between the means of each group – calculate st. error for this statistic – perform a hypothesis test
St. Error = (st. error^2 + st. error)
root
Parameter Vs Statistic
Parameter: fixed value calculated from every individual in the population
Stat: Calculated from only some of the populayion