Unit 6 section A/B Flashcards
Parameter
Number of a population
Population
Entire group of individuals
To estimate or predict a population parameter, we use a
confidence interval
To test a claim
we use a significance test
The process that scientists and data analysts use to make that conclusion comes from a process called
statistical inference
Statistic
The number that describe some characteristics of a sample
Confidence Intervals
an interval of numbers based on our sample proportion that gives us a range where we can expect to find the true population proportion.
A confidence interval will be based on three things:
sample proportion, sample size, and confidence level (usually 95%)
sample size must be
large enough that we can use a normal distribution to estimate our population proportion
As our confidence level increases, so does our __ , which in turn increases the range of our confidence interval
z*
When we are given a population parameter and we have some reason to believe that it is false, we can perform a _____ test to check if that value is correct.
significance
As we had with confidence intervals and sampling distributions, our significance test hinges on the fact that we must meet the three conditions of inference:
randomness, independence and normality.
To check if our sampling distribution is normal, we need to verify that the expected successes and expected failures of our study is at least 10. This is known as the
Large Counts Condition.
Large Counts Condition formula
np ≥ 10 and n(1-p) ≥ 10.
Confidence Interval “___”
“We are C% confident that the interval from ____ to ____ captures the [parameter in context]” (The word “all” must be included).
Confidence Level “__”
Gives the approximate % of confidence intervals that will capture the population parameter in repeated sampling in the same sample size.
“If we were to select many random samples of the same size from the same population and construct a c% confidence interval using each sample, about c% of the intervals would capture the [parameter in context]”
Conditions
- Random
- Independence - n ≤10%N
- Normal - using the Large Counts Condition, np and n(1-p)
Standard error
An estimate of the standard deviation of the sampling distribution of p-hat
Margin of error
Based on the critical value (z-score). The critical value is the number of standard deviations that the sample statistic is from the population parameter.
One Sample z-interval for Proportions
An appropriate procedure for constructing a confidence interval for a single population proportion based on a sample of categorical data. Uses a z-test to estimate the population proportion based on the sample proportion and the sample size.
The one-sample z-interval for a proportion is used when the following conditions are met:
- The sample is a random sample, or the sample size is large enough (usually n > 30) to use the normal approximation to the binomial distribution.
- The data are collected from a single categorical variable, such as a yes/no response or a dichotomous variable.
- The sample size is large enough to use the normal approximation to the binomial distribution.
- The population proportion is unknown and needs to be estimated from the sample data.
Calculating the Interval: Calculating a confidence interval is based on two things:
our point estimate and our margin of error.
Point Estimate
p-hat, The sample proportion is the estimate of the population proportion based on the sample data.