Module 7, Confidence Intervals Flashcards
Confidence Level:
how likely our parameter is to lie within that interval ^^;
Margin of Error:
estimate of the maximum amount of difference we think is possible between our statistic and its corresponding parameter
Why is a confidence interval inferential?
An inferential stat because sample stats are used to estimate the location of population parameters (GENERALIZING)
Point vs. interval estimate
Point Estimate: single statistic that is used as our best estimate of a corresponding parameter (the value in a population from which you drew that estimate), GREATER PRECISION, LESS ACCURACY (very specific single stat)
Have extensive precision: give us a specific figure
DISADVANTAGE OF POINT ESTIMATE: low accuracy, low freedom from error/the extent to which our estimate differs from the parameter of interest’s true value
Interval Estimate: provides us with a RANGE of values within which the population parameter is likely to fall; GREATER ACCURACY, LESS PRECISION, this range is more accurate but not precise because of how the big the range can be
Sampling Theory and level of Confidence
Sampling Theory: a well-drawn probability sample, where every element has a known probability of being selected, normally involving a random mechanism, will yield results that closely resemble those we would get if we measured the ENTIRE population
Law of Large Numbers:
states that with a sufficiently large sample size, sample statistics (usually means) will tend to approximate population parameters very closely (THEY SHOULD BE VERY SIMILIAR
Confidence Level:
specifies the probability that our particular sample’s interval estimate will contain the population parameter
95% Confidence Level:
if a large number of samples is collected and a confidence interval is created for each sample, approximately 95% of these intervals will contain the population mean
Confidence interval vs. confidence level
Confidence Interval: range of values within which we estimate that a population parameter will fall (LOWER BOUND AND UPPER BOUND OF AN INTERVAL WITHIN WHICH WE ESTIMATE A POPULATION PARAMETER WILL FALL, IE. MEAN NUMBER OF FRIENDS IS BETWEEN 200-240)
Confidence Level: specifies the PROBABILITY that the population parameter will lie within that specified range of values
What does a higher level of confidence mean?
HIGHER LEVEL OF CONFIDENCE = WIDER THE CONFIDENCE INTERVAL
LOWER LEVELS OF CONFIDENCE: HAVE A SMALLER INTERVAL BECAUSE THE TRADE OFF FOR HAVING A SMALLER AND MORE PRECISE RANGE OF VALUES IN WHICH WE THINK OUR PARAMETER LIES IS THAT WE CAN BE LESS SURE THAT WE WOULD CAPTURE IT IN SUBSEQUENT SAMPLES
Sampling error vs. margin of error
Sampling Error: we can reasonably assume that there will always be at least some gap between our sample statistics and their respective population parameters
Margin of Error:m estimate of the AMOUNT OF DIFFERENCE that we think is possible between our statistic and its corresponding parameter
Sampling theory and sample size
SAMPLING THEORY: the larger the sample size, the LESS amount of sampling error we expect there to be = smaller margin of error (to know if this is true, we would need to know the population parameter)
CLT
states that with repeated samples, the sampling distribution will eventually become approximately normal and the mean of all samples will approximate the mean of the population
CLT and increasing sample size
INCREASING SAMPLE SIZE: HAS THE SAME EFFECT AS REPEATING SAMPLES OVER AND OVER AGAIN, IT REDUCES THE MARGIN OF ERROR
AS SAMPLE SIZE INCREASES, THE MARGIN OF ERROR VALUE BECOMES SMALLER AND THE CONFIDENCE INTERVAL BECOMES MORE NARROW
Two assumptions when using confidence intervals
First assumption: is that you have used simple random sampling
Second assumption: is that we have a normal probability distribution; this is crucial because confidence intervals rely on the Central Limit Theorum in order to make an interval estimate