Module 8 Flashcards
What is a parameter?
A constant that describes a population characteristic; value may be unknown.
What is a statistic?
A random variable that describes a characteristic of a sample; depends on the chosen sample.
Describes a characteristic of a sample.
A random variable, as it depends on the chosen sample.
What is a population parameter?
Describes a characteristic of the population.
It is constant, but its value may be unknown.
Sample proportion vs sample mean
Sample Proportion: Proportion of observations with a specific characteristic.
Formula: p=x/n
Example: Proportion of left-handed people.
Sample Mean: Average value in a sample. Example: Average annual income.
Why use sample statistics?
Sample statistics are used to estimate unknown population parameters.
A single population can have many possible samples of a given size.
Confidence interval for population mean (standard deviation known)
Use sample data to estimate the population mean with a certain level of confidence. Example: 90% confidence interval.
Estimator vs estimate
Estimator: A rule/formula used to estimate a parameter (e.g., sample mean).
Estimate: A specific calculated value of the parameter from sample data.
What is alpha (a)?
The allowed probability of error in a statistical test.
Also known as the level of significance (introduced in Chapter 9).
Confidence coefficient vs confidence level
Confidence Coefficient: The proportion of confidence intervals that would contain the true parameter if repeated sampling occurred (e.g., 0.95).
Confidence Level: The percentage representation of the confidence coefficient (e.g., 95%).
Key features of t-distribution
Symmetric around a mean of zero but with broader tails than the normal (Z) distribution.
The shape depends on degrees of freedom (df).
Fewer df → broader tails.
As df increases, the t-distribution approaches the Z-distribution.
Also called “Student’s t” or t-scores.
When to use the t-distribution vs z-distribution
Use z-distribution when the population standard deviation (σ) is known.
Use t-distribution when σ is unknown and sample size is small.
What factors influence confidence interval width?
Sample Size: Larger samples → narrower confidence intervals.
Confidence Level: Higher confidence levels (e.g., 99%) → wider intervals.
Population Variability: Greater variability → wider intervals.
Confidence interval for population mean when standard deviation is unknown
Use the t-distribution instead of the z-distribution.
The formula for the confidence interval:
x±t×s/sqrt n
x bar = sample mean
t = critical t-value based on degrees of freedom (df)
s =. sample standard deviation
n = sample size
What is the formula for degrees of freedom (df)?
df = n – 1