statistics exam 2 Flashcards
association
values of one variable tend to occur with certain values of another variable; detected when the conditional distributions differ from the marginal distribution and from each other.
bias
a condition where the mean of the statistic values differs from the parameter and the statistic estimates
bivariate data
data collected on two variables for each individual in a study.
central limit theorem
the name of the statement telling us that the sampling distribution of x bar is approximately normal whenever the sample is large and random.
conditional distribution
the distribution of the values in a single row (or a single column) of a two-way table.
control chart
a statistical tool for monitoring the input or output of a process
control limits
u-3sigma/rt n and u+3sigma/rt n; used to detect out-of-control signals in a control chart.
correlation coefficient
a measure of the strength of the linear relationship between two quantitative variables.
disjoint events
events that cannot occur simultaneously
distribution of a variable
a list of the possible values of a variable together with the frequency of each value (probabilities can be given instead of frequencies)
event
a single outcome or a combination of outcomes from a random phenomenon
extrapolation
predicting a Y value using a value of X that is outside of the range of X values used to obtain the regression equation. This prediction could be very far off.
inference
using results from a sample statistic value to draw conclusions about the population parameter.
influential observation
an observation that substantially alters the values of slope and y intercept in the regression equation when it is included in the computations.
law of large numbers
The fact that the average (x bar) of observed values in a sample will get closer and closer to u as the sample size increases.
laws of probability
the basis for hypothesis testing and confidence interval estimation
least squares
a method for finding the equation of a line that minimizes the sum of squared residuals.
least squares regression line:
the line with the smallest sum of squared residuals
lurking variable
a variable that is not measured but explains association between two variables that are measured.
marginal distribution
the distribution of the values in the “total” row (or the “total” column) of a two-way table
mean of the sampling distribution of x bar
the mean of all the sample means (x bars) from all possible samples of size n from a population; equals u
u
the mean of the population
no association
a condition where values of one variable occur independent of values of another variable; detected when the conditionals of a two-way table equal the marginal distribution (and each other)
out-of-control process
one sample mean outside three standard deviations of x bar or 9 sample means in a row above or below the center line.
outlier
an observation that falls outside the overall pattern of the data set