1.5 Baseline Data Flashcards
What is SIPOC?
A: Suppliers, Inputs, Process, Outputs, Customers. Useful for identifying the key players and potential stakeholders.
What is the difference between a population and a sample?
A: Population: Often very large. Expensive and difficult or impossible to observe. Sample is a subset of the population. Observable and knowable. Subject to error and bias. Observational foundation for inference.
What is the difference between variable and attribute data?
A: Attribute is qualitative, categorical, or discrete. Count of whole things – colors, puppies, defects. Can’t be divided in a meaningful way.
Variable is quantitative, numerical, or continuous data. A measure on an infinite scale – time, distance, temperature. Can be meaningfully subdivided.
What is the difference between data, information, knowledge, and wisdom?
A: Data is facts without context (observations), Information is data with meaning and purpose, Knowledge – synthesis of information over time, Wisdom – integrated knowledge and understanding
What is normal distribution and why is it important?
A: The normal distribution forms the basis for statistical predictions about the future performance of a process. It helps determine the probability of a particular outcome.
What is the Central Limit Theorem?
A: If you have a population with a mean and standard deviation and take sufficiently large random samples from the population, then the distribution of the sample means will be approximately normally distributed. This means we can use the probability of the normal curve to estimate an outcome. It also means that systems with many random variables, tend to form a normal distribution.
There are two types of variation, what are they?
A: Common Cause and Special Cause (or Assignable Cause)
What is a Measurement Systems Analysis (MSA)?
A: Evaluates the quality of measurements – Methods, Tools, and People
What is a Gage R and R study?
A: Gage reliability and reliability study. An experiment to measure gauge error.
What is the 10:1 rule?
A: A measurement system should have a resolution 10x more precise than the tolerance it needs to measure.
What should be true about the metrics that you are measuring?
A: They need to be meaningful and measurable. We need to have the ability to affect change on the metric. We also need to be able to make decisions on the metrics.
What should be true about your data?
A: It must be stable, accurate, and measured correctly
What is the Mean of a data set?
A: The mathematical average. The value of all data points divided by the number of points
What is the Median of a data set?
A: The middle number when the data set is arranged in numerical order. If there are an even number of data points, the median is the average of the middle two points.
What is the Mode of a data set?
A: The most common value.