Collecting Data 1 Flashcards
Scales of Measurement
- In order of desirability
- Nominal
- Ordinal (Ranking)
- Interval
- Ratio
Nominal Scale of Measurement
- Data that consists of names or categories only
- Allows us to classify the object
- E.g. Is a famous beach or not
- Does not allow rank
- E.g. Doesn’t rank how famous the beach is
- Cannot determine the interval
- No ordering scheme is possible
- E.g. # of M&M colors in a bag
Ordinal (Ranking) Scale of Measurement
- Data arranged in order
- Difference between the values cannot be determined or are meaningless
- A ranking scale
- E.g. Likert Customer satisfaction scale
- The difference between a 2 rating and a 4 rating does not mean the customer is twice as satisfied when giving a 4.
- E.g. Software defect categories
- 3 UI, 4 data, 1 browser compatibility
- E.g. Likert Customer satisfaction scale
Interval Scale of Measurement
- Data type which is measured along a scale, in which each point is placed at equal distance from one another
- Always appears in the form of numbers or numerical values where the distance between the two points is standardized and equal
- Has an interval
- Data is arranged in order and differences can be found
- No starting point
- Cannot be multiplied or divided, can be added or subtracted
- Ratios are meaningless
- E.g. Temperature of 3 pizzas. If one pizza is 100 degrees, that doesn’t make a 300 degree object 3 time as hot
- Examples:
- Temperature (in Celsius or Fahrenheit)
- IQ test
- Grade level, 1st, 2nd, 3rd grade
- Dates
Ratio Scale of Measurement
- Extension of interval level that includes a zero starting point
- Data is high level variable data
- There is an inherent zero starting point
- Both differences and ratio are meaningful
- Classify objects
- Rank Objects
- Has equal intervals
- Has a true zero point
- E.g. Watches that cost $200 and $400. The 2nd one is 2 times as expensive as the first
Types of Data
The type of data you have will dictate what you can do and the tools you can use.
- Discrete Data
- Qualitative Data
- Attribute Data
- Continuous Data/Variable Data
- Location Data
Discrete Data
- Best at discerning whether or not we have a defective product or service
- “Pass/Fail: is better for failure analysis
- Counted data is discrete
- E.g. Number dimples on a golf ball
- Number of people in a stadium
- 80/100 to discrete - it is out of a finite set
- E.g. Number dimples on a golf ball
- Full numbers
Qualitative Data
- An example of qualitative data is color. It cannot be expressed as a number
Attribute Data
- Anything that can be classified as either/or
- Very binary
- Pass/Fail, go/no-go, good/bad
- Example:
- Paint chips per unit, percent of defective units in a lot, audit points
- Attribute charts
- A kind of control chart to display information about defects and defectives. Helps you visualize variation
Continuous Data/Variable Data
- Anything that can be measured on a continuous basis
- Can always be divided into smaller increments
- Exists on a continuum
- Preferred over Discrete
- Use continuous data where possible because it tells us the magnitude of the issue
- Helpful for controlling the process and providing enough discrimination
- Examples:
- Length (inches, half inch, hundredths of an inch…)
- Weight
- Temperature
- Time
- Anything you can measure: torque, tension, length, volume
Teaching Discrete and Continuous Data
Imagine you have a young child who says that he is sick. As a parent, the first thing you do is to touch their forehead to see if they feel warm – that is collecting discrete data.
If it feels like he has a fever, you’re likely to use a thermometer to take his temperature – Another type of data collection. You need to know magnitude of the fever because that will determine the course of action; 105 – ER, 101 – TYLENOL. That temperature reading is continuous data – data that exist on a continuum.
Location Data
- You could record on a measles diagram
- Example:
- Determining root cause of paint blemishes occurring on a car production line
-
Measles Diagram/Chart
- Use specifically to analyze the problem’s location and density, not just collecting the count of the problem.
- Helps determine where the common defects on parts are located
Converting Types of Data
- Difficult to translate after the fact attribute (go/no go) data to variable. But in most cases, you can find a way during measuring to convert attribute to variable
- Example: how far out of tolerance
- Always easy to convert variable data to attribute data if you have a standard.
- Example: Water is too cold to swim at less than 75 degrees. No go <75. Then put all of the data that is less than 75 to “no go” and all above “go”
Data Distribution
- Data distribution is a function that specified all possible values for a variable and also quantifies the relative frequency (probability of how often they occur)
- Distributions are considered any population that has a scattering of data.
- It’s important to determine the kind of distribution that population has so we can apply the correct statistical methods when analyzing it
Types of Continuous Distributions
- Normal Distribution
- Lognormal Distribution
- F Distribution
- Chi-Square Distribution
- Exponential Distribution
- T-Student Distribution
- Weibull Distribution
- Non-Normal Distributions
- Odd Distributions
- Bivariate Distribution
- Bi-Modal