Six Sigma Correlation, Regression, and Hypothesis Testing Flashcards

Question

Key components

Answer 1

1. Apply a linear equation to the data set (obtain a least-squares line) 2. Helps us predict future values of x based on existing factors

Answer 2

1. There are various ways of getting b (the regression coefficient) 2. Understand the y intercept value (expressed as little a) Before we use the model, we have to know a and b

Answer 3

1. Plot the scatter diagram (not the line, just the dots) | 2. Get x-bar and y-bar (sum up totals, divide by the count)

Answer 4

1. Reducing handle/hold Times in contact center x factor - time to get CSR computer turned on 2. Understanding how processing temperature in production of wall material Cost of material into pipe is 50-60% of sales Can we find an optimal process temperature that gives a better wall thickness?

Answer 5

1. Draw conclusions about population based on sample data 2. Test a claim about population parameter 3. Provide evidence to support opinion 4. Check for statistical significance

Answer 6

DMAIC Analyze Example: Hypothesis: What would be the effect on customer satisfaction if we reduce the time to answer the phone, or how long it takes to provide a quality answer Another hypothesis: If we're able to tighten our control over temperature for pipe production, would we be able to minimize costs while maintaining customer satisfaction levels

Answer 7

Descriptive What we can physically measure about something (size, form, distribution). - Our ability to manipulate this Relational - What's the relationship between the variables? - Positive or negative? - Greater or lesser than a given value? - Ex: reducing handle times in customer contact center's effect on satisfaction

Answer 8

- 1-sample hypothesis test for means - 2-sample hypothesis tests for the means - Paired t-test - Test for proportions - Test for variances - ANOVA - Analysis of Variances

Answer 9

We use two sample means to prove/disprove hypothesis about two different populations of the data. - Do we see shifts based on the hypothesis - Ex: is there a relationship on the handle time for a call vs CSRs experience

Answer 10

1. Establish our null and alternative hypotheses (Ho & Ha) 2. Testing our considerations (what are the things we want to test for, and how will we manage the process) 3. Calculate test statistics 4. Whether to apply critical value/p-value method while comparing our desired confidence level to the test results 5. Interpret results

Answer 11

- "What they say" and expresses the status quo - Assumes any observed differences are due to chance or random variation - Often expressed as = , >= or <=

Answer 12

- "What we want to test/prove" - Assumes the observed differences are real and NOT due to chance/random variation - often !=, > or

Answer 13

Null hypothesis - assuming population parameters of interest are equal and there is no change or difference Ex: Humidity will not have en effect on the weight of the parts we measure Ex: The country you live in would not have an effect on your level of life satisfaction

Answer 14

Represented by H with subscript "a". Wants to look at parameters of interest that are not equal, assuming the difference is real. Ex: Assume greater level of CSR experience directly correlates to quality of work output

Answer 15

1. Reject the null in favor of proving that there is Ha 2. Need to prove it is statistically significant 3. We're expecting to find a no, but we could reject the null 4. Fail to reject: we find insufficient evidence to claim that the null hypothesis was valid, or the alternative true 5. Presenting the results - Even though it's more natural sounding to state in view of the Ha, we actually express in terms of whether or not we're rejecting the null hypothesis - "We reject the null hypothesis." OR - "We fail to reject the null hypothesis."

Answer 16

Type I error (alpha risk) - constant risk | Type II error (beta risk) - the effect we're looking for

Answer 17

The risk we're willing to take in rejecting the null hypothesis when it's actually true (producer's risk) - False alarm - False negative - Error with the alpha factor Common alpha factor = 0.05 - Testing: what's the possibility of making a type 1 error at that confidence level

Answer 18

Signifies the degree/risk of failure that's acceptable to us in the study at hand. Helps decide if null can be rejected

Answer 19

Signifies the level of assurance we expect with the results of the data being studies - Describes the uncertainty of the sample method you're using

Answer 20

Most common beta risk value is 0.10 - Similar to failing to find the defective piece when producing a product - AKA Consumer risk - False positive

Answer 21

If Ha mu > mu-o (hypothesis mean) > one-tailed test to the right If Ha mu < mu-o > one-tailed test to the left If Ha mu != mu-o, two-tailed test (we'll find defects on both sides of the data curve)

Answer 22

Used to compute the margin of error | Derived by critical value x standard deviation/standard error of statistic

Answer 23

Z = (x-bar - mu)/(sigma/sq root of n)

Answer 24

The confidence level of a test of 1-alpha | If alpha factor is 5%, resulting confidence factor would be 95%

Answer 25

If Ha mu != mu-o Testing on both sides of the mean

Answer 26

The ability to make the correct decision of a test. Power/sensitivity of a statistical test of rejecting a null hypothesis when it's actually false. Power of a test helps increase the likelihood of rejecting a null hypothesis correctly.

Answer 27

1. Sample size 2. Population differences 3. Variability 4. Alpha level

Answer 28

Most important part of power of a test.

Answer 29

Particularly important when planning the study. Sample must be large enough to avoid type II errors.

Answer 30

Less variability = more power Ex. If looking at equivalency exams in schools, we know that sample size and variance matter

Answer 31

Most common: 0.05 Used to determine critical value Plenty of instances where alpha factor of 0.05 results in rejecting the null hypothesis, but an alpha factor of 0.01 would not.

Answer 32

Used to determine statistical significance. | Use it to evaluate how well the data supports the null hypothesis.

Answer 33

Effect size Sample size Variability of data

Answer 34

Indicates that the sample data contains enough evidence to reject the null for the population.

Answer 35

"If the p is high, null will fly." | "If p is low, null will go."

Answer 36

If p value is less than the alpha factor (0.05 in this case), then we reject. If p value is greater than alpha factor, then we do not reject.

Six Sigma Correlation, Regression, and Hypothesis Testing Flashcards

(61 cards)