Stats Flashcards

1
Q

The purpose of experimentation?

A

 Comparing Alternatives
 Identifying the Significant Inputs (Factors) affecting an Output (Response) - separating the
vital few from the trivial many
 Achieving an Optimal Process Output (Response)
 Reducing Variability
 Minimizing, Maximizing, or Targeting an Output
 Achieve product & process robustness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Standard Deviation = ?

properties?

A

σ = sqrt(sum(xi - x)^2/(n-1)

Always positive or 0
Affected by outliers
Same units as the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define Random Variable

A

A random variable is a variable whose possible values are numerical outcomes of a random phenomenon. There are two types of random variables, i.e., discrete and continuous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define Probability of distribution

A

A probability of distribution is a list of possible values of a random variable together with their probabilities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define Binomial Distribution

A

A binomial distribution is a frequency distribution of the possible number of successful outcomes in a given number of trials in each of which there is the same probability of success.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

State and define the two types of variables.

A

Discrete random variable - a variable which can only take a countable number of values.

Continuous random variable: a random variable takes on values within an interval or it has so many possible values that they might as well be considered continuous.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Properties of normal distribution?

A
  • Bell curve
  • Area = unity
  • Symmetrical
  • Middle = mean
  • Saddle point is when it turns from concave down to concave up and is 1 SD away from u
  • Almost all points are within 3 Sd.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the standard normal distriubution?

A
  • mean = 0
  • SD = 1
  • The normal random variable of a standard normal distribution is called a standard score or z-score.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is sampling distribution?

A

The sampling distribution of the sample means gives all the possible values of the sample mean and quantifies how often they occur

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How to build the sampling distribution of the sample mean?

A

1) Take a sample of values from random variable X (population) 2) Calculate the mean of the sample
3) Repeat step 1) and 2) over and over again

All the sample means xbar result in a new population which is denoted using random variable Xbar

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Describe the properties of mean of the sampling distribution

A
  • A sampling distribution represents averages that are based on samples and NOT individual values from a population
  • A sampling distribution is nothing bu a distribution, so that it has its own shape, centre and variability

The mean of sampling distribution Xbar is denoted a uxbar

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do the standard errors pf a sampling distribution vary with SD and n

A

As n increases standard errors decrease

As SD increases standard errors increase

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the shape of a sampling distribution if:

  • the distribution X is normal?
  • The distribution of X is Un-known or not-normal?
A
  • Xbar is normal

- Xbar can be approximated with a normal distribution according to the Central Limit theorem (CLM)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does the Central limit theorem state?

A

If you have a population with mean u and SD and take sufficiently large random samples (usually n>30) from the population itself, then the sample means will be approx normally distributed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Define confidence intervals.

A

A range of values so defined that there is a specified probability that the value of a parameter lies within it.

e.g.
confidence Interval = sample statistic ± margin of error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the margin of error affected by?

A

Confidence level - z*

Sample size - n

Variation in population - SD

z* * (SD/sqrt(n))

17
Q

Define confidence level.

A

The probability that the value of a parameter falls within a specified range of values.

Corresponds to the percentage of the time the result would be correct if numerous random samples were taken

18
Q

What is a hypothesis test?

What are the two types?

A

Its a procedure that uses data from a sample to confirm or deny a claim about the population.

1) Null hypothesis (Ho) - Ho is true unless data and statistics demonstrate otherwise.
2) Research (or alternative) hypothesis - Ha

19
Q

State and describe the two types of hypothesis

When is open chosen over the other?

A

1) Null hypothesis (Ho) - Ho is tryu unless data and statistics demonstrate otherwise.
2) Research (or alternative) hypothesis (Ha) - Population parameter is: Not equal to Ho, Larger than Ho or Smaller than Ho.

Ho is rejected in favour of Ha when we have found a statistically significant result.

20
Q

What does the correlation coefficient tell you about the bivariate data set?

A

r = -1 - perfect negative linear relationship
r approaching -1 - strong negative linear relationship
r approaching 0 - no linear relationship.
r approaching 1 - strong positive linear relationship
r = 1 - perfect positive linear relationship.