Midterm 2 Flashcards

Question 1

Q

What is probability?

Answer

A

the science of chance behaviour (proportion of times an outcome will occur)

Question 2

Q

What is Gallup Polling?

Answer

A

random sampling gives information about
the sample (people polled) which can be used to make an
estimate of the population.

Question 3

Q

Is chance predictable?

Answer

A

Chance behaviour is unpredictable over the short run but is
regular and predictable over the long run.E.g. coin tossing

Question 4

Q

What is randomness?

Answer

A

A kind of order that emerges only after a long run.

Question 5

Q

What factors affect randomness?

Answer

A

Must have a long series of independent trials.
– Outcome of one trial must not influence the outcome of the next.

Question 6

Q

What is a probability model?

Answer

A

mathematical description of a random
phenomenon consisting of two parts – a sample space and
a method of assigning probabilities to events.

Question 7

Q

What is a discrete sample space?

Answer

A

Discrete variables that can
take on only certain values (a
whole number or a
descriptor). E.g. blood types

Question 8

Q

What is Continuous sample space?

Answer

A

Continuous variables that can take on any one of an
infinite number of possible values over an interval. E.g. cholesterol levels

Question 9

Q

What do all the probable outcomes sum to?

Question 10

Q

What is the equation for the probability of something not happening?

Answer

A

1 - the probability of it occuring

Question 11

Q

If two independent outcomes exist, what is the probability of either event occurring?

Answer

A

Probability of event 1 + probability of event 2

Question 12

Q

What is the addition probability rule?

Answer

A

Probability of an event is the sum of the probabilities of the
outcomes making up the event

Question 13

Q

What is Benford’s law?

Answer

A

Legitimate documents have a preponderance of 1s and 2s which
usually do not occur with falsified documents.

Question 14

Q

What is a Normal curve statistically speaking?

Answer

A

A Normal probability model

Question 15

Q

What is a random variable?

Answer

A

a variable whose numerical outcome is
due to a random phenomenon

Question 16

Q

What is probability distribution?

Answer

A

of X describes the values X can
take and how to assign probabilities to those values.

Question 17

Q

What are disjoint/mutually exclusive events?

Answer

A

events that NEVER happen together

Question 18

Q

What is the General Addition Rule?

Answer

A

pA+pB-P(a+b)

Question 19

Q

What are independent events?

Answer

A

One event has no probability change on the other

Question 20

Q

What is the Multiplication Rule for Independent Events?

Question 21

Q

What is sampling with replacement?

Answer

A

Experimental units are replaced before each new
sampling event is started – samples are independent.

Question 22

Q

What is conditional probability?

Answer

A

Conditional probabilities reflect how the probability of an
event can be different if we know that some other event
has occurred or is true.

Question 23

Q

What is a discrete random variable?

Answer

A

– random variables that have a
finite list of possibilities.

Question 24

Q

What is a continuous random variable?

Answer

A

infinite number of outcomes.

Question 25

Q

What is a risk?

Answer

A

The risk of an undesirable outcome of a random
phenomenon is the probability of that undesirable
outcome.

Question 26

Q

What is an odd?

Answer

A

The odds of any outcome of a random phenomenon is the
ratio of the probability of that outcome divided by the
probability of that outcome not occurring.

Question 27

Q

What is a parameter?

Answer

A

a number describing a characteristic of the population

Question 28

Q

What is a sample?

Answer

A

part of the population examined and for which we have data

Question 29

Q

What is the difference between mew and x (with a line over top)?

Answer

A

Mew = mean of the population
x + line = mean of a sample

Question 30

Q

What is important to remember about random sampling?

Answer

A

A statistic computed from a random sample is a
random variable.

Question 31

Q

What is a sampling distribution?

Answer

A

probability distribution of that statistic for samples
of a given size n taken from a given population.

Question 32

Q

What is the Law of large numbers?

Answer

A

As the number of samples of randomly sampled data increases, the mean of the sample gets closer to the population mean + the sample proportion gets closers to the population proportion

Question 33

Q

What is the Central limit theory?

Answer

A

When randomly sampling from any population
with mean µ and standard deviation σ, when n is large enough, the
sampling distribution of is approximately Normal: N(µ,σ/√n).

Question 34

Q

What is statistical inference?

Answer

A

Drawing conclusions from a sample about the population. Uses probability to state how reliable conclusions really are

Question 35

Q

Are sample means usually the same as the population mean?

Question 36

Q

What is a confidence interval?

Answer

A

The confidence interval is a range of values with an associated
probability or confidence level C. The probability quantifies the chance
that the interval contains the true population parameter

Question 37

Q

What are the two parts of a confidence interval?

Answer

A

estimate ± margin of error. Represent corresponding area under a curve

Question 38

Q

What does the confidence interval tell us?

Answer

A

with 95% confidence, we can say the population mean is two standard deviations away from the sample mean

Question 39

Q

What does the confidence level depend on?

Question 40

Q

What does a large sample size mean?

Answer

A

Smaller standard deviation

Question 41

Q

What kind of error does the margin of error cover?

Answer

A

Random sampling error

Question 42

Q

For a legitimate experiment, what are some key rules for gathering samples?

Answer

A

The data must be a probability sample or come from a randomized
experiment

Question 43

Q

What is a confidence interval?

Answer

A

Confidence intervals are used to estimate a population
parameter, with a built-in estimate of the precision of that
estimate. Estimate ± Margin of Error. Relies on srs + central value theorem

Question 44

Q

What is statistical significance?

Answer

A

Statistical significance only says whether the effect
observed is likely to be due to chance alone because of
random sampling.

Question 45

Q

How does sample size affect statistical significance?

Answer

A

Because large random samples have small chance
variation, very small population effects can be highly
significant if the sample is large.
* Because small random samples have a lot of chance
variation, even large population effects can fail to be
significant if the sample is small.

Question 46

Q

What is the purpose of hypothesis testing?

Answer

A

Tests to see if sample data is valid with the hypothesis

Question 47

Q

What is a null hypothesis?

Answer

A

The null hypothesis is a very specific statement about a
parameter of the population(s). It is labeled H0
.

Question 48

Q

What is an alternate hypothesis?

Answer

A

The alternative hypothesis is a more general statement
about a parameter of the population(s) that is exclusive of
the null hypothesis. It is labeled Ha

Question 49

Q

Whats the difference between a one tail and two tail sided test?

Answer

A

Two sided has both null and alternative (one equals while other doesn’t equal). one sided has null and alternative (null and alternative is higher or lower)

Question 50

Q

How do you know which test to use?

Answer

A

If the question says higher or lower you only need to do a one sided test.

Question 51

Q

What is the p-value?

Answer

A

A way to confirm whether a null or alternative hypothesis is correct

Question 52

Q

What does a small p-value mean?

Answer

A

Small P-values are evidence AGAINST H0
. (less than 0.05)

Question 53

Q

What is a significance level?

Answer

A

Alpha. The largest p value tolerated for rejecting the null hypothesis. Decided before conducting the test

Question 54

Q

How can you find a confidence level in a two-sided test using alpha?

Question 55

Q

What things must you know for a significance test?

Answer

A

Data is SRS, Normal, and standard deviation must be known

Question 56

Q

What is statistical power?

Answer

A

The power of a test of hypothesis with fixed significance level
α is the probability that the test will reject the null hypothesis
when the alternative is true.
In other words, power is the probability that the data gathered
in an experiment will be sufficient to reject an incorrect null
hypothesis.

Question 57

Q

What is a type 1 error?

Answer

A

when we incorrectly reject the null hypothesis

Question 58

Q

What is a type 2 error?

Answer

A

when we fail to reject the null hypothesis and it is false

Question 59

Q

What are conditions for inference around the mean?

Answer

A

A SRS
Normal distribution
Both mean and standard deviation are unknown

Question 60

Q

What is the difference between standard deviation and standard error?

Answer

A

SD=n-1 degrees of freedom
SE=mean +/- SE