What are chances? Flashcards

1
Q

How to calculate for probability?

A

taking the number of ways the event can happen and dividing it by the total number of possible outcomes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is sampling with replacement?

A

The sample is placed back into the selection and can be chosen again.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is independent probability?

A

Two events are independent if the probability of the second event does not changed based on the outcome of the first event.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Sampling without replacement is what?

A

Not replacing the value taken from the sample. Each pick is dependent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Dependent Events are what?

A

Probability of the second event is affected by the outcome of the first event

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is conditional probability?

A

Used to calculate the probability of dependent events.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

A venn diagram is used for what?

A

Visualizing conditional probability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a venn diagram?

A

technique used to display the possible outcomes of multiple events and the overlap where both events can occur.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a probability distribution?

A

Describes the probability of each possible outcome in a scenario

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the expected value of a probability distribution

A

mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Discrete probability distribution is what?

A

Describes probabilities for discrete outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the law of large numbers?

A

As the size of sample increases, the sample mean will approach the expected value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

We can use discrete distributions to model what?

A

situations that involve count or interval data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a binary outcome?

A

Two possible values can occur

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is binomial distribution?

A

Probability distribution of the number of successes in a sequence of independent events

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Binomial distribution can be represented by two parameters:

A

n: total number of events
p: probability of successes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Expected value of binomial distribution can be calculated by:

A

expected value: n x p

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Normal distribution shape is commonly referred to as a what?

A

a bell curve

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Bell curve properties:

A

Symmetrical
Area beneath the curve = 1
Curve never hits 0
Normal distribution is described by mean and standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Skewness is what?

A

Describes the direction that data tails off

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Positive skewed / Right skewed is defined as

A

the plot peaks on the left and tails off to the right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

A Negative skewed / left sked distribution is defined as

A

peaks on the right and tails off to the left.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Kurtosis is what?

A

A way of describing the occurrence of extreme values in a distribution

24
Q

characterized by a large peak around the mean and smaller standard deviation

A

Positive kurtosis / leptokurtic

25
Q

mesokurtic distribution is the term used to describe the

A

normal distribution

26
Q

describes a distribution with a lower peak and larger standard deviation

A

negative kurtosis / platykurtic

27
Q

A distribution of a summary statistic such as the mean is called?

A

Sampling Distribution

28
Q

What is the central limit theorem?

A

a sampling distribution will approach a normal distribution as the size of the sample increases.

29
Q

a process where the average number of events in a given time period is known, but the time or space between events is random.

A

Poisson process

30
Q

describes the probability of some number of events happening over a fixed period of time.

A

Poisson distribution

31
Q

Poisson distribution is described by a value called?

A

Lambda

32
Q

Lambda represents what?

A

average number of events per time interval

33
Q

Lambda changes the shape of the distribution

A

Lambda is the distribution’s peak

34
Q

What is hypothesis testing?

A

Hypothesis testing is a group of theories, methods, and techniques to compare populations.

35
Q

What is null hypothesis?

A

Start with an assumption that no difference exists between the populations

36
Q

Hypothesis testing workflow:

A

Define the target populations
Develop null and alternative hypotheses
Collect sample data
Perform statistical tests on sample data
Draw conclusions about the population

37
Q

What are experiments?

A

Experiments are a subset of hypothesis testing that involves performing statistical tests on sample data to draw conclusions about a population.

38
Q

Experiments aim to answer:

A

What is the effect of the treatment on the response?

treatment = independent variable
response = dependent variable

39
Q

What are controlled experiments?

A

A common type of experiment is a controlled experiment, where participants are randomly assigned to either the treatment group or the control group.

40
Q

What is the gold standard of experiments?

A

Eliminate the bias as much as possible

41
Q

Methods to help eliminate bias in controlled experiments?

A

Randomization
Blinding
double-blind randomized controlled trial

42
Q

What is randomization?

A

participants are assigned to the treatment or control group randomly, not due to some characteristics

Randomization helps ensure that the groups are comparable.

43
Q

What is blinding?

A

In a blind trial, the participants don’t know if they’re in the treatment or control group. This ensures that the effect of the treatment is due to the treatment itself, not the idea of getting the treatment.

44
Q

What is a double-blind randomized controlled trial?

A

The person administering the treatment or running the experiment also doesn’t know whether they’re administering the actual treatment or a placebo.

45
Q

What is A/B testing?

A

A/B testing only splits participants into two groups - treatment and control.

46
Q

What is the Pearson correlation coefficient? (correlation coefficient).

A

It quantifies the strength of a relationship between two variables, producing a value between minus one and one.

This number corresponds to the strength of the relationship between the variables, and the sign, positive or negative, corresponds to the direction of the relationship

47
Q

The Pearson correlation coefficient can only be used for what type of relationships?

A

Linear Relationships

48
Q

What are linear relationships?

A

Proportionate changes between dependent and independent variables

49
Q

Values represent what?

A

The strength of the relationship

50
Q

What are the different correlation coefficients?

A

0.99 (very strong relationship)
0.75 (strong relationship)
0.56 (moderate relationship)
0.2 (weak relationship)

51
Q

When the correlation coefficient is close to zero, what does that mean?

A

x and y have no relationship and the scatterplot looks completely random.

This means that knowing the value of x doesn’t tell us anything about the value of y.

52
Q

What does the sign of the correlation coefficient represent?

A

The sign of the correlation coefficient corresponds to the direction of the relationship.

53
Q

What is a positive and a negative correlation?

A

A positive correlation coefficient indicates that as x increases, y also increases.

A negative correlation coefficient indicates that as x increases, y decreases.

54
Q

What is a confounding variable?

A

Something that affects the data we are analyzing, but was not accounted for when assessing the relationship between variables.

55
Q

What is p-value?

A

This is the probability of achieving a result at least as extreme as the one we have observed, assuming the null hypothesis is true.

56
Q

What is a type 1 error?

A

We can wrongly accept our null hypothesis when it’s false.

57
Q

What is a type 2 error?

A