week 1 Flashcards

Question

probability distribution.

Answer 1

Each random variable has a probability distribution. Discrete: probability mass function (PMF) § Tells us the probability associated with each possible value of the random variable. § Continuous: probability density function (PDF)

Answer 2

In the example of X being the indicator random variable representing getting a head after a fair coin flip, the PMF of X is P(X=0) = 0.5 P(X=1) = 0.5

Answer 3

If a random variable is a Bernoulli random variable, we can say that the random variable follows the Bernoulli distribution. By Bernoulli distribution, we mean the probability distribution associated with the Bernoulli random variable.

Answer 4

X ~Ber(p) where the symbol “~” stands for “follows”, and Ber stands for Bernoulli distribution.

Answer 5

For X ~Ber(p), p is the parameter that fully describes the Bernoulli distribution. Parameters are considered non-random, fixed variables. This usage of the term “parameter” is a bit different but related to the case when “parameter” is used to mean the quantities computed with population data.

Answer 6

A normal random variable (a.k.a., Gaussian random variable) is a continuous random variable that follows the famous “bell curve” distribution.

Answer 7

The “bell curve” distribution is called the Probability Density Distribution (PDF) of the normal random variable

Answer 8

1. expected value u 2. variance o^2

Answer 9

X ~ N (u,o^2)

Answer 10

When the normal random variable has a mean of 0 and a variance of 1, then it is called the standard normal random variable, usually denoted as Z ~ N (0,1)

Answer 11

For a continuous random variable, we cannot talk about the probability of the random variable taking on any specific value. the probability of a continuous random variable taking on a specific value is always zero. For a continuous random variable, we can only talk about the probability of the random variable taking on a range of possible values.

Answer 12

tells us the probability of a random variable taking on a value that is equal to or less than a cutoff point. P(X< a) or P(X < a) is the area under the curve below a

Answer 13

The 68–95–99.7 rule is a shorthand used to remember the percentage of values that lie within an interval estimate in a normal distribution.

Answer 14

dnorm() pnorm() qnorm() rnorm()

Answer 15

The dnorm() function computes the PDF of the normal distribution. Output the probability density of a normal random variable at a specific value Not commonly used because for continuous random variables, the probability of a range of values is more important (i.e., the area under the PDF)

Answer 16

The pnorm() function computes the CDF of the normal distribution Output the probability of a normal random variable taking on values below the quantile value. Need to input: q: the quantile value at which you want to compute the probability. mean: value for the parameter µ. sd: value for the parameter σ. Other input: § lower.tail: logical; whether you want the upper tail or the lower tail probability. By default, lower.tail=T.

Answer 17

The qnorm() function computes the quantile value given a probability below the quantile value. Output the quantile value. Need to input: p: the probability below the quantile value. mean: value for the parameter µ. sd: value for the parameter σ. Other input: lower.tail: logical; whether you specified the upper tail or the lower tail probability for p. By default, lower.tail=T.

Answer 18

Note: Remember to square root the variance to get the standard deviation for the argument sd.

Answer 19

generates/simulates random numbers from the normal distribution Suppose our population data follow a normal distribution N(100, 400). We want to simulate randomly sampling 10 values from the population. Then we can do rnorm(n = 10, mean = 100, sd = sqrt(400))

Answer 20

X ~ Bin(N,p) discrete random variable

Answer 21

X ~ x^2(df) continuous random variable

Answer 22

X ~ t(df) continuous random variable

Answer 23

Associated with random events. Have probability distribution Can take on more than one possible value. Denote using capital letters XY

Answer 24

Associated with non-random event Do not have probability distribution Can only take on one possible value Denote using small letters ax

Answer 25

a random procedure’s different outcomes.

Answer 26

the realized value of a random variable. The realized value of a random variable is treated as constant

Answer 27

We can also realize this random variable multiple times and then graph the empirical probability distribution We can realize the random variable 10 times by flipping a fair coin 10 times. The empirical probability distribution is an estimation of the theoretical probability distribution.

Answer 28

the normal distribution

Answer 29

random variables - has a probability distribution

Answer 30

constants do not have a probability distribution

Answer 31

constants (or fixed values)

Answer 32

No bc they are constants

Answer 33

numerical quantities that fully describe a distribution u and o^2 in X ~ N (u,o^2)

Answer 34

numerical quantities characterizing the population data

Answer 35

random variables because we are uncertain about their values. In Bayesian statistics, you can specify a probability distribution for each parameter. § called prior distribution.

Answer 36

that when we add or average a large number of random variables, the sum or the mean of the random variables is a random variable that follows a normal distribution. CLT implies when you add or average different random events together and use a random variable to quantify it, then the probability measure of the random variable follows the normal distribution

Answer 37

At a large n, Xbar approximately follows a normal distribution N(uxbar = u, o2/x = o^2/n)

Answer 38

the mean of the sampling distribution of sample mean xbar

Answer 39

the standard deviation of the sample distribution of the sample mean X; standard error of the mean SEM

Answer 40

that when we add oraverage a large number of random variables each with finite µ and σ2 the sum or the mean of the random variables follows a normal distribution. This implies when you add different random events together and map them onto a number line, it follows the normal distribution.

Answer 41

the sampling distribution of the sample mean.

Answer 42

The sampling distribution of the sample mean is the distribution of the sample mean over repeated samples. § “Over repeated samples” means “conducting the same experiment (with a fixed sample size n) infinitely many times.” § Related to the frequentist perspective.

Answer 43

normal distribution.

week 1 Flashcards

(70 cards)