STAT MOD 4: Chapter 6 Flashcards

1
Q

What are random variables?

A

assigns a number to each outcome of a random circumstance to each unit in a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does a random variable describe?

A

Describe probabilities of outcomes of that variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is X? What is x?

A

X (refers to abstract random variable)
- height
x (refers to particular value of the variable)
- 7 foot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is discrete random variable? Can we find probabilities for a discrete random variable?

A

possible values are isolated points on the number line; can take one of a countable list of distinct values

  • can find probabilities for exact outcomes
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is continuous random variable? Can we find probabilities for a continuous random variable?

A

possible values are all points in some interval

  • limited to finding probabilities for intervals of values
  • probabilities of exact outcomes are zero
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is probability distribution function (pdf)? Draw probability distribution function

A

table that assigns probabilities to possible values of x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What do the following stand for in pdf?

  • X
  • k
  • P(X=k)
A
  • X = random variable
  • k = a number the discrete random variable could assume
  • P(X = k) = the probability that X equals k
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is cumulative distribution function (cdf)?

A

a table that gives P(X ≤ k) for any real number k

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are requirements that a discrete distribution (pdf) must satisfy?

A

All probabilities between 0 and 1
- impossible to have a negative outcome

Probability of all possible outcomes sums to 1
- All the values on P(X = k) side of the table must add up to 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How do you compute mean/expected value of discrete random variable using a probability distribution table?

A

expected value: describes where the probability of distribution of x is centered

  • Make pdf table of k and P(X = k)
    then multiply each possible value (k) by its probability (P(X = k) → kP(X = k)
  • Add all multiplied values together to get mean/expected value of a discrete random variable
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How is a probability of a continuous random variable expressed as?

A

Density curve: a curve whose area represents the probability of the continuous random variable taking values in that interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are characteristics of a density curve?

A
  • The curve always falls at or above zero (can never be negative)
  • The total area under density curve is equal to 1
  • The probability a random variable (x) falls in a given interval is equal to the area under the density curve in that interval
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is uniform distribution? How do you find probability?

A

a square or rectangular density curve (ex: trolley waiting time)

Know the height and length of the shaded to find probability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How do you find areas under a given normal distribution (left tail, interval, or right tail)?
- normal problems

A

Use empirical rule - (if values perfectly fall on standard deviation values)

normalcdf (lower, upper, mean, SD)
- Use normalcdf when trying to find an interval
- use normalpdf when trying to find probability at an exact value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do you find quartiles of any normal distribution?

How to find x value from the distribution given left or right area (inverse normal problems)?

A
  • given percent as the left or right tail area
  • find mean and standard deviation
  • use invnorm(left tail area, mean, standard deviation)
  • if given something lowest percent = left tail area, input value as is
  • if given something highest percent = right tail area, subtract this from 1
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do you turn values from any normal distribution into z-scores from the standard normal?

A

Z-scores or standardizing: number of standard deviations that it falls from the mean

(x - μ/sigma) to get z-score then refer to standard normal distribution table for area of that z-score

17
Q

How do you check for normality in a data set?

A
  • Plot a histogram
  • See if empirical rule fits the data
  • Interpret a normal probability plot
  • you are looking for the dots to fall on a 45 degree line (must follow a linear pattern) i
    f the data set matches up to what you would expect a data set of that size to look like coming from a normal distribution.
18
Q

What is binomial random variable?

A

defined as X = number of successes in the n trials of a binomial experiment

  • x ~ bin (n,p)
19
Q

What are requirements for a binomial experiment?

A
  • Only two possible outcomes on each trial (success/failure)
  • Fixed number of trials (n)
  • Outcomes are independent from one trial to the next
  • Probability of a success (denoted by p) remains the same from one trial to the next
  • Probability of a failure is 1 - p
20
Q

What are the commands to find to find binomial probabilities and cumulative probabilities?

A

○ Binompdf(n, p, k)
○ Binomcdf(n, p, k)

21
Q

How do you compute mean of binomial distribution?

A

Mean (μ) = (trials)(probability success) = np

22
Q

How do you compute standard deviation of binomial distribution?

A

Standard deviation (sigma) = square root of np (1-p)

23
Q

Which conditions can the normal distribution be used to approximate binomial distribution?

A

If X has a binomial distribution and n is large, we can approximate the binomial probabilities

  • If a SRS of size (n) is drawn from a large population and (n) is sufficiently large, the sampling distribution of X is approximately normal

○ Conditions:
- x ~ AN (np, square root of np(1–p)
if np ≥ 10 and np(1–p) ≥ 10

24
Q

How do you use normal distribution to approximate the binomial distribution when appropriate?

Find approximate probabilities using the normal curve

A

First approximate the binomial distribution to normal distribution (check if np and np(1–p) ≥ 10)

Use either normalpdf or normalcdf (lower bound, upper bound, np, standard deviation or square root of np(1–p))