Statistics I Flashcards

Question

Equations for mean and variance of binomial distributions

Answer 1

The mean of X (number of favorable occurances) is u = np The variance of X is σ² = np(1-p)

Answer 2

The two inflection points represent where 1 standard deviation occurs

Answer 3

Go to the corresponding z value for the percentile, then use the z score / x value / std dev / mean equation to get the x value ( TI-84: DISTR \> invNorm() )

Answer 4

Approximate it with a normal distribution The following conditions must me true n \* p \>= 10 n \* (1-p) \>= 10 You will need to calculate the mean, std dev to get the z score, then find the percentile (TI 84: DISTR \> invNorm

Answer 5

CDF: cumulative density function (eventually rises up to 1) PDF: probability density function (doesn't rise up to 1, like the normal distribution) Basically CDF is good for a range of occurences. instead of a specific number of successes (i.e. “3 trials”) this function gives you the probability there will be 0 to x successes in n trials. In other words, if you put X=3 it will five you the probability for 0,1,2 and 3 trials (all together).

Answer 6

Shorter and fatter than the z distribution, gets taller and skinnier with more samples used when you only have a sample, and trying to determine facts about the population

Answer 7

used to describe t distributions Equal to sample size - 1 (n-1) notated as t₉ (9 degrees of freedom) t₃₀is desired (very close to normal)

Answer 8

They can be 1 sided, or 2 sided. Be careful

Answer 9

This is the standard deviation of the sampling distribution of the sample mean... σ_x

Answer 10

Margin of error = Critical value x Standard deviation of the statistic or Margin of error = Critical value x Standard error of the statistic Standard error is a function of sample size and standard deviation. Standard error is basically the same as standard deviation, except you can't use population parameters because you don't know them. If the confidence interval is 95%, then alpha is equal to .05. Critical probability is 1-(alpha / 2)= (0.975). Critical value is the z or t score associated with that probability. Then go back to the original equation

Answer 11

All distributions are somewhat normal, and that 30 is the good transition point for sample size

Answer 12

p hat is the proportion of individuals in the sample who have a particular characteristic

Answer 13

where p is the sample number

Answer 14

np and n(1-p) to be greater than or eqaul to 10

Answer 15

population mean vs. sample mean (you can have a u_x)

Answer 16

You have to use normalCDF() The range has to be that z score and an extreme z score (like -999 or 999)

Answer 17

Calculate margin of error for a sample proportion (sort of like binomial, approve disaprove of politicians) or Calculate the margin of error for a sample mean Margin of error = Critical value x Standard deviation of the statistic or Margin of error = Critical value x Standard error of the statistic

Answer 18

If you can assume it came from a normal distribution, use t-values Trick: if the population standard dev, σ, is not given, you can use the sample standard dev and use t values

Answer 19

The building blocks of confidence intervals. A conficence interval is a statistic plus or minus a margin of error, and the margin of error is the number of standard errors you need. The number of standard errors required is called the critical value (z^*) called the z star value

Answer 20

The significance level, or alpha level (typically 0.05)

Answer 21

Rejecting the null hypothesis when you shouldn't

Answer 22

Not rejecting the null hypothesis when you should have

Answer 23

d is for differences

Answer 24

0 because the theoretical difference between proportions is zero

Answer 25

s is sample std devs, bars are means of the samples how to calculate: for each (x,y) multiply the differences, then add up all of those results The rest of the formula is clear - 1: negative linear relationship 0: no relationship 1: positive linear relationship

Answer 26

The line that minimizes the sum of squares for error (SSE) Slope is the standard deviations and r is correlation, y int is calculated using the two means

Answer 27

Illustration of a simple confounding case: in this graphical model, given Z, there is no association between X and Y. However, not observing Z will create fake association between X and Y. In the latter case, Z is called a confounding factor.

Answer 28

Pick the row or column variable, and divide each subtotal by the grand total as shown:

Answer 29

Divide each cell by the grand total. Sum of all should be 1.

Answer 30

"Find the conditional distribution of gender by country" Say there's 3 countries... The result will be 3 totals all equal to 1, with each having a percentage of gender If it's find x by y... If x is a row, each row adds up to 1, if x is a column, each colum adds up to 1

Answer 31

Compare the reslts of two conditional distributions (check if they match) Compare the marginal and conditional distributions to check for independence ^ if greater than a 2 way table, go to the Chi-square test

Statistics I Flashcards

(60 cards)