vocab definitions Flashcards

Question

Probibility distribution

Answer 1

A prob distribution describes the true relative frequency of all possible values of a random vairable

Answer 2

if two events are mutually exclusive they cannot both be true Pr(A and B)= 0

Answer 3

The prob of an event is its true relative frequency, the proportion of times the event would occur if we repeat the same process over and over

Answer 4

the error that occurs when samples are not indepenent, but they are treated as though they are

Answer 5

estimate is the standard deviation of its sampling distribution. predicts the sampling error of the estimate

Answer 6

the standard deviation of its sampling distribution It predicts the sampling error of estimate

Answer 7

the conditional probability of an event is the probability of that event occurring given that a condition is met. Pr[X|Y]

Answer 8

the 95% confidence provides a plausible range for a parameter. All values for the parameter lying within the interval are plausible, given the data, whereas those outside are unlikely

Answer 9

the interval from Y-2SEy to Y+2SEy provides a rough estimate of the 95% confidence interval for the mean

Answer 10

compares count data to a model of the expected frequencies of a set of categories -it is an approximation (don't use when there's little amount of data) H0: the data come from a specified probability distribution x^2= sum of all classes (observed-expected)^2/ expected

Answer 11

the number of degrees of freedom of a test specifies which of a family of distributions to use for x^2 df= number of categories-number of parameters estimated from the data-1

Answer 12

the value of the test statistic where P= alpha

Answer 13

A test statistic is a number calculated from the data and the null hypothesis that can be compared to a standard distribution to find the P-value of the test

Answer 14

=that its a random sample - No more than 20% of categories have expected <5 - no category with expected = 1 when both these conditions are not met the approximations to make the x^2 test do not work

Answer 15

a porbobility disribution descibing a discrete numerical random variable example: number of heads from 10 flips of a coin number of flowers in a square meter number of disease outbreaks in a year

Answer 16

a mathematical probability distribution. - describes the probability that a certain number of events occur in a block of time or space, when those events happen independently of each other and occur with equal probability at every point in time or space

Answer 17

tests the independence of two of more categorical variables

Answer 18

for 2x2 contingency analysis does not make assumptions about the size of expectations use when you cant do x^2 contingency analysis *don't need to do by hand

Answer 19

the probability of success divided by the probability of failure

Answer 20

the odds of success in one group divided by the odds of success in another group OR< 1 means odds of bad thing happens lower OR>1 means odds of bad thing is higher

Answer 21

-independent selection of individuals -random selection of individuals -sufficiently large

Answer 22

The difference between the estimate and average value of the estimate

Answer 23

The cumulative frequency of a value is the proportion of individuals equal to or less than the value graphed this goes from 0-1 on y axis, never decreasing

Answer 24

contingency table, grouped bar graph, mosaic plot,

Answer 25

multiple histograms

Answer 26

scatter plot

Answer 27

Ybar= sum of Yi/n n=sample size

Answer 28

The median is the middle measurement in a set of ordered data

Answer 29

the mode is the most frequent measurment

Answer 30

the maximum minus the minimum

Answer 31

Small samples tend to give _lower__ estimates of the range than small samples. So sample range is a _biased estimator_ of the true range of the population

Answer 32

sigma^2= sum of (Yi- u)^2/N N is the number of individuals in population u= true mean of the population

Answer 33

s^2= sum of(Yi-Ybar)^2/n-1 n=sample size Ybar= sample mean

Answer 34

positive square root of the variance sigma is the true standard deviation s is the sample stand deciation s= sqare root of s^2= sqrt(sum(Y-Ybar)^2/n-1)

Answer 35

CV= 100% S/Ybar

Answer 36

a measurement of asymmetry refers to the pointy tail of a distribution

Answer 37

standard error of he mean: sigma ybar= sigma/ srt(n)

Answer 38

SEYbar= S/ srt(n) gives us some knowledge of the likely difference b/w our sample mean and the true population mean

Answer 39

Pr[x]= sum of all values of Y Pr[X|Y] Pr[Y]

Answer 40

P[positive result]= Pr(positive result| X)Pr(x) +Pr(positive result| Y) Pr(Y)

Answer 41

Pr[A|B]= Pr[B|A]Pr[A]/ Pr[B]

Answer 42

hypothesis testing asks how unusual it is to get data that differ from the null hypothesis If the data would be quite unlikely under H0 we reject H0

Answer 43

sapmples with assumption it is random

Answer 44

a specific statement about a population parameter made for the purposes of argument. usually the simplest statement

Answer 45

represent all other possible parameter values except that stated in the null hypothesis usually the statement of greatest interes

Answer 46

would be interesting if proven wrong

Answer 47

the probability of getting the data or something as or more unusual, if the hypothesis where true

Answer 48

Simulation Parametric tests Permutation

Answer 49

The significance level, alpha, is a probability used as a criterion for rejecting the null hypothesis If the P-value for a test is less than or equal to alpha then the null hypothesis is rejected often 0.05

Answer 50

A large sample will tend to give and estimate with a _smaller_ confidence interval A larger sample will give _more power to reject___ a false null hypothesis

Answer 51

Rejecting a true null hypothesis Probability of Type I error is alpha (the significance level)

Answer 52

Not rejecting a false null hypothesis The probability of a Type II error is beta The smaller beta the more power a test has

Answer 53

The ability of a test to reject a false null hypothesis Power = 1- beta

Answer 54

most tests are two-tailed tests and this means that a deviation in either direction would reject the null hypothesis normally alpha is divided into alpha/2 on one side and alpha/2 on the other

Answer 55

are only used when the other tail is nonsensical example: comparing grades on a multiple choice test to that expected by random guessing

Answer 56

The value of a test statistic beyond which the null hypothesis can be rejected

Answer 57

outside the 95% confidence interval

Answer 58

an unmeasured variable that may be the cause of both X and Y

Answer 59

a fraction of individuals having a particular attribute

Answer 60

describes the probability of a given number of successes from a fixed number of independent traits Pr[X]=(n given X)p^X(1-p)^n-X n trails; p probability of success

Answer 61

mean and variance of number of succusses u=np sigma^2= np(1-p)

Answer 62

phat= X/n the hat (^) shows that this is an estimate of p

Answer 63

mean: p variance: p(1-p)/n

Answer 64

(p'-1.96 sqrt(p'(1-p')/n+4) <= p<= (p'+1.96 sqrt(p'(1-p')/n+4) p'= X+2/n+4

Answer 65

uses data to test whether a population proportion p matches a null expectation for the proportion example: H0: dog good is chosen at best 20% of time Ho=0.2 N=18, p0= 0.2, X=2 P-value= 2(Pr[2]+Pr[1]+Pr[0]) example say = 0.543 > 0.05 therefor cannot reject the null hypothesis (It is plausible that people do not prefer pate over dog food)