lecture 19,20,21 Flashcards

1
Q

what is a multinomial experiment?

A

qualitative data falling into more than 2 categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

properties of a multinomial experiment?

A

n identical trials
k possible outcomes (categories or cells)
probabilities are p1, p2, p3 and remain the same from trail to trial
independent trials
random variable interests are cell counts, n1, n2, of the # of observations that fall in each of the k classes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is a one way table

A

consider a multinomial experiment with k outcomes that correspond to categories of a single qualitative variable. These results are summarized in a one way table

one way because only one variable is classified

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what does a chi sqaured test measure?

A

the degree of disagreement between the data and null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

5 steps to solving a chi squared?

A

1 state null and alternative
2 test statistic (x^2= sum of (ni-npi,0)^2/npi,0)
3 Specify a
4 calculate test stat
5 determine whether to fail to reject or reject null and draw conclusions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

rejection region for chi squared?

A

if x^2> x^2a, k-1; where k-1 is number of cells -1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

conditions for x^2

A

multinomial experiment; satisfied by taking a random sample
n is large; satisfied if for every cell, Ei>5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is the chi squared used to determine?

A

whether there is a significant difference between expected frequencies and observed frequencies in one or more categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is a two way contigency table?

A

multinomial experiment results summarized in 2-way table called a contigency table
probability totals are called marginal probabilites for each row and column

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what happens in a contingency table analysis?

A

if the two classifications are independent, the probability that an item is classified in any particular cell of the table is the product of the corresponding marginal probabilities.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

how to estimate probabilites?

A

phatci= Ri/n
phatci=Ci/n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

as the number of degrees of freedom increases,?

A

the chi squared distribution becomes more symmetrical; values of x^2 are always positive. The distributions are positively skewed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

finding expected cell counts for 2-way table

A

Ehat=RiCj/n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

test stat for two way table?

A

x^2= sum of (nij-Ehatij)^2/Ehatij

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

pvalue

A

P(x^2>x^2a) where x^2a has (r-1)(c-1) df and x^2c is computed value of test stat

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

conditions for a contingency table?

A

1 n observed counts are random sample from population of interest. then consider it a multinomial experiment with rxc possible outcomes
2 n is large enough so that for every cell the estimated expected count, Ehatij is > 5; if not the approximation can become very poor

17
Q

what should happen if x^2 values does not exceed critical value of x^2

A

do not accept hypothesis of independence; avoid inferring a casual relationship between classifications.

18
Q

what can not be established by a contingency table?

A

The existence of causality cannot be established by a contingency table

19
Q

what is simple linear regression?

A

suppose we want to study the relationship between the dependent variable y and the predictor variable x. Suppose the equation has some restricted form such as y=Bo +B1x where Bo is the y intercept and B1 is the slope. This is known as a deterministic model

20
Q

if we use a model that accounts for random error…?

A

this is a probabilistic model which includes both a deterministic component and a random error component
y (variable of interest; we always assume that the mean value of random error =0 Therefore, E(y) is deterministic component)=deterministic component and random error

21
Q

what is the random error term?

A

E (y=Bo +B1x +E)

22
Q

what is x and y?

A

x= predictor, explanatory, or independent variable
y=response, outcome, or dependent variable

23
Q

what is the implication of linear regression?

A

expected value of Y is a linear function of x. Y differs from its expected value by a random amount
formally, let x, denote a particular value of the independent variable x, then our linear probabilistic model says
E(y|x1)=uy/x=mean value y when x is x1

24
Q

steps for linear regression?

A

1 hypothesize the deterministic component of model that relates the mean, E(y), to independent variable x
2 use sample data to estimate unknown parameters in model
3 specify the prob distribution of random error term and estimate the SD of this distribution
4 statistically evaluate the usefulness of the model
5 when satisfied that the model is useful, use it for prediction, estimation, and other purposes

25
Q

what is the least squares method?

A

determine the values for the parameters so that the overall discrepancy D= sume of (observed response-predicted response)^2 is minimized