Probability And Statistics Flashcards

1
Q

Find the probability of a joint distribution function

A

(1) draw the boundaries using the interval (box)
(2) rearrange the probability and draw this on the graph and find the area of the region.
(3) if the area is found simply then multiply the ans by the joint distribution value.
Otherwise construct a double integral for the area and this is the answer.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

For a joint distribution function what is the expectation and how do you show independence?

A

E(x)=Integral x fX(x) dx
E(y)=Integral y fY(y) dx
Independence is fX,Y(x,y)=fX(x)fY(y)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the steps for change of variables?

A

(1) rearrange the new variables to make x and y the subject
(2) construct the Jacobean matrix and find the determinant, (dx/dudy/dv-dx/dvdy/du) and take the modulus of this value.
(3) multiply this matrix by the joint distribution function where x and y have been replaced in terms of u and v.
(4) and simplify to obtain the joint density.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the formula for the conditional probability?
And the expectation for x|y
What is the equation for the COV(X,Y)?

A

For fX|Y(x|y)=fX,Y(x,y)/fY(y)
And visa versa for y|x
For the expectation: E(X|Y)=sum of xfX|Y(x|y)
Cov(X,Y)=E(XY)-E(X)E(Y) where
E(XY)=sum of x
y*fX,Y(x,y) if the covariance=0 then the marginals are independent.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Write down all of the summary statistics.

And how you can check whether the model is fit for linear regression analysis.

A
Check in book.
If the correlation |Sxx/sq(Sxx*Syy)|>0.8
Write the linear regression equation:
Y=ax+b where a=Sxy/Sxx and 
b=(y-bar) - (x-bar)*a
Another check is that the error variance is small compared to the y values. 
1/n(Syy-Sxy^2/Sxx)
Also residuals should be between +- 2 
standard deviations where residuals: 
e=y - y' where y is the original y values subtract the linear regression equation y values.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Write the equation for the form of the exponential family of distributions.
How do you find the normal form?

A

Check in book.
Normal form is obtained by parameterising the eta term by rearranging and making the other term the subject. Substitute this into the original form and sort into the other EFD form. Where the A(eta) can be used to find the expectation and variance, first and second derivative respectively.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How do you find the MLE?

A

(1) write L=product from i=1 to n of the function
(2) then write l= sum of the log of the function
(3) then apply the log function to the function
(4) where x is involved replace with the sum from i=1 to n of xi.
(5) then differentiate with respect to the parameter we want and set to 0.
(6) 1/n*sum of xi is the x-bar.
(7) make the parameter-bar the subject.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How do you find an unbiased estimator for the parameter?

A

Want to show that the expectation of the parameter-bar is equal to the parameter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How do you find the moment-generating function?

A

Mx(t)=E(exp(tx))=exp(tx)*function
For the kth moment we have that
E(X^k)=d^k/dt^k Mx(t) at t=0
So the expectation is the first derivative at t=0.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How to find C for the joint distribution function?

And the x and y marginals?

A

Do the double integral.

Check in book

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the formula for the Z-Test and

T-Test.

A

Check in book.
For t-test is when the variance is unknown.

For a sample variance s^2, it is calculated by 1/n-1*sum of (xi-(x-bar))^2.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

For a difference in means of populations, what are the three cases and equations that follow?

A

(1) when the variance is unknown but are known to be equal we use the pooled t-test ~ t(m+n-2). Write equations and check in book.
(2) when the population variances are not the same by the sample sizes are sufficiently large for it to work we use the z-test but with the unknown variances Sx and Sy.
(3) the two variances are known and are equal we use the z-test.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the steps to test the hypothesis for a contingency table?

A

(1) state the hypothesis
(2) calculate the proportion p=x/n
(3) compute the expected frequencies p*n total =fe
(4) draw the table, so we obtain to total for (f0-fe)^2/fe
(5) degrees of freedom: (n-1)(m-1) n=rows
(6) we accept for

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the test for the value of a variance of a normal population?

A

V=(n-1)S^2/variance0 in the hypothesis~chi squared (n-1)

If |V|

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How to test for the ratio of variances of two normal populations.

A

We use the F-test:
F=SA^2/SB^2 for SA^2>SB^2

For the numerator the d.f. N-1
And the denominator d.f M-1
So we find F(1-alpha/2, n-1, m-2)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

For a proportion test what do you do? How are p-values calculated?

A

Z=p-p0/(square root (p0(1-p0)/n)) where p is the proportion and p0 is the p stated in the hypothesis.
Then to test whether we reject or not follow as if this was the z test.

For a two tailed test:
P value: 2P(Z>z)
Left tailed:
P(Zz)
For a p-value