FM Qs Wrong Flashcards

1
Q

Explain why it may be appropriate to carry out a hypothesis test based on the PMCC

A

Because scatter diagram illustrates a rough elliptical shape, so it suggests a bivariate normal distribution.

  • also if there’s two big clusters
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Spearman’s hypothesis

A

H0 - there is no association between x and y
H1 - there is association between x and y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define significance level

A

The probability of rejecting the null hypothesis when it is in fact true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Discuss briefly what the contingency excel table suggests

A
  • go with each type, e.g:
    Type 1 - large contributions to test statistic of __ and __ show that fewer than expected are _ and more than expected are __
    Type 2 - for type 2, small contributions of ___, so as expected
    Type 3 - ….
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Interpolation vs extrapolation

A

Interpolation good estimate and reliable, extrapolation is probs unreliable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Comment on the fit of the regression line

A

Because r^2 = ____, fit is bad/moderate/ good

And the points lie fairly far/close from the line of best fit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Lil uniform distribution

A

Uniform on values {1,2,3….n}

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Much larger sample size =

A

Smaller critical value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Residual =

A

Real - calculated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Conditions for binomial (4)

A
  • fixed number of observations
  • each observation is independent
  • each observation is either success or fail
  • probability is constant
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Conditions to have position distribution

A
  • all events independent of each other
  • rate of events through time is constant (mean)
  • mean = variance roughly
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Explain why for proper inference, PMCC sample should be randomly selected

A

Because then the probability basis on which the sample has been selected is known

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Conditions for geometric distribution

A
  • independent trials
  • binary - success or fail
  • chance of success is the same each trial
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Degrees of freedom tips and tricks

A

1 - go from final tableau - after possibly merging columns due to small yute
2 - if testing for poisson binomial, you are estimating parameters, so v - 1, then -1 for the merged column

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Student concludes no correlation between variables in the summer months, ctm about the students conclusion

A
  • result of hypothesis test always has uncertainty, and then expand related to that question
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Give two desirable features that the sample should have

A
  • randomness
  • unbiased
  • representative of the population
  • components selected independently
17
Q

Explain why random sample was chosen for PMCC

A
  • random sample enables proper inference about population to be undertaken
  • unbiased
  • for hypothesis test to be valid, necessary to assume sample is random
18
Q

Explain why binomial and poission can be used

A
  • occurrences are random and independent with constant probability
  • number of successes are counted
    Therefore binomial appropriate

For poission, sample size is large, and p is small

19
Q

Sometimes when chatting about rough elliptical

A
  • point out any anomalies too
  • also mention POPULATION
20
Q

Reliability of regression estimate usually related to

A

EXTRAPOLATION OR INTERPOLATION
- sometimes slide in r value and points closeness to line too

21
Q

State the distributional assumption which is necessary for this test to be valid

A

The POPULATION
Must have a bivariate normal distribution

22
Q

Comment on the outcome of this test with a larger sample but with 0.076 being considered as an effect size

A
  • test shows that there is/ isn’t almost certainly some real correlation in the population
  • however this is of little practical consequence since effect size is so small
23
Q

Would it be appropriate to use regression y on to find a value of x given a venue of y

A

Not appropriate; the regression line of d on f is needed

24
Q

Based on weird effect size table + data, two reasons why it would not be appropriate to carry out a hypothesis test based on this data

A
  • sample not random
  • distribution of the bivariate population is unknown
25
Q

What does table + effect size allow u to conclude on the relationship between x and y

A
  • point out largest effect size
  • is it large or medium? Medium = less useful for practical purposes
  • all other factors have low effect size, suggesting a very little relationship for them