prac q's 2 Flashcards

1
Q

a) What is the difference between a statistic and a parameter?

A

A parameter is a numerical characteristic of a population while a statistic is a numerical characteristic of a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

b) What is meant by sampling variability in a statistical study?

A

Sampling variability refers to the process whereby statistics, such as the sample mean, would give different results if the random sampling process was repeated. We thus need to account for sampling variability when making any conclusions from our data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

c) What is meant by the term double-blind in an experiment?

A

In a blind experiment the subjects do not know what treatment they are receiving. In a double-blind experiment the experimenter is also blind as to what treatment the subjects are receiving

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

State the assumptions that have to be made when performing a linear regression.

A

Linear relationship;
Constant variability;
Normal variability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Discuss the assumption of constant error variance with reference to the residuals plot shown above.

A

the scatterplot is grouped in a clunk
- The variance seems fairly constant for the 26 participants in the main group. It is difficult to assess for the outlying values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

null hypothesis for regression analysis

A

slope of linear is 0 !!!!

Null hypothesis: The slope of the linear association is 0
P-value: p < .001
Conclusion: Very strong evidence against the null hypothesis, suggesting that there is an association between 6MWD and Age.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

e) What is the R-squared value for this linear model and how would you interpret it?

eg r^2 is 0.485

A

how much variability is explained by the model relative to the total variability = what is the proportion of the explained variance …. BASICALLY TELLS YOU HOW GOOD IS MODEL

R-squared is 0.485, suggesting that 49% of variability in 6MWD values is explained by knowing the age of the participant.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Overall, do you think this is a good model for predicting expected 6MWD? If not, then how would you suggest the predictive capabilities might be improved?

  • the bloody thing is all over the spot
A

No, the linear relationship seems spurious, likely just a result of trying to fit a line through the outlying points in the scatterplot. If the researchers are interested in the full range of ages 25-75 then they should include more subjects in the 20-40 and 60-75 age ranges to determine whether there is a genuine linear relationship.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

levene’s test

A

Null hypothesis: The variances within the three groups are all equal
P-value: p = .093
Conclusion: No evidence to reject the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What conclusion would you draw from the results of Levene’s test in relation to the assumptions of the ANOVA (p<0.001) and why?

A

It is reasonable to assume that the variances within the three groups are all equal.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

c) What other assumptions should we make before performing ANOVA?

A

The groups are independent of each other and the variability within each group is Normal.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly