Statistics VIII - Exam Questions II Flashcards
What’s the linear regression model?
What assumptions are made about the errors or residuals?
What are the consequences if those assumptions are not true and what can one do about it?
A model about the linear effect of x on y.
Errors independent and normally distributed.
No normal distribution in a histogram.
What’s the difference btw multiple and multivariate regression?
Multiple regression takes several independent variables and only one dependent variable.
Multivariate regression takes one independent variable and several dependent variables.
How is the coefficient of determination calculated and how can it be interpreted?
The coefficient of determination is the squared correlation: R² = r² (squared correlation coefficient).
R² indicates how much of the variance of y can be explained by the linear relation to x.
What’s a partial coefficient of regression?
y=β0 +z1 β1 +z2 β2 +z3 β3+ε
βi … partial coefficient of regression
Interpretation: β₂ is the coefficient of y and z₂, if z₁ and z₃ are held constant.
The partial coefficient of regression is NOT equal to the bivariate coefficient of regression.
What are the differences btw a factor analysis and a PCA (in their calculation and interpretation)?
When an investigator has a set of hypotheses that form the conceptual basis for her/his factor analysis, the investigator performs a confirmatory, or hypothesis testing, factor analysis. In contrast, when there are no guiding hypotheses, when the question is simply what are the underlying factors the investigator conducts an exploratory factor analysis. The factors in factor analysis are conceptualized as “real world” entities such as depression, anxiety, and disturbed thought. This is in contrast to principal components analysis (PCA), where the components are simply geometrical abstractions that may not map easily onto real world phenomena.
You designed a questionaire for a study. After a testrun you notice that questions 1 and 3 are answered in exactely the same way by all participants. Why should you consider to rethink your questions?
There will be no variance on these questions.
You use three variables to measure fish in two ponds and you are interested in the differences of the two populaitons. The uncorrected p-values for the group differences (e.g. t-test) are:
body weight: 0.04
body length: 0.01
length of tail fin: 0.005
If you want to except a collective alpha-error of 5% after a Bonferroni-correction, for which of the three variables do you assume a significant group difference?
0.05 (5%) / 3 (number of tests) = 0.0167
->
Body length and length of tail fin is still ok
Plants and fertilizer: height of plant (in cm) = Y and amount of fertilizer (in g) = X.
Interpret the following result (describe equation in words): Y = 10.3 + 1.7 X
What should you control for in this experiment?
Positive influence of x on y.
Started at 10.3 cm.
Plants are 1.7 cm higher per g fertilizer.
What does R² usually stand for?
Coefficient of Determinatioin
What does MANCOVA mean?
Multivariate Analysis of Covariance
What is a partial correlation coefficient?
The correlation coefficient between the residuals of a regression is called partial correlation coefficient.
What’s the difference btw 1-way and 2-way ANOVA?
The 2-way is an extension of the 1-way ANOVA to include different categorical independent variables on one dependent variable. (1-way ANOVA uses just one independent variable)
You recorded 5 physiological measurments in one group of subjects before and after the treatment with a certain drug. How will you proceed statistically to investigate the effect of the drug?
Test covariance in paired comparison -> smaller variances can be detected that would be masked by interpersonal variance in cross-sectional studies.
You are interested in whether students, who do have plants in their classroom, get better grades. You record grades in the subjects Maths, German and Biology and conduct your study at 5 different schools.
How many factors are there in your study and which are fixed and which are random?
Fixed Factors:
plants or no plants
5 different schools
Random Factors: ethnicity? exact age of students grades in Maths grades in German grades in Biology
??????????????
You are interested in whether students, who do have plants in their classroom, get better grades. You record grades in the subjects Maths, German and Biology and conduct your study at 5 different schools.
Which statistical analysis would you use?
MANOVA