QDA2 Flashcards

Question 1

Q

Variance

Answer

A

statistical measurement (1) of the spread (2) between numbers in a data set (3)

Question 2

Q

Why (2) do we use PCA

Answer

A

go from large number of variables to smaller number of variables
helps to show which variables are strongly related/seperated

Question 3

Q

Data requirements PCA

Answer

A

1) quantitative variables (normally distributed, based on correlation)
2) large numbers of observations
3) strong correlation amongst variables (>0.3)

Question 4

Q

Where does the b stand for in the equation for PCA

Answer

A

component loadings (= how much they relate to each other)

Question 5

Q

difference between common and unique variance

Answer

A

common variance is variance that is shared among other variables in the data set and unique variance is variance that is specific to a variable.

Question 6

Q

communality

Answer

A

proportion of common variance once that is present in a variable, the higher, the more common variance

Question 7

Q

what does EV > 1 mean

Answer

A

component explains more variance that individual variable

Question 8

Q

what does kaiser-meyer Olkin measure?

Answer

A

compares observed correlations with partial correlations.

Question 9

Q

whats the conclusion for Kaiser-meyer Olin

Answer

A

P = x and x > 0.5 so there is more shared variance than unique variance.

Question 10

Q

whats the conclusion for Bartletss test

Answer

A

P = x so significant, this means that the correlations between items differs from zero. H0 = all variables are uncorrelated so we can reject H0.

Question 11

Q

what is component rotation

Answer

A

redistributes explained variance over components and makes components loadings more extreme. makes it easier to assign and interpret.

Question 12

Q

what is the conclusion of R2

Answer

A

the model explains x% of the variance in the OV

Question 13

Q

when it comes to the follow up tests, if the variances are not homogeneous, what do we do use then?

Answer

A

Welch’s test

Question 14

Q

whats the disadvantage of POST HOC TEST

Answer

A

artificial inflation of alpha

Question 15

Q

What do you do it a component loading loads on an unexpected component

Answer

A

you do nothing, apparently the items correlate more strongly with a latent factor that is not the one that they were expected to load on. this doesn’t make the analysis unreliable, it basically indicated that it is good that you did the analysis.

Question 16

Q

What does Tom have to do after component rotation, before he goes further with his analysis?

Answer

A

Recode the negative loaded factors

Question 17

Q

5 criteria ANOVA

Answer

A

1) categorical PV, quantitative OV
2) residuals are equally distributed
3) homogeneity of variances
4) mutually exclusive (overlap)
5) EQUAL SAMPLE SIZES

Question 18

Q

whats the H0 of ANOVA

Answer

A

H0: µx = µx2 = µx3

Question 19

Q

categorical vs quantitative

Answer

A

Categorical: nominal, ordinal
Quantitative: ratio, discrete, interval (temperature), continuous (weight, age)

Question 20

Q

make table and show criteria

Answer

A

are they equal to zero horizontally? are they vertically equal to zero (orthonogality) and why is there orthogonality: because they are used together in one part so you cannot use them again, inflation of alpha

Question 21

Q

is they ask about which pv has strongest effect on OV, we use

Answer

A

partial eta squared

Question 22

Q

main effect on direction and position

Answer

A

direction: core pv, position: legenda

Question 23

Q

when you dont have the p value for the t statistic nor the CI and you need to check if it wil have an impact on OV what do you do?

Answer

A

=2 or > 2 then it has a significant impact, otherwise its just too low to have an impact

Question 24

Q

use two statistics to conclude if the second model is better than the first

Answer

A

F change + p value and the R square/Rsquare adjusted value

Question 25

Q

multicollinearity and whats the problem with that?

Answer

A

two or more PVs in a multiple regression model are highly correlated, which affects the testing for the coefficient testing

Question 26

Q

why do we use dummy variables

Answer

A

If you enter Education as PV, SPSS it will treat the nominal categories as numeric, meaning it will estimate a linear
effect for this PV, which makes no sense for nominal PVS.

Question 27

Q

F test why used

Answer

A

The additional explanatory power of model 2
compared to model 1 is formally tested in the F-change test.

Question 28

Q