2nd Stats Exam MCQ Flashcards

Question 1

Q

What is the violation of the assumption of independence? (Multi Level Modelling)

Answer

A

When one data point is dependent on another data point (one data point gives info to another data point)
Linear R’s assume independence so when this is violated you use a multi level model of logistic R

Question 2

Q

What does the General Linear Model change to once you start using multi-level modelling?

Answer

A

It becomes part of the GENERALIZED linear Model

Question 3

Q

What is the difference in calculation between independent and paired samples t test?

Answer

A

The way SE/variance is calculated

Independent samples - When SE is calculated > variance for both groups is incorporated into the calculation = pooledSE

Paired samples > whatever variance occurs in time 1 also occurs in time 2 e.g. individual differences (hunger, mood, time…)

Therefore do not need to count variance for both time points. If you did it would be double counting! and get drastically incorrect p values

Question 4

Q

Just like with Binary logistic regression, we assess mixed effects models using what?

Hint: similar to SSE reduced

Answer

A

-2 Log Likelihood

Question 5

Q

What do we find out from the Estimates of Covariance Parameter box? Specifically, the intercept variance box?

Answer

A

A p value either under .05 (signif) or over .05 (nonsignif)

If the p value is signif it tells us there is SIGNIF VARIANCE in intercepts and hence we were right to conduct a MLM

Question 6

Q

What is the new value introduced in Binary Logistic R, used in MLM? and what is it equivalent to?

Answer

A

Walde stat - equivalent to t score

Calculation:

1) estimate(bvalue)/SE (found in estimates of covariance parameter box)
2) then SQUARE

Question 7

Q

What does the ‘Empirical Best Linear Unbiased Predictions’ box show us?

Answer

A

Each ppts u0 variable estimate - the difference between their most ideal ‘intercept’ and b0 (deviation of ideal intercept from b0)
(

Later on in model with random intercepts AND random slopes will include each ppts u1 variable estimate too

Question 8

Q

Dependency between data points can also come in the form of what?

Answer

A

Clustering (a general problem) - when a subset of ppts are more connected with each other

Whether clustering exaggerates in favour of our hypothesis, or against it, dependency will always distort our analysis from reality

Question 9

Q

If 2 related ppts are both in the post-experiment group, what happens? (CLUSTERING)

Answer

A

Increased score for 1st ppt after experiment, causes 2nd ppt scores to be increased more through talking to 1st ppt

So, the experimental group scores are exaggerated

Question 10

Q

If there are 2 related ppts in study and one is in the control group and one is in the experimental group, what happens? (CLUSTERING)

Answer

A

1st ppt scores will increase after experiment > leads to increase in 2nd ppts score in control through talking to 1st ppt

makes exp group look less effective than it is, because scores in both groups look the same (not much to compare)

Question 11

Q

If 2 related ppts are both in the control group, what happens? (CLUSTERING)

Answer

A

An event outside study could bring the 1st ppts mood up or down, thus affecting the 2nd ppts mood too

This brings both control scores up (exp looks worse) or down (exp looks better)

scores not based exp itself!

Question 12

Q

What is heirarchical clustering? An example?

Answer

A

When data is naturally grouped, at multiple levels

- In schools for example, children within classes talk to each other and children within schools too (but less so)

Question 13

Q

How do we deal with (heirarchical) clustering?

Answer

A

Modelling the dependencies - include dependencies (both class and school) in our model as variables

Question 14

Q

When looking at if there is an effect of cosmetic surgery on quality of life, where patients are within clinics, how does clustering occur? (L7)

Answer

A

having different clinics - one clinic in one area better than others = ppts from this clinic starting with higher baseline quality of life
Different surgeons within clinics - good surgeons boost quality of life a lot more post surgery in exp group, compared to bad surgeons. Bad surgeon could do the surgery wrong and worsen quality of life!

Question 15

Q

Based on the cosmetic surgery example, what are the variables we need to consider? (L7)

Answer

A

Quality of life
Surgery
Clinic

Question 16

Q

For the cosmetic surgery example, what is the formula for the first model where we do not specify random effects yet? (MLM L7)

NOTE: can run a MLM same as a linear regression

Answer

A

Quality of life = b0 +b1*Surgery

one way of getting -2LL for Linear R

Also called a No random effects / fixed effects model

Question 17

Q

When does the MLM depart from Linear R?

Answer

A

When we start adding random intercepts

Question 18

Q

What variable is added to the original GLM formula when we add random intercepts?

Answer

A

u0 variable

becomes (b0+u0[variable the varying intercept is based on]) *Time

Question 19

Q

Based on the cosmetic surgery example, what variable is the varying intercept based on?

Answer

A

Clinic > each clinic now has its own intercept (u0) for quality of life

Question 20

Q

Why do we vary the intercepts based on a particular variable in MLM?

Answer

A

By changing the intercept, we are effectively allowing DV (quality of life) to ‘start’ at a different point in each variable (clinic)

Question 21

Q

Aside from the p value given in the Estimates of Covariance Parameters box, how can we find whether DV (quality of life) varies between u0 variables (clinics)? (this can be a legitimate research q)

Once the variance is found what is the difference called?

Answer

A

Compare how much DV varies with random intercept to how much DV varies without random intercept (u0 variable)

-2LL current model with random intercept take away -2LL previous model with no random intercepts (linear R model)

Difference = Likelihood ratio

Question 22

Q

How do you get a chi square p value in the chi square calculator?

Answer

A

The likelihood ratio and DF of 1

Question 23

Q

Andy Field says that for testing random effects, the __1__ is much more accurate, so you should rely on this rather than __2__

Answer

A

Likelihood ratio (diff in -2LL between random intercepts model and no random intercepts model(Lin R))
Wald Z (in the covariance parameters box)

Question 24

Q

What in SPSS tells us whether there is in fact significant variance in intercepts between participants

i.e. whether the ‘best’ intercepts for each participant are significantly different from each other (i.e. whether they needed to be ‘random’?

Answer

A

Estimate of covariance parameters’ box

look at the p value

Question 25

Q

How do we get ppts individual intercept?

Answer

A

Add individual u0 variable (quality of life estimate for each clinic) to the fixed b0 value

Question 26

Q

If we have random intercepts and slopes for a variable (clinic) what are we effectively running?

Answer

A

Separate regressions for each variable (clinic, so 10)

- which can be manually combined - just have to split clinic variable

Question 27

Q

across 10 regressions, are the best slopes for each clinic. So, rather than ___1___ being our data points, we are now making _2___ our data points

Answer

A

Patients

2. Clinics

Question 28

Q

What t test’s t and p values is equivalent to MLM’s t and p values?

Compares manual MLM - 10 data points ( b values in manual R) to zero( the null) (L7)

Answer

A

One sample T Test

mean of the b values / SE = calculate t
the t given is close to t in MLM intercepts row

Question 29

Q

Why do many researchers think psychology is in trouble? e.g. Brian Nosek, Stephen and others (L8)

Answer

A

Replication Crisis - when studies are replicated different p results are found (actually non signif…)

Question 30

Q

How did Nosek demonstrate the replication crisis? (L8)

Answer

A

replicated random 100 studies all with signif p values and found only a third maintained the signif result

Question 31

Q

What does this replication crisis mean for us? (L8)

Answer

A

Belief in Null Hypothesis Significance Testing (p values) drastically declines

Question 32

Q

What is the good about human reasoning?

Answer

A

We can draw on our knowledge of the world (current model - view of someone and factors affecting their decision) which will affect how you influence ambiguous data

We use our current model of knowledge, because we cannot measure direct info (e.g. someones thoughts) that would give us correct answer

Question 33

Q

What is the bad about human reasoning?

Answer

A

Confirmation Bias - seeing what we expect to see, followed by biased updating of our beliefs

(a) interpreting ambiguous data in light of your prior beliefs and (b) updating those very same beliefs based on your interpretation of the ambiguous data

Question 34

Q

What scientific method is used in psychology to try and prevent confirmation bias? and who are the famous figures who helped to establish this?

Answer

A

Null Hypothesis Significance Testing (NHST)

Francis Bacon, Pearson, Neyman, Popper, Fisher,

Question 35

Q

What is the basic idea behind an experiment? (Bacon)

Answer

A

create a hypothesis and if what we expect occurs under certain circumstances then we expect our theory to be true

/ scientists are trying to contrive a situation which will produce unambiguous evidence either for or against the theory

Question 36

Q

We can conduct many studies to support our theories, but according to Karl Popper, what do we also need to do?

Answer

A

To conduct studies FALSIFYING our theories too - identifying that there can be evidence against our theory too (idea behind crucial experiment)

Question 37

Q

The idea of a crucial experiment (bacon) in science was created for physics can say ‘If my theory is true, this will definitely happen – if it is false, this other thing will happen, or nothing at all’

but why is there an issue using it in psychology (and medicine)?

Answer

A

Psychology and medicine deals with PROBABILISTIC THEORIES - if my theory is true, then on average this will happen / on average these people will do better

Therefore, cannot really follow Bacons idea of crucial experiment - our experiments do not definitely prove a theory wrong > only provides PROBABALISTIC EVIDENCE

Question 38

Q

The issues within using scientific crucial experiments in psychology, allows what to creep back in, that we tried to prevent before with scientific method?

Answer

A

Probabilistic theories and evidence we gather, allows for confirmation bias to creep back in

data is open to interpretation by researcher, allowing for biased prior beliefs in favour of theory > interpret data in biased way

Question 39

Q

What is the data we get based on Fisher?

Answer

A

P values and effect sizes

p value is the probability of getting the current data if the null hypothesis is true

CONTINOUS SPECTRUM - smaller the p, more significant our data is and can reject the null

Question 40

Q

How do the ideas of analysing our data differ between Fisher and Pearson&Neyman?

Answer

A

Fisher - continuous spectrum for p value = smaller p = evidence for our theory

P&N came up with a DECISION RULE in NHST - ‘reject’ our null hypothesis if the p value is less than .05 (signif result) - conveying evidence for our theory

Intended to stop interpretation of p values > but has the opposite effect

Question 41

Q

How has the replication crisis arose?

hint: confusion between ideas

Answer

A

A mixing of Fishers idea and Pearson/Neymans idea of analysing data - should both be exclusive (use one way or the other)

Question 42

Q

What are the problems in science then that allow for the replication crisis? Shows NHST cannot prevent theses problems from corrupting science.

HINT: 4 Horseman

Answer

A

Publication Bias > P-Hacking and bottom drawer effect

HARKing

Question 43

Q

What is publication bias? (and so bottom draw effect)

Answer

A

Majority of papers published show significant results and nonsignif ones are discarded or not written up

signif papers seem more interested and generate more money

Many studies suspiciously reporting p values just below .05 (reported by Masicampo)

Question 44

Q

What is bottom draw effect?

Answer

A

Incentive structure to write and publish papers with signif results and discarding any with non-signif results as they usually are rejected by journal (chucking non-signif findings in bottom drawer)
= SKEWED REPRESENTATION

Question 45

Q

What is one technique to do if publication bias is occurring in research?

Answer

A

Conducting a Meta-anlalysis i.e statistical reviews - they try to come to a single statistical overview of the field

Question 46

Q

Instead of throwing non-signif results in “bottom drawer” what is a more nuanced form of publication bias?

Answer

A

P-HACKING - nudging our p-value towards a signif result - e.g. researchers will try all the different ‘acceptable’ ways (methods and analyses) out and pick the one that is significant (CONFIRMATION BIAS)
because researchers have certain degree of freedom when running experiment

Question 47

Q

How can we prevent Bottom drawer effect and P-hacking?

Answer

A

Following Neyman-Pearson approach more - pre-registration (registering experimental and analysis plans before collecting data)
Give up on NHST / p value being .05 entirely
make it continuous again like fisher proposed?

arbitrary significant vs non-significant distinction removed

Question 48

Q

What is a statistically-legitimate opposition to NHST?

Answer

A

Bayesian approach

Question 49

Q

Why is it hard in psychology to produce powerful tests of theories?

Answer

A

because we have weak theories to begin with (thus vague predictions) - e.g. we say meditating makes us happier - well by how much? no idea

and weak experiments (unlike in physics)

studying complex humans - messy situation
confounded - whether confound variables explains signif results rather than theory (whether it is or not is very subjective)

Question 50

Q

Why do we come to rely on a-priori hyptothesising?

Answer

A

a priori hypothesizing suggests that the researcher is on to something impressive and not due to confound variables (… many researchers want to do this so they may HARK)

Question 51

Q

What is HARKing?

Answer

A

Hypothesising after the results are known and pretending to make apriori hypotheses

occurs because incentive framework - work with impressive apriori hypotheses more likely to be published - difficult to predict before seeing results though = harking can help get published

pre-registering proposed to prevent HARKing… OR give up the sanctification of a priori hypothesizing there will be no incentive to HARK (rid of NHST)

Question 52

Q

Why do some want to get rid of NHST?

Answer

A

we don’t know what our theory predicts (so difficult to make apriori hyps) and don’t know if our experiment is a good test of our theory before we run it

we don’t have any mature enough theories to use NHST and make hypotheses, should just do explorative studies first with no expectations in mind

Question 53

Q

What is the problem with a-priori hypothesizing?

Answer

A

based on vague, weak theories to begin with

confounding variables tend to occur during experiment causing it apriori hyps to weaken - not able to make predictions when everything is complex

Still in exploration stage, not NHST stage

Question 54

Q

What do we mean by removing ‘significance’ thinking?

Answer

A

Not saying results are significant to .05 in an experiment, instead saying = provides a bit of evidence for or against a theory

No apriori hyps needed - it doesn’t matter if you predicted it beforehand or not – it’s just one piece in the puzzle - in a decade or so we’ll have some idea if the theory is true or not

Question 55

Q

Why will it prove hard to remove ‘significance’ thinking and apriori hypothesizing?

Answer

A

privatised capitalist journal system which prioritises ‘interestingness’

Question 56

Q

(L7) instead of getting a whole model r and f value, what do we get instead in MLM?

Answer

A

a whole model -2LL

we do get individual predictor r and f values in the fixed effects box

Question 57

Q

(L6) Give an example of things you measure indirectly?

Answer

A

Cognitive ability (what we actually want to measure) through scores on an IQ test (tools we use to measure this indirectly)

Question 58

Q

What Q should we keep in mind always regarding measurement theory?

Answer

A

What is the relationship between my measurement tool and the thing I actually want to measure? (relationship being distance)

Question 59

Q

What is the relationship between the extern (indirect measure) tool and the internal measure (that is not directly observable)?

Answer

A

The validity of that measurement tool i.e. to what extent are we measuring what we intend to measure? extent is far from perfect.. (far from r2 of 1)

Question 60

Q

Why is there never a perfect relationship between what a tool is measuring and what we really want to measure?

Answer

A

NUISANCE VARIABLES - there are many other factors that we do not want to measure that affect responses on the measurement tool.

This prevents us from actually measuring what we want to measure = adds noise to your measurement tool

e.g. daily mood, hunger, time

Question 61

Q

When we collect data with our measurement tools, what do we get back?

Answer

A

different DATA TYPES - NOMINAL, ORDINAL, INTERVAL or RATIO

data type might not always match the data type of the thing we actually want to measure

Question 62

Q

What is nominal data?

Answer

A

Data with no particular order e.g. gender, eye colour

Question 63

Q

What is ordinal data?

Answer

A

Has a particular order e.g. ranks in the army, race positions or a likert scale with non-symmetrical points (lopsided)

Question 64

Q

What is interval data?

Answer

A

Has a particular order and equal distances between points e.g. likert scale with symmetrical points (very bad, bad, neutral, good, very good)

Answer 65

A

Has a particular order, equal distances between points and true zero

if someone scores a ‘zero’ on your measurement tool, they also have ‘zero’ on the thing you’re trying to measure e.g. reaction time (height and weight)

Answer 66

A

When it moves from nominal&raquo_space;> ratio

- conduct more sophisticated calculations on data further to the right

Answer 67

A

5 or 7 points to balance sufficient precision while not overloading your responder

Answer 68

A

How your responders INTERPRET your scale - they may interact with scale in ordinal manner, rather than interval manner

e.g. five star rating system - data points not equally distributed > overly generous with 5star ratings and 4star doesn’t mean good anymore…

Answer 69

A

If you get them confused, the mean will NOT be the central point as intended with interval, if data is actually ordinal

Answer 70

A

Measurement tool = number of incidents

data gathered is ratio when = want to measure the reduction in challenging behavior (external - directly observable) - 0 incidents = 0 challenging behaviours (true zero)

data gathered is interval when = want to measure happiness (internal - not directly measured) = no. of incidents then becomes an indirect measure, because a reduction in incidents might not indicate increase in happiness…

Answer 71

A

Binary data

Special - unsure if two points have particular order and to have equal distances you need at least 3 points

Answer 72

A

When a criterion variable is binary

Lin R can only handle criterion variable that are at least interval data

Answer 73

A

Downs syndrome - a condition where you either have it or you don’t (genetic mechanism)

Answer 74

A

A disorder (e.g. autism) may be measured on a spectrum (not directly measurable), but diagnosed in a binary way (either have condition or you don’t based on cut off point)

Answer 75

A

Dummy coding used only
Having the disease = 1
Not having the disease = 0

Answer 76

A

It produces a line with NO BOUNDS - goes to infinity in both ways

Needs to make predictions that stay in its max 1 and min 0 bounds

Answer 77

A

Graph produced with a line with NO BOUNDS - goes to infinity in both ways

Needs to make predictions that stay in its max 1 and min 0 bounds

Answer 78

A

A SIGMOIDAL CURVE, not a linear line

Answer 79

A

The right side of the normal linear regression formula is transformed

the lin R formula is multiplied by -1, e raised to the power of the model/linRformula, added to 1 and then divided by 1!

Answer 80

A

Now predicting the PROBABILITY of having disease!=P(Dis) (rather than just predicting the disease=Dis)

In logistic regression - looking at predictions of probability that an individual with the given cause levels has the disease (or whatever you coded as ‘1’)

Answer 81

A

b0 and b1 values

Answer 82

A

It FLIPS the direction of the line

Answer 83

A

Raising e to the power of the model so far
(e is 2.71828)

we raise a number (any number) to the power of the linear regression formula which has dramatic effect on the line

Answer 84

A

IDEAL SLIDE - steepness of the slope of the line is reduced increasingly as it approaches zero, so it never quite reaches zero (i.e. it is bounded at zero).

Answer 85

A

Step 4 - as the line approaches 1, the slope of the line becomes less steep, never quite reaches 1

end up with a SIGMOIDAL CURVE bounded at 0 and 1 like our criterion variable (producing probabilities of 1 - having the disease)

Answer 86

A

because this model produces predictions of probability rather than just predictions of a continuous variable e.g happiness

Answer 87

A

LEFT - If a ppt is coded 1, so has the disease

RIGHT - if a ppts is coded 0, doesnt have the disease

Answer 88

A

To undo the fact that we raised our model to the power e - ln(x) is the ‘inverse’ of e

Answer 89

A

the prediction is less good - because say the model predicted small chance of having dis [P(Dem) of .19], but they actually had the disease (coded 1). = Larger number

Answer 90

A

We sum them up - equivalent to how we sum up all the individual squared errors for linear regression

This gives us a FINAL LL value (larger neg value = worse model)

Answer 91

A

DISi (1-have dis, 0-dont)

P(Dis) - probability given that they have the disease

Answer 92

A

Multiply the final LL value by -2 = -2LL ( now a positive since we x2)
-2LL is known as deviance/variance

because the final value has a CHI2 DISTRIBUTION - p value can be calculated easily on computer

Answer 93

A

higher positive values now mean =worse model

smaller values = better model.

Answer 94

A

LIKELIHOOD RATIO - The difference in -2LL between our current model (model with predictor e.g. TauC) and model with no predictors

Answer 95

A

The probability of getting the LR, if the NULL HYP is true

i.e. if the new model is not any better than the model with no predictors.

Answer 96

A

SSE-Left (error reduced after including current model with predictor)

Answer 97

A

-2LL for current model + CHI2 output (LR, equivalent to the SSE Reduced)

Answer 98

A

the individual predictors coefficients box in Lin R

Answer 99

A

Bvalue for predictor /SE for predictor

2. square the answer

Answer 100

A

The change in the odds of having the dis for every one increase in the pred

Answer 101

A

Odds/(1+Odds)

Answer 102

A

Odds of getting dis (or not getting dis) for that pred score, MULTIPLIED by ExpB value for that pred

Answer 103

A

a backwards inference question (posterior)

Bayes wanted to update peoples prior beliefs based on new evidence

Answer 104

A

FORWARDS - KNOWN CAUSES > KNOWN EFFECTS

BACKWARDS - KNOWN EFFECTS > BACK TO KNOWN CAUSES (more tricky)

Answer 105

A

start with initial prior belief for the probability
update this prior belief based on probability of getting new data
arrive at a new posterior belief level

So we want to update the probability of a hypothesis as more evidence becomes available

Answer 106

A

The total number of false positives depends not only upon the rate, but also upon how many people there are WITHOUT disease

Answer 107

A

Most diseases are rare so we have a large number of individuals without the disease and a small FP rate.

So when we multiply the large no. of people without disease by the FP rate = large FP total

Answer 108

A

true positive number / total positive results

total positive results = TP number +FP number

Answer 109

A

they give the TP RATE % or the FP RATE % as the answer
thus, confusing forward and backwards inference

Should have done = true positive number / total positive results

Answer 110

A

Bayes-LaPlace formula

Answer 111

A

Prior beliefs about hypothesis + new data = posterior belief about hypothesis

Answer 112

A

P(Da|¬Hy) - probability of getting the current data if our hypothesis is not true i.e. if the null hypothesis is true

only difference - p value - the probability of getting the current data OR MORE EXTREME if the null hypothesis is true

Answer 113

A

the p value is only a small part of the whole picture, despite most papers only reporting the p value

to find out the probability that our hypothesis is true, given our data (i.e. the purple posterior), the p value has to be combined with all these other figures – it absolutely cannot tell us that all by itself.

Answer 114

A

Confusing P(Da|¬Hy) (pvalue) with probability of the null hypothesis being true (P(¬Hy|Da) (posterior)

This mistake confuses ‘forward’ and ‘backwards’ inference - in natural language hard to detect the difference

Answer 115

A

The prior - who decides what the prior belief in the hyp is? SUBJECTIVE

but becomes less of a problem as you gather more evidence - data speaks for itself and you get a true posterior belief
thus, your belief will change according to data in the bayes formula!

Answer 116

A

Don’t know our ALT HYP / P(Da|Hy) - we don’t have a specific value in mind for ‘Hy’, unlike when it come to the ¬Hy (0).
Thus, if we don’t have a value for ‘Hy’ we can’t calculate a probability for P(Da|Hy).

due to SOFT SCIENCE = weak theories and predictions (just predicting a therapy outperforms control by more than 0)

Answer 117

A

only figure we can objectively calculate - only figure all scientists can agree on
unlike P(Da|Hy) is based on weak theories and vague predictions

Answer 118

A

META-ANALYSES of several experimental papers

Meta-analyses calculate P(Da|Hy) using the figures in papers they’re examining (sample mean and SE).

Answer 119

A

assigns a cut off point - P

Answer 120

A

P(Da|Hy) and P(Da|¬Hy)

Answer 121

A

LR = P(Da|Hy) / P(Da|¬Hy)

(equivalent to SSE-Reduced) = support for alt hyp if it is more than 0

Answer 122

A

Belief that our hypothesis is true has risen from prior belief value to posterior belief level (e.g. from 0.01 to .23)

Answer 123

A

if the NULL is FALSE, the probability CORRECTLY getting a SIGNIF result

Answer 124

A

The amount of POWER

Answer 125

A

0.8
the power should be greater than the alpha (FP rate) .05
(but most psych experiments have way less power)

Answer 126

A

FP = a TYPE 1 error rate (alpha) 
FN = a TYPE 2 error rate (beta)

Answer 127

A

False negative rate - when the hypothesis really was true, but you got a non-significant result i.e. incorrectly accepted the null hypothesis.

Answer 128

A

Alpha (FP) - smaller this is, smaller type 2 is, but the larger type 1 error is

Sample Size - bigger sample =greater power and smaller type 2 error

True effect size - the bigger the effect, easier to detect and type 2 error lowers
lower effect = lower power/TP rate

Sample variance - looking for true effect size in the noise/variance - effectsize vs noise ratio

Answer 129

A

increase

e.g. if alpha set a .01, effect is harder to detect, thus lower power

Answer 130

A

b1 (in fixed effects box) + u1

( do not just look at slope u1 variable in unbiased predictions box! or u0 intercept variable if asked about intercepts!)

Answer 131

A

LR / -2LL No Pred Model