Week 3:Probability Flashcards

1
Q

define inferential statistics

A

uses a random sample of data (our participants) from a population to help us make inferences about a population to help us reach conclusions that reach beyond our data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

define frequentist statistics/what do frequentists do?

A

assign probabilities to datasets (probability of recurrence in a sequence of experiments)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

define null hypothesis significance testing

A

-a method of statistical inferences
-it turns a research question into two hypotheses:alternative and null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

define what a null hypothesis is

A

a statement of no difference (considered to be true until evidence against it)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

define what an alternative hypothesis/experimental is

A

a statement of ‘difference’ ‘association’ or ‘treatment effect’ (it’s what we’re attempting to find evidence for)
you would only ever report this hypothesis, not the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

define what a one-tailed hypothesis is

A

states the ‘direction’ the effect/difference/association will be in (“dogs are more likely to live longer than cats”) used for directional hypothesis 5% sig. level conc. on one tail/end

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

define what a two-tailed hypothesis is

A

doesn’t state a direction (“there’s a difference in the life period of dogs and cats”) used for non-directional (sometimes used regardless of hypothesis) hypothesis 5% sig level split on both tails/ends

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what type of frequency is collected by nominal data?

A

event frequencies e.g. frequencies of males/females in a room, smokers/non-smokers etc.
-descriptive statistics presented as frequencies/% of the total

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

why are frequencies used as descriptive?

A

no other descriptive statistic makes sense

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is chi-square?

A

an inferential statistical test that’s used for nominal (categorical) data and assesses if there’s an association between the categorical variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

how to calculate estimated frequencies

A

(row total) x (column total)/N

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what does a chi-squared test do?

A

-measures differences between observed and expected frequencies
-if it differs we can suspect an association between the IV and DV AND between the two categorical variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Explain how there are different types of chi-squares

A

named by the number of ‘rows’ and the ‘number of columns’.
-e.g., two ‘rows’ (subscribes/does not subscribe) and two ‘columns’ (chocolate/apple)=2x2 chi square
-extra choice/column e.g., apple =3x2 chi square

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Explain what assumptions are in stats tests

A

Every stats test has ‘assumptions-rules that our data must
adhere to in order to be analysed using that test so is about looking at your data and making decisions based on certain criteria.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

State the assumptions of a chi-square test

A
  1. INDEPENDENCE- all frequencies must be unique so can’t be used for within-subjects/repeated measures designs. (as association BETWEEN)
  2. All Expected values should be above 5.
    -lots of conditions=all expected values should be greater/equal to 1 AND no more than 20% of expected counts should be less than 5.
    -we need to have an appropriate number of data points but if the sample is too large this can lead to misleading results.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

define P values

A

evaluates how well the data from the sample supports the argument that the null hypothesis is true
high p value=data likely supports true null
low p value=data likely rejects true null for the population rather in line with alt. hypothesis (less than 0.05)

17
Q

State the rules of P values

A

If P value below .05=result is ‘statistically significant’.
-The level at which we accept a result to be significant is known as the alpha level (α)
-however there are current arguments about the usefulness of p values and the problems with them.

18
Q

what does the test statistic let the reader know?

A

what stats test was conducted (every stats test has one)
e.g., chi square=Chi symbol/X^2

19
Q

why do we use effect sizes?

A

to show how strong the association is
-2x2 chi square effect size=‘Phi’ φ
-other chi-square designs we can report ‘Cramer’s V’

20
Q

In Cramer’s V, what does the effect size depend on?

A

the degrees of freedom

21
Q

what’s the main problem with p values?

A

type 1/2 errors

22
Q

define a type 1 error

A

the rejection of a true null hypothesis finding (a ‘false positive’)

23
Q

define a type 2 error

A

failing to reject a null hypothesis (known as a ‘false negative’).

24
Q

how can we reduce a type 1 error occurring? aka Bonferroni correction

A

change your alpha level accordingly so you no longer accept results to be significant as p<.05.
-divide your alpha level (.05) by the number of tests you are going
to conduct.
■ So imagine I am going to conduct 4 tests on a set of data. I would take the standard alpha level (.05) and divide it by the number of tests I am going to conduct (4).
■ .05/4= .0125 or .013.
■ So I will no longer accept a finding to be significant unless it is
below .013.
■ This is known as a Bonferroni correction.

25
Q

Explain what P hacking is

A

A method of manipulating data to achieve significant results.
■ Multiple analyses involved
■ Omitting other information (e.g., the existence of other variables).
■ ‘Controlling’ for variables
■ Analyse part way through then gather more data - stop when
you get a significant result
■ Changing the DV (helped via ill-defined / poorly operationalised DV)

26
Q

what’s a solution to p hacking?

A

could change the alpha level from 0.05 to 0.005 BUT solve with preregistration

27
Q

define preregistration

A

when researchers predefine what analysis they are going to do before they conduct the study.
■ Outline the method and provide some background research.
■ They should then adhere to this in the paper they publish.

28
Q

what can be the issues with p values?

A

p values are prone to misuse and questionable research practices.