Week 1: Types of Outcomes + NHST Flashcards

1
Q

Process of hypothesis testing (6)

A
  1. We have a question or hypothesis about a population
  2. Propose a study to gather data
  3. The design of the study aims to optimise it to gain the most valuable information about the hypothesis
  4. Collect data
  5. Use statistics to test the hypothesis base on a model of the data (informed by hypothesis)
  6. Examine and interpret the results.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

There are constraints of getting valueable info for hypothesis from study’s design such as - (2)

A

duration of the study
how many people you can recruit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a sample?

A

A sample is the specific group that you will collect data from.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a population?

A

A population is the entire group that you want to draw conclusions about.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Example of population vs sample (2)

A

Population : Advertisements for IT jobs in the UK
Sample: The top 50 search results for advertisements for IT jobs in the UK on 1 May 2020

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does the arrows show in process of hypothesis testing? (2)

A

show the iterative nature of hypothesis testing,

the question or design to answer the question can be updated on the basis of the statistical analysis from only study so a future study can be devised.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

The decision tree above guides you to the appropriate

A

inferential statistics (statistical approach) to use

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is inferential statistics?

A

Inferential statistics allow you to test a hypothesis or assess whether your data is generalisable to the broader population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What we have measured is called the (2)

A

outcome variable/
DV variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

The outcome variables influences what

A

statistical test to use on data you have gathered

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why is there a focus to do parametric tests than others in research? - (3)

A
  • they are more rigorous, powerful and sensitive than non-parametric tests to answer your question
  • This means that they have a higher chance of detecting a true effect or difference if it exists.
  • They also allow you to make generalizations and predictions about the population based on the sample data.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What question has been covered?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

We measure the answers to our question (hypothesis) which

A

informs on our question (hypothesis)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

We can obtain multiple outcomes from the

A

same people

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

We can obtain outcomes under

A

different conditions, groups or both

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

We specificy what we measure and under what condition we measure them in the

A

design of the experiment or study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are the 4 types of outcomes we measure? (4)

A
  1. Ratio
  2. Interval
  3. Ordinal
  4. Nominal
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is a continous variables? - (2)

A

: there is an infinite number of possible values these variables can take on-

entities get a distinct score

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

2 examples of continous variables (2)

A
  • Interval
  • Ratio
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is an interval variable?

A

: Equal intervals on the variable represent equal differences in the property being measured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Examples of interval variables - (2)

A

e.g. the difference between 600ms and 800ms is equivalent to the difference between 1300ms and 1500ms. (reaction time)

temperature (Farenheit), temperature (Celcius), pH, SAT score (200-800), credit score (300-850)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is ratio variable?

A

The same as an interval variable and also has a clear definition of 0.0.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Examples of ratio variable - (3)

A

E.g. Participant height or weight
(can have 0 height or weight)

temp in Kelvin (0.0 Kelvin really does mean “no heat”)

dose amount, reaction rate, flow rate, concentration,

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is a categorical variable? (2)

A

A variable that cannot take on all values within the limits of the variable

    • entities are divided into distinct categories
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What are 2 examples of categorical variables? (2)

A
  • Nominal
  • Ordinal
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What is nominal variable? - (2)

A

a variable with categories that do not have a natural order or ranking

Has two or more categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Examples of nominal variable - (2)

A

genotype, blood type, zip code, gender, race, eye color, political party

e.g. whether someone is an omnivore, vegetarian, vegan, or fruitarian.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is ordinal variables?

A

categories have a logical, incremental order

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Examples of ordinal variables - (3)

A

e.g. whether people got a fail, a pass, a merit or a distinction in their exam

socio economic status (“low income”,”middle income”,”high income”),

satisfaction rating [Likert Scale] (“extremely dislike”, “dislike”, “neutral”, “like”, “extremely like”).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Using the term ‘variables’ for continous and categorical variables as - (2)

A

both outcome and predictor are variables

We will see later on that not only the type of outcome but also type of predictor influences our choice of stats test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Likert scale is ordinal variable but sometimes outcomes measured on likert scale are treated as - (3)

A

continuous after inspection of the distribution of the data and may argue the divisons on scale are equal

(i.e., treated as interval if distribution is normal)

gives greater sensitivity in parametric tests

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What is measurement error?

A

The discrepancy between the actual value we’re trying to measure, and the number we use to represent that value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

Example of measurement error in psych experiments - (2)

A

Imprecise measurement: Not accurate to use a stopwatch to measure reaction times that are about 1/2 second

Systematic problem: broken ruler (affects validity)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

In reducing measurement error in outcomes, the

A

values have to have the same meaning over time and across situations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

Validity means that the (2)

A

instrument measures what it set out to measure

refers to the accuracy of a measure (whether the results really do represent what they are supposed to measure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

Reliability means the

A

ability of the measure to produce the same results under the same conditions

37
Q

Test-retest reliability is the ability of a

A

measure to produce consistent results when the same entities are tested at two different points in time

38
Q

3 types of variation (3)

A
  • Systematic variation
  • Unsystematic variation
  • Rnadomisation
39
Q

What is systematic variation - (2)

A

Differences in performance created by a specific experimental manipulation.

This is what we want

40
Q

What is Unsystematic variation (3)

A

Differences in performance created by unknown factors
.
Age, Gender,** IQ, **Time of day, Measurement error etc.

These differences can be controleld of course (e.g., inclusion/exclusion of pps setting age range of 18-25)

41
Q

Randomisation (other approaches) minimises - (2)

A

effects of unsystematic variation

does not remove unsystematic variation

42
Q

What is the independent variable (Factors)? ( 3)

A
  • The hypothesised cause
  • A predictor variable
  • A manipulated variable (in experiments)
43
Q

What is depenedent variable? (measures)- (3)

A
  • The proposed effect , change in DV
  • An outcome variable
  • Measured not manipulated (in experiments)
44
Q

In all experiments we have two hypotheses which is (2)

A
  • Null hypothesis
  • Alternative hypothesis
45
Q

What is null hypothesis?

A

that there is no effect of the predictor variable on the outcome variable

46
Q

What is alternative hypothesis?

A

is that this is an effect of the predictor variable on the outcome variable

47
Q

Null Hypothesis Signifiance Testing computes the probability the (4)

A

the probability of the null hypothesis being true (referred as p-value) by computing a statistic and how likely it is that the statistic has that value by chance alone

48
Q

The NHST does not compute the probability of the

A

null hypothesis

49
Q

Null Hypothesis Signifiance Testing Example Z Curve - (9)

A
  • From experiment we have z statistic that is calculated from 2 groups
  • Shows normal distribution in this case of z statistic
  • The horizontal axis measures how many standard deviations from the mean
  • the vertical axis measures the probability of density of z
  • On left, mean of grp1 < mean of grp 2
  • On right, mean of grp > mean of grp 2
  • If two-tailed test/non-directional then spilt probability on two ends of the tail –> more extreme value of statistic for sig
  • If directional hypothesis then not spilt alpha value –> leway along x axis and get lower value and still p 0.05
50
Q

There can be directional and non-directional hypothesis of

A

an alternate hypothesis

51
Q

non-directional alternate hypothesis is..

A

The alternative hypothesis is that this is an effect of the group on the outcome variable

52
Q

Directional alternate hypothesis is…

A

The alternative hypothesis is that this the mean of the outcome variable for group 1 is larger than the mean of group 2

53
Q

Example of directional alternate hypothesis

A

There would be far greater engagment in stats lecture if they were held at 4 PM and not 9AM

54
Q

For a non-directional hypothesis you will need to divide your alpha value at

A

two ends of the tail of normal distirbution

55
Q

The 3 misconceptions of Null Hypothesis Signifiance Testing (NHST) - (3)

A
  1. A significant result means the effect is important
  2. A non-significant result means the null hypothesis is true
  3. A significant result means the null hypothesis is false (just give probability that data occured given null hypothesis, doesn’t say huge evidence that null hypothesis is categorically false)
56
Q

P-Hacking and HARKING is another issue with

A

NHST

57
Q

p-Hacking and HARKINGS are the - (2)

A

researchers degrees of freedom

cchange after results are in and some analysis has been done

58
Q

P-hacking refers to a

A

selective reporting of significant results

59
Q

Harking is

A

Hypothesising After the Results are Known

60
Q

P-hacking and HARKING are often used in

A

combination

61
Q

What does EMBERS stand for? (5)

A
  1. Effect Sizes
  2. Meta-analysis
  3. Bayesian Estimation
  4. Registration
  5. Sense
62
Q

EMBERS can reduce issues of

A

NHST

63
Q

Uses of Effect sizes and Types of Effect Size (3)

A
  • There a quite a few measures of effect size
  • Get used to using them and understanding how studies can be compared on the basis of effect size
  • A brief example: Cohen’s d
64
Q

Meaning of Effect Size (2)

A

Effect size is a quantitative measure of the magnitude of the experimental effect.

The larger the effect size the stronger the relationship between two variables.

65
Q

Formula of Cohen’s d

A
66
Q

What is meta-analysis?

A

Meta-analysis is a study design used to systematically assess previous research studies to derive conclusions about that body of research

67
Q

Meta-analysis brings together.. and assesses (2)

A
  • Bringing together multiple studies to get a more realistic idea of the effect
  • Can assess effect siz that are averaged across studies
68
Q

Funnel plots in meta-analysis can be made to….. values stuides…. (2)

A

investigating publication bias and other bias in meta-analysis
values studies by their sample size and observe bias

69
Q

Bayesian approaches capture

A

probabilities of the data given the hypothesis and null hypothesis

70
Q

Bayes factor is now often computed and stated alongside

A

conventional NHST analysis (and effect sizes)

71
Q

Registration is where (5)

A
  • Telling people what you are doing before you do it
  • Tell people how you intend to analyze the data
  • Largely limits researcher degrees of freedom (HARKING p-hacking)
  • A peer reviewed registered study can be published whatever the outcome
  • The scientific record is therefore less biased to positive findings
72
Q

Sense is where (4)

A
  • Knowing what you have done in the context of NHST
  • Knowing misconceptions of NHST
  • Understanding the outcomes
  • Adopting measures to reduce researcher degrees of freedom (like preregistration etc..)
73
Q

most of the statistical tests in this book rely on
having data measured

A

at interval level

74
Q

To say that data are interval, we must be certain that
equal intervals on the scale represent

A

equal differences in the property being measured.

75
Q

To say that data are interval, we must be certain that
equal intervals on the scale represent equal differences in the property being measured. For
example, , on www.ratemyprofessors.com students are encouraged to rate their lecturers on
several dimensions (some of the lecturers’ rebuttals of their negative evaluations are worth
a look). Each dimension (i.e. helpfulness, clarity, etc.) is evaluated using a 5-point scale.

For this scale to be interval it must be the case that the - (2)

A

difference between helpfulness ratings of 1 and 2 is the same as the difference between say 3 and 4, or 4 and 5.

Similarly, the
difference in helpfulness between ratings of 1 and 3 should be identical to the difference
between ratings of 3 and 5. V

76
Q

The distinction between continous and discrete variables can often be blurred - 2 examples- (2)

A

continuous variables can be measured in discrete terms; we measure age we rarely use nanoseconds but use years (or possibly years and months).
In doing so we turn a continuous variable into a discrete
one

treat discrete variables as if they were continuous, e.g., the number of boyfriends/girlfriends
that you have had is a discrete variable. However, you might read a magazine that says ‘the average number of boyfriends that women in their 20s have has increased from 4.6 to 8.9’

77
Q

a device for measuring sperm motility that actually measures sperm count is not

A

valid

78
Q

Criterion validity is whether the

A

instrument is measuring what it claims to measure (does
your lecturers’ helpfulness rating scale actually measure lecturers’ helpfulness?).

79
Q

The two sources of variation that is always present in independent and repeated measures design is

A

unsystematic variation and systematic variation

80
Q

effect of our experimental manipulation
is likely to be more apparent in a repeated-measures design than in a

A

between-group design,

81
Q

effect of experimental manipulation is more apparent in repeated-design than independent since in independent design,

A

differences between the characteristics of the people allocated to each of the groups is likely to create considerable random variation both within
each condition and between them

82
Q

This means that, other things being equal, repeated-measures designs have
more power to

A

, repeated-measures designs have
more power to d

83
Q

We can use randomization in two different ways depending on
whether we have an

A

independent and repeated design measure

84
Q

Two sources of systematic variation in repeated design measure - (2)

A
  • Practice effects
  • Boredom effects
85
Q

What is practice effects?

A

Participants may perform differently in the second condition because
of familiarity with the experimental situation and/or the measures being used.

86
Q

What is boredom effects?

A

: Participants may perform differently in the second condition because
they are tired or bored from having completed the first condition.

87
Q

We can ensure no systematic variation between conditions in repeated measure is produced by practice and boredom effects by

A

counterbalancing the order in which a person participates in a condition

88
Q

Example of counterbalancing

A

we randomly determine whether a participant
completes condition 1 before condition 2, or condition 2 before condition 1

89
Q

To reduce unsystematic variation in independent design, we can ensure - (2)

A

that confounding variables are unlikely to
contribute systematically to the variation between experimental conditions is to randomly allocate participants to a particular experimental condition.

This should ensure that these
confounding variables are evenly distributed across conditions.