Research Method + Stats Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What is a pseudoscience?

A

a claim/belief that is presented as scientific but doesn’t use the scientific method, lacks empirical evidence and cannot be reliably tested
it is more like advice

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is psychology the science of?

A

the scientific study of the human mind and behaviour

how we think, feel and act

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what challenges does psychology have as a science?

A

much of what we are interested in is unobservable

there is a question of whether we can really study human behaviour without being subjective

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

give a brief history of psychology as a science

A
  • Freud - claimed to be scientific but relied heavily on introspection
  • Behaviourists - only study things that are directly observable, only focussed on the S –> R
  • Cognitive - makes predictions then subjects them to empirical testing, no need for direct observation
  • Neuroscience - look inside the human mind and directly observe what’s happening
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is induction?

A

evidence gathered from multiple observations

however this cannot guarantee that it will happen again in the future

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is falsifiability?

Who came up with the concept?

A

the need to have the possibility to disprove a theory
if we find supporting evidence, then the theory is upheld as undefeated
Karl Popper

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a scientific conjuncture?

A

a scientific question that must be falsifiable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is Bayesianism?

A

The idea that beliefs come in degrees
Can express the likelihood of future events based on past knowledge
Provides a measure of a state of knowledge

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the characteristics of a good scientist?

A
  • Uncertain
  • Sceptical
  • Open-minded
  • Cautious
  • Ethical
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are some limitations of science?

A

You can’t answer questions about:

  • Value
  • Morality
  • The supernatural
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the hypothetico-deductive method?

A
  • Identify behaviour of interest and generate theory
  • Theory is generally derived from inductive reasoning
  • Generate a hypothesis from theory - needs to be objective and falsifiable
  • Subject hypothesis to empirical testing to gather evidence
  • Hypothesis isn’t supported = refine or abandon
  • Hypothesis is supported = uphold theory as undefeated with an estimate of confidence
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What was the replication crisis?

Who was a key person here?

A

Methodological crisis where we couldn’t replicate the results of studies
Bem showed that people were being influenced by the prospect of being published in a journal - they were finding significant results when there weren’t any to be found (false positive)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a random error?

A
  • Random error – obscures the results (normally averages out)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Longitudinal design: problems and solution

A

o Problem – its not possible to counterbalance order

o Solution – control group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Cross-sectional design: problems and solutions

A

o Problem – its not possible to randomly assign ppts

o Solution – matching

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the types of developmental design?

A
  • Between subjects: “cross-sectional”

- Within subjects: “longitudinal”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is pre-test post-test control group design?

A
  • split ppts into 2 groups and manipulate the IV in one group only
  • The inclusion of a CG allows us to account for any order effects that might be present
  • We can then statistically control for the difference in the treatment group accounted for by the order effects
  • This is a mixed design – got within and between subjects components
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is Within subjects design without counterbalancing?

A
  • Counterbalancing order that ppts are exposed to levels of IV is not always possible
  • E.g., examining the effectiveness of mnemonic training on memory performance
  • The order in which ppts are exposed to levels of IV is fixed
    o Likely to be differences between time 1 and time 2 other than the variable of interest
  • Have to be cautious about inferring causality
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is matched pairs?

A
  • Even better than matching the groups on the basis of potentially moderating variables
  • Compare individuals from the same background but who are in different levels
  • However, this frequently isn’t possible. Also, always the possibility of other variables influencing that haven’t been considered
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is matching?

A
  • Identify potentially moderating variables and match the groups on this basis
  • E.g., similar age, similar weight, similar experience etc.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is Between subjects design without random allocation (quasi-experimental)?

A
  • The assignment of ppts to the experimental conditions is pre-determined
  • E.g., comparing pre-existing alcohol consumption groups: alcoholics vs non-alcoholics
  • This poses a serious problem as there are likely to be differences between the groups other that the variable of interest
  • Have to be cautious about inferring causality
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Your choice of subject design depends on?

A

Concerns –
o Between subjects – eliminates order effects
o Within subjects – eliminates individual differences
Number of ppts available -
o Within subjects designs require fewer ppts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is a factorial mixed design?

A
  • One IV used between measures
  • The other IV uses within measures
  • E.g., night shift and then with and without alcohol, day shift and then with and without alcohol
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is a Fully repeated measures factorial design (within subjects)?

A
  • Ppts would complete all levels in the IVs

- Ppts would complete 4 different conditions if two IVs were used

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is a Fully independent factorial design (between subjects)?

A
  • For first IV you assign ppts to one or the other condition
  • For the second IV you assign ppts to one or the other condition
  • Effectively end up with 4 different groups
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What is a factorial design?

A
  • Experimental designs with 2 or more IVs
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is counterbalancing?

A
  • Split the group of ppts in half
  • Level 1 then level 2, and level 2 then level 1
  • Order effects will still influence the ppts performance, but the effects of that influence will be evenly spread across each level of the IV
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is random allocation?

A
  • Ensure each ppt is equally likely to be assigned to any IV levels
  • Distributes the occurrence of potential moderating variables equally among experimental conditions
  • Prevents experimenter (un)intentionally biasing their results
  • Enables the use of powerful stats tests that can help determine causal relationships between variables
  • Could achieve by tossing a coin, pulling straws, pulling names from a hat etc.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What is a within subject design (repeated measures)?

A
  • Ppts exposed to both levels of the IV
  • Potentially moderating characteristics are kept equal across levels
  • Requires fewer ppts
  • Problem of order effects – effects that occur as a result of taking a test more than once (could do better on test because they’ve done it before and know what’s coming or could do worse because they’ve done the test before and are tired of it)
  • Can use counterbalancing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What is a between subjects design (independent groups)?

A
  • Ppts are assigned to one level of the IV
  • All subjects are inherently different and these differences have the ability to affect the outcome (individual differences)
  • Can’t eliminate the effects of these other variables
  • But we can minimise the effects by spreading their influences across the different levels of the IV
  • Can use random allocation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What is the subjects design?

A

– the assignment of participants to experimental conditions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What is the dependent variable?

A
  • The variable that is measured

- We compare differences in the DV under the different levels of the IV

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What is an independent variable?

A
  • The variable we manipulate

- Each have at least two levels e.g., conditions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What is the experimental method?

A
  • A research design which allows us to make causal inferences about one or more variable of interest
  • The researcher manipulates one or more variables and measures the effect on other variables
  • All other variables are to be kept constant
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What is ratio data?

Give an example

A

Ratio –

  • Highest level of data
  • Equal intervals and a true zero point
  • E.g., height, distance, time
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

What is interval data?

Give an example

A

Interval –

  • Intervals between successive values are equal
  • But no “true” zero point (no absence of something)
  • E.g., temperature, shoe size
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What is ordinal data?

Give an example

A

Ordinal –

  • Data can be ranked along a continuum
  • Intervals between ranks are not equal
  • E.g., race positions, attractiveness
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What is nominal data?

Give an example

A
  • Lowest level of measurements
  • Category membership
  • Numbers assigned serve as labels but do not indicate numerical membership
  • E.g., gender, political party, religion
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What are the scales of measurement?

A
  • Nominal (category membership)
  • Ordinal (ranked/ordered)
  • Interval (equal increments but not real 0 point)
  • Ratio (equal increments and real 0 point)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

True or False

If you operationalize a construct it becomes a variable?

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What type of definition defines a variable?

A

An operational definition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

What type of definition defines a construct?

A

A theoretical definition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

What is a variable?

A
  • anything that can assume multiple values (can vary)
  • An event/condition the researchers observes/measures
  • Variables must be operational – explicitly stated
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

What is a construct?

A
  • A construct is the building block of a theory
  • Theoretical concepts formulated to serve as causal/ descriptive explanations
  • Don’t directly indicate a means by which they can be measured
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

Hypotheses make specific predictions and must be:

A
  • Falsifiable
  • Testable
  • Precisely stated (clearly defined and unambiguous)
  • Rational (needs to be consistent with known knowledge)
  • Parsimonious (the explanation needs to be the simplest possible)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

Theories are a collection of proposals that:

A
  • Define
  • Explain
  • Organise
  • Interrelate
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

What is a hypothesis?

A

– a clear but tentative explanation for an observed phenomenon
- it is testable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

What is a theory?

A

– a broad encompassing framework.

Puts a collection of propositions/hypotheses together to explain a set of observed phenomena

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

What is a fact?

A

statements that are under no dispute.

Direct observations of phenomena that occur consistently over time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

What is methodological triangulation?

A

The convergence of findings of methodologically varying studies which can lend acceptance to a theory pattern.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

What is methodological pluralism?

A

The use of multiple methods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

What are the quantitative research approaches?

A

Descriptive – describe a behaviour
Relational – predict a behaviour based on its relationship with another behaviour
Experimental – allows us to determine the cause of a behaviour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

What are some criticisms of qualitative research?

A
  • can’t apply notions of reliability or validity,

- can’t generalise to other members of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

What is qualitative research?

A
  • descriptive, often text-based,
  • not experimental (inductive approach) but still empirical,
  • focuses on underlying meaning of behaviours,
  • typically small samples,
  • asks more open-ended questions and so allows for unexpected answers.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

What are some criticisms of quantitative research?

A
  • humans are complex so this would be oversimplifying things,
  • fails to recognise the subjective nature of research with humans,
  • doesn’t look in depth at differences between individuals (focuses on averages and views population as homogenous)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
56
Q

What is quantitative research?

A
  • collection of numerical data,
  • subscribes to hypothetico-deductive approach,
  • takes place in controlled settings,
  • focuses on describing, predicting and identifying causes of behaviours,
  • typically uses large samples,
  • focus on structured methods
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
57
Q

What are the goals of Psychology?

A
  • describe behaviour
  • predict behaviour
  • explain behaviour
  • control behaviour
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
58
Q

What is the role of teaching in Open Science?

A

Teaching open science – need to teach its principles and promoting the methodological practices.
Advantages – promotes best practice, increases critical evaluation research

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
59
Q

What are the types of replication research?

A

Direct replication – reproduce the elements that produced the original findings (are similar findings produced in subsequent attempts?)
Conceptual replications – change at least one aspect of the original procedure e.g., changing the sample (assess whether similar findings are produced under different conditions?)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
60
Q

What role does replication research have in Open Science?

A

Replication research – process of repeating research to verify findings
Enables confidence in results and helps build theories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
61
Q

What is exploratory research?

A

Exploratory research – focuses on generating hypotheses/research questions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
62
Q

What is confirmatory research?

A

Confirmatory research – focuses on confirming hypotheses/research question

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
63
Q

What role does preregistration have in Open science?

A

Preregistration – researchers encouraged to submit plans for specific research questions and analyses they will conduct prior to data collection (transparency), reduces risk of false positives.
Promotes confirmatory research

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
64
Q

What role does reproducible analyses have in Open Science?

A
  • need to provide materials to enable others to generate the exact same results as those reported.
  • need adequate documentation to be kept in order to do this
  • Good principles: clear annotations of what documents are, store original data files separately, record all steps of data processing, use open-source software
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
65
Q

What role does APA have in Open Science?

A
  • APA requires researches to make data available with editors for 5 years after publication - verification, analytic reproducibility
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
66
Q

What is the benefit of research needing public access?

A
  • Research needs public access - allows for accumulation of knowledge, increased citation of work, more media coverage, supports meta research practice
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
67
Q

What is Open Science?

A

A set of practices used to overcome limitations of previous scientific research.
Concerned with reproducibility and replicability
Research practices and data needs to be transparent and accessible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
68
Q

What is exploratory research?

A

Exploratory research – focuses on generating hypotheses/research questions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
69
Q

What are the sources of confounding variables?

A
  • Selection – bias resulting from the selection/assignment of ppts to the different levels of the IV
    - Results in ppts who are assigned to different levels of the IV differ systematically in some way that could influence the measurement of the DV
    • Particular problem for quasi-experimental designs
  • History – uncontrolled events that take place between testing conditions
  • Maturation – intrinsic changes in the characteristics of ppts between different test occasions
  • Instrumentation – changes in the sensitivity/ reliability of measurement instruments during the course of the study
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
70
Q

Confounds can result in us measuring:

A
  • An effect of the IV on the DV when it is not present

- No effect of the IV on the DV when it is present

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
71
Q

How can we eliminate confounding variables?

A

o Random allocation/counterbalancing spreads the influence of extraneous variables (so that they do not become confounding variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
72
Q

What type of validity does confounding variables threaten?

A

Internal validity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
73
Q

What is a confounding variable?

A
  • Extraneous variables that disproportionately affect one level of the IV more than other levels
  • Add constant/systematic error at the level of the IV
74
Q

How can we eliminate extraneous variables?

A

o Use random allocation or counterbalancing

o Results in an even addition of error variance across levels of the IV

75
Q

What is an extraneous variable?

A
  • Undesirable variables that add error to our experiments – add error to the measurement of the DV
76
Q

What is a constant error?

A
  • Constant errors – bias the results (more problematic)
77
Q

What are the sources of confounding variables?

A
  • Selection – bias resulting from the selection/assignment of ppts to the different levels of the IV
    o Results in ppts who are assigned to different levels of the IV differ systematically in some way that could influence the measurement of the DV
    o Particular problem for quasi-experimental designs
  • History – uncontrolled events that take place between testing conditions
  • Maturation – intrinsic changes in the characteristics of ppts between different test occasions
  • Instrumentation – changes in the sensitivity/ reliability of measurement instruments during the course of the study
78
Q

What are the sources of confounding variables?

A
  • Selection – bias resulting from the selection/assignment of ppts to the different levels of the IV
    o Results in ppts who are assigned to different levels of the IV differ systematically in some way that could influence the measurement of the DV
    o Particular problem for quasi-experimental designs
  • History – uncontrolled events that take place between testing conditions
  • Maturation – intrinsic changes in the characteristics of ppts between different test occasions
  • Instrumentation – changes in the sensitivity/ reliability of measurement instruments during the course of the study
79
Q

What are the sources of confounding variables?

A
  • Selection – bias resulting from the selection/assignment of ppts to the different levels of the IV
    o Results in ppts who are assigned to different levels of the IV differ systematically in some way that could influence the measurement of the DV
    o Particular problem for quasi-experimental designs
  • History – uncontrolled events that take place between testing conditions
  • Maturation – intrinsic changes in the characteristics of ppts between different test occasions
  • Instrumentation – changes in the sensitivity/ reliability of measurement instruments during the course of the study
80
Q

What is external validity?

A

The ability to generalise our results

81
Q

Why do we gather data from samples?

A

To infer population parameters

82
Q

What is snowball sampling?

A
  • Recruit small number of ppts and then use those initial contacts to recruit further ppts
  • Biases the sample, but useful if you want to recruit very specific populations
83
Q

What is an opportunity/convenience sample?

A
  • People who are easily available

- But can lead to a biased sample

84
Q

What is a cluster sample?

A
  • Researcher samples an entire group/cluster from the population of interest
  • Need to be cautious about generalising results – cluster unlikely to be representative of the entire population
85
Q

What is a stratified sample?

A
  • Proportional: specified groups appear in numbers proportional to their size in the populations
  • Disproportional: specified groups which are not equally represented in the population, are selected in equal proportions
86
Q

What is a systematic sample?

A
  • Draw from the population at fixed intervals

- Problematic in populations with periodic function

87
Q

What is a random sample?

A
  • gold standard
  • Each member of the population has an equal chance of being selected
  • Usually quasi-random e.g., because we don’t have access to everyone in a population
88
Q

Why do we sample?

A
  • Time – difficult to collect data from everyone quickly
  • Money – expensive to collect data from everyone
  • Access – not possible to reach all members of a population
  • Sufficiency – pattern of results doesn’t change much even if we have data from everyone
89
Q

What is a sample?

A

– a subset of individuals from the larger population

  • For any population, there are many possible samples
  • Vary in size
  • Defined by sample statistics e.g., measurements which describe the sample
  • Sample statistics are used the infer population parameters
90
Q

What is a population?

A

– the entire collections of things that we are interested in

  • Defined by population parameters e.g., measurements which describe the population
  • Vary in size
91
Q

What is multifactorial causation?

A

– phenomenon is determined by many interacting factors

92
Q

Why is it hard to determine true causation?

A

human behaviour is very complex and it is usually impossible to control for all other factors – because its usually impossible to identify them all

93
Q

What is true causation?

A

Need to satisfy necessary and sufficient criteria in order to make true claims about causality:
- Sufficient – y is adequate to cause x
- Necessary – y must be present to cause x
True causation can only be established when necessity and sufficiency criteria are satisfied

94
Q

What can construct validity be assessed in terms of?

A
  • Convergent validity – correlates with tests of the same and related constructs
  • Discriminant validity – doesn’t correlate with tests of difference or unrelated constructs
95
Q

What is construct validity?

A

– is the construct we are trying to measure valid?
o E.g., does the construct itself exist?
o The validity of a construct is supported by cumulative research evidence that is collected over time – together, supporting the existence of the construct itself

96
Q

What is criterion validity?

A

– does the measure give results which are in agreement with other measures of the same thing?
o Concurrent: comparison of new test with established test
o Predictive: does the test predict outcome on another variable

97
Q

What is face validity?

A

– does it looked like a good test?

98
Q

What is content validity?

A

does our test measure the construct fully?

99
Q

What is internal consistency?

A
  • determines whether all items are measuring the same construct
    o E.g., split-half reliability: questionnaire items split into two groups and the halves are administered to ppts on separate occasions
    o Beware of order effects – e.g., if they have answered similar questions before
100
Q

What is parallel form of reliability?

A

– if we administer different versions of our measures to the same ppts, would we obtain the same results
o Different versions can be useful to help eliminate memory effects as the questions are different
o Beware of order effects – might still improve because they’ve had experience of a similar test
o Fatigue effects – might be fatigued the second time around if same ppts are used

101
Q

What is inter-rater reliability?

A

– measures fluctuations between observers

102
Q

What is test-retest reliability?

A

– measure fluctuations from one time to another
o Important for constructs which we expect to be stable (e.g., personality type)
o Beware of order effects

103
Q

What is validity?

A

= accuracy (truthfulness)

- The extent to which it is measuring the construct we are interested in

104
Q

What is reliability?

A

= consistency (precision)

- The extent to which our measure would provide the same results under the same conditions

105
Q

What is reactivity?

A
  • Awareness that they are being observed may alter behaviour
  • Can threaten internal validity if ppts are more influenced by reactivity at one level of the IV than the other
  • Resulting artefacts can be: subject based (demand characteristics), experimenter related (experimenter bias)
  • Counter reactivity by using blind procedures (either single or double)
106
Q

What is population validity?

A

is our sample representative?

107
Q

What is bimodal data?

A

Data that has two modes

108
Q

What is the danger of non-normal data?

A
  • Danger – mean is distorted by the tails which are the more extreme values
109
Q

What is non-normal data?

A
  • Has a tail either to the right or left – skewed data
  • Positively skewed = long tail to the right, peaks at the left
  • Negatively skewed = long tail to left, peaks at the right
  • E.g., reaction time – tends to be positively skewed
110
Q

What is normally distributed data?

A
  • Many naturally occurring variables are Normal
  • E.g., height, IQ (not naturally occurring but has been defined as this)
  • If we don’t have much data then the normality can be difficult to see in a histogram
  • As sample size increases, the normality will emerge
111
Q

What is distribution of data?

A

the manner in which data for a particular variable is spread over its range is commonly referred to a its distribution

112
Q

What is a data summary plot?

A
  • Plot bar showing mean (categorical data) or line graph (numerical data)
  • Plot error bars showing +/- 1s.d.
113
Q

What is a scatter plot?

A

shows the relationship between variables

114
Q

What is a box plot?

A

Seems to be plotted vertically instead of horizontal?

115
Q

What is a histogram?

A
-	Good way to inspect data
o	Can see if there’s any odd-looking scores
o	Can see the mode
o	Can see how spread out the scores are
o	Can see how the data is distributed
116
Q

How to calculate sample standard deviation?

A
o	Calc mean
o	Calc deviations
o	Square deviations
o	Calc sample variance
o	Take square root of sample variance – now back in comprehendible units
117
Q

What is the issue with simple variance?

A

potential issue is the units used, so if deviations are in hours, when squared the units would become hours squared which isn’t comprehendible

118
Q

How to calculate simple variance?

A
o	Calc mean
o	Calc deviations
o	Square deviations
o	Calc a slightly adjusted average squared deviation
        - You divide by n-1
119
Q

What is a deviation?

A

o The signed distance of a score from the mean

120
Q

What is the range?

A

o Difference between min and max scores

o Range doesn’t always change for distributions with different shapes

121
Q

What is the mode?

A

o Easy to calculate from a histogram and easy to understand – the most common value
o Data might have more than 1 mode or no mode at all

122
Q

What is the median?

A

o Is insensitive to extreme scores in the data set

o Doesn’t reflect the shape of the scores e.g., doesn’t care how far away the extreme scores are

123
Q

What is the mean?

A

o Provides and estimate of the average score in the data set

o Is affected by extreme data points

124
Q

What are measures of central tendency?

A

They provide an indication of a “typical” score in the data set

125
Q

Why is it important to summarise data?

A
  • Data can be very complex and therefore it is useful to summarise it
  • Allows for interpretation
126
Q

What is a sample statistic?

A

– a quantity that describes some characteristic of a sample with respect to a specific variable

  • E.g., sample mean, sample range etc.
  • We can always calculate these from a sample
  • Sample statistics provide an estimate of population parameters
127
Q

What is a population parameter?

A
  • a quantity that describes some characteristic of a population with respect to a specific variable
  • E.g., population mean, population range etc.
  • Not usually possible to calculate
  • Might be given to you if available
128
Q

Factors in deciding sample size?

A

o Design
o Response rate
o Heterogeneity of population

129
Q

What is important to consider when looking at sample size?

A
  • Size matters
  • Sampling error can result if your ample is not large enough
  • Trade off between size and time/cost
130
Q

What is ecological validity?

A

does the behaviour measured reflect naturally occurring behaviour?

131
Q

What is a SND table and how do you use it?

A
  • Table that provides values of areas underneath the SND in different ranges
  • Find z-score (first column) then decide if you want the area above or below this score
  • If z-score is negative, use the positive value in the table but be careful when choosing above or below because the scores will be flipped
    o E.g., z-score = -2 and you want the area below. On table you will use z-score 2 but use the area above
  • If you have a range that is bounded e.g., 70
132
Q

How do you obtain a z-score?

A
  • Obtained by subtracting the population mean from x and then dividing by the population SD – (x-µ)/σ
133
Q

What is a z-score?

A
  • Z measures how far away your sample is from the population mean in multiples of the SD
  • If you were to find z-scores for all points on a normal distribution, you would find that it would form a normal distribution with mean 0 and SD 1 – N (0, 1)
  • The area underneath a normal distribution above/below some variable value of x EQUALS the area underneath N (0, 1) above/below z
134
Q

What is conditional probability?

A
  • Probability of an event given that something else is known/assumed e.g., A|B
135
Q

What is probability?

A

– a measure of how likely it is that an uncertain event will occur

136
Q

What is the normal distribution?

A
  • Bell-shaped
  • Symmetric about the centre
  • Tails never reach 0 – go towards infinity
  • The area under the centre is always equal to 1
  • Very close to 0 by the time it gets to 3 SD from the mean – can use this to draw a rough idea of a normal distribution
137
Q

What is the danger of bimodal data?

A
  • Danger – mean is not representative

- Tends to suggest an issue with your experiment – more than one underlying population

138
Q

How do you find a z-score for a SDM?

A

z-score = (x-µ)/(σ/√N)

139
Q

What is central limit theorem?

A
  • Given a population with a mean and SD, the sampling distribution of the mean approaches a normal distribution with a mean and SD sigma/ square root N as N increases
  • This is true regardless of the underlying distribution – so even if your population is not normal, the distribution of means sampled from it will be
140
Q

What is SDM a distribution of?

A

SDM is a distribution of sample means for samples of size N drawn at random from the parent population

141
Q

What is a parent population a distribution of?

A

Parent population is a distribution of individual scores x (e.g., from an individual person or thing)

142
Q

What are the properties of the sampling distribution of the mean (SDM)?

A
  • Mean which is the same as the parent population
  • SD is different to that of the parent population – find by calculating σ (of p pop)/√N (sample size)
  • SD is called the standard error of the mean (s.e.m.) or standard error (s.e.)
  • S.e.m. must be smaller than SD of the parent population because you are diving by something that is bigger than one
143
Q

What does the sampling distribution tell us?

A
  • Tells us important info about how a statistic changes from sample to sample
  • What is the mean value of the statistic over all samples?
  • How variable is the statistic over all samples?
  • What shape is the distribution of the statistic over all samples?
144
Q

How do we generate a sampling distribution?

A
  • Take a sample (size N) from a population
  • Calculate a sample statistic (e.g., mean, SD etc.)
  • Add the new statistic to a frequency plot (a histogram) of the sample statistic
  • Repeated the above 3 steps multiple times
145
Q

What does the magnitude of a sampling error depend on?

A

The sample size

  • Bigger sample = big sampling error less likely
  • Smaller sample = big sampling error more likely
146
Q

Why do sampling errors occur?

A
  • It occurs because in our sample we don’t have all the members of the population
147
Q

What is a sampling error?

A

Sampling error – the error associated with examining statistics calculated from a sample rather than the population

148
Q

True or False, if centred on sample mean, there is a 5% chance that the population mean falls outside of this range and vice versa (for a 95% confidence interval)?

A

TRUE

149
Q

True or False, if centred on sample mean, there is a 95% chance that the population mean is also in the range and vice versa (if looking for a 95% confidence interval)?

A

TRUE

150
Q

What does a 95% confidence interval mean?

A

A 95% confidence level means that if we repeated our sampling many times and worked out a new CI each time centred on our new sample mean we would expect the population mean to be in the interval on 95% of those repeats

151
Q

For a sample drawn at random from a normal population N (µ, σ) with known s.d. σ ,the 95% CI for the population mean is centred on the sample mean m and goes from?

A

m – (1.96 x σ√N) to m + (1.96 x σ√N)

152
Q

What is a confidence interval?

A

– describes an interval (e.g., a range) of values for our population parameter, together with a specified level of confidence that the parameter is in that range

153
Q

What is an interval estimate?

A

– a range of possible values of a population parameter e.g., confidence interval

154
Q

What is a point estimate?

A

– a single value estimate of a population parameter e.g., sample mean

155
Q

What is the p-value?

A

p-value = the conditional probability associated with your sample statistic

156
Q

What is the value of α in stats?

A

α = 0.05

157
Q

What do values of p < α suggest?

A

suggest inconsistent with H0: reject the null

158
Q

What do values of p > α suggest?

A

suggest not inconsistent with H0: fail to reject null

159
Q

True or false, If we fail to reject the null (H0) then we can claim to have evidence for the null hypothesis?

A

False

160
Q

True or false, If we were able to reject the null (H0) in favour of the research hypothesis (H1) then we can claim to have evidence for the research hypothesis?

A

True

161
Q

What are the steps for null hypothesis testing?

A
- Formulate research hypothesis  	
o	Null hypothesis (H0)
o	Research hypothesis (H1)
- Collect data
- Evaluate inconsistency with H0 and data
o	How inconsistent are the data with H0?
- Reject or fail to reject H0?
- Interpret in context
162
Q

What do values of p > α suggest?

A

suggest not inconsistent with H0: fail to reject null

163
Q

How do you conduct a z-test?

A
  • Use NHST framework
  • Calculate inconsistency with mean by calculating the z-score, use the table to find the associated p-value and compare this to 0.05 to decide whether to reject or fail to reject the null hypothesis
164
Q

When is a z-test used?

A
  • To check if a sample mean that has been obtained is different from some population mean
165
Q

What is a 1 tailed hypothesis that is right hand tailed?

A
  • Something is better than the population
  • H1: sample mean > population mean
  • Looking for p-value above score
166
Q

What is a 1 tailed hypothesis that is left hand tailed?

A
  • Something is worse than the population
  • H1: sample mean < population mean
  • Looking for p-value below score
167
Q

What is a two tailed hypothesis?

A
  • Something is different than the population
  • H1: sample mean =/= to population mean
  • Looking for p value above and below score – have sample mean and then also find another value the same distance away from the population mean but on the other side. E.g., population mean = 67.5, sample mean = 70.7, the difference is 3.2 so the other value you should consider is 64.3 (z-score will be the same for the two)
  • Conditional probability = 2 x p-value
168
Q

When can you formulate a 1 tailed hypothesis?

A
  • There is previous research

- You can predict the effect

169
Q

What is a type I error? Why does it occur?

A
  • Rejecting the null hypothesis when it was correct – occur due to sampling error
170
Q

What is a type II error? Why does it occur?

A
  • Failing to reject the null hypothesis when it was incorrect
  • Arise due to a number of reasons such as a biased sample, an error in the experimental task, sample size was too small etc.
171
Q

Why do we use α = 0.05?

A
  • It is small so it is difficult to reject the null hypothesis but not so small that it is impossible to do so
  • It is a compromise between type I and type II errors
172
Q

How is a student’s t distribution similar to SND?

A
  • Bell-shaped, symmetric, uni-modal
173
Q

How is a student’s t distribution different to SND?

A
  • Has a lower peak, higher tails, have more variance
174
Q

When is a student’s t distribution used?

A
  • When population s.d. is unknown
175
Q

Does student’s t distribution include a variety of t tests?

A

yes

176
Q

How do you find the t statistic?

A

T(m) = (m-µ) / (s/√N)

177
Q

How do you find the estimated standard error?

A

(s/√N) – estimated standard error

178
Q

When using t distribution table, what value should you use for v?

A

When using t table – t (v = N-1) – subtract 1 off of sample size

179
Q

How do you find confidence intervals when population s.d. is unknown?

A
  • For 95% of repeat sample mean m would be within:
    o Some number c e.s.e.’s of µ
    o (µ- (c x s/√N) to µ+ (c x s/√N))
  • To find c:
    o Find t value for 0.025% in one tail (or 0.05% for 2 tails)
180
Q

How do you conduct a 1 sample t test?

A
  • Same as a z test except:
  • Work out e.s.e.
  • Find t statistic
  • Find if t stat is inconsistent with critical value for corresponding t(n) and significance level
  • Reject or fail to reject H0
  • Interpret in context
181
Q

When do you use a 1 sample t test?

A
  • Use to test whether sample mean you have is different from some given or hypothetical population mean