All topics Flashcards

Question 1

Q

What makes a good theory?

Answer

A

Falsifiability
Parsimony (elegance of theory – simplest explanation is best)
coherence
correspondence with reality (more likely to have a high pay-off)

Question 2

Q

Reliability of Measures

Answer

A

Test-retest – administering a test twice
inter-rater reliability – extent to which 2 raters (judges) obtain the same result using the same measure
Split-half reliability – a test is split in 2 and the scores from each half are compared with eachother

Question 3

Q

Validity of measures:

Answer

A

Face validity – the extent to which an assessment measures the variable/construct in purports to measure
Content validity –
Construct validity – 2 types
– convergent – when 2 tests that purport to measure the same thing are highly related
divergent (discriminant) – tests that measure different but related constructs should not be highly correlated (eg. IQ for spatial v reading)

Question 4

Q

Research method:

Answer

A

Experimental
quasi-experimental – (manipulation of IV but cannot randomly assign participants) eg. male v female, smoker v non-smoker. Don’t talk about cause and effect
Correlational

Question 5

Q

What are the different kinds of research design?

Answer

A

Between subjects – different participants assigned to each condition
within subjects or repeated measures design – each participant exposed to both conditions
matched pairs – different participants assigned to each group but matched on particular characteristics

Question 6

Q

What is difference between descriptive and inferential statistics?

Answer

A

Descriptive statistics – summarise data eg. mean, median, mode, variance, SD,
Inferential statistics – help us test hypotheses. Allow us to make generalisation’s about populations of interest based on samples eg. correlation, regression, ANOVA

Question 7

Q

Define:

Mean
median
Mode
Reliability
Validity

Answer

A

Mean – average
Median – the middle score in a distribution
Mode – score that occurs the most often
Reliability v validity
Reliability – consistency of a measure
Validity – accuracy of a measure (measures what it purports to measure)

Question 8

Q

How do you find the median with an even number of scores?

Answer

A

add two middle scores and divde by 2 – ie. Average them

Question 9

Q

Describe different scales of measurement

Answer

A

Nominal – consists of categories with no underlying scale or order. Eg. religious affiliation – Christian, buddhist, hindu, muslim etc.
Ordinal – Consists of categories that are ORDERED, but don’t know what the distance is between ranks (ie. The distance between scale values is unknown). Eg. police ranks.
Interval – Meaningful distances between points on the scale eg. termperature. Interval scales lack true zero point (zero is the absence of something, you can still feel temperature at zero)
Ratio – All the characteristics of an interval scale plus a true zero point – weight and length are examples

Question 10

Q

Discrete v continuous variable

Answer

A

Discrete – Takes on whole numbers

Continuous – can take any fraction on non-whole number

Question 11

Q

Shape of Distribution

Answer

A

normal – bell shaped
positively skewed – tail pointing to the right
negatively skewed – tail pointing to the left

Question 12

Q

Research ethics (1q)

Answer

A

informed consent
voluntary participation
passive deception (don’t tell whole truth but don’t tell lies)
active deception (delierately mislead the participant with information)
withdrawal anytime

Question 13

Q

Central Tendency (3 q’s

Answer

A

the tendency for the values of a random variable to cluster round it’s mean, mode, or median

Question 14

Q

mean/median/mode – which would be the best to use?

Answer

A

mean is affected by outliers and can be skewed
median – less affected by outliers and skewed data
mode (most frequent) – normally used for categorical data – problematic when 2 categories have highest value
not a good mark when most common data is far away from the rest of the data in the set.
when data is skewed – median is best representative of central location of data

Question 15

Q

Type of variable and best measure of central tendency?

Answer

A

Nominal - Mode
Ordinal - Median
Interval/Ratio (not skewed) - Mean
Interval/Ratio (skewed)- Median

Question 16

Q

Population v sample

Answer

A

Populaiton – all the individuals of interest
- population values are called parameters

Sample – the individuals selected from the population used in study
- sample values are called statistic

Question 17

Q

Sampling error

Answer

A

the discrepancy between population parameter and sample statistic

Question 18

Q

What is the relationship between sample statistics and population parameters?

Answer

A

A sample is a part or portion of a population

parameter is a measure of describing whole population
statistic is a measure of a sample/portion of a target population

Question 19

Q

What is standard devitation?

Answer

A

a measure of variability – how spread out are the scores?

Question 20

Q

Variability 3 (qs)
What does SS denote?

Answer

A

sum of squares = sum of squared deviation from the mean

Question 21

Q

Variability?

Answer

A

how much scores vary from each other and from the mean

Question 22

Q

Variance

Answer

A

the average of the squared differences from the mean

Question 23

Q

Standard deviation?

Answer

A

numerical depiction of variability

- under a normal distribution 68% of scores fall within +_ 1 SD from the mean (95.44 within 2SD, 99.72 within 3SD)

Question 24

Q

Define and describe the relationship between variance and SD?

Answer

A

As variance increases so does standard deviation

- low variability in data set = low standard deviation

Question 25

Q

What are the degrees of freedom?

Answer

A

In a sample N-1 scores are free to vary. For example if have sample of 3 scores and we know first 2 scores and the mean we know what the 3rd score must be. So 2 scores are free to vary but third is not, thus N-1.

Question 26

Q

Why do we adjust degrees of freedom in a sample?

Answer

A

We do n-1 because of sampling error in sample that may not be representative of the population

Question 27

Q

What is a z-score?

Answer

A

a standardised score (transformation of distribution of raw scores into z-score distribution)
Z-score will always have mean of 0 and SD of 1
Z-score is expressed in standard deviation units

Question 28

Q

What do you need to know to calculate z-score?

Answer

A

X – individual score
M () – sample mean
SD () – sample standard deviation

Question 29

Q

If you convert all the raw scores to z-scores what do you get?

Answer

A

mean = 0
SD = 1
distribution is same shape as before ALWAYS (ie. Still normal/skewed etc.)
Benefit? – allows you to compare scores from different distributions

Question 30

Q

What does a z-score of +1 mean? What does z-score of -2 mean?

Answer

A

The score is one SD above mean

- the score is 2 SD below the mean

Question 31

Q

Why do we hypothesis test?

Answer

A

To get around heuristics (mental shortcuts – availability/representative) and human biases (hindsight/cognitive)

Question 32

Q

What is a theory?

Answer

A

a ‘model’ that describes how certain phenomenon work

Question 33

Q

What is a hypothesis?

Answer

A

A statement derived from a theory or theories about the relationship between variables or differences between groups

Question 34

Q

What is the null hypothesis and alternative hypothesis?

Answer

A

null - states there is no effect

alternative - states there is a difference

Question 35

Q

Error Types

Answer

A

Type I – Reject the null hypothesis when it is TRUE (false positive) (alpha - 5% chance)
Type II – Accept the null hypothesis when it is FALSE (false negative) (beta – 20% chance)
Type III – (only applicable to a directional hypothesis; H1) – predicting the inverse of a
relationship

Question 36

Q

What does p

Answer

A

In NHST p < .05 means that there is less than a 5% chance of obtaining the results (or more extreme) if the null hypothesis were true

Question 37

Q

What factors affect the p-value?

Answer

A

Size of mean differences – Increases probability of rejecting the null
Variability of scores – decreases probability of rejecting null
sample size – larger sample size increases probability of rejecting the null

Question 38

Q

What is correlation?

Answer

A

when 2 variables are related to eachother a correlation exists
measures relationship between 2 variables
correlation is a prediction NOT causation

Question 39

Q

What is the correlation coefficient? (r)

Answer

A

numerical index of strength and direction of relationship
expressed as number between 0 and 1
direction can be positive or negative (as one goes up the other goes up OR as one goes up the other goes down)
numbers closer to 1 indicate stronger relationship

Question 40

Q

What does positive/negative/no correlation look like on scatter plot?

Answer

A

positive – slopes up from left to right
negative – slopes down from left to right
no correlation – no pattern (ill defined scatter)

Question 41

Q

Perfect correlation

Answer

A

perfect linear relationship – every change in x is accompanied by a corresponding change in y variable

Question 42

Q

What is small/medium/large correlation? (coefficient (r))

Answer

A

small – 0.1 to 0.3
medium – 0.3 to 0.5
large – 0.5 to 1.0

Question 43

Q

What is the coefficient of determination?

Answer

A

is the correlation coefficient squared
the percentage variation in one variable that can be predicted based on the other variable
as the magnitude of the correlation increases, our ability to predict one variable based on knowledge of the other variable increases

Question 44

Q

Calculate the coefficient of determination from r = .70 and what does the result mean?

Answer

A

=r squared
= .70 x .70
= .49
Means that variable X can account for 49% of the variation in variable Y
The higher the correlation coefficient the higher the coefficient of determination will be

Question 45

Q

What is the 3rd variable problem?

Answer

A

As correlation is a prediction not a causation, the observed relationship may be accounted for by some other third variable eg. size of foot might be strongly correlated to IQ in children, but the 3rd variable – age may account for the relationship

Question 46

Q

What are the assumptions for correlations?

Answer

A

Independence – each participant should participate only once in the research and should not influence the participation of others
Normality – each variable should be normally distributed – ie. Data form a symmetrical bell-shaped curve about the mean. To assess normality we can look at Skewness and kurtosis
Linearity – should be a linear (straight line) relationship between the variables. If the relationship is not linear it will not be adequately captured and summarised by Pearson’s r.
Homoscedasticity – the error variance is assumed to be the same at all points along the linear relationship. That is the variability in one variable should be similar across all values of the other variable.

Question 47

Q

What is skewness?

Answer

A

is a measure of symmetry of distribution

- when the skewness statistic is 0 the distribution is perfectly symmetrical

Question 48

Q

What is kurtosis?

Answer

A

how peaked or flat is the distribution

- a kurtosis statistic of 0 (plue skewness statistic of 0) indicates distribution is normally distributed.

Question 49

Q

How to calculate normality?

Answer

A

Use the Shapiro-Wilk test if N is lower than 50 – if p >.01 then null hypothesis is not rejected and there is no difference indicating distribution is normal (Kolmnogrov-smirnov test for N =50 +)
calculate by dividing the skewness/kurtosis statistic by the standard error.. if falls in  3.29 then distribution is normal. If falls outside of this then not normal.

Question 50

Q

What is the difference between bivariate correlation and partial correlation?

Answer

A

Bivariate – used to measure the linear association between 2 continuous variables
Partial – used to measure linear association between 2 continous variables after controlling for a third (and fourth, fifth etc.) continuous variable

Question 51

Q

What is the t-statistic?

Answer

A

T = the actual difference between sample mean (from data) and the population mean (hypothesised from H0)/ estimate of standard error (estimate of standard distance between sample mean and population mean)

Question 52

Q

How to calculate probability in one sample t-test?

Answer

A

Is the population mean (in one sample t-test compare sample with a predetermined value (test value)

Question 53

Q

What is the definition of degrees of freedom?

Answer

A

the number of scores that can vary given a constant mean
Eg. If you only have a set amount of money to pay 50 people in your company you could allow 49 of them to set their own salary but the unlucky last would have to get a very small salary or even pay to work in your company to make up for 49 that gave themselves high pay.
thus only 49 (N-1) could vary but one is fixed by total amount

Question 54

Q

What is the t-distribution table showing?

Answer

A

The numbers are the values of t that separate the tail from the main body of the distribution.

Question 55

Q

How to find if result statistically significant from t-distribution table?

Answer

A

T-critical value is found in table, t-obtained value is calculated from results – if t-obtained is greater than value found in table – results are statistically significant.

Question 56

Q

two types of 2 sample t-test:

Answer

A

Independent samples – aka independent groups, between groups, between subjects
Paired Samples t-test – aka repeated measures, within samples, matched samples, dependent samples

Question 57

Q

Relationship between t-statistic, sample variance and statistical significance:

Answer

A

When the variance increases, so does the standard error. Since the standard error occurs in the denominator of the t statistic, when the standard error increases, the value of the t decreases.
when the t-statistic decreases less probability of getting a t-obatined greater than t-critical, decrease in p-value. So when variance increases p-value decreases.

Question 58

Q

independent sample t-test

Answer

A

uses a between groups design between sample mean 1 and sample mean 2.
Actual or observed difference divided by the estimated standard error. And
when n for sample 1 is not equal to n of sample 2 have to pool the variance!
To pool the variance – sum of squared deviations from the mean in sample + SS (sample 2)/ degrees of freedom sample 1 + degrees of freedom sample 2.

Question 59

Q

Independent Samples t-test assumptions:

Answer

A

Scale of measurement – DV should be interval or ratio data
Independence – Each participant should participate only once in the research, and should not influence the participation of others
Normality – each group of scores should be approximately normally distributed
Homogeneity of variance – There should be an approximately equal amount of variability in each set of scores

Question 60

Q

How does adjusting the degrees of freedom make it harder to detect a significant result?

Answer

A

looking at the t-distribution table – the smaller the df = larger t-critical value. This means you would need a larger t-obtained value to get a statistically significant result. So harder to get a statistically significant result with smaller df.

Question 61

Q

What is Cohen’s D?

Answer

A

A measure of effect size
Measures the extent to which the 2 sample distributions overlap – measured in standard deviations. If cohen’s D was zero there would be a complete overlap between 2 populations

Question 62

Q

How to measure homogeneity?

Answer

A

Using Levene’s test.
For this course if Levene’s test p < .001 then have violated the assumption of homogeneity – the result is significant meaning there is a difference in variance between groups.
If p > .001 then is not significant and there is no difference in variance between groups.

Question 63

Q

PAIRED SAMPLES t TEST

Answer

A

Measure participants before and after treatment

- Compare the performance (DV) of males and females (IV) that are matched on different critera

Question 64

Q

How to report results APA style:

Answer

A

participants had a higher recall score in the images condition (M = 26.00, SD = 4.71) than the no images condition (M = 18.00, SD = 4.22), t(9) = 3.24, p = .010 (two-tailed).

Answer 65

A

when sample size of first sample is different relative to the second sample, to eliminate the disparity the variance is pooled. (when n1 is not equal to n2 - need to pool variance)

Answer 66

A

the difference between the sample mean and the population mean expected by chance. The actual difference versus the expected or the actual difference versus the error.

Independent t-test - actual difference v difference expected by error (between 2 sample means)

Answer 67

A

because there is one independent variable which might have 2 or more groups or levels.
Eg. IV – temperature had 3 groups/levels at 50 degrees, 70 degrees or 90 degrees

Answer 68

A

to compare more than two treatments

- Anova = Analysis of variance – all about variance

Answer 69

A

Sample variance is the sum of squares divided by N-1 (so = first value minus the mean)squared), do this for each score and sum them together to get sum of sqaures

Answer 70

A

understanding between group variability versus within group variability

Answer 71

A

that one or more pairs of treatment means will be different from each other

Answer 72

A

Between treatment variance which is due to:
- Treatment effects
- chance
Within treatment variance which is due to:
- chance/individual differences

Answer 73

A

F is a ratio of variability. = Variance between treatments/variance within treatments
F= 0 + error (individual differences + other)/ error (individual differences + other) = 1
If null hypothesis is true than F will be 1 or close to 1. (if treatment effect will not be 0)

Answer 74

A

Total – go to total df then +1 = total number of participants
look at df for between groups and +1 = total number of groups

Answer 75

A

Scale of measurement – DV should be interval or ratio data
Independence – each participant only participates once
Normality – each group of scores should be approximately normally distributed
Homogeneity of Variance – there should be approximately equal amount of variability in each set of scores

Answer 76

A

ANOVA can tell us there is a difference but doesn’t tell us where the difference lies.
Tukey post-hoc test is most often used

Answer 77

A

When we have more than one independent variable in the analysis

Answer 78

A

Example used is whether the effects of puppet-type are the same for binge eaters and non-binge eaters. – 3 different kinds of puppet – cookie monster, counts and warrens
- each of these are either – binge eaters or non-binge eaters
= 3 x2 factorial anova (6 factorial combinations)
- DV is how many cookies eaten

Answer 79

A

No, must use 2-way ANOVA when have 2 factors

Answer 80

A

An interaction occurs when the effect of one factor on the dependent variable is not the same at all levels of the other. You can tell this by looking at the graph: if the lines are not parallel there is an interaction effect

Answer 81

A

Independence
Population distributions are normal
Homogeneity of variance (s2)

Answer 82

A

A measure of effect size

Answer 83

A

A measure of the magnitude of a treatement effect
independent of sample size
can be standardised – measured in standard deviations. (how many standard deviations are the means apart)
referred to as d or cohen’s d

Answer 84

A

= r  (or Pearson’s r) effect size is the strength of the association between 2 variables
- ranges between -1.0 and +1.0
Small r between -.10 and +.10
Medium: between .10 to .40
Large: r > .40

Answer 85

A

is the proportion of variance explained – the proportion of variance in one variable that can be explained by the other variable. r² is like an apple with a bite taken out of it, if you take a little bite it accounts for a small percentage of variability in the apple, if you take many bites and leave a core the percentage of variability explained is much larger.

Answer 86

A

01 < r² < 0.09

0. 09

Answer 87

A

0.01 < r² < 0.09 - small
0.09 < r² < 0.25 - medium
r² < 0.25 - large
Large effect is 25% of the variability in the dependent variable can be explained by the independent variable. What’s the chance of a single independent variable accounting for 100% of the variability in the DV? – not high as normally there are multiple factors accounting for the variability.

Answer 88

A

It is commonly used in power analysis and is the ratio of variance explained to variance unexplained.

similar to n² and R²
What sample size will you need to detect a statistically significant effect?

Answer 89

A

the probability of correctly detecting a statistically significant effect if one exists.

Answer 90

A

Power = ES x N x alpha

Answer 91

A

Type I – Reject the null hypothesis when the null hypothesis is true (False positive)
Type II – Accept the null hypothesis when it is false (False negative)
Type III – predicting the inverse of a TRUE relationship

Answer 92

A

Is set at the alpha level of p < .05 – meaning the probability of a type I error is 5%. (so type I error is also alpha)

Answer 93

A

1 – beta. Where beta is type II error rate (accept the null hypothesis when false)
Cohen has set power at .80 – an 80% power of detecting an effect if it exists.

Answer 94

A

When we increase α, we decrease β and increase our statistical power. This is because when we increase alpha the threshold for significance is moved further from the end of the tail, increasing the area of beta (under the normal distribution) and increasing statisitical power.

When beta decreases power increases.
When increase alpa, beta gets smaller and power gets larger

Answer 95

A

sample size
effect size
P-level

Answer 96

A

large effect size (large differences between means and small SD)
Large sample size (as standard error decreases and sample mean closer to pop mean)
High alpha level (.05 or .10) (the larger the alpha the bigger the rejection region, the more chance to reject the null hypothesis, high alpha – high power)
One-tail test
Within subjects design (as decreases variability – any variability in the DV is not due to individual differences)

Answer 97

A

Small Effect size (small associations or small differences between means and large SD)
Small N (less likely to be representative of the population)
Low alpha level (.01 or .001) (the small the rejection region – few scores are going to fall in it = less power)
two-tail test
Between subject design (more variability due to individual differences – error is obscuring the treatment effect making it harder to detect)

Answer 98

A

If we increase alpha (from .05 to .10) we reduce z-score

Answer 99

A

Critical rejection region is placed in both ends of the tail. This means there is .025 pprobability in each tail (smaller alpha level), with a smaller alpha level, get larger z-score, with larger z-score get less power.

Answer 100

A

One-tailed hypothesis
Increase sample size to large
Have a within subjects design (as limits variability or error between groups)

Answer 101

A

Distribution-free tests
Are not normally distributed – may be positively/negatively skewed.
may display kurtosis – Mesokurtic (normal distribution)
- Leptokurtic (long tailed kangaroo)
- Platokurtic (flatter like a platypus)

Answer 102

A

in general the same number of observations are more likely to lead to the rejection of a false null hypothesis
they have more statistical power

Answer 103

A

Labels used for CATEGORIES of data
No meaningful underlying scale
eg. religious affiliation

Answer 104

A

χ2

no assumptions of homogeneity or variance
no assumptions of population distribution

Answer 105

A

Frequencies

- Do no have typical variance, SD, M etc as dealing with frequencies not numerical scores

Answer 106

A

Goodness of fit (comparing frequencies of one nominal variable to theoretical expectations)
Independence (comparing frequencies of one nominal variable for different values of second nominal variable)

Answer 107

A

How do the frequencies that we have observed fit the frequencies that we expected?
If fit is good then χ2 will be small. We want there to be a big difference between observed and expected frequencies in order to reject the null hypothesis.
We want a bad fit to reject the null hypothesis

Answer 108

A

No preference or variation from category of nominal variable to the next – same frequency value in every category, researchers would calculate every frequency by hand
No difference from comparison population – research examines the literature and sees what one should expect for the frequencies of each category for the single nominal variables as specified by the null hypothesis.

Answer 109

A

The actual data obtained. Observed frequencies are always whole numbers, as we are dealing with individuals and not part of an individuals

Answer 110

A

Calculated based on proportions – can be fractions

- those based on the null hypothesis

Answer 111

A

Chi square is about comparing observed frequencies with expected frequenices
under the null hypothesis expected frequencies are the same across all categories.
bad fit means probably statistically significant – good fit probably not

Answer 112

A

If chi-sqaure obtained (24.30) is greater than chi-sqaure CV (9.49) then it is significant!
(same as in t-test if t-obtained is greater than t-critical reject the null and state is significant)
- the result above is stating there is a bad fit between observed frequencies and expected frequencies

Answer 113

A

χ2(4, N = 100) =24.30, p < .05, phi (symbol)= .49

Answer 114

A

there is a standardised residual for each category – looking at this can tell us which category contributed to the chi square being significant
If the absolute value for R > 2 then that category contributes to overall significance
in example above category C and HD are > 2, meaning these categories have contributed to overall significance

Answer 115

A

same formula as goodness of fit but have a Second nominal variable to consider
eg – gender and grades – there would be 2 equivalent variations on null hypothesis
– there is no relationship between grade and gender
– there is no grade difference for gender

Answer 116

A

(R-1)(c-1)
= rows (how many categories for gender = 2)
= Columns (how many categories for columns – grades = 5)
= (2-1)(5-1)
= 4

Answer 117

A

for a 2x2 use the pho-coefficient (2x2 means the first nominal variable has 2 categories and the second nominal variable has 2 categories)
for larger tables such as 2x3, 3x3 use Cramer’s V (modification on phi) (3x3 means the first nominal variable has 3 categories and the second nominal variable has 3 categories)
for df for cramer’s v to calculate df use the SMALLER of rows-1 or columns -1

Answer 118

A

small (0.10), medium (0.30), large (0.50)

Answer 119

A

χ2(4, N = 100) =13.31, p = .01 (two-tailed), phi symbol = .37

Answer 120

A

Chi-sqaure tells you if it is significant or not

= Phi and cramer’s V tells you the size of the relationship or the size of the difference.