Paper 2 data handling + statistical testing Flashcards
Mean
the mean is calaculated by adding up available data scores, and dividing by the number of actual data scores
S - Representative of all the scores/values collected.
L- distorted by extreme values / No values should be ignored
Mode
most frequently ocurring score in a data set, and this becomes your mode
S - easy to calculate to identify/calculate
L - The mode does not consider all the values in the data.
There can be more than one mode or no mode for the data.
Median
rank all scores from smallest to largest, and then work out which is the middle value. 2 scores that sit in the middle, then the median is the sum of the 2 scores, divide by 2.
S - easy to identify and extreme scores do not affected it
L - Not all values are included
Range
highest value - lowest value
s - easy to calculate
l - only takes into account the most extreme values unrepresentative of the dataset as a whole.
Negative skew
long tail on the negative left side of the peak.
most of the distribution is concentrated on the right
the mean is lower than the median/mode
Postive skew
long tail on the positive right side of the peak
the mean is higher than the mode
Distributions
Normal
symmetry on both sides of the curve
mean, median, mode are equal
Display of quantitaive data
table - converted to descriptive statistics
bar chart - bars are separate to show we are dealing with separate conditions
scattergram - depict associations between co-variables
histogram - distribution of s continuous data set
Line graph - displays continuous data and use points connected by lines to show how something changes in value over time.
standard deviation is a meaure of dispersion that shows the spread of scores around the mean.
is a meaure of dispersion that shows the spread of scores around the mean.
the greater the SD the greater the spread of scores around the mean.
S- precise measure of dispersion than the range as it included all the values
L - only takes into account the extreme values - not giving a full representation of the spread of scores - it can be distorted by a single extreme value , extreme values cant be revealed.
Experimental/alternative Hypothesis
A clear, precise testable statement/prediction about the relationship between the variables in the investigation
Directional (one-tailed)
States the direction of the investigation
Non-directional (2-tailed)
States that there will be a difference between 2 variables but not what the difference will be
Null Hypothesis
Written along side alternative/experimental hypothesis
states that there is no difference between variables/conditions and any differences is due to chance
Significance level
psychologists can never be 100% certain aboout anything. So they have settled on a 95% confidence level .
this is written as p<0.05
this means there is a 5% or less that the results are due to chance and we must reject our null hypothesis
alternative you can say there is a 95% likelihood/confidence that the findings are not due to chance
if the null hypothesis has been rejected the results are significant
if the null hypothesis has been accepted the results are insignificant
Type 1 error
false positive
**D: A type 1 error It is where you accept the alternative/experimental hypothesis when it is false.
- occurs when a researcher incorrectly rejects a true null hypothesis
**
5% chance of making a type 1 error
psychologist use a 0.05 level of significance as it is halfway between type 1 and type 2 error, because 0.01 and 0.1 level of significance so the chance of making an error is reduced
too lenient - optimistic level of significance 10%
type 2 error
False Negative
happens when you accept the null hypothesis when it should actually be rejected.
occurs if the investigator fails to reject a null hypothesis that is actually false
A type 2 error occurs when the p value is set too low, for example, p<0.01. - too strict - pessimistic
How to use statistical tests
- calculate the calculated value using the correct statistical test
- Compare the calculated values with the critical value using a critical value table.
- if the calculated value - equal to or greater then or smaller than
Choosing a statistical test
- am i looking for
difference/relationship - What is experimental design?
- independent group design - unrelated - Ps in each condition are different
repeated measures - same ps used in all conditions
matched pairs design - Ps are matched together - Which level of measurement is the data?
Nomial counting things and putting them in categories - mode
Ordinal - measuring things and data has been put in rank order of size - median/range
Interval. - measuring things on an equal scale that has equal units - mean/standard deviation
Sign test is used when ?
- test of difference
- Nomial data
- Repeated measures-The same Ps are used in more than one condition
Sign test step 1
- calculate the observer/calculated value using the sign test
- data need to be converted to nomial data (named categories
- calculate the difference between 2 scores of the Ps
- if there is an increase between condition 1 and condition 2 write a +
- if there is a decrease between conditions 1 and 2 write a -
- if there’s no increase/decrease write 0 - theses scores will not be included in the investigation
- add up all the + and -
- count the number of the less frequent sign this is the calculated/observer value
- for example if there are 9+ and 4- then the observer/calculated value is 4
Sign test step 2 + 3
- use the critical value table to work out the critical value for the investigation
- the critical value needs to be compared with the observer/calculated value
the observer/calculated value must be equal to or less than the critical value for the investigation/findings to be considered significant (the null hypothesis needs to be rejected)
Spearman’s Rho
- testing for a correlation- a relationship between 2 variables
- data is related ( comes from same person ) - repeated measures design
- level of measurement: Ordinal (put iin order) or interval
- Needs number of Ps
- Significant = OV must be greater / equal than CV= Null hypothesis rejected
Pearson’s r
Predicts correlation-a relationship between two co-variables.
Data is related
level of measurement: Interval data
Significant = OV must be greater / equal the CV= Null hypothesis rejected
Chi-squared
Predicts a difference between 2 conditions
data is unrelated / independent
Independent group design used
level of measurement: Nominal data
Degree of freedom needed
Significant = OV must be greater / equal the CV= Null hypothesis rejected