Statistics Flashcards
Positive skew
skewed to the left
negative skew
skewed to the right
what is kurtosis
Pointy/flat. How heavily the tails of a distribution differ from the tails of a normal distribution
what is a hypothesisa
a predictable theory to test
what is a alpha level
the significance level
level which you would be happy to make an error rejecting the null hypothesis if it were really true
what does it mean if If p < alpha
null hypothesis is rejected this is a statistically significant result
what is a type 1 error
• False positive
• There wasn’t really a difference/ relationship but you said there was
• You rejected the null hypothesis when it was actually true
• A higher alpha level = more type 1 errors (the acceptable level of error is higher and therefore you will make more errors)
• Also depends on the type of stats you use and sample
Alpha too high
what is a type 2 error
• False negative
• There was a difference/ relationship but you said there wasn’t
• You accepted the null hypothesis and missed the difference/ relationship
• A lower alpha level = more type 2 errors (acceptable level of error is low and therefore you will miss differences/ relationships because you think these are errors)
• Also depends on the type of stats you use and sample
Alpha too low
what is power
the chance of spotting an error
A powerful test is one that spots the difference if there is one
what are Parametric tests
• Interval/ ratio scale, data normally distributed, data from multiple groups have the same variance, data are independent
• More powerful
• Greater range of tests
Easier to understand and compute
give the equation for the regression line
Y=mx + c where m is gradient, c is intercept on the y axis
what is r
R = 0 means there is no correlation
what is r squared
R squared is a good indicator of goodness of fit
what is regression
• Regression puts the line through and gives the equation so you can calculate y
• Regression is basically the same thing as correlation but with more information
• Linear regression describes strength and direction of the relationship between 2 variables where you measure one and manipulate or control the other variable as basis for prediction
• —-> Bivariate regression
• Linear regression helps to establish the correlational relationship
• Residual = the difference between a predicted score and an actual score. Residuals are - if the line is perfect (measure of error)
Bivariate regression - need to make sure residuals are normally distributed (Shapiro Wilk), homoscedasicity
what is linear regression
• Describes strength and direction of the relationship between variables where you measure one and manipulate or control another
• Helps to establish the correlational relationship as a basis for prediction
• To check linear regression is significant, use the ANOVA table
When you run linear regression analysis in SPSS, Pearson’s correlation coefficient is calculated as part of the analysis
what is multiple linear regressoin
• Normality of residuals • At least 2 IV • Each IV has at least 10 participants • Linearity Normal distribution of all unstandardized residuals combined together
what is linear bivariate regression
• Residuals normally distributed
• Only looking for linear relationship
• Need to make sure the residuals are not related to each other
One outlier can cause a big proble
what is multiple regression
• When there are more variables
How several things are relate
what is Standard error of the estimate
- Tells us how accurate your predicted value is
- Used to build a 95% confidence interval
- To calculate how much error there would be if you used the linear regression to predict a score
- 95% confidence interval is +/- 1.96 x standard error of the estimate
- Confidence interval= the range of the scores what the y will be in because the line doesn’t fit exactly and therefore we might be slightly off with our prediction
- Confidence interval means that we can be 95% sure that the predicted scores will ie within the identified range of scores
- To calculate the confidence interval, multiply the standard error by 1.96
what are beta weights
- Allow you to compare the importance of different IV which are usually calculated using different measurement units
- Standardised scores of your raw values which allow for easier comparisons to be made
- The highest beta weight is the most important variable in your multiple regression
- Absolute value = ignorewhether it is positive or negative
what are error bars
Give you an idea how precise the results are (indicate error)
what is the absolute difference
• Simply find a difference between the two means and ignore if it is positive or negative
• Percentage of overall mean usually asked in the context of absolute difference between the means of the 2 groups. Need to find the absolute difference between the means of 2 groups. You then need to find the overall mean of 2 groups and express as a percentage
Absolute difference % = absolute difference / overall mean x 100
when would you use a one way between subjects ANOVA
- Interval/ ratio
- At least 3 levels of IV
- Normality of DV within every level of IV
- Homogeneity of variance