study notes Flashcards

1
Q

Crosstabs

A

H0: no association between variables A and B
H1: association between A and B

Assumptions:
1. two nominal variables
2. cells need an expected count greater than 5 (at least 80% of the cells)

SPSS: Analyze -> Descriptive Statistics -> Crosstabs -> Statistics (Phi and Cramers V)
Cells -> Observed + Expected + Row, Columnn, Total
Residuals: standardized

if expected count and count have a big distance we can reject H0 more easily

-> Cramers V measures the strength of the relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

one sample t-test Assumptions

A

H0: the populations mean equals the specified mean value
H1: the population mean is different from the specified mean value

Assumptions:
1. Normal distribution (approximately)
H0: normal distribution
H1: no normal distribution

SPSS: Analyze -> descriptive statistics -> explore (add outliers)

  1. no outliers -> remove them if there are outliers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

one sample t-test (the test)

A

-> when the assumptions are met do the test

SPSS: Analyze -> Compare Means -> One-sample t-test

to reject H0 -> p < 0,05
t > 2 and confidence intervall does not cross 0 -> results are statistically relevant

Cohen´s d: size of the effect ->
0 - 0,4 small effect
0,5 - 0,8 medium effect
0,9 - 1 large effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Independent sample t-test Assumptions

A

H0: all means are equal
H1: at least one mean is different

Assumptions
1. normal distribution
H0: normal distribution
H1: no normal distribution

SPSS: Statistics -> explore -> outlier + normality plots

  1. no outliers
    -> if there are outliers remove them with 1,5 IQR

SPSS: Explore -> Statistics -> Percentiles
IQR: Q3 - Q 1
lower outliers: Q1 - 1,5IQR
upper outliers: Q3 + 1,5
IQR
-> take them out with select cases and if condition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

independent sample t-test

A
  1. Homogeneity of variances - run the test
    SPSS: Analyze -> compare means -> independent sample t-test
    t > 2, confidence intervall cannot cross the 0 line -> then the results are statistically relevant

p < 0,05 -> we can reject H0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

paired sample t-test

A

H0: the population mean differences between the paired values is equal to zero
H1: the population mean difference between the paired values is NOT equal to zero

Assumptions to test
1. normal distribution
-> we need to compute a new variable from the difference of the variables
SPSS: transform -> compute variables
test normality with Analyze -> explore
H0: normal distribution
H1: no normal distribution

  1. test for outlliers
    -> if there are none you can do the study
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

paired sample t-test: conduct the test

A

SPSS: Analyze -> compare means and proportions -> paired sample t-test

Use the old variables

t > 2, confidence interval cannot cross 0 -> then results are statistically relevant

cohen´s d:
0 - 0,3 = small effect
0,3 - 0,5 = medium effect
0,5 - 0,8 = large effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

one way ANOVA

A

H0: all population means are equal
H1: at least one mean is different

Assumptions to test:

  1. Normal distribution
    Analyze -> Descriptive Statistics -> Explore
  2. no outliers
  3. Homogeneity of variances (conduct the test)
    Analyze -> compare means -> one way ANOVA

Statistics: click Descriptive, Homogeneoty of variance test, Welch test, means plot
options: turkey´s b and Games-Howell

H0: there is homogeneity
H1: there is no homogeneity

conduct test and look at Sig.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Correlation (Pearson)

A

1 = perfect correlation
0 = no correlation
-1 = negative correlation

Assumptions:
1. Linear relationship
SPSS: Graphs -> chart builder -> scatter plot

  1. Normal distribution
    SPSS: analyze -> descriptive statistics -> explore -> both variables in the dependent list
  2. outliers
    -> remove outliers
    skewness value is more important than Shapiro wilk test -> skewness between 1 and -1

Test:
Analyze -> Correlate -> Bivariate
0,1 - 0,3 = small
0,3 - 0,5 = medium
0,5 - 1 = strong

H0: no association
H1: there is an association

-> if there is no linear relationship we cannot use Pearson

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Spearman´s correlation

A

Assumptions:
1. paired observation measured at an ordinal or continuous scale
2. relationship does not need to be linear -> it needs to be monotonic

SPSS: Analyze -> correlate -> bivariate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Kendall´s Tau

A

Data requirements:
1. Two variables
2. Paired observations
H0: the two variables are independent
H1: There is an association between the variables

SPSS: Analyze -> Correlate -> Bivariate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Simple linear regression (Assumptions 1)

A

independent variable = x-axis
dependent variable = y-axis

  1. Linear relationship
    SPSS: graphs -> chart builder -> scatter plot -> add line of best fit
  2. independence of observations
    Durbin Watson -> value between 1,5 and 2,5
  3. no significant outliers
  4. Data needs to show homoscedasticity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Simple linear regression test

A

SPSS: Analyze -> Regression -> Linear

Statistics: Estimates, Confidence Intervals, Durbin-Watson, Casewise diagnostics

Plots
Y = ZRESID
X = ZPRED
Histogramm and Normality plots

  1. the residuals of the regression line need to be normally distributed -> see in Histogramm

H0: our x variable has no explanatory power
H1: x variable has powers to predict changes

Model summary: Adjusted R-squared tells us how many % of the changes are explained by the dependent variable

Model:
Constant + B *x + e

Statistically relevant if
t> 2, p (Sig) < o,o5 and confidence intervall does not cross the 0 line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Binned data

A

SPSS: Transform -> Visual Binning -> binned variable name

-> make cutpoints -> first cutpoint: minimum, number of cutpoints or width -> apply -> make labels -> analyze -> descriptive statistics ->frequencies -> bar chart

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly