Introduction to statistics Flashcards

1
Q

What are the two types of statistics?

A
  • Descriptive Statistics
  • Inferential Statistics
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the purpose of descriptive statistics?

A
  • Describe data
  • Summarize data
    For example
  • How many people got each score
  • The standing of a score relative to other scores
  • Graphically summarizing set of scores
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the types of descriptive statistics?

A
  • Frequency distribution
  • Central tendency
  • Variability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Frequency Distribution

A
  • Number of participants in total or each category
  • A full glance without overwhelmed by raw scores
  • Visual assessment
  • The sum of frequency should be equal to n
  • Possible score and frequency of occurrence
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the characteristics of distribution shapes?

A
  • Modality
    Number of humps in a distribution
  • Skewness
    Symmetrical or not (leaning on to one side over the other?)
  • Kurtosis
    The relative peakedness or flatness of a distribution compared to normal distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a normal distribution?

A
  • Bell-shaped curve
  • The majority of the scores in the centre
  • Skewness and kurtosis less than +/- 1(strict)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are statistical assessment one can do to check normal distribution?

A
  • Kolmogorov-Smirnov Test
    If n is larger than 50
  • Shapiro-Wilk test
    N smaller than 50
  • The tests should not be significant, if they are the groups are too different from known populations
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What happens if there is no normal distribution?

A
  • Mann Whitney test (independent groups)
  • Wilcoxson test (paired groups)
  • Non-parametric data, instead of t-tests
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are different frequency shapes that is not normally distributed?

A
  • Positive skew
    On the right side, tail pointing toward the higher score
  • Negative score
    On the left side, tail pointing toward lower score
  • Leptokurtic
    Symmetrical in shape but central peak is higher; more frequent scores near the mean, thus less variability
  • Platykurtic
    Symmetrical, the frequency of most values are the same so a flatter curve
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What can happen when data is not normally distributed?

A
  • Positively skewed; inflated mean
  • Negatively skewed; deflated mean
  • Leptokurtic; off little variation in the data, so too little differences between people
  • Platykurtic; too much variation
  • Less confidence in the outcome of parametric tests
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Central Tendency

A
  • Describe the average score on a variable
  • Ideally a singly value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the three common measures of central tendency?

A
  • Mean
    The average score in the distribution
  • Median
    The middle score
  • Mode
    Most frequent occurring score in the distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Variability

A
  • The differences between the samples are with respect to variability
  • How spread out are the scores in a distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are different measures of variability?

A
  • Range
  • Interquartile Range
  • Standard Deviation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is standard deviation?

A
  • Conceptually it is an average deviation score
  • How big one step is from the mean
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What rule applies if data is normally distributed?

A
  • 68, 95 and 99.7 % are 3 steps away from the mean
17
Q

Inferential Statistics

A

Analyses one conduct in order to draw conclusion from their data and to be able to test hypothesis

18
Q

Indirect Approach - Hypothesis Testing

A
  • Obtain sample from population
  • Compute statistics
  • Infer relations in population from the sample
19
Q

Null Hypotheses

A

That there is no effect
Example; KBT has no effect on depression
- Often what gets tested
- Less than 5% means that the likelihood of getting our finding by change is less than 5%
- 95% confidence its not random

20
Q

Alternative Hypotheses

A

That there is an effect

21
Q

Type 1 Error

A
  • False positive
  • Detect a significant result
  • 2-tailed tests often eliminate these errors
  • Set significant to .01
  • Too many comparisons
22
Q

Type 2 Error

A
  • False negative
  • Saying there is no effect despite being one
  • Saying there is no effect despite there being one
  • Increase sample size
  • .80 Power
23
Q

Effect sizes

A

The actual magnitude of the difference between groups or the magnitude of the association between variables
- Might be statistically significant but is the effect size big enough?
- Strenght of the relationship; size of association or group means

24
Q

What are the 4 general assumptions of parametric tests?

A
  • Dependent variable is normally distributed
  • Homogeneity of variance
  • Outcome variable is continuous
  • Independence of observations
25
Q

Assumption - Dependent variable is normally distributed

A
  • Problematic with smaller sample sizes
  • Check skewness, kurtosis, histogram and conducting tests
26
Q

Assumption - Homogeneity of variance

A

Variances of the dependent variable are equal across different groups or conditions in statistical tests like ANOVA.
- Important in t-test and ANOVA
- Should not be statistically significant
- Levene’s test
- Falsey rejecting null if violated

27
Q

Assumption - Independence of observations

A

Data obtained from participants is independent
- No observation bias

28
Q

Assumption - Outcome variable is continious

A
  • Should be measured on an interval or a ratio scale
    Likert scale
    “1.Disagree - 5.Agree”
29
Q

What is standard error and its relation with sample size?

A
  • Calculated error between my sample and the whole population
  • The higher sample size, the smaller the error i.e closer to real population