Statistiek 1-5 Flashcards

1
Q

5 clean data checks

A
  • Missing
  • Range checks
  • Outlier checks
  • Distribution checks
  • Logical checks
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

verschil disreet en contious

A

discreet alleen hele getallen. continous kan alle getallen aannemen.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Nominaal vs ordinaal?

A

nominal: > 2 categorieen niet geordenend: bloedgroep
ordinal: > 2 categorieen wel geordend: pijnscore 1-5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

mode, mean, median

A

Modus: getal wat meest voor komt. (meerdere of geen modus mogelijk

(arithmetic) mean: disproportionately affected by outliers. distributie checken van tevoren
median: center of the data set (ranked highest to lowest. less effected by outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

def: variance, def SD

A

Variance: mean of squared deviatons
SD: wortel(variance)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Voordeel: mean + sd?

A

gebruikt alle informatie, makkelijk te inetrpreteren. cave: gevoelig voor outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Voor/nadeel van assymetrische verdelingen standaaarden

A

Range: makkelijk e verrihten, easily distorted
Percentiles: niet zo snel distorted by outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

5 eigenscappen normale verdeling

A
  • bell shaped
  • symmetrical around its mean
  • mean and median are equal
  • one top
  • fully described by mean and SD
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

3 manieren van normaliteits bepaling

A

summary statisstics

visual inspection

formal tests

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Q-Q plot?

A

shows quantiles. X as = quantilen eigen data, Y as = verwacht als normale distributie gevolgd zou worden

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Downsides Kolmogorov Smrinov test and shapiro wilk test

A
  • in small samples insufficiient power

- large samples, smal deviates are flagged as significantly deviant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly