Probability, Sampling and Distributions Flashcards

1
Q

what is a gaussian distribution

A

a normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

whats positive skew

A

the mode, median and mean are on the left

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

whats negative skew

A

the mean median and mode are on the right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

how do you measure skew

A

pearsons coefficient of skew

uses the difference in mean andf median

when ‘tail’ of data is on left of mean then PCOS (pearsons coefficnet of skew, not the ovary thing) is negative

when tail of data is on right of mean then PCOS is positive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what are parametric tests

A

use population parameters - estimates of data such as mean and STD DEV

assume the mean and STD DEV accurately represent the population distribution of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

how and why do we transform data

A

we perform a mathmatical operation on all the data we have

it helps to reduce the impact of outliers and skew

we take the log of each data point then perform a statistical test

also useful in viewing data in a standardised format

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is a z-score

A

it tells us how many standard devfiations we are above or below the mean value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is sampling error

A

we take a sample at random from a population of data in order to estimate parameters of the whole population

but the mean of each sample differs from the true mean of the population

this is sampling error

e.g trying to find out mean age of 50 people in a room by picking 10 at random and asking them

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is the standard error (usually standard error of the mean)

A

tells us how much that statistic is likely to vary between samples

effectively a measure of confidence that we know the true population mean

its dependent on: variability of the original data (STD dev of the oopulation) and amount of data used to create the sample mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

whats a confidence interval

A

similar to standard error but more intuitive feel to it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

whats a key principle to remember abut error bars

A

if they overlap then it implies there isnt a significant difference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly