BAP lab 3 - statistics workshop Flashcards
what is parametric data ?
parametric data is normally distributed data which follows a symmetrical bell shape
what is non parametric data ?
non parametric data does not follow a normal curve. it may take many different forms: uniform or skewed and others
what is descriptive statistics?
statistical process or tools used to calculate values from datasets that describe the shape of the dataset distribution plot
what is interferential statistics?
statistical process used to calculate values that represent a comparison between 2 or more sets of data
what is the mean ?
the centre and most prevalent value in a parametric dataset.
what is standard deviation?
standard deviation is a value that represents the spread/width (technically the variation or dispersion) in a parametric dataset. 68% of the datapoints in a normal distribution will be within the boundaries of the standard deviation
what are confidence intervals?
Confidence intervals: are values represents the spread/width in a dataset (can be used for parametric and non-parametric datasets). For parametric datasets the 90% CI is 1.64 x SD, and the 95% CI is 2 x SD. Confidence intervals can be determined for non-parametric distributions but its more complicated to calculate
what is central limit theorem
iwhen a process that is being measured has lots of components, then even of the individual components are non-parametric, the final output of the process with give measurements/values that are normally distributed
what is median
the central value in a non-parametric dataset, so that, if sorted in numerical order) half the datapoints would be below the value and half above.
what is the mode ?
the most prevalent value in a non-parametric dataset
what are quartiles?
Quartiles are values that represents the spread/width (technically the variation or dispersion) in a non-parametric dataset. If the dataset was sorted in numerical order, 25% of the values would be below Q1 and 75% below Q3.
what is null hypothesis ?
a hypothesis for interferential statistical tests where the hypothesis is that there is no difference between the groups being compared. The P-value (see below) is the probability of the null hypothesis being true
what is the P-value?
a numerical value which is the probability of a statement (for example the null hypothesis) being true
what is Shapiro-wilk test ?
a test that give a probability (P value) of a dataset being normally distributed
what is student t-test
a tool for comparing (inferential statistics) of two parametric datasets (comes in different forms)