Parametric test assumptions Flashcards
Define
Parametric test
tests that make assumptions about the parameters of the population distribution from which the sample is drawn
Define
Outlier
a data point that differs significantly from other observations
Define
Linear transformation
a function from one vector space to another that respects the underlying (linear) structure of each vector space
Define
Non-parametric test
tests don’t assume that your data follow a specific distribution
Define
Central limit theorem
states that if you have a population with mean μ and standard deviation σ and take sufficiently large random samples from the population with replacement , then the distribution of the sample means will be approximately normally distributed
Define
Normality
the sampling distribution of the mean is normal or that the distribution of means across samples is normal
Define
Homogeneity of variance
the assumption that all groups have the same or similar variance
Define
Independence
means that your data isn’t connected in any way (at least, in ways that you haven’t accounted for in your model)
Define
Residual
The difference between the observed value of the dependent variable (y) and the predicted value (ŷ)
Define
Kurtosis
a measure of the combined weight of a distribution’s tails relative to the center of the distribution
Define
Leptokurtic
having greater kurtosis than the normal distribution; more concentrated about the mean
Define
Mesokurtic
having the same kurtosis as the normal distribution
Define
Platykurtic
a statistical distribution in which the excess kurtosis value is negative
Define
Shapiro Wilkes Test
a test that examines if a variable is normally distributed in a population
Define
Q-Q Plot
a scatterplot created by plotting two sets of quantiles against one another
Define
Univariate outlier
outlier when considering only the distribution of the variable it belongs to
Define
Bivariate outlier
outlier when considering the joint distribution of two variables
Define
Multivariate outlier
outliers when simultaneously considering multiple variables
Define
Log transformation
A type of transformation that can be used to reduce positive skew and stabilise variance and is only defined for positive values > 0
Define
Square root transformation
A type of transformation that can be used to reduce positive skew and stabilise variance. It is defined for zero and positive values
Define
Reciprocal transformation
A type of transformation that can reduce the impact of large scores and stabilize variance. Transformation reverses the scores, but can be avoided by reversing the scores before transforming
Definition
tests that make assumptions about the parameters of the population distribution from which the sample is drawn
Parametric test
Definition
a data point that differs significantly from other observations
Outlier
Definition
a function from one vector space to another that respects the underlying (linear) structure of each vector space
Linear transformation
Definition
tests don’t assume that your data follow a specific distribution
Non-parametric test
Definition
states that if you have a population with mean μ and standard deviation σ and take sufficiently large random samples from the population with replacement , then the distribution of the sample means will be approximately normally distributed
Central limit theorem