Theory Section part 1. Flashcards
What is “Kurtosis”?
The kurtosis value provides information about the “Peakedness” of the distribution
Positive KS values indicate that the distribution is rather peaked (clustered in the centre), with long thin tails
Negative KS values indicate that the distribution is relatively flat (too many cases in the extremes)
What is “Skewness”?
The Skewness value provides an indication of the symmetry of the distribution
If mean = median = mode, then it is a symmetric distribution
What is “Standard Deviation”?
Is the square root of the variance
What is “Variance”?
Variance of a set of observations is the average squared deviation of the data points from the mean
What is “Simple Linear Regression”?
Technique used for determining how one variable of interest (the response variable) is affected by changes in another variable (the explanatory variable).
What is “Correlation”?
Correlation describes the relationship between two continuous variables, in terms of both the strength of the relationship and the direction
What is “Confidence Level”?
Det seier noko om kor sikre vi ynskjer å vere når vi generaliserer til populasjonen. Eit sikkerheitsnivå på 95% betyr at vi generaliserer i 95% av tilfella, men at vi tar feil i 5% av tilfella
What is “Mode” (Typetal)?
The value that occurs most frequently
Døme: Det skonummeret det er flest av
What is “Average” (Gjennomsnitt)?
The average of a set of observations
Døme: Alle skonummera lagt saman og delt på antallet
What is “Median”?
An observation in the center of a data set (Halt of the data lies below, half lies above)
Døme: Det skonummret som er midt i rekkja av sko når dei er ordna frå minst til størst
What is a Postive Skewness?
Positive SN values indicates clustering of scores at the low-end (left-handed side of the graph). Then the mean will be to the right of the median, median will be to the right of mode
What is a Negative Skewness?
Negative SN values indicates a clustering of scores at the high-end (right-handed side of the graph). Then the mean will be to the left of median, median will be to the left of mode
What is the formula for Skewness?
Dersom svaret > 1 er utvalet “Positively Skewed”/”Skewed to the right”
(Då er avstanden mellom XMAX og medianen større enn avstanden mellom XMIN og medianen)
Dersom svaret < 1 er utvalet “Negatively Skewed”/”Skewed to the left”
(Då er avstanden mellom XMax og medianen
mindre enn avstanden mellom XMin og medianen)
Why do we use standard deviation instead of the variance?
Both variance and standard deviation measures the deviation of data points, but we need standard deviation compared to variance because it gives the value in the same unit as the observations
What are the three main purposes of simple linear regression?
- To describe the linear dependence of one variable on another
- To predict values of one variable from values of another, for which more data are available
- To correct for the linear dependence of one variable on another, in order to clarify other features of its variability