W2 Univariate Analysis Flashcards
What is univariate analysis?
Univariate analysis explores one variable at a time, doesn’t deal with relationships between variables and is the simplest form of data analysis.
What is a measure of central tendency?
Measures of central tendency contain information about the middle/centre of a set of numbers. Central tendency includes mean, median and mode.
What is the mode?
The mode is the most frequently occurring value in the set of data.
What is bimodal data?
When there is a tie for the mode.
What is multimodal data?
Data with two or more modes.
What is the median?
The middle value in an arranged array of numbers. If there is an even dataset, the median is the average of the middle two numbers. The median is also the ((n+1)/2)th term
What is the mean?
The average of all values.
What does capital N represent?
The number of terms in a population
What does lowercase n represent?
The number of observations in a sample
What is a measure of location?
Measures of location provide information about certain sections of a set of numbers when organised in ascending order. These include percentiles and quartiles.
What is a percentile?
Percentiles are a way of dividing data so that a fraction of data can be described as falling on or below this location. Eg. The 10th percentile means 10 percent are equal to or below that value, and no more than 100%-10% are above
What is interpolation?
Interpolation is the prediction of a value that is hypothetical or unknown in relation to given values
What are quartiles?
Quartiles divide data into four locations. Three quartiles are known as Q1 Q2 & Q3. The first quartile is equal to the 25th percentile and separates the lowest quarter of data from the top three. The second quartile separates the second quarter of data from the third, is equal to the 50th percentile and marks the median of the data. The third quartile is equal to 75th percentile and separates the first 3 quarters of the data from the last.
What are measures of variability?
Measures of variability describe the spread or dispersion of a set of data. These include range, interquartile range, variance and standard deviation.
What is the range?
The range is the difference between the highest and lowest value in a data set