Descriptive Statistics Flashcards
What is a population in the context of descriptive statistics?
1) A subset of individuals selected from a larger group
2) The entire group of individuals being studied
3) A sample chosen randomly from a population
4) A fixed set of continuous variables
The entire group of individuals being studied
What is the main difference between a sample and a population?
1) A population includes all possible observations, while a sample is a subset
2) A sample includes only nominal data, while a population includes all data types
3) A population requires statistical analysis, while a sample does not
4) A sample consists of independent variables, while a population consists of dependent variables
A population includes all possible observations, while a sample is a subset
What type of data is measured on a continuous scale?
1) Hair colour
2) Shoe size
3) Reaction times
4) Test scores ranked from highest to lowest
Reaction times
Which measure of central tendency can be used with nominal data?
1) Mean
2) Median
3) Mode
4) Range
Mode
What is the formula for calculating the mean?
1) Sum of the data points divided by the total number of data points
2) The difference between the highest and lowest values
3) The middle value in an ordered dataset
4) The frequency of the most common data point
Sum of the data points divided by the total number of data points
Which measure of central tendency is least affected by outliers?
1) Mean
2) Median
3) Mode
4) Variance
Median
What does the interquartile range (IQR) measure?
1) The spread of scores in the entire dataset
2) The range of scores within the middle 50% of the data
3) The difference between the highest and lowest values in the dataset
4) The average of squared deviations from the mean
The range of scores within the middle 50% of the data
What is the main disadvantage of using the range as a measure of spread?
1) It is difficult to calculate without software
2) It does not account for extreme values in the dataset
3) It is highly sensitive to outliers
4) It ignores the middle 50% of the data
It is highly sensitive to outliers
What does the variance measure in a dataset?
1) The difference between the highest and lowest scores
2) The average squared deviation of each score from the mean
3) The number of unique values in the dataset
4) The central point of the dataset
The average squared deviation of each score from the mean
Why is the standard deviation preferred over variance?
1) It is easier to calculate without a computer
2) It uses the same unit of measurement as the data
3) It eliminates the need for squared values
4) It is unaffected by the sample size
It uses the same unit of measurement as the data
What does a standard deviation of 0 indicate?
1) The data is normally distributed
2) There is no variability; all data points are identical
3) The data has a high degree of variability
4) The mean and median are equal
There is no variability; all data points are identical
What is the difference between discrete and continuous data?
1) Discrete data can take any fractional value, while continuous data has fixed values
2) Discrete data has fixed, ordered values, while continuous data can take any fractional value
3) Continuous data includes nominal variables, while discrete data does not
4) Discrete data measures only dependent variables, while continuous data measures independent variables
Discrete data has fixed, ordered values, while continuous data can take any fractional value
What is the primary use of cumulative frequency distributions?
1) To show the variability within a dataset
2) To display the total count or percentage up to a certain value
3) To rank data points from highest to lowest
4) To summarize nominal data
To display the total count or percentage up to a certain value
What is the primary drawback of using the mean for skewed data?
1) It ignores extreme values in the dataset
2) It is not a meaningful measure for ordinal data
3) It is highly sensitive to outliers
4) It does not consider the entire dataset
It is highly sensitive to outliers
Which measure of spread should be used with the median?
1) Standard deviation
2) Variance
3) Range
4) Interquartile range
Interquartile range