Descriptive Statistics Flashcards

1
Q

What is a population in the context of descriptive statistics?

1) A subset of individuals selected from a larger group
2) The entire group of individuals being studied
3) A sample chosen randomly from a population
4) A fixed set of continuous variables

A

The entire group of individuals being studied

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the main difference between a sample and a population?

1) A population includes all possible observations, while a sample is a subset
2) A sample includes only nominal data, while a population includes all data types
3) A population requires statistical analysis, while a sample does not
4) A sample consists of independent variables, while a population consists of dependent variables

A

A population includes all possible observations, while a sample is a subset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What type of data is measured on a continuous scale?

1) Hair colour
2) Shoe size
3) Reaction times
4) Test scores ranked from highest to lowest

A

Reaction times

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Which measure of central tendency can be used with nominal data?

1) Mean
2) Median
3) Mode
4) Range

A

Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the formula for calculating the mean?

1) Sum of the data points divided by the total number of data points
2) The difference between the highest and lowest values
3) The middle value in an ordered dataset
4) The frequency of the most common data point

A

Sum of the data points divided by the total number of data points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Which measure of central tendency is least affected by outliers?

1) Mean
2) Median
3) Mode
4) Variance

A

Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does the interquartile range (IQR) measure?

1) The spread of scores in the entire dataset
2) The range of scores within the middle 50% of the data
3) The difference between the highest and lowest values in the dataset
4) The average of squared deviations from the mean

A

The range of scores within the middle 50% of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the main disadvantage of using the range as a measure of spread?

1) It is difficult to calculate without software
2) It does not account for extreme values in the dataset
3) It is highly sensitive to outliers
4) It ignores the middle 50% of the data

A

It is highly sensitive to outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does the variance measure in a dataset?

1) The difference between the highest and lowest scores
2) The average squared deviation of each score from the mean
3) The number of unique values in the dataset
4) The central point of the dataset

A

The average squared deviation of each score from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why is the standard deviation preferred over variance?

1) It is easier to calculate without a computer
2) It uses the same unit of measurement as the data
3) It eliminates the need for squared values
4) It is unaffected by the sample size

A

It uses the same unit of measurement as the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does a standard deviation of 0 indicate?

1) The data is normally distributed
2) There is no variability; all data points are identical
3) The data has a high degree of variability
4) The mean and median are equal

A

There is no variability; all data points are identical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the difference between discrete and continuous data?

1) Discrete data can take any fractional value, while continuous data has fixed values
2) Discrete data has fixed, ordered values, while continuous data can take any fractional value
3) Continuous data includes nominal variables, while discrete data does not
4) Discrete data measures only dependent variables, while continuous data measures independent variables

A

Discrete data has fixed, ordered values, while continuous data can take any fractional value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the primary use of cumulative frequency distributions?

1) To show the variability within a dataset
2) To display the total count or percentage up to a certain value
3) To rank data points from highest to lowest
4) To summarize nominal data

A

To display the total count or percentage up to a certain value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the primary drawback of using the mean for skewed data?

1) It ignores extreme values in the dataset
2) It is not a meaningful measure for ordinal data
3) It is highly sensitive to outliers
4) It does not consider the entire dataset

A

It is highly sensitive to outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Which measure of spread should be used with the median?

1) Standard deviation
2) Variance
3) Range
4) Interquartile range

A

Interquartile range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does the sum of squared errors (SS) represent?

1) The total deviation of scores from the mean
2) The squared difference between the highest and lowest scores
3) The cumulative frequency of all data points
4) The total variance in a dataset

A

The total deviation of scores from the mean

17
Q

What is the relationship between variance and standard deviation?

1) Standard deviation is the square root of variance
2) Variance is the square root of standard deviation
3) They measure completely different aspects of a dataset
4) Variance and standard deviation are always equal

A

Standard deviation is the square root of variance

18
Q

Which type of data cannot be summarized using the mean?

1) Ratio
2) Ordinal
3) Interval
4) Nominal

A

Nominal

19
Q

Why is the median often preferred over the mean for income data?

1) The median uses all data points, unlike the mean
2) Income data is often nominal and unsuitable for mean calculations
3) The median is less affected by extreme values or outliers
4) The mean cannot accurately summarize large datasets

A

The median is less affected by extreme values or outliers

20
Q

What type of frequency distribution is most useful for datasets with numerous unique values?

1) Cumulative frequency distribution
2) Raw frequency distribution
3) Grouped frequency distribution
4) Percentage frequency distribution

A

Grouped frequency distribution