7 Measures of Central Tendency and Dispersion Flashcards

1
Q

What is a distribution in statistics?

A

A function that describes the probabilities of any theoretical outcome based on evidence from a study.

Distributions help in understanding how likely a specific value is when compared to historical data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the purpose of creating a histogram?

A

To visualize the distribution of data and understand its shape.

A histogram allows analysts to see the frequency of different values in a dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a normal distribution?

A

A bell-shaped distribution where the mean, median, and mode are all at the center, and probabilities decrease evenly as you move away from the center.

It is one of the most common distributions in statistics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does a uniform distribution look like?

A

A flat line where every value has the same probability of occurring.

In a uniform distribution, there is no variability among the data points.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a Poisson distribution?

A

A distribution that describes the number of times an event occurs in a fixed interval of time or space.

Examples include counting occurrences of events like website hits or roadkill.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the main characteristic of an exponential distribution?

A

It describes the time between events in a Poisson process, typically showing a sharp curve.

Exponential distributions can indicate rapid changes in a variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are Bernoulli distributions?

A

Distributions with only two possible outcomes, such as true/false or success/failure.

They are used to model binary variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the key difference between Bernoulli and binomial distributions?

A

Bernoulli distributions have one trial, while binomial distributions involve multiple trials.

Binomial distributions calculate the probability of a specific number of successes across several trials.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does skew measure in a distribution?

A

The degree of asymmetry of a distribution around its mean.

A distribution can be negatively skewed (left) or positively skewed (right).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is kurtosis?

A

A measure of the ‘tailedness’ of a distribution, indicating how much of the variance is due to extreme values.

Distributions can be leptokurtic (tall and skinny) or platykurtic (short and wide).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the three measures of central tendency?

A
  • Mean
  • Median
  • Mode

These measures summarize a dataset with a single value representing the center.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How is the mean calculated?

A

By summing all values and dividing by the number of values.

This represents the average of the dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the median in a dataset?

A

The middle value when the data is ordered from least to greatest.

If there is an even number of values, the median is the average of the two middle numbers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the mode?

A

The value that appears most frequently in a dataset.

A dataset can have more than one mode or no mode at all.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the mean?

A

The mean is also called the average. To find the average, you take the sum of the values and divide them by the number of values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the steps to calculate the mean?

A
  • Find the sum of the values.
  • Divide the sum by the number of values.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Calculate the mean of the weights: 22 lb, 26 lb, 24 lb.

A

Mean = 24 lb

Sum = 22 + 26 + 24 = 72; Mean = 72 / 3 = 24

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the median?

A

The median is the middle value of a dataset when arranged in ascending order.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What are the steps to find the median?

A
  • Arrange values in ascending or descending order.
  • Find the value that is in the exact middle.
  • If the dataset has an even number of values, find the mean of the two middle values.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Find the median of the weights: 22 lb, 24 lb, 26 lb.

A

Median = 24 lb

Middle value of 22, 24, 26 is 24.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Find the median of the weights: 22 lb, 24 lb, 25 lb, 26 lb.

A

Median = 24.5 lb

Middle values are 24 and 25; (24 + 25) / 2 = 24.5.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the mode?

A

The mode is the number that occurs most often in a dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What are the steps to find the mode?

A
  • Arrange values in ascending or descending order.
  • Count the occurrences of all repeating values.
  • Compare the number of occurrences for every repeated value.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is the mode of the weights: 22 lb, 25 lb, 25 lb, 24 lb, 24 lb, 23 lb, 25 lb, 22 lb?

A

Mode = 25 lb

25 occurs most frequently.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

True or False: The mode can have no value if no number is repeated.

A

True.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

When should you use the mean?

A

Use the mean as a default; it works well with normal distributions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

When is the median preferred over the mean?

A

Use the median when dealing with skewed or asymmetrical data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

When is the mode useful?

A

The mode is ideal for handling nominal variables.

29
Q

What is range?

A

Range is the difference between the maximum and minimum values in a dataset.

30
Q

What are the steps to calculate the range?

A
  • Arrange values in ascending or descending order.
  • Identify the minimum and maximum values.
  • Subtract the minimum from the maximum.
31
Q

Calculate the range of the following dataset: 12, 18, 10, 22, 15, 25, 16, 17, 14, 19.

A

Range = 15

Max = 25, Min = 10; 25 - 10 = 15.

32
Q

What are quartiles?

A

Quartiles divide your data into four equal parts.

33
Q

What are the steps to find quartiles?

A
  • Arrange values in ascending or descending order.
  • Find the median (Q2).
  • Split the dataset at the median.
  • Find the median of the lower dataset (Q1) and the upper dataset (Q3).
34
Q

Find the quartiles of the dataset: 12, 18, 10, 22, 15, 25, 16, 17, 14, 19.

A

Q1 = 14, Q2 = 16.5, Q3 = 19

Q2 is the median; Q1 is median of lower half; Q3 is median of upper half.

35
Q

What is the interquartile range?

A

The interquartile range is the difference between Q3 and Q1.

36
Q

Calculate the interquartile range given Q1 = 14 and Q3 = 19.

A

Interquartile range = 5

19 - 14 = 5.

37
Q

What is variance?

A

Variance is a measure of dispersion that looks at the squared deviation of each data point from the mean.

38
Q

What are the steps to calculate variance?

A
  • Find the mean of the dataset.
  • Subtract the mean from each data point.
  • Square the results.
  • Find the sum of the squared results.
  • Divide by the number of data points minus 1 (for samples).
39
Q

Calculate the variance for the dataset: 3, 6, 4, 7, 5, 1, 9, 5, 4, 6.

A

Variance = 4.9

Mean = 5; Squared deviations summed = 44; 44 / (10 - 1) = 4.9.

40
Q

What is standard deviation?

A

Standard deviation is the square root of the variance.

41
Q

Why is standard deviation important?

A

It provides a measure of dispersion in the same units as your dataset.

42
Q

What happens when you apply standard deviation to a normal distribution?

A

One standard deviation above the mean includes 34.1% of your data points.

43
Q

What is the square root of the variance called?

A

Standard deviation

44
Q

Why is standard deviation important?

A

It puts the standard deviation in the same units as your dataset

45
Q

What percentage of data points are within one standard deviation of the mean in a normal distribution?

46
Q

What percentage of data points are within two standard deviations of the mean in a normal distribution?

47
Q

What percentage of data points are within three standard deviations of the mean in a normal distribution?

48
Q

What is a common cutoff for outliers in data analysis?

A

Three standard deviations

49
Q

What is the first step in calculating standard deviation?

A

Find the mean of the dataset

50
Q

What do you do after finding the mean in the standard deviation calculation?

A

Subtract the mean from each data point

51
Q

What do you do after subtracting the mean from each data point?

A

Square the results

52
Q

What is done after squaring the results in the standard deviation calculation?

A

Find the sum of the squared results

53
Q

What do you divide the sum of squared results by?

A

Number of data points minus 1

54
Q

What do you take the square root of in the standard deviation calculation?

55
Q

What is the mean of the dataset: 4, 5, 3, 4, 2, 2, 6, 4, 6, 4?

56
Q

What is the variance of the dataset: 4, 5, 3, 4, 2, 2, 6, 4, 6, 4?

57
Q

What is the standard deviation of the dataset: 4, 5, 3, 4, 2, 2, 6, 4, 6, 4?

58
Q

What are common types of distributions?

A
  • Normal
  • Uniform
  • Poisson
  • Exponential
  • Bernoulli
  • Binomial
59
Q

What is skew in data distribution?

A

How data is distorted left or right

60
Q

What is kurtosis in data distribution?

A

How data is distorted up or down

61
Q

What are the common measures of central tendency?

A
  • Mean
  • Median
  • Mode
62
Q

What are the simpler methods of measures of dispersion?

A
  • Range
  • Quartiles
63
Q

What are more complicated methods of measures of dispersion?

A
  • Variance
  • Standard deviation
64
Q

What does standard deviation measure?

A

The average distance of every point from the mean

65
Q

True or False: The middle quartile (Q2) is the same as the median.

66
Q

What is the mode of the dataset: 24, 18, 36, 51, 24, 48, 18?

67
Q

What is the range of the dataset: 15, 615, 46, 73, 45, 80, 46?

68
Q

What is the middle quartile (Q2) of the dataset: 10, 24, 13, 9, 15, 7, 19?

69
Q

What is the standard deviation of the sample dataset: 9, 11, 7, 8, 9, 10?