Chapter 3.1 Flashcards

1
Q

What does a measure of central tendency represent in a data set?

A

A measure of central tendency represents the center or middle of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Is a measure of central tendency always a typical value in a dataset?

A

No, not all measures of central tendency are necessarily typical values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define the population mean and its symbol.

A

The population mean, denoted as “μ” (pronounced “mew”), is the average of the population measurements.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the formula for calculating the population mean (average)?

A

The formula for the population mean (average) is:

μ (Population Mean) = ΣX / N

Where:

μ is the population mean.
ΣX represents the sum of all individual measurements in the population.
N is the total number of measurements in the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the formula for calculating the sample mean (average)?

A

The formula for the sample mean (average) is:

x̄ (Sample Mean) = ΣX / n

Where:

x̄ is the sample mean.
ΣX represents the sum of all individual measurements in the sample.
n is the total number of measurements in the sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a population parameter in statistics?

A

A population parameter in statistics is a numerical value calculated using the measurements from an entire population. It serves as a descriptive measure that characterizes some aspect of the entire population.

Population parameters provide information about the population’s central tendency, variability, or other characteristics, and they are typically used to make inferences and draw conclusions about the population as a whole.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does “n” represent in the formula for sample mean?

A

“n” represents the number of sample measurements, and it is referred to as the sample size.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How are sample measurements denoted in formulas?

A

Sample measurements are denoted as x₁, x₂, …, xₙ, where x₁ represents the first sample measurement, x₂ represents the second, and so on, with xₙ denoting the last measurement.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How is summation notation (Σ) used in formulas for sample mean?

A

Summation notation Σ is used for convenience in formulas. It represents the sum of terms that follow the symbol.

In the context of the sample mean, Σxᵢ represents the sum of sample measurements, where “xᵢ” is a generic observation in the dataset, and “i” is the index that ranges from 1 to n, indicating where to start and stop summing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does a point estimate in statistics represent?

A

A point estimate in statistics represents a single numerical value that estimates a population parameter, such as the mean, based on a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does it mean when there is sampling error in a point estimate?

A

Sampling error refers to the discrepancy between the point estimate (e.g., the sample mean) and the actual population parameter (e.g., the population mean).

It means that the point estimate is not expected to exactly equal the population parameter due to the randomness in sampling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How does the provided point estimate of 31.56 mpg relate to the population mean (μ)?

A

The point estimate of 31.56 mpg is an estimate of the population mean (μ), representing the average mileage of all cars.

However, it is subject to sampling error, meaning it may not exactly equal μ. This estimate provides some evidence that the population mean is at least 31 but does not provide definitive evidence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Definition of median

A

median (denotedMd)
A measure of central tendency that divides a population or sample into two roughly equal parts.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Definition of mode

A

mode (denoted Mo)
The measurement in a sample or a population that occurs most frequently.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Typical relationship between mean, meadian and mode.

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the advantage of using the Mean as a measure of central tendency?

A

The advantage of using the Mean is that it is a reliable measure because it takes into account every entry of a data set.

17
Q

What is the disadvantage of using the Mean?

A

The disadvantage of using the Mean is that it is greatly affected by outliers, which are data entries that are far removed from the other entries in the data set.

18
Q

Why is the Median often preferred over the Mean?

A

The Median is often preferred over the Mean because it is not sensitive to extreme values or outliers, making it a robust measure of central tendency.

19
Q

When is the Mode useful in data analysis?

A

The Mode is useful for determining data that is more likely to occur or identifying the most frequently observed value in a dataset.

20
Q

What is the definition of an outlier in statistics?

A

An outlier in statistics is an individual data point or observation that significantly differs or deviates from the overall pattern or distribution of the data.

Outliers are data values that are unusually high or low compared to the majority of the data points and can potentially affect the accuracy of statistical analysis and summary measures like the mean and standard deviation.

21
Q

What are the effects of an outlier on the mean, median, and mode in a dataset?

A

Mean: An outlier can significantly influence the mean, pulling it toward the extreme value of the outlier. If the outlier is very large or small, it can result in a mean that does not accurately represent the central tendency of the majority of the data.

Median: The median is less affected by outliers because it is not dependent on the actual values of the data points but rather their position in the ordered list. An outlier does not have as much impact on the median, making it a more robust measure of central tendency in the presence of outliers.

Mode: The mode represents the most frequently occurring value in a dataset. Outliers typically do not affect the mode unless the outlier itself becomes the new mode. In most cases, the mode remains relatively stable even when outliers are present.

22
Q

What are weighted means in statistics, and how are they calculated?

A

Weighted means in statistics are used when some measurements within a dataset are more important or carry greater significance than others. To calculate a weighted mean, you assign numerical “weights” to each data point, which represent the relative importance of each value. The weighted mean is then calculated using the formula:

Weighted Mean = Σ(wᵢ * xᵢ) / Σwᵢ

Where:

Weighted Mean is the calculated average.
wᵢ represents the weight assigned to the i-th measurement (xᵢ).
Σ denotes summation, indicating that you sum the products of the weights and corresponding measurements, and divide by the sum of the weights.