hevdb s nd Flashcards

1
Q

What are measures of location?

A

Measures of location indicate at what numerical values certain characteristic points of the distribution are located.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the mean of a data set?

A

The mean is commonly referred to as the average of all the data values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a sample statistic referred to as?

A

A sample statistic is referred to as the point estimator of the corresponding population parameter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the formula for the sample mean?

A

Sample Mean = Sum of the values of the n observations / Number of observations in the sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the population mean denoted as?

A

The population mean is denoted as m.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the weighted mean?

A

The weighted mean is computed by giving each data value a weight that reflects its importance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the formula for computing the weighted mean?

A

Weighted Mean = Σ (y_i * W_i) / Σ W_i.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the median?

A

The median is the value in the middle when the data items are arranged in ascending order.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How is the median calculated for an odd number of observations?

A

Position of the median: i = (n + 1) / 2; Value of the median: Me = y_i.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How is the median calculated for an even number of observations?

A

Position of the median: i = (n + 1) / 2; Value of the median: Me = (y_i-0.5 + y_i+0.5) / 2.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the mode of a data set?

A

The mode is the value that occurs with greatest frequency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a bimodal data set?

A

A bimodal data set has exactly two modes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are percentiles?

A

Percentiles are cut-off values that separate the lower p% of the data from the upper (100 - p)%.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the formula to compute the position of the pth percentile?

A

i = (p / 100) * n.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the first quartile?

A

The first quartile is the 25th percentile.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the second quartile?

A

The second quartile is the 50th percentile, also known as the median.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the third quartile?

A

The third quartile is the 75th percentile.

18
Q

What is the five-number summary?

A

The five-number summary includes the smallest value, first quartile, median, third quartile, and largest value.

19
Q

What is a box plot?

A

A box plot visualizes the five-number summary and displays the interquartile range.

20
Q

What is the interquartile range (IQR)?

A

The IQR is calculated as the difference between the first and third quartiles: IQR = Q3 - Q1.

21
Q

What defines the normal range in a box plot?

A

The normal range is defined as the interval between the lower and upper limits determined using the IQR.

22
Q

True or False: The median is preferred over the mean in data sets with extreme values.

23
Q

Fill in the blank: The _______ of a data set is the value that occurs with greatest frequency.

24
Q

What is the formula for the sample mean for grouped data?

A

Sample Mean = Σ (f_i * M_i) / Σ f_i.

25
Q

What is a multmodal data set?

A

A multimodal data set has more than two modes.

26
Q

What is the 90th percentile in the given apartment rents example?

A

The 90th percentile is 585.

27
Q

What is the formula for calculating the weighted mean for grouped data?

A

Weighted Mean = Σ (f_i * M_i) / Σ f_i where f_i is frequency and M_i is the midpoint.

28
Q

What does IQR stand for?

A

Interquartile Range

29
Q

How is IQR calculated?

A

IQR = Q3 - Q1

30
Q

What is the formula for calculating the lower limit in a box plot?

A

Lower Limit = Q1 - 1.5 × IQR

31
Q

What is the formula for calculating the upper limit in a box plot?

A

Upper Limit = Q3 + 1.5 × IQR

32
Q

What is considered as normal range in data analysis?

A

The interval between the lower and the upper limits

33
Q

What is the IQR for the given data if Q3 is 525 and Q1 is 445?

34
Q

What are outliers in data?

A

Data outside the normal range

35
Q

What are the lower and upper limits calculated from Q1 and Q3 in the example?

A

[325, 645]

36
Q

What does a box plot use to represent outliers?

A

A suitable symbol, e.g., an asterisk (*)

37
Q

True or False: There are outliers in the apartment rent data provided.

38
Q

What are whiskers in a box plot?

A

Dashed lines drawn from the ends of the box to the smallest and largest data values within the normal range

39
Q

Fill in the blank: The smallest value in the normal range is _______.

40
Q

Fill in the blank: The largest value in the normal range is _______.