CHAPTER 2 QUESTIONS Flashcards

1
Q

Descriptive statistics describe the distribution of a data set in terms of _________________

A

Descriptive statistics describe the distribution of a data set in terms of both central tendency and dispersion.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Measures of dispersion include _ _ _ _ _

A

Measures of dispersion include the standard deviation, coefficient of variation (standard deviation divided by the mean), coefficient of dispersion, and range.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The median is the ____ percentile or midpoint in the distribution, with half of the sales prices less than it and half greater than it.

The median is a frequently used measure of central tendency in both assessment and single property appraisals.

This is because _ _ _ _ _ _

The ______ percentile is the first quartile and the _____percentile is the third quartile. These represent the cut-off points for the lowest one-fourth and lowest three-fourths of the data, respectively.

A

The median is the 50th percentile or midpoint in the distribution, with half of the sales prices less than it andhalf greater than it.

The median is a frequently used measure of central tendency in both assessment and single property appraisals. This is because the mean can be significantly influenced by outliers; for S_PRICE, the influence of some very high priced sales has resulted in a mean that is larger than the median.

The 25th percentile is the first quartile and the 75th percentile is the third quartile. These represent the cut-off points for the lowest one-fourth and lowest three-fourths of the data, respectively.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Although not shown, the coefficient of variation (COV) can be computed by _ _ _ _ _

A

Although not shown, the coefficient of variation (COV) can be computed by dividing the standard deviation by the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

After finding the mean for a sample, it is a good idea to ask whether the figure is representative of the whole population. This involves calculating something called the __________, then determining ___________

A

After finding the mean for a sample, it is a good idea to ask whether the figure is representative of the whole population. This involves calculating something called the standard error of the mean,then determining confidence intervals around that number.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

The _____________, often referred to as _______ OR ___, is a measure of how well the mean for a particular sample estimates the mean for the whole population.

A

The standard error of the mean, often referred to as standard error or SE, is a measure of how well the mean for a particular sample estimates the mean for the whole population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

S**E**x = Estimate of the standard error of the mean

s = standard deviation of the sample

n= the square root of the sample size

A

SEx = Estimate of the standard error of the mean

s = standard deviation of the sample

n= the square root of the sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The standard error of the mean can be used to form confidence intervals around the mean. Adding and subtracting one standard error to and from the mean produces a range of values that typically encompasses approximately ____% of the possible means for the population overall.

A

The standard error of the mean can be used to form confidence intervals around the mean. Adding and subtracting one standard error to and from the mean produces a range of values that typically encompasses approximately 68% of the possible means for the population overall.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

As stated above, the 95% confidence interval for the mean is closely approximated by _ _ _ _ _ _

A

As stated above, the 95% confidence interval for the mean is closely approximated by adding and subtracting the value of two standard errors of the mean from the mean value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

TRUE OR FALSE?

The mean will always be in the centre of the range of the 95% confidence interval, but this is not true for the median.

A

ANSWER: TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

While the median value will always be within its 95% confidence interval, the median will not always be at the centre of the confidence interval range. This is because _ _ _ _ _

A

While the median value will always be within its 95% confidence interval, the median will not always be at the centre of the confidence interval range.

This is because the confidence interval for the median is found by counting values greater than and less than the median rather than by addition and subtraction of a value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

The ASR shows how accurately assessed values relate to actual sale prices by _ _ _ _ _

A

The ASR shows how accurately assessed values relate to actual sale prices by dividing the assessed value (the value predicted by the model) by the actual sales price.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

A “normal” distribution means _ _ _ _ _ _

A

A “normal” distribution means that the data is evenly spread out on either side of the mean, with the bulk of the observations near the mean and trailing off on either side.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

NOTE ONLY

A normal distribution is required in order to accurately estimate confidence intervals of the mean and to carry out probability estimates for sample data: e.g., “68% of the data will fall between the points ___ and ____, which are one standard deviation on either side of the mean”. See Figure 2.1 for an illustration. However, if the data is not normal, these measures will not be completely reliable.

Quite often, real estate data is not normally distributed, especially residential sales data, as it tends to have a large number of lower-priced sales and then a few high priced sales that skew the curve. When this is the case, the median is a better indicator of central tendency than the mean and the confidence intervals of the median are preferable to those around the mean.

A

NOTE ONLY

A normal distribution is required in order to accurately estimate confidence intervals of the mean and to carry out probability estimates for sample data: e.g., “68% of the data will fall between the points ___ and ____, which are one standard deviation on either side of the mean”. See Figure 2.1 for an illustration. However, if the data is not normal, these measures will not be completely reliable.

Quite often, real estate data is not normally distributed, especially residential sales data, as it tends to have a large number of lower-priced sales and then a few high priced sales that skew the curve. When this is the case, the median is a better indicator of central tendency than the mean and the confidence intervals of the median are preferable to those around the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

_____________ allow you to use mathematical and logical operations to create new variables from existing ones.

For example, a database may include information on sale price and house size (square footage). If you were interested in creating a variable that represented sale price per square foot, you would create a transformation for such a variable.

A

T**r**a**n**sfor**m**a**t**i**o**n**s allow you to use mathematical and logical operations to create new variables from existing ones. For example, a database may include information on sale price and house size (square footage). If you were interested in creating a variable that represented sale price per square foot, you would create a transformation for such a variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

SPSS has a separate module specifically created for carrying out transformations, which you can find on the main menu bar.

The __________ window is where you will create most of the new variables needed for carrying out mass appraisal model building.

A

SPSS has a separate module specifically created for carrying out transformations, which you can find on the main menu bar. You can view some of the transformations that will commonly be used in this course under Transform ⇒ Compute Variable.

The Compute Variable window is where you will create most of thenewvariables needed for carrying out mass appraisal model buildin

17
Q

A _______ is a list of a SPSS commands. This can include transformations, regression commands, or pretty much any command SPSS offers.

A

A syntax file is a list of a SPSS commands. This can include transformations, regression commands, or pretty much any command SPSS offers.

18
Q

Apart from simple mathematical calculations, the _____ transformation is the most used transformation in this course. ______ allows youto change the value of one or more variables according to specific rules which you supply.

A

Apart from simple mathematical calculations, the Recode transformation is the most used transformation in this course. Recoding allows youto change the value of one or more variables according to specific rules which you supply.

19
Q

Boxplots are useful for comparing the distribution of one variable by values of another variable. Normally the second variable will be _ _ _ _ _ _

A

Boxplots are useful for comparing the distribution of one variable by values of another variable. Normally the second variable will be a discrete variable with only a fewdistinct values.

For example,you might be interested in comparing sales prices by neighbourhood. The boxplot provides some of the statistics that can be found in the descriptive statistics reports, but in a visual context. The “box” or rectangle of a boxplot contains the middle 50% of the cases. The lower and upper ends of the box represent the 25th and 75th percentiles.

The dark line within the box represent the median (50th percentile).

Points which are more than 1.5 box lengths from the edge of the box constitute outliers (values that lie outside the norm) and are represented by circles.

Points which are more than three box lengths from the edge of the box are indicated by asterisks and are termed extremes.

20
Q

Boxplots are a very useful tool for examining a _ _ _ _ _ _ _ _ based on the characteristics of _ _ _ _ _ _ . For example, they can be used to determine if there is a difference in the value of one variable for properties that are in different neighbourhoods, as above, or have a different number of bathrooms, bedrooms, or fireplaces for example.

A

Boxplots are a very useful tool for examining a continuous variable based on the characteristics of discrete variables.

For example, they can be used to determine if there is a difference in the value of one variable for properties that are in different neighbourhoods, as above, or have a different number of bathrooms, bedrooms, or fireplaces for example.

21
Q

TRUE OR FALSE? Boxplots only provide a graphic presentation of these relationships and should not be relied upon to determine the precise value of the median or other statistics.

A

Boxplots only provide a graphic presentation of these relationships and should not be relied upon to determine the precise value of the median or other statistics.

This is especially true if there are a number of outliers or extremes, as the expanded scale may cause the boxes to be compressed, which may mask differences.

22
Q
A