Chapter 4 - QMB2100 Flashcards

Question 1

Q

What is a dot plot?

Answer

A

It is a type of graph that summarizes the distribution of one variable by stacking dots at points on a number line that shows the values of the variable.

Question 2

Q

What is one advantage of the dot plot?

Answer

A

It groups the data as little as possible and doesn’t lose the identity of an individual observation.

Question 3

Q

When are dot plots more useful than histograms?

Answer

A

They are more useful when using smaller data sets.

Question 4

Q

What are quartiles, deciles, and percentiles?

Answer

A

They are measures of position.

Question 5

Q

What is a quartile?

Answer

A

Values of an ordered data set into four equal parts of observations.

Question 6

Q

What are deciles?

Answer

A

Values of an ordered data set into ten equal parts of observations.

Question 7

Q

What are percentiles?

Answer

A

Values of an ordered data set into one hundred equal part of observations.

Question 8

Q

What is the formula for location of a percentile?

Answer

A

Lp = (n+1)(P/100); where n is the number of observations and P is the Pth percentile.

Question 9

Q

What are the two methods to find the location of a percentile?

Answer

A

Exclusive method and inclusive method.

Question 10

Q

Describe the formula for the exclusive method.

Answer

A

Lp = (n+1)(P/100); where n is the number of observations and P is the Pth percentile.

Question 11

Q

Describe the formula for the inclusive method.

Answer

A

Lp = n(P/100) + 1-(P/100); where n is the number of observations and P is the Pth percentile.

Question 12

Q

What is a box plot?

Answer

A

A graphic display that shows the general shape of a variable’s distribution. It is based on five descriptive statistics: the maximum, minimum, first and third quartiles and the median.

Question 13

Q

What is the interquartile range?

Answer

A

The range of values between the first and third quartiles; 50% of a distribution’s values are located within this range.

Question 14

Q

What are outliers?

Answer

A

A data point that is unusually far from the others. An accepted rule is to classify an observation as an outlier if it is 1.5 times the interquartile range above the third quartile or below the first quartile.

Question 15

Q

What is the formula for upper outlier boundary?

Answer

A

UOB = Q3 + 1.5(Q3 - Q1)

Question 16

Q

What is the formula for lower outlier boundary?

Answer

A

LOB = Q1 - 1.5(Q3 - Q1)

Question 17

Q

What is skewness?

Answer

A

Another characteristic of the shape of the distribution. There are four types: symmetric, positively skewed, negatively skewed, and bimodal.

Question 18

Q

What are the 2 ways to calculate skewness?

Answer

A

Pearson’s coefficient of skewness and software coefficient of skewness.

Question 19

Q

What is the formula for Pearson’s coefficient of skewness?

Answer

A

Sk = [3(x̄-median)] / s; where s is the standard deviation and x̄ is the mean. The coefficient can only range from -3 to 3.

Question 20

Q

What is the formula for software coefficient of skewness?

Answer

A

Sk = n/((n-1)(n-2)) [ Σ ((x - x̄)/s)^3]; where x̄ is the mean, s is the standard deviation and n is the number of observations.

Question 21

Q

What is univariate?

Answer

A

When studying a single variable.

Question 22

Q

What is bivariate

Answer

A

When studying the relationship between two variables.

Question 23

Q

What is a scatter diagram?

Answer

A

Graphical technique used to show the relationship between two variables measure with interval or ratio scales.

Question 24

Q

What is the correlation coefficient?

Answer

A

Is a statistic that can be calculated to measure the direction and strength of the relationship between two variables. It varies from -1 to 1 and the closer the coefficient is to -1 or 1 the stronger the correlation. 0 would mean no correlation.

Question 25

Q

What is the formula for the correlation coefficient?

Answer

A

r = Σ [(x - x̄) (y - ȳ)]/[(n - 1) SxSy]; where x̄ is the mean of x variables, ȳ is the mean of y variables, n is the number of observations, Sx is the standard deviation of x and Syis the standard deviation of y.

Question 26

Q

What are contingency tables?

Answer

A

A table used to classify sample observations according to two identifiable characteristics.

Question 27

Q

What is the contingency table used for?

Answer

A

When you wish to study the relationship of two variables when one or both are nominal or ordinal scale.