Chapter 4 - QMB2100 Flashcards

1
Q

What is a dot plot?

A

It is a type of graph that summarizes the distribution of one variable by stacking dots at points on a number line that shows the values of the variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is one advantage of the dot plot?

A

It groups the data as little as possible and doesn’t lose the identity of an individual observation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

When are dot plots more useful than histograms?

A

They are more useful when using smaller data sets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are quartiles, deciles, and percentiles?

A

They are measures of position.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a quartile?

A

Values of an ordered data set into four equal parts of observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are deciles?

A

Values of an ordered data set into ten equal parts of observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are percentiles?

A

Values of an ordered data set into one hundred equal part of observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the formula for location of a percentile?

A

Lp = (n+1)(P/100); where n is the number of observations and P is the Pth percentile.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the two methods to find the location of a percentile?

A

Exclusive method and inclusive method.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Describe the formula for the exclusive method.

A

Lp = (n+1)(P/100); where n is the number of observations and P is the Pth percentile.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Describe the formula for the inclusive method.

A

Lp = n(P/100) + 1-(P/100); where n is the number of observations and P is the Pth percentile.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a box plot?

A

A graphic display that shows the general shape of a variable’s distribution. It is based on five descriptive statistics: the maximum, minimum, first and third quartiles and the median.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the interquartile range?

A

The range of values between the first and third quartiles; 50% of a distribution’s values are located within this range.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are outliers?

A

A data point that is unusually far from the others. An accepted rule is to classify an observation as an outlier if it is 1.5 times the interquartile range above the third quartile or below the first quartile.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the formula for upper outlier boundary?

A

UOB = Q3 + 1.5(Q3 - Q1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the formula for lower outlier boundary?

A

LOB = Q1 - 1.5(Q3 - Q1)

17
Q

What is skewness?

A

Another characteristic of the shape of the distribution. There are four types: symmetric, positively skewed, negatively skewed, and bimodal.

18
Q

What are the 2 ways to calculate skewness?

A

Pearson’s coefficient of skewness and software coefficient of skewness.

19
Q

What is the formula for Pearson’s coefficient of skewness?

A

Sk = [3(x̄-median)] / s; where s is the standard deviation and x̄ is the mean. The coefficient can only range from -3 to 3.

20
Q

What is the formula for software coefficient of skewness?

A

Sk = n/((n-1)(n-2)) [ Σ ((x - x̄)/s)^3]; where x̄ is the mean, s is the standard deviation and n is the number of observations.

21
Q

What is univariate?

A

When studying a single variable.

22
Q

What is bivariate

A

When studying the relationship between two variables.

23
Q

What is a scatter diagram?

A

Graphical technique used to show the relationship between two variables measure with interval or ratio scales.

24
Q

What is the correlation coefficient?

A

Is a statistic that can be calculated to measure the direction and strength of the relationship between two variables. It varies from -1 to 1 and the closer the coefficient is to -1 or 1 the stronger the correlation. 0 would mean no correlation.

25
Q

What is the formula for the correlation coefficient?

A

r = Σ [(x - x̄) (y - ȳ)]/[(n - 1) SxSy]; where x̄ is the mean of x variables, ȳ is the mean of y variables, n is the number of observations, Sx is the standard deviation of x and Syis the standard deviation of y.

26
Q

What are contingency tables?

A

A table used to classify sample observations according to two identifiable characteristics.

27
Q

What is the contingency table used for?

A

When you wish to study the relationship of two variables when one or both are nominal or ordinal scale.