Module 2: Descriptive Statistics: Tabular, Graphical and Numerical Methods Flashcards

1
Q

Frequency Distribution

A

A tabular representation of the summary data that shows the numerical count of items in each class in the data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

TRUE or FALSE:

In a frequency distribution, classes can overlap.

A

FALSE: For any frequency distribution, the classes must not overlap. An overlap will result in double counting an item and will yield erroneous results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Relative Frequency

A

Relative frequency is a calculated value that represents the proportion of the items in each class.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the Relative Frequency equation?

A

Frequency of Class
Relative Frequency of a Category = ________________
n

(where n is the total count of all the classes being compared)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How can you convert relative frequency relative percentage?

A

Multiply the relative frequency by 100

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Frequency distribution tables are a way to help us understand data

a) numerically
b) visually
c) philosophically

A

a) numerically

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Charting is a means to represent frequencies

a) numerically
b) visually
c) auditorily

A

b) visually

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

TRUE OR FALSE:

You must arrange charts in descending order

A

FALSE

You do not have to arrange your information in descending order, however it can be very helpful to do so.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Pie charts are most helpful when representing distributions focused on what?

A

Proportions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

TRUE OR FALSE:

There are usually many ways to define classes for numerical data.

A

TRUE

There are usually many ways to define classes for numerical data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the three steps to produce an effective frequency distribution table?

A

Step One: Determine the number of classes to be evaluated.
Step Two: Determine the width of the classes.
Step Three: Determine each class’s limits.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When creating a frequency distribution table, how many classes should you create?

A

Use the fewest number of classes possible to effectively explain your data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the formula to calculate the approximate width of your classes?

A

Largest data value - smallest data value
Approx. width = _______________________________
number of categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

TRUE OR FALSE:

Cross-tabulations can only be used to compare qualitative-to-quantitative data.

A

FALSE
The use of cross-tabulations is an effective method for comparing qualitative-to-qualitative data, qualitative-to-quantitative data, and quantitative-to-quantitative data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Scatter Diagram

A

A two-dimensional plot, or graphical display, of data. It helps to determine whether there is relationship between two variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

On a scatter diagram, what indicates a POSITIVE RELATIONSHIP between variables?

A

When data points have an increasing slope as you move from left to right.
It indicates that if one variable is decreased, the other variable will also decrease.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What 3 types of relationships do we see in scatter diagrams?

A

1) Positive relationship
2) Negative relationship
3) No relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

On a scatter diagram, what signifies NO RELATIONSHIP between variables?

A

Usually indicated by a horizontal pattern (or close to horizontal pattern) on the scatter diagram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

On a scatter diagram, what indicates a NEGATIVE RELATIONSHIP between variables?

A

When the graphed data points have a downward slope, as you move left to right.
As one variable increases in value, the other variable decreases in value, and vice versa.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Measure of Central Tendency

A

“typical” or “average” value of a data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

TRUE OR FALSE:

There are only 3 types of measures of central tendencies?

A

FALSE
There are many measures of central tendencies.

The 3 we examine in this section are:

1) Mean
2) Median
3) Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Mean

A

Average

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What are the 2 types of means?

A

1) Sample Mean

2) Population Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is the mathematical formula for

SAMPLE MEAN?

A

Σ xi
x̄ = —–
n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What does x̄ (x-bar) represent?

A

x̄ stands for the “sample mean”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What does xi (x subscript i) represent?

A

xᵢ means “all of the x-values”

The subscript i will equal 1 to n; therefore:
x₁ = the first term in the set,
x₂ = the second term in the set,
and so on until the last term xn is entered.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What does n represent?

A

n means “the total number of items in the sample”

28
Q

What does μ (“mu”) represent?

A

μ (“mu”) stands for population mean

29
Q

What is the mathematical formula for

POPULATION MEAN?

A

Σ xᵢ
μ = —–
N

30
Q

What does N represent?

A

N means “the total number in the population”

31
Q

Median

A

The midpoint of all the points in the data set.

32
Q

How do you find the median of a data set?

A

FIRST, arrange the data in ascending order.
Find the midpoint:

a) If there are an ODD number of data points in the set, the median is the middle point in the set.
The median is the point that has an equal number of points below and above it.

b) If there are an EVEN number of data points in the set, the median is the mean (or average) of the middle two points in the data set. Add the two middle points together, then divide by 2.

33
Q

Mode

A

The value that appears most frequently in the data set.

34
Q

TRUE OR FALSE:

Usually the mode will be the only measure of central tendency for qualitative data

A

TRUE

In many cases, the mode will be the only measure of central tendency for qualitative data.

35
Q

Bimodal

A

When a data set has two modes

36
Q

Multimodal

A

When a data set has more than two modes

37
Q

No Mode

A

When no data point occurs more than once

38
Q

What is the formula to determine the LOCATION OF A SPECIFIC PERCENTILE?

A

100
i = (——-) n
p

39
Q

What do all the letters represent in the formula for determining the location of a specific percentile?

A
i = index
p = the desired percentage 
n = the number of observations in the data set
40
Q

How do you find the location of a specific percentile?

A

FIRST arrange all data points in ascending order
Then use the formula:
100
i = (——-) n
p
n will give you the numerical location within your ascending list of data points.

41
Q

Quartiles

A

Dividing up the data into four equal parts

42
Q

When dividing data into quartiles, what percentile does each quartile represent?

A

The first quartile (Q1) is at the 25th percentile
The second quartile (Q2) is at the 50th percentile
**Q2 is also the median
The third quartile (Q3) is at the 75th percentile

43
Q

How do you find quartiles?

A
  1. Put the data in ascending order.
  2. Find the median, M, of the data. This median is the second quartile Q2.
  3. Separate the data into two halves. (That is, data below the median and data above the median.)
  4. Find the median of the data below the median.
    This is Q1.
  5. Find the median of the data above the median.
    This is Q3.
44
Q

What are “Measures of Dispersion”?

A

Measure the extent to which data is spread out or dispersed.

45
Q

Range

A

The span between the smallest value in the set and the largest

46
Q

What is the formula for RANGE?

A

Range = Largest Value - Smallest Value

47
Q

What is the crudest way to measure dispersion?

A

Range

48
Q

Interquartile Range

A

This calculation focuses on the middle 50% of the data.

49
Q

What is the INTERQUARTILE RANGE formula?

A

1QR = Q3 - Q1

50
Q

Variance

A

A measure of dispersion that uses all of the data in the set and is based on the difference between each data point in a data set and the mean of the data set.

51
Q

What are the two different type of variances?

A

Sample Variance
and
Population Variance

52
Q

σ²

A

The Greek lowercase letter sigma.

It represents population variance.

53
Q

A

Represents sample variance.

54
Q

What is the formula for POPULATION VARIANCE?

A

Σ ( xᵢ - μ )²
σ² = —————-
N

55
Q

What is the formula for SAMPLE VARIANCE?

A

Σ ( xᵢ - x̄ )²
s² = —————–
n - 1

56
Q

What is “deviation about the mean”?

A

The calculated difference between the data value and the mean.

57
Q

TRUE OR FALSE:

The larger the variance, the less dispersed (or spread out) the data.

A

FALSE:

The larger the variance, the MORE dispersed (or spread out) the data.

58
Q

Why do we use n - 1 as the divisor in the sample variance formula?

A

The n - 1 value assists the formula in providing an unbiased estimate of the population variance.

59
Q

Standard Deviation

A

The positive square root of the variance

60
Q

Population Standard Deviation

A

= σ = √σ²

61
Q

Sample Standard Deviation

A

= s = √s²

62
Q

Coefficient of Variation

A

Shows how the size of the standard deviation compares to the mean of the data set
The coefficient of variation may be used to compare the dispersion of one set of data to the dispersion of another set of data.

63
Q

What is the formula for COEFFICIENT OF VARIATION?

A

Standard Deviation
= —————————— * 100
mean

64
Q

z-score

A

aka, the standardized value.

It tells the researcher how many standard deviations an item in the data set lies from the mean of the data set.

65
Q

What is the formula for Z-SCORE?

A

x - μ
z = ———-
σ

66
Q

What does everything stand for in the z-score formula?

A
z = the z-score for x
µ = the mean
σ = the standard deviation