Chapter 2: Descriptive Statistics and Analytics-- Tabular and Graphical Methods Flashcards

1
Q

T or F: visualizing our data helps us to understand data easily, and see trends/outliers.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

T or F: With qualitative data, we can count this data and put in different categories. This data can be summarized using a frequency distribution.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

a table that summarizes the number of items in each of several non-overlapping classes; is a useful summary that makes our data easier to understand

A

frequency distribution (look at pic in camera roll of frequency distribution)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

________ frequency summarizes the proportion of items in each class.

A

relative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the formula to calculate relative frequency?

A

Frequency / total (or n– total # of observations)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the formula to calculate percent frequency?

A

Relative frequency x 100

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the formula to calculate and assign degree of pie slice for a pie chart?

A

Relative frequency x 360

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The sum of RELATIVE frequency is always equal to ______.

A

1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

The sum of PERCENT frequency is always equal to _______.

A

100%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

From frequency data, we can make what two kinds of charts?

A

Bar charts and pie charts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

T or F: Bar charts/graphs must have spaces between bars to indicate that each class is separate from each other.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

a vertical or horizontal rectangle represents the frequency for each category; height can be frequency, relative frequency, or percent frequency

A

bar chart

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

a circle divided into slices where the size of each slice represents its relative frequency or percent frequency

A

pie chart

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

a bar chart having the different kinds of defects listed on the horizontal scale; tells us how many errors/defects we have in our data

A

Pareto chart (look at camera roll for pic of pareto chart)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

T or F: A pareto chart is a chart that can be made from quantitative data.

A

False; QUALITATIVE data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

In a pareto chart:
- bar height represents the ______ of occurrence
- bars are arranged in (increasing/decreasing) height from left to right
- sometimes augmented by plotting a _________ percentage point for each bar

A
  • frequency
  • decreasing
  • cumulative
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

A dot plot is used for (quantitative/qualitative data).

A

quantitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

A histogram can be made for (qualitative/quantitative) data.

A

quantitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

a graphical display of a frequency distribution, relative frequency distribution, or percentage frequency distribution. It divides measurements into classes and graphs the frequency, relative frequency, or percentage frequency for each class.

A

histogram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What are the 5 steps to build a histogram?

A
  1. Find the number of classes (formula= 2^k > or equal to n); (n= total number of observations, k= number of classes, which tells us the number of bars we need to build in histogram)
  2. Find the class length (length of our bars)
  3. Form non-overlapping classes of equal width (take lowest number and keep adding class length)
  4. Tally and count (frequency)
  5. Graph the histogram
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the formula for finding the number of classes for building a histogram?

A

Formula = 2^k > (or equal to) n

n = total # of observations
k = # of classes (tells us the number of bars we need to build in histogram)

(look over example 2.2 in notes for histogram example notes)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

T or F: Anything to the power of 0 is 1.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

The number of classes (or bars in a histogram) is represented by the letter ____.

A

K

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is the formula for calculating class length (length of our bars in a histogram)?

A

Class Length = (largest value in our data - smallest value in our data) / classes (or k)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

T or F: For class length, you always round down.

A

False; always round UP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

T or F: With a histogram, there is space between the bars.

A

False; NO SPACE between bars (bars on a histogram touch to represent continuous data)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

T or F: The base (x-axis) of a histogram represents the class length.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

T or F: In a histogram, the height represents: a.) the frequency in a frequency histogram, or b.) the relative frequency in a relative frequency histogram.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

With histograms, if you are making a PERCENT frequency histogram, make sure to take the ______ of the class length for the base label.

A

average

(ex: if class length is 3, and smallest number in data is 10, you would do (10+13)/2 = 11.5… 11.5 would be your base label for the first bar on the histogram)

30
Q

T or F: The frequencies on a bar graph represent the counts from categories (qualitative data) while the frequencies on a histogram represent the counts of the quantitative data values grouped into the “classes”.

31
Q

the right tail of the histogram is longer than the left tail

A

right-skewed

32
Q

the left tail of the histogram is longer than the right tail

A

left-skewed

33
Q

the right and left tails of the histogram appear to be mirror images of each other

A

symmetrical

34
Q

cumulative distributions are also called ________ _______.

A

running total

35
Q

Another way to summarize a distribution is to construct a cumulative distribution. To do this, use the same number of _______, class lengths, and class boundaries used for the ________ _______. Rather than a count, we record the number of measurements that are ______ ______ the upper boundary of that class.

A

classes; frequency distribution; less than

(look over picture of cumulative distribution/frequency on camera roll)

36
Q

a graph of a cumulative distribution

A

ogive

(plot a point above each upper class boundary at a height of the cumulative frequency; connect the points with line segments; can also be drawn using cumulative relative frequencies and cumulative percent frequencies)

37
Q

Which type of distribution shape does this describe:
two humps, the left of which may or may not look like the right one, nor is each hump required to be symmetrical

A

double peaked

38
Q

Which type of quantitative graph is useful in detecting outliers and easy to read?

A

Dot plots

(the horizontal axis spans the range of observations or measurements, and dots represent the observations)

39
Q

a graphical portrayal of a data set that shows the data set’s distribution by using stems consisting of leading digits and leaves consisting of trailing digits

A

stem-and-leaf display (for quantitative data)

(look in camera roll for pic of this)

40
Q

T or F: Stem-and-leaf display shows the shape of distribution and shows the value of individual measurements.

41
Q

Stem-and-leaf displays are best used for what size data distributions?

A

Small to moderately sized data distributions

42
Q

What is the purpose of a stem-and-leaf display?

A

to see the overall pattern of the data, by grouping the data into classes

(the variation from class to class, the amount of data in each class, the distribution of the data within each class)

43
Q

What are 3 advantages of stem-and-leaf displays?

A
  1. Displays all the individual measurements and puts data into numerical order.
  2. Simple to construct.
  3. Tells us from where the data is starting and where it’s ending (the min. and max. points)
44
Q

T or F: For stem-and-leaf displays, there are no rules that dictate the number of stem values, and you can split the stems as needed

45
Q

a table consisting of rows and columns that is used to classify data on two dimensions

A

contingency table

(look at camera roll for example pic of a contingency table)

46
Q

Contingency tables…
- classify data on _____ dimensions.
- Rows classify according to one dimension and columns classify according to a second dimension.

A
  • two

(association between 2 variables– one will be giving us info about rows, one will be giving us info about columns)

47
Q

Contingency tables require what 3 variables?

A
  1. Row variable
  2. Column variable
  3. Variable counted in the cells
48
Q

What is the purpose of contingency tables?

A

To investigate possible relationships between variables

49
Q

_______ _____ are used to study the relationship between two quantitative variables.

A

Scatter plots

(place one variable on the x-axis, place a second variable on the y-axis, place dot on pair coordinates)

50
Q

Types of Relationships– Scatter Plots…
a straight line relationship between the two variables

51
Q

Types of Relationships–Scatter Plots…
when one variable goes up, the other variable goes up

A

positive (ex: as x goes up, y goes up… OR as x goes down, y goes down)

52
Q

Types of Relationships–Scatter Plots…
when one variable goes up, the other variable goes down

A

negative (ex: as x goes up, y goes down… OR as x goes down, y goes up)

53
Q

Types of Relationships–Scatter Plots…
there is no coordinated linear movement between the two variables

A

no linear relationship

(when slope = 0)

54
Q

when all dots are in one line and going up

A

perfect positive correlation

55
Q

when all dots are in one line and going down

A

perfect negative correlation

56
Q

when dots are going up but not in one line (weak relationship)

A

low degree of positive correlation

57
Q

when dots are going down but not in one line (weak relationship)

A

low degree of negative correlation

58
Q

when dots are close to line and going up (strong positive relationship)

A

high degree of positive correlation

59
Q

when dots are close to line and going down (strong negative relationship)

A

high degree of negative correlation

60
Q

The purpose of constructing a scatter plot is to look at relationships between two quantitative variables. We can see what 3 things?

A
  1. Direction (positive or negative)
  2. Form (strong or weak relationship)
  3. Linear or non-linear relationships
61
Q

T or F: A graph is misleading if the vertical axis does not start from 0.

62
Q

T or F: A graph is misleading if all bars are not of equal width.

A

True (all bars should we equal width)

63
Q

T or F: Pie charts are misleading when the percentages do not equal 100 when added up.

A

True (sum of all percentages must be = to 100%)

64
Q

T or F: Graphs are misleading when they are missing titles or axis titles.

A

True (all graphs should be labeled)

65
Q

T or F: Graphs can be misleading when data isn’t consistent or missing (doesn’t give any info about data or a clear story of data).

A

True (ex: on x-axis of a bar chart, for some of the bars it is labeled with the year, then it switches to the month and year)

66
Q

provides a graphical presentation of the current status and historical trends of key performance indicators; gives us info about all descriptives on one page (tells us what has happened)

67
Q

What are the 4 things included in a dashboard?

A
  1. Gauges
  2. Bullet Graph
  3. Treemaps
  4. Sparklines
68
Q

Included in dashboard…
charts that present data similar to a speedometer

69
Q

Included in dashboard…
features a single measure that extends into ranges representing qualitative measures of performance

A

bullet graph

70
Q

Included in dashboard…
display information in a series of clustered rectangles

71
Q

Included in dashboard…
line chart drawn without axes to embed in text (makes data easier to see)

A

sparklines

72
Q

T or F: With a bullet graph, the line tells you whether you’ve achieved your criteria or not… if the line doesn’t meet criteria line, then the criteria has not been met.