EDA: Stem and Leaf Diagrams And Boxplots Flashcards

0
Q

An EDA figure that displays a distribution of data

A

Stem-and-Leaf Diagram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

A method and philosophy of data analysis begun by John Tukey which is designed to uncover information in data without interference of outlying values.

A

Exploratory Data Analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

An EDA property in which a calculation is not highly affected by outlying data values

A

Resistance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

A stem-and-leaf diagram in which the leaves of each stem are shown on one line.

A

One-Line Summary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A stem-and-leaf diagram in which the leaves of each stem are shown on two lines. The symbols * and . are used for stems 0-4 and 5-9, respectively

A

Two-line summary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

A stem-and-leaf diagram in which the leaves of each stem are shown on five lines. The symbols *,t,f,s, and . are used for stems 0 and 1, 2 and 3, 4 and 5, 6 and 7, 8 and 9, respectively.

A

Five-line Summary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

An EDA schematic diagram comprised of a box and two lines that show the distribution of data.

A

Box plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

An EDA term to denote how far a number is in from the highest or lowest number in a batch of data. The greatest depth is the median.

A

Depth of a number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

An observation that is numerically distant from the rest of the data

A

Outlier

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Two stem-and-leaf diagrams placed next to each other that use a common set of stems.

A

Side-by-side Stem-and-Leaf Diagram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

An EDA principle in which the display of data is aided by the use of nonlinear transformations, such as a logarithm or square root.

A

Re-expression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The difference between a measurement and the value of the measurement that is predicted by some mathematical model

A

Residuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

The primary goal of EDA in which one can see information carried by one’s data

A

Revelation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

An image that communicates information without words

A

Glyph

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

An average that is the middle number in an ordered set of data. The median has half the data below it and half above it.

A

Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Upper hinge: An EDA term for the median of the upper half of a batch of data Lower hinge: An EDA term for the median of the lower half of a batch of data

A

Upper and lower hinges

16
Q

An EDA term that is the difference between the upper and lower hinges. The hinge spread is often called the fourth spread.

A

Hinge Spread