Assignment 1 Flashcards

1
Q

Exploratory Data Analysis

A

A method and philosophy of data analysis begun by John Tukey which is designed to uncover information in data without interference of outlying values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Resistance

A

An EDA property in which a calculation is not highly affected by outlying data values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Re-expression

A

An EDA principle in which the display of data is aided by the use of nonlinear transformations, such as a logarithm or square root.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Residuals

A

The difference between a measurement and the value of the measurement that is predicted by some mathematical model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Revelation

A

The primary goal of EDA in which one can see information carried by one’s data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Glyph

A

An image that communicates information without words.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Median

A

An average that is the middle number in an order set of data. The median has half the data below it and half above it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Upper & Lower Hinges

A

An EDA term for the median of the upper half of a batch of data (upper hinge) and median of the lower half of a batch of data (lower hinge).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Hinge Spread

A

An EDA term that is the difference between the upper and lower hinges. The hinge spread is often called the fourth spread.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Stem-and-Leaf Diagram

A

An EDA figure that displays a distribution of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Side-by-side stem-and-leaf diagram

A

Two stem-and-leaf diagrams placed next to each other that use a common set of stems.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

One-line summary

A

A stem-and-leaf diagram in which the leaves of each stem are shown on one line.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Two-line summary

A

A stem-and-leaf diagram in which the leaves of each stem are shown on two lines.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Five-line summary

A

A stem-and-leaf diagram in which the leaves of each stem are shown on five lines.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Box plot

A

An EDA schematic diagram comprised of a box and two lines that show the distribution of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Depth of a number

A

An EDA term to denote how far a number is in from the highest or lowest number in a batch of data. The greatest depth is the median.

17
Q

Outlier

A

An observation that is numerically distant from the rest of the data.