Week 4 Flashcards

1
Q

Frequency definition

A

How often a value appears in data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Histogram definition

A

Visualizes how data are distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Mode

A

Highest value or number in data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Median

A

The middle value dividing data into two groups with the same number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Mean

A

Sum of data value / frequency
(1st moment)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Spread

A

Is the difference between highest value and lowest value in data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Quantiles

A

Quantiles are locations of sections divided by the same count of data points.
There can be an N-number of sections

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Quartiles

A

When there are 4 sections in total, they are called quartiles
Median is 2nd quartile

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Percentiles

A

When there are 100 sections total, they are called quartiles.
Median is 50th percentile

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Variance

A

Sum of (distance from mean squared) to each data point / number of data points
2nd moment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Standard Deviation

A

Standard deviation (SD) is the square root of variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Z Score

A

Z-score enables fair comparisons of deviations
Z-score = (Value -Mean) / SD
Higher the Z score, the greater the value is deviated from the mean
Outlier if z-score is more than 3 or -3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Skewness

A

Skewness measures the degree of asymmetry
3rd moment
Sum of (distance of mean cubed) to each data point / number of data points
Positive Skewness shifted left
Negative skewness shifted right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Kurtosis

A

Kurtosis measures the sharpness
4th moment
Sum of (distance from mean 4th power) to each data point / number of data points
Kurtosis is always positive by definition, but normally we subtract 3. This is known as excess kurtosis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Outliers

A

Extreme values relative to the bulk of values in a data set
Outlier if z-score is more than 3 or -3
Outlier is IQR is 1.5 greater than 3rd quartile, or 1.5 smaller than 2nd quartile

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Box plots

A

Box plots is a plot summarizing quartile-based statistics of a data set