Describing Data Flashcards

1
Q

Qualitative description properties - 3

A
  • Shape
  • Location
  • Dispersion
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Shape - 5 (DEF + LIST)

A
  • Make a smooth approximation of the histogram
  • Shape of smooth curve can give idea of data distribution
    • Symmetrical
    • Right/Left Skewed
    • Uniform
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Location - 1

A
  • The position of the peak on the x axis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Dispersion - 1

A
  • How much the data spreads on the x axis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Numerical summeries - 1

A
  • Describe the data distribution with numerical values
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Measures of center - 4 (DEF + LIST)

A
  • Value which stands at the center or middle of the data set
    • Mean
    • Median
    • Mode
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Mean - 4

A
  • Mean is average, computed by sum of all elements divided by number of elements.
  • Mean is not robust, it is affected from outliers
  • Sample mean is x bar
  • Population mean is μ (Mu)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Median - 2

A
  • Middle value of the data set after sorting

- Is robust, not affected from outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Mode - 4

A
  • Value that occurs with highest frequency
  • Mostly used for nominal data, not as much for numerical
  • The number of modes defines how modal a data set is:
    • 1 = Unimodal
    • 2 = Bimodal
    • 3 + = Multimodal
  • When a graph has multiple peaks it follows the same logic
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Measures of variation - 3

A
  • Standard deviation
  • Variance
  • Range
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Variance - 3

A
  • Is the average quadratic deviation from the average
  • The sample variance is s^2
  • The population variance is σ^2 (Sigma squared)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Standard deviation - 4

A
  • Measures how much the values deviate from the sample mean.
  • It is the square root of the variance
  • The sample standard deviation is s
  • The population standard deviation is σ (Sigma)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Range - 2

A
  • Maximum - minimum

- It is very sensitive to extreme values because it uses only two values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Percentiles and Quartiles - 5 (DEF + LIST)

A
  • Percentile Pi indicates that i% of data is smaller than Pi and (100 - i)% is larger than Pi
  • Quartiles divide data set in four groups, which approximately have 25% of values
    • Q1 = P25
    • Q2 = P50 = Median
    • Q3 = P75
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

5 Number Summary - 6 (DEF + LIST)

A
  • Graphical representation is boxplot
    • Minimum
    • First Quartile
    • Median
    • Third Quartile
    • Maximum
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Interquartile range - 1

A
  • IQR is the difference between the third and first quartile
17
Q

Boxplot - 2

A
  • Is graphical representation of 5 num summary
  • Provides information on distribution (If median is not center of box and/or outliers than distribution is asymmetrical)
18
Q

Whiskers Boxplot - 2

A
  • Lines extending from the box

- Are defined as a value which exceeds x * IQR

19
Q

Outliers - 1

A
  • Points which are not included between the whiskers