Numerical summaries (mean, median, variance, sd, IQR). Bivariate data: correlation, two-way tables Flashcards

1
Q

Mean μ

A

Average of all data points.

Add all numbers together then divide by the total number of numbers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Median

A

The centre number in the ordered sequence of data points.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Mode

A

The number that occurs the most in a data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Standard deviation σ

A

How measures are spread out from the mean.
Low σ means numbers are close to the mean
High σ means numbers are spread out from the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Variance σ^2

A

Distance each number is from the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Range

A

Difference between highest and lowest values.

max - min = range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Inter quartile range

A

Middle 50% of the data.

Q3 - Q1 = IQR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Correlation coefficient r

A
r = -1 : as y decreases, x increases - negative linear 
r = 0 : no linear model - no association 
r = 1 : as x increases so does y - positive linear
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How to best represent data for 1 qualitative and 1 quantitative variable?

A

Side-by-side boxplot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How to best represent data for 2 quantitative variables?

A

Scatterplot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Bivariate data

A

For each x data point there is a corresponding y data point

How well did you know this?
1
Not at all
2
3
4
5
Perfectly