Measures of Central Tendency Flashcards

1
Q

What formula do you use to calculate the mean?

A

x̄ = ∑X
______
N

N= number of scores
x̄= mean
∑= summation
∑X= add up all of the scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How do you find outliers?

A

Need to calculate the interquartile range by finding the smallest and largest 25% of scores. In the middle 50% take away the lowest score from the highest score. Multiple this by 1.5. Whatever this new score is, take that away from the lowest and highest score in the 50%. If any scores fall outside of these new scores, they’re outliers.

To find extreme outliers, multiply by 3 rather than 1.5.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are variables?

A

Things we measure, they vary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What types can data be classified into?

A

Nominal: more qualitative, things we can’t express as meaningful numbers
Numerical: things can be expressed in meaningful numbers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is nominal data?

A

Categorical data
Things we can name e.g. gender, countries
Can’t be ranked

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is interval data?

A

Equal intervals between each number on our scale
No true 0
e.g. temperature

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is ratio data?

A

Equal intervals between each number on our scale
0 indicates the absence of the thing we are measuring
The 0 is true
Other measurements might have a label of 0 but that doesn’t mean the absence e.g. temperature whereas a ruler with 0cm means an absence of zero
Can measure the ratio of things

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is ordinal data?

A

Numerically measured and ranked
Each difference in position doesn’t mean they have the same meaning
e.g. language ability, rank people as beginner, intermediate and fluent
e.g. likert questions most to least satisfied

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are frequency tables, what are the symbols?

A

A count of how many times a certain response occurs in a data set
X: variable name
F: frequency of each value
∑= summation
∑F= sum of all frequencies
N: sample size

We don’t have to just use numerical data for frequency tables, we can use nominal data too.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What do we do for larger data sets?

A

For larger data sets we group our data in the frequency tables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why do we use histograms?

A

Distribution of our data
Group our data, usually 10 or fewer so data is easy to understand
Helps us see if we have issues with our data, one issue could be that data is skewed (positive or negative.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are positive/negative skews and bell curves?

A

positive skew= skew is on the right end
negative skew= skew is on the left end
Normal distributions usually have a bell curve, symmetrical, this is ideal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the mean?

A

The average
Add all values, divide by how many values there are

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the mode?

A

Most frequent
Number that appears most which can be assessed in frequency tables
2 modes= bimodal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the median?

A

halfway between lowest and highest value (middle value)
Order data lowest to highest and find the data in the middle
If there are 2 mediums, we add them together and divide by 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the advantages/disadvantages of the mean?

A

+ find the value to represent the whole data set
- affected by outliers

17
Q

What are the advantages/disadvantages of the mode?

A

+ can be found for nominal data as well
- can be more than one mode, or none at all

18
Q

What are the advantages/disadvantages of the median?

A

+ not affected by outliers
- when we have 2 middle items, we need to add them up and divide by 2, the number we get after may not actually be a part of our data set

19
Q

When to use mode, median and mean?

A

Mean: most preferred
Median: used when there are outliers or distribution is heavily skewed. Also better for ordinal data
Mode: best for nominal data or when we can’t use the median or mean