module 1 intro Flashcards

1
Q

levels of measurement

A

categorical and scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

sometimes referred to as discreet data, where numbers are used to represent categories of information, qualitative in nature. the types are nominal and ordinal data

A

categorical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

a numerical measurement of something in which the difference between values has meaning. quantitative in nature. referred to as continuous data. two types are interval and ratio

A

scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

data that is grouped in unordered categories. only label holds value. number assigned is meaningless. can be binary or non binary

eg binary yes or no non binary blue eye green eyes brown eyes hazel eyes

A

nominal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

data that is grouped in ordered categories, the number assigned to the category means something. calculations cannot be performed on. these numbers. referred to as ranked data

eg disagree somewhat disagree neutral somewhat agree

A

ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

a numerical measurement on a scale where each point is placed at an equal distance from one another. there is no true zero

eg temperature, 0 degrees doesn’t mean absence of heat

A

interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

measurement of something where the numbers are not restricted to certain values and there is a true zero

eg amount of money in pocket 0 = no money

A

ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

true or false nominal data has no inherent order or ranking

A

true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

true or false interval data has equal spacing between values but no true zero

A

false

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

true or false ordinal data can tell you how much more one value is than another

A

false

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

true or false ratio data can be used to calculate meaningful ratios example twice as much

A

true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

identify the level of measurement

type of pet

A

nominal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

star rating at restaurant 1-5

A

ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

temperature in celsius

A

interval, equal intervals but no true zero, 0 doesn’t mean absence of heat

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

age in years

A

ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

time of day in a clock

A

interval, no true zero

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

exam scores as percentages

A

ratio, percent scores have a true zero 0% =0 no points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

happiness level rated as unhappy neutral happy

A

ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

relative frequency =

A

absolute frequency/ sum of all frequencies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

measures of central tendency

A

mean median and mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

the sum of all values divided by the number of values in the data set. denotes as x bar. also known as average

22
Q

most sensitive to outliers

23
Q

best measure of central tendency for nominal data

24
Q

best measure of central tendency for ordinal data

A

median or mean depending on category

25
Q

best measure of central tendency for scale data

A

mean median or mode

26
Q

the variability of a set of data is also referred to as ___

27
Q

two common ways of describing the spread of data

A

range and interquartile range (IQR)

28
Q

range

A

a way of measuring the spread of data by describing the difference between the minimum and the maximum value in a data set.

range = maximum value - minimum value

29
Q

interquartile range

A

a way of measuring the spread of data by dividing the data set into quartile as. by finding the quartiles l you can identify the range of values that way within 50% of the median of a data set.

30
Q

how to calculate interquartile range

A

order data
calculate median
calculate median of lower half
upper half
findings are IQR= Q3-Q1

31
Q

the larger the IQR value the ____

A

more spread out the data is

32
Q

how to calculate variance

A

identify the distance between each data point and the mean and then averaging those distances (subtract the mean from each value)

33
Q

degrees of freedom

A

the number of values free to vary in a data set, total observations minus 1

n - 1

34
Q

standard deviation

A

a way to calculate variance around the mean. calculated by taking a positive square root of variance or s2. when you take the positive square root of something that is swuared it cancels out the square

35
Q

how do you calculate standard deviation

A

it’s just square rooted variance squared

36
Q

what does a high standard deviation indicate?

A

the data points are more spread out from the mean, a high sd means more variability data is more spread out

37
Q

true or false if all the data points in the set are the same the sd cannot be calculated

A

false, there is is no spread if the data is the same, no spread so sd is 0

38
Q

a small standard deviation means the data values are clustered closely around the mean true or false

39
Q

absolute number

A

the raw numbers collected during the days acquisition process

40
Q

relative number

A

the absolute numbers shown as a proportion or percentage

41
Q

true or false variance is measured in the same unit as the original data

A

false, variance is in squared units not the same as data

42
Q

true or false the standard deviation can be negative

A

false, it is always zero or positive never negative

43
Q

true ir false SD is a measure of central tendency

A

false, it is a measure of spread

44
Q

why might a public researcher want to look at standard deviation in a survey response?

A

to understand how much variation or spread there is in the response. even if the mean is the same between two groups, the sd tells them how consistent or inconsistent people’s answers are.

45
Q

if two datasets have the same mean but different standard deviations what does that tell you

A

it tells you that one dataset is more spread out than the other, even though they centre around the same average. the data set with the larger Sd has more variability meaning individual data points tend to deviate more from the mean. the dataset with the smaller sd is more consistent and tightly clustered around the mean true

46
Q

the

A

it tells you that one dataset is more spread out than the other, even though they centre around the same average. the data set with the larger Sd has more variability meaning individual data points tend to deviate more from the mean. the dataset with the smaller sd is more consistent and tightly clustered around the mean

47
Q

a small sd means and large sd means what

A

small sd means people gsve very similar answers and large means people have a wide range of answers

48
Q

why is SD important in real life

A

low sd means most people consistently liked it
high sd means some people loved it and some people hated it. can influence deductions such as as why there is so mu fb variability

50
Q

to avoid having values canceled out when using variance on symmetrical data, use

A

variance squared. (s2) makes all values positive so they don’t cancel each other out