Lec. 27 - Measurement Theory and Statistics Flashcards

1
Q

What does this lecture deal with?

A

1- mini-history of measurement theory
2- scale levels and transformations
3- implications for statistical practice
4- debate between measurement and statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q
  1. Mini-history of measurement - general first steps
A
  • 500 years ago there were no sciences
  • only few people were studying science
  • the basis became “empirical research” (check ideas against observations)
  • Galileo Galilei→ you can represent relations in nature through mathematical equations, in a systematic way
    > people started measuring nature (e.g. trying to distinguish white and black pieces and how far they had to go to be able to distinguish it)
    = arose idea that subjective qualities (e.g. sight) can be measured with units
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Francis Galton
- what did he discover?

A
  • measure intelligence, regression coefficient, fingerprint, weather map, eugenics (to make better race)
  • “anthropometric laboratory” → he stated that he could measure human intelligence

> the important part is that he tried to measure human intelligence (not in the same way as we do now of course); the other discoveries are not as important to know

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Charles Spearman
- what did he discover?

A
  • positive manifold of intelligence
    > on average, if you’re good at one intelligence task, you are good at all of them (e.g. verbal intelligence, spatial skills, …)
    > all correlations are positive between the subparts of intelligence
    ! it goes for many things in life, not just intelligence
  • coined the concept of “general intelligence”, which can be measured in degrees, and explains positive manifold
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what are the consequences of Spearman’s discoveries?

A
  • psychologists start talking about “measurements”
    > e.g. total score of IQ-test is interpreted as general intelligence
  • some psychollogists hold that there is no essential difference between physics and psychology
  • psychophysics researchers measurements to assess observable differences to infer subjective qualities (e.g. sight)
    → physicists disagree with psychologists using measurements without meeting necessary scientific requirements
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Campbell
- what was he arguing against?

A
  • one of the more vocal physicists against measurements used in psychology
    > “there is a very specific way in which real measurements are formed and it is not met in psychology”
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Campbell
- what are measurements a result of ?
- what is this known as?

A
  • measurement is the result of a process of assigning numbers such that:
    > each object is represented by a single number
    > the sum of two assigned numbers represents the results of an empirical combination of objects (e.g. putting two stones on a weighting scale)
    → concatenation operation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Fundamental Measurement - what is it?

A
  • measurement with no prior quantification
    > balance scale: can be used to decide whether one object is heavier than the other (qualitative scale; no numbers)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How can we use a qualitative system to construct a numerical representation (that has the properties that we normally associate with measurement)?

A
  • e.g. with balance scale
    > we decide which object is heavier without assessing any number to them
    → qualitative assessment
    > we can also concatenate massess (put them in same arm of balance)
    → here we empirically add things, without the use of mathematics
    > by putting two units on one side and putting a mass that is equal in weight on the other side, we are rcreating a standard sequence
    → now we can assign numbers to sequence, and predict what would happen when we manipulate reality based on the mathematical manipulation of the numbers
    (see picture 1)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Standard sequence

A
  • sequence of weights where difference between each weight is one unit
  • units are arbitrary (we could choose any unit)
    (see picture 2)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Representational measurement theory

A
  • numbers are assigned such that relations between numbers mirror relations between objects
  • numerical structures are mirroring empirical structures
    (see picture 3)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q
  1. Scale levels and transformations
    - what happens when we transform the numbers and not account for it?
A
  • e.g. we square the numbers (1, 2, 3, 4 → 1, 4, 9, 16)
    > we can’t do that because the difference between 1&2 and 2&3 … is always 1, while the difference between 1&4 and 4&9 … is not the same
    → prediction is that it will not balance out = we lose the direct representation of the empirical relation (break the mirror between empirical and numeric)
    (see picture 4&5)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What was Campbell’s claim on measurement?
- How did the British Association respond?

A
  • fundamental measurement always requires concatenation (must have additive structure)
    > this means that measurement is impossible in psychology (no concatenation operations)
  • the British Association for the Advancement of Science failed to reach an agreement on whether psychological measurement is possible
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How can concatenation be explained in everyday life?

A
  • you can concatenate length and mass
    > 1km+5km=6km
  • not true for temperature
    > 20°+40°=30° (average)
    → “temperature is not measurable”
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

look at picture 6
- is this claim right or wrong?

A
  • wrong: measurement is not overlooked by physicists
    > they can study and manipulate measurements precisely, as they are assessing events and objects
    > in psychology instead, measurement is more complicated because we assess humans
  • right: in psychology, we don’t actually know that our measurements are precise
    > e.g. we don’t know for sure how to measure intelligence, or depression, ….
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

S. S. Stevens - what did he invent?

A
  • one of the most influential psychologists of all times
  • he invented the scale levels
    > nominal, interval, ordinal, …
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How did Stevens define measurement?

A
  • measurement is the assignement of numerals according to rule
    > any systematic rule would work
    > the rule determines what kind of measurement we have
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

How should a rule be structured?

A
  1. determine equality
  2. determine order
  3. determine scale levels
    = one is always measuring, the question is at which level
    - some things are inflexible in measurement, like mass; other things’ measurements can be manipulated more easily
    > therefore, important is what “breaks the mirror”
    > e.g. you can’t just square the numbers because mathematical relationship does no longer reflect empirical relationship
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

The representational solution

A
  • measurement involves an act of representation
  • measurement attepts to capture the structure of an attribute in symbolic form
  • numeric relations are isomorphic to empirical relations
  • scale levels are defined by the set of transformations that leave the isomorphism intact
    (this is what was in the slide; he didn’t explain further)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Steves’ measurement levels

A
  • nominal
  • ordinal
  • interval
  • ratio
    > we can choose scale that we want, as long as we stick to it
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Nominal scale

A
  • numbers only represent equivalence
    > the objects that get the same number have the same property
    > e.g. postal codes
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Ordinal scale

A
  • number represent order
    > objects that are assigned the higher numbers have more property represented
    > e.g. rankings
    > e.g. hardness scale of metals (metals in order for which can scratch other metals and not be scratched, …)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Interval scale

A
  • numbers represent order, but distances between assigned numbers also have a definite meaning that is the same at all levels of the scale
    > the zero-point is arbitrary (you assign zero to anything)
    > e.g. temperature in degrees celsius (zero does not mean absence of temperature), IQ
24
Q

Ratio scale

A
  • numbers represent order, distances between assigned numbers have meaning, and the ratio of numbers have meaning as well
    > zero-point is not arbitrary, but indicates absence of property
    > e.g. mass in kg, time, length
25
Q

So, is measurement possible after all?
What did Mitchell debate?

A
  • still debatable
    > Mitchell wrote many books saying that psychology is overstepping the boundaries of measurement
    > he tried to measure many things that are not measurable, and made sense out of them (proving that psychology might be doing the same)
26
Q

what are admissible transformations in the nominal scale?

A
  • all one-to-one transformations are admissible
    > e.g. divide females and males as 1&2, or red&blue, or 35&40, …
27
Q

what are admissible transformations in the ordinal scale?

A
  • all monotonic transformations are admissible
    > = all transformations that leave the order of numbers intact
    > e.g. multiply all numbers by 2
28
Q

what are admissible transformations in the interval scale?

A
  • all transformations that preserve order and distances between numbers are admissible
    > all linear transformations X’-a+bX, where b is a positive constant
  • the stronger the scale level, the less you can do to the assigned numbers without breaking the mirror
    (see picture 7)
29
Q

what are admissible transformations in the ratio scale?

A
  • all transformatinos that preserve the ratio between numbers are admissible
    > = tranformations of the form X’=bX, where b is a positive constant
    > the fact that there is no translation as in interval level above implies an absolute zero
30
Q
  1. Statistical implications
    what are the statistical implications of changing the scale?
A
  • parametric tests such as t-test and ANOVA are sensitive to transformations that change distances between scale points
  • as a result, nonlinear transformations will change one’s conclusions
31
Q

Why can’t we do a t-test on nominal numbers?

A
  • e.g. we have an IQ scale, and we want to square the results
  • it seems at first that the order of the results would remain the same, because they are just squared
  • if we square the results though the distribution changes, and a t-test on the changed distribution would give different results
  • therefore, if we do a non-linear transformation, we would get different results
    (see picture 8)
32
Q

Important takeaway !

A
  • if we have nominal or ordinal scale, it implies that if we transform the data according to admissible transformations (which is ok given said scale level), we will get different results
    → results are not robust under scale transformations

= we are allowed to transform every scale (look at rules in flashcards 27-29), but when we transform nominal and ordinal scale, the distance between points change. Because of this, the distributions are also different, and when you do a statistical analysis the results will change. With interval and ratio scales however, the rules follow linear models, so we can statistically analyse the results

33
Q

Logical process of Q&As about statistical transformation

A
  • Q: when does the t-test have a definite interpretation?
    A: only if you don’t transform the data such that the distances between scale points change
  • Q: so why wouldn’t I be allowed to do that?
    A: because the data are at the interval level of measurement, so only linear transformations are admissible

= we can’t square numbers when it’s an interval square, or the distance between points will change and so will the distribution
= ! no t-test if the data aren’t at the interval level of measurement !

34
Q

Methods police vs Statisticians

A
  • Methods police: no t-test on non-quantitative data! → you can use nonparametric statistics instead
  • Statisticians: “there is no assumption in t-test that says that there should be a certain level of measurement, people can do t-test whenever”
35
Q

The statistical treatment of football numbers
- paper by Frederic Lord
(it’s a long flashcard, but it’s just a story, so don’t worry!)

A

Professor X works in a university, and he is a hardcore measurement theorist. He was in charge of giving out numbers to football players in the uni team, and he created a vending machine to get assigned a number. Younger students complained that they were getting lower numbers than the older students, and the professor didn’t know how to analyse data (those numbers identified players, and were nominal data). The professor went to a statistician, and statistician calculated mean and standard deviation of numbers as if they were not nominal. When attacked, the statistician replied “the numbers don’t know that! They behave the same way regardless”, and carried out with his statistical analysis.
- that line is super famous, used in conferences etc.
(see picture 9, and look at lecture for theatrical representation of dialogue)

36
Q

What is the conclusion of this story?

A
  • Lord’s professor draws a sensible conclusion on the bases of illegal statistics
  • thus, prohibitions like “don’t run ANOVAs on nominal data” are not always justified
    = no clear connection between the measurement level and statistical level
37
Q

Moral of the story

A
  • the reasoning behind Steven’s rules is strong, and makes a lot of sense
  • also is a good argument for non-parametric statistics, because these are not sensitive to nonlinear transformations
  • however, as Lord shows, clever people who know what they are soing can still obtain sensible results
    (from slides)
38
Q
  1. Dialogue between measurement theory and statistics
    - what are the main take-aways?
A
  • dialogue is ongoing (very slow discussion, new argument every many years)
    1. reasoning behind the rules is not trivial, it’s quite good
    > e.g. good reason why you should be careful about doing t-test on ordinal data
    2. you should understand that there are many cases where we can work with data even though it might not be “legal”
    = so, how do we deal with this situation?
39
Q

The pragmatic solution

A
  • concept of robustness
    1. transform your data according to transformations you think should not matter
    2. if we don’t know what the right analysis is, we can run them all
    > not p-hacking (we don’t just pick the one with the results that we want)
40
Q

What happens when we run many analysis and the results are the same/different?

A
  • same results: robustness proved
  • different results: we must figure out why and reconsider scale levels
41
Q

Discussion

A
  • methodological rules are not written in stone, but are based on assumptions and arguments
  • rules do exist, but were proposed and argued for by people (on basis of ideas, arguments and analysis)
    > therefore rules contain errors and assumptions can be questioned
  • no methodological rules are safe from debate
42
Q

Chapter 2 - Scaling
- The property of identity

A
  • objects or events can be sorted into categories that are based on similarity features
  • it informs about whether two individuals are similar or different (no more info)
43
Q

what are the rules to follow when sorting people into categories?

A

1- the people within a category must satisfy the property of identity
2- the categories must be mutually exclusive
> items can fit only in one of the two categories
3- the categories must be exhaustive
> the number of categories must be the same as the number of possibilities of those features

44
Q

What is an example to explain the property of identity? How does it apply to the aforementioned characteristics?

A
  • teachers want to differentiate students into a group with behavioral probems, and a group without
    1> each student in each group has or doesn’t have behavioural problems, depending on the group they are in
    2> the categories are mutually exclusive because a student can’t both have and not have behavioural problems
    3> the categories are exhaustive because there are no other possibilities outside of having or not having behavioral problems (no third option)
45
Q

The property of Order

A
  • it conveys information about the relative amount of an attribute that people possess
  • numerals serve as labels (we could also use letters instead)
    ! it does not tell us the degree of difference
46
Q

The property of Quantity

A
  • provides information about the magnitude of differences between people
  • numerals reflect real numbers
    > e.g. number 1 reflects the size of the basic unit on any particular scale, while other numbers are multiples or fractions of one
  • real numbers: scalar, metric, cardinal, quantitative values (these are just synonyms of real numbers)
47
Q

The number 0
- what are the two meanings of zero?

A

a) the attribute of an object or event has no existence
> called absolute zero
> e.g. 0cm long → no length
b) arbitrary quantity of an attribute
> called arbitrary zero
> e.g. 0°C → arbitrary choice of that temperature as zero degrees; does not mean that there is zero temperature

48
Q

What is the dilemma about the interpretation of zeros?

A
  • when someone scores zero on a test, is it absolute or abitrary zero?
    ! properties and meanings of zero are fundamental to assess the answer
    > property of identity→ most fundamental one
    > property of order→ after identity, and fundamental for quantity
    > property of quantity
49
Q

Units of Measurement
- how can measurement units be arbitrary?

A
  1. the unit size can be arbitrary
  2. some units of measurement are not tied to any one type of object
    > e.g. cm can measure pencils, tables, pizzas, …
  3. when taking a physical form, some units of measurement can be used to measure different features of objects
    > e.g. using a ruler to measure the weight of a pineapple, by putting rulers on a weight so that they weight like a pineapple
50
Q

do units of measurement apply all characteristics?

A
  • usually, units of measurement are only arbitrary in size, but they are tied to a particular object
    > e.g. unit of measurement for IQ is specific only to IQ
  • however, standard measures can be used to measure psychological attributes (e.g. reaction time)
51
Q

Addivity - what does it require?

A
  • assumption that unit size does not change
  • the size of measurement unit should not change as the conditions of measurement change
    > conjoint measurement
    > e.g. size of a meter should remain constant no matter the day and time
52
Q

Does counting count as measurement?

A
  • some say that counting qualifies as measurement only when one is counting to reflect the amount of some feature or attribute of an object
    > e.g. when counting the amount of correct responses, you are measuring the proportion of knowledge
53
Q

Scaling - what is it?

A
  • particular way in which numbers are linked to behavioral observations to create a measure
    > some people mistake measurement and scaling
54
Q

Interval scales

A
  • have arbitrary zero
    > e.g. temperature
    > in psychology, vast majority of intelligence tests, personality tests, achievement tests, developmental tests are treated as interval scales
  • the size of unit is constant and additive, but the scale does not allow multiplicative interpretations
    → e.g. you can add two degrees to 40°, and you would get 42°, however by multiplying 40x2=80°, it’s wrong to interpret that 80° is twice as warm as 40
55
Q

Ratio scales

A
  • absolute zero point
    > e.g. physical distance
  • allow additivity as well as multiplicity
    > e.g. right to interpret 8km as twice as far as 40km
  • there are no ratio scales in psycology
    → this is because e.g. when measuring reaction time, it’s impossible for the individual to have 0 reaction time. Therefore, we must distinguish the instrument’s vs the people’s absolute zero