Thematic Mapping: Choropleths Flashcards

1
Q

Chorochromatic map

A
  • Nominal Data (Name)
  • Simple presence/absence
  • Map areas differentiated by colour/shading/pattern
  • No meaning behind symbol choice
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the breakdown of the word Chorochromatic?

A

Choros: Area
Chroma: Colour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What can we use thematic mapping of areas for?

A
  • Compare and contrast areas
  • Relationship between areas
  • Quantitative and qualitative data
  • Data that has a range where the class boundaries influence the map message
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why do we choose scientific and statistical class boundaries?

A
  • So that they are defendable and not arbitrary (Tells lies!)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What happens when you change the classes in a range?

A

Changes the map message!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are some colour guidelines for a chorochromatic map?

A
  • Remove colour or change colour and map message doesn’t really change
  • Don’t use shades of the same colour because that could imply degrees of value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the breakdown of the word Choropleth?

A

Choro: Area
Pleth: Value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is used to map quantitative areal data?

A

Choropleth maps

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is used to map Qualitative/Nominal Data?

A

Chorochromatic maps

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Choropleth Map

A

Data value is mapped and colour/symbolization has meaning

  • Magnitude of data is proportional to a cartographic attribute (colour, shade, texture)
  • Shades of one colour where dark=high and light=low values
  • Colour gives message, Legend gives value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the 2 types of choropleth map?

A
  • Classless

- Range-Graded

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Classless choropleth map

A
  • Spectrum of colour with no classes

- Each number is directly attached to it’s own unique colour on spectrum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the problems with a Classless choropleth map?

A
  • Not trusted
  • Numbers on spectrum are invisible
  • Can’t determine message with ease
  • Numbers are infinitely available/ divisible
  • Can mitigate with boundaries behind the colour but that negates some of the purpose
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the potential purpose of using a Classless choropleth map?

A
  • Maybe useful to remove distracting boundaries when there are many polygons
  • Removing boundaries helps reduce psychological effect of large polygons seeming to have more influence
  • But without boundaries value is impossible to determine for audience
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Range-Graded Choropleth

A

Data grouped into classes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Why does Dent argue that Classless Choropleths aren’t really a choropleth?

A

Because of the loss of polygon boundaries

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the key to a successful range-graded choropleth?

A

Successful class boundaries

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are the 6 steps in Choropleth Mapping?

A

1) Judge data suitability
2) Order/Rank data
3) Classless or Range-graded
4) Determine # of classes
5) Determine class intervals
6) Determine Symbology (visual clues to differentiate between classes)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Choropleth mapping: Data suitability

A
  • Must have data everywhere (continuous)
  • Must be area
  • Must be derived not absolute
  • No missing or unreported data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Derived vs. Absolute data

A

Absolute: Theoretical situation of equal areas (data independent of area size)
Derived: Changed or calculated from absolute (per, %, avg, etc.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Example of Derived vs. Absolute data

A

Absolute number of dog licenses is not related to number of people and can artificially show an area as having more dog ownership.
Take percentages of licenses per population of area to derive data and give appropriate message of percentage dog ownership

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Choropleth mapping: Ordering Data

A
  • minimum to maximum
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Choropleth mapping: Choosing Classless or Range-Graded

A

Range-Graded is the convention

- Classless is complicated and difficult to interpret

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Choropleth mapping: When do you increase the number of Classes?

A
  • Smarter audience
  • Technology and colour availability (vs. pattern fill = less classes)
  • More enumeration units
  • Regular distribution (Irregular = fewer classes)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Choropleth mapping: Rules of thumb for number of classes

A
  • 4-6 is good
  • 3 is too simplistic (maybe)
  • 7 becomes complicated (maybe)
  • 2 is too few (bivariate)
  • 12 is too many (for most audiences)
26
Q

Choropleth mapping: When might it be possible to use 2 classes?

A

When doing an above/below map with one variable

27
Q

Choropleth mapping: Fewer vs. more classes

A

Fewer: Louder message
More: Message may get confused

28
Q

What are the 4 basic and most used class boundary types?

A

1) Natural Breaks (Jenks)
2) Quantile
3) Equal Interval
4) Standard Deviation

29
Q

Class Boundaries: Natural Breaks (Jenks)

A
  • Based on natural clusters inherent in data
  • Group similar values to maximize differences between classes
    Disadvantage: Based on how many classes are chosen and may end up breaking clusters
30
Q

Class Boundaries: Quantile

A
  • Classes contain equal number of features irrespective of distribution
  • Designed to distribute data
  • Designed for uniformly distributed data
  • Used more often than they should be as data is often nonuniform
31
Q

Class Boundaries: Quantile Disadvantages

A
Disadvantages: features with wildly different values can be put in same class even though they are more related to another class
- Map can be misleading graphically
32
Q

Class Boundaries: Equal Interval

A
  • Classes based on equal sized subranges
  • You specify # of intervals and breaks
  • Ex. 0-10, >10-20 etc.
33
Q

Class Boundaries: Standard Deviation

A
  • Classes based on how much the values differ from a mean using standard deviations from the mean
  • Normally Distributed Data!
  • Tough for most audiences, best for statisticians
  • Used for above/below maps (about the mean)
  • Based on mean not the actual attributes themselves
34
Q

What does changing the class boundary type do to the map?

A

Changes the message, sometimes drastically, even when # of classes remains the same

35
Q

Choropleth Mapping: Primary decision Criteria for class boundary choice

A
  • Maximize variation between classes
  • Minimize variance within classes
  • Highlight critical values?
36
Q

How is class boundary optimization measured?

A

With a Goodness Variance Fit test (GVF)

37
Q

SDDBG

A

Part of GVF

= Sum of Squared Deviation Between Groups (classes)

38
Q

SSDTotal

A

Part of GVF
= Total Sum of Squared Deviation
SSDTotal = Sum of (each attribute - mean)^2

39
Q

GVF equation

A

GVF = SSDBG/SSDTotal

40
Q

GVF

A
  • Helps determine if the chosen classification system matches the data
  • Use for each classification type to find highest GVF
  • Use for justification, it helps decision but doesn’t make the decision
41
Q

Why do we look at data distribution when choosing a class boundary and interpreting GVF?

A
  • Standard Deviation requires normal distribution

- If data is not normal and even if it has highest GVF, the standard deviation still cannot be chosen

42
Q

How to derive SSDBG

A

SSDTotal = SSDBG + SSDWG

–>Then SSDBG = SSDTotal - SSDWG

43
Q

SSDWG

A

Sum of Squared Deviation Within Groups

SSDWG = (for each class) the Sum of (each attribute in class - mean of class)^2

44
Q

What is optimal GVF?

A

GVF = 1

Classification technique closest to 1 minimizes within group variation and maximizes between group variation

45
Q

What are the steps and why do we organize data?

A
  • Organize to visualize data to see distribution
    Step 1) Rank Order
    Step 2) Draw a histogram
    Step 3) Draw graphic array (scatterplot)
46
Q

What are the 4 constant step methods?

A
  • Common difference
  • Quantiles
  • Normal Distribution
  • Nested Means
47
Q

What are the 2 Systematically Unequal step methods

A
  • Arithmetic Series
  • Geometric Series
  • Basically nature of how curve changes
48
Q

What are the 2 Irregular step methods?

A
  • Graphic Method (Natural Breaks)
  • Iterative Method (Jenks Optimization)
  • To see where changes occur
49
Q

How is the Common Difference

(CD) Calculated?

A
  • Calculate Range (Highest - Lowest)
  • Calculate CD (Range/# of observations
  • Upper Class Limit = Lowest + 1 * CD, Lowest + 2*CD, etc until # reaches # of classes
  • Data is evenly distributed
50
Q

How are Quantiles Calculated?

A
  • Step 1) # data values/# classes
  • Step 2) Make adjustments for odd number results
  • Step 3) Separate classes based on number of observation sin each class
51
Q

What are advantages of Quantile Classification?

A
  • No Empty classes
  • Class Limits are computed manually
  • Same Percentage of observations in each class
  • Discussion of mapped data more simplistic
  • Good for Ordinal data (by definition this data is ranked
52
Q

Disadvantages of Quantile Classification?

A
  • Fails to consider data distribution, ranking can change distribution
  • Values far apart can end up in same class
  • Identical values on class breaks
  • Odd numbers will need to adjust for which class will have more numbers
  • Creates gaps (Highest in one class is not lowest in net class
53
Q

How does Normal Distribution Classification work?

A
  • Calculate mean and standard deviations and use as class breaks
  • Use when data are normally distributed
  • Even # of classes
  • No more than 6 classes (+/- 3 std. dev.)
  • Class boundaries are arrayed around a central value instead of ascending from lowest value
54
Q

What are some disadvantages of Normal Distribution classification?

A
  • Data must be normal
  • Even glasses only
  • Can confuse audience with what one standard deviation represents
  • Class boundaries are arrayed around a central value instead of ascending from lowest value
55
Q

How does Nested Means Classification work?

A
  • Calculate mean and use to divide data into two classes
  • Take mean of the two classes above and below mean and use to create new classes
  • Normally distributed data
  • Can only have even # of classes
  • Can only have number of classes to power of 2, 4, 8, 16 etc.
56
Q

What are some disadvantages to using Nested Means?

A
  • Only for normally distributed data
  • Can only have even # of classed to powers of 2, 4, 8, 16, etc.
  • Mean can end up as one of the values and and cause uneven # of classes
57
Q

How do Systematically Unequal Stepped class limits work?

A
  • X = (Hightest - Lowest)/Progression value sum
  • Lowest + Pregression value * X
  • Can do Ascending or Descending
  • Use where frequency of data is skewed in one direction
58
Q

What is does positive skewness relate to?

A
  • Mean > Median

- Use ascending arithmetic (1, 2, 3, 4, etc) or geometric (1, 2, 4, 8, etc.)

59
Q

What does negative skewness mean?

A
  • Mean
60
Q

What is the progression for Geometric Series?

A
  • 2^0, 2^1, 2^2, 2^3, etc

- L + 1x +2x+4x+8x = H

61
Q

How does the Irregular Step Method (natural breaks) work?

A
  • Best for irregular data
  • Finds major gaps or groupings, or sharp breaks in histogram slope
  • Take difference of neighbouring data points and see where largest differences occur to find class breaks
62
Q

How are visualization symbols selected in choropleth mapping?

A
  • Symbols have meaning related to magnitude
  • Symbols must be selected to carefully communicate proper relationship and magnitude
  • Usually a shading pattern or colour