Spatial Patterns and Cartographic Danger Flashcards
3 basic spatial patterns
Random, uniform and clustered
Cartographic Danger
Aggregate regional counts, often to preserve confidentiality
Counts lose some resolution
Different aggregations can lead to different results
Need to be critical with aggregated data maps
Question: what the underlying data is saying
E.g. county v ZIP Code v Census tract → changes what the message the map conveys
Can obscure heterogeneity at the larger spatial unit → smaller the spatial unit = more variation
Modifiable areal unit problem (MAUP)
different aggregations can lead to diffrent results
Gerrymandering: Regional data and the modifiable areal unit problem
Congressional districts - measure number of people per district for representation in govt
Re-district to divvy up district to political advantage
Massaging teh data
Do not use congressional districts for health data as unreliable
Aiming to get good counts
Happens at state level
Why is confidentiality important?
do not want to “out” someone if a disease with a lot of stigma + low prevalence → e.g. mask data or choose not to use household level data - choose your audience
Areal Units: Census Geography
More Areal Units
States (US), counties, tracts (most commonly used by Dr Clennon)
Block groups
Blocks
We use areas because
Often summaries are only available for areas
Most common background demographic information in US is from the decennial census
Census data is free
FIPS Code
Federal Information Processing Standard Codes
Hierarchical and can be broken apart or concatenated (combined)
Allows you to do table joins
ZIP Code Areas
Designed to deliver mail, not tabulate the population
Commonly used because -
Can extract ZIP from billing address
Would have to geocode address to find tract, block, block-groups
Can approximate census tracts but not the same
Measuring geographic distribution: Mean Centre
A way to mask data
Measures centre tendency
Average x + y values for input
Input locations need to be projected to accurately measure distances
Can compute locations or weight the locations by some numeric values
Mean centres can be compared to look at differences between population, case and incidence distribution
Measuring Geographic Distribution: Median Centre
Measures central tendency but less influenced by outliers than the mean
Measures the distribution of location values around statistical mean
Uses an iterative algorithm to locate the point minimising the distance to all locations
Measuring Geographic Distribution: Standard Deviation Ellipse
Shows the directional distribution - measures dispersion
Allows you to look at trends
1 standard deviation ellipse polygon will cover 68% of the points, 2SD = 95%, 3 SD = 99%