Lecture 6 Flashcards
Uses of GIS
Lots of choropleth maps are produced to show new data
Consider what is shown, scale (population), colour scheme
Red and green should never be seen (red and green colour blindness)
Aims
Error and uncertainty in data and models Sources of error and uncertainty Managing error and uncertainty Understanding MAUP - Scale - Zones
Error- why it matters
GIS analysis relies on quality data- but also sensible technicians who can identify inaccuracies and the source of error.
Error can render the analysis useless, meaningless or cause inappropriate decisions to be made based on faulty outputs.
We bring together lots of diverse datasets (a huge benefit), which create more chances for error to be introduced.
Introduction to error and uncertainty
What do we mean by ‘error’?
Everyday language: A mistake Quantitative sciences: Difference between a predicted or measured value and the true value Error = measured value - true value Error = predicted value - true value
Limiting Digitizing Error
Heads-up digitizing (using a mouse) can lead to user error
Potential sources of digitizing error:
Overshoots and undershoots
Dangling segment
Sliver polygons
Closeness of output to reality is unknown…
Therefore errors introduce uncertainty into outputs (maps, statistics etc…)
Accuracy
The extent to which an estimated data value represents its true value
Precision
The dispersion of a measured or predicted value also specificity (decimal places)
Bias
Is any systematic difference between the true value and the measured or predicted value (e.g overprediction)
Three key types of accuracy in GIS: Positional, attribute and topological
Positional accuracy: are features in the right place?
Attribute accuracy: do features have the correct value (continuous/ categorical)?
Topological accuracy: Do features relate to each other
Common errors
Misalignment errors in overlay
Aggregation/ disaggregation
Classification (especially of continuous data to categorical)
Errors will propagate through
Understanding scale effects
Fotheringham and Wong (1991) explain the increase in the coefficient of correlation through the fact that aggregation into larger and larger area is equivalent to smoothing and therefore a reduction in variance for the variables of interest.
This outcome is also affected by the presence of spatial dependence among the initial observations: positive spatial autocorrelation moderates the fall in variance from smoothing, while negative spatial autocorrelation exacerbates it.
Zonation
How you draw the boundary of an area may influence the resulting measure of health outcomes
Often zones are created for convenience, based on historic boundaries
Major differences in correlations
Sources of error and uncertainty in GIS analysis
Data collection
Data encoding
Data editing/ conversion
Data analysis