Spatial data quality and uncertainty Flashcards
what is uncertainty?
difference between the contents of a dataset and the phenomena that the data are supposed to represent
what are some sources of uncertainty?
- users (variation in understanding how a real thing differs from its representation)
- units of analysis (i.e. how do u measure health?)
- proxy and indirect indicators (i.e. judging income based on home location)
- geographic phenomena
- GIS data model (i.e.raster vs vector)
Who sets standards for data quality?
- International Standards Organization (ISO)
- governemnt legislation (i.e. US national map accuracy standards, Canada national topographic database standards)
how is data quality differnt from the user and the producer perspective?
producer: degree to which data in a dataset conforms to the capture specification and to product specifications for the dataset (internal data quality)
- user: data quality=fitness for purpose of data. external data quality
What are some elements of spatial data quality?
- completeness (i.e. omission)
- logical consistency (does it make sense topologically, conceptually)
- positional accuracy (i.e. location of lake in feature class vs location in a TIFF image)
- thematic accuracy
- temporal quality
What are errors?
- deviations between measured value and its true value
- classes: human, systematic (biases in equipment use, defects in measuring equipemtns), random errors
Are accuracy and precision the same thing?
- accuracy: how close the recorded values are to “true” values.
- precision: how exact data measurement and storage are (i.e. # of decimal place)
What document sets standards for measuring and evaluating quality from a producer’s perspective?
-ISO 19157
completeness, logical consistency, positional accuracy, thematic accuracy
Explain positional error
- error in locational coordinates (x,y)
- Root Mean Square error: tells us how far points are from their true location on average
What are some sources of positional error?
- map projection, datum
- improper representation of objects or phenomena (i.e. boundaries)
- equipment errors
- human errors (i.e.bad sampling procedures)
- media related (i.e. bad source quality like CanVec data)
How can we determine if the data we use is good enough?
- check metadata!!
- consider validity of authors
- compare with the real thing
Explain map topology
- temporary relationships between FCs or shapefiles within a ArcMap edit session
- i.e. reshaping polygon edges
Explain geodatabase topology
- used to check data integrity relative to the rues and FCs that are selected by the geoDB designer
- rules govern relationships between features within an FC or between several FCs
- topology rules are saved as table object inside feature dataset
- relationships in each rule are checked against the participating data sets