Topic 8 Flashcards

1
Q

what is a histogram

A

graph showing the frequency of measurements/obeservations plotted against the range of observations

an important data exploration and summary tool

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

explain modality and symmetry

A

median - 50% higher and 50% lower
skew… what way is the tail facing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

symmetrical distribution

A

mode, median, mode are coincident

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

modality

A

when there are more than one value with a high frequency

greatly impacts the use of median and mean measures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

sample mean vs population mean

A

divided by the sample number or entire population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

we use deviation from mean ro deviation from the median

A

median is more popular because it doesnt get so easily affected by outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

standard deviation is important with data transformations T/F?

A

true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is data normalization

A

raw totals (Numerator) are standardized against a denominator
min-max scaling
comparing something to make it comprehensible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

standardization (z-score normalization)

A

types of normalization
denominator is standard deviation
max is 1
min is 0
goes to 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

min-max scaling

A

important for rasters
range of data
min and max values
does not go to 0 only to the min value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

how do you know when you need to standardize your data?

A

know your data before normalizing it. Normalizing unrelated data is like mixing apples and oragnes. It makes fruit salad, not a good analysis

not all variables need to be normalized

results can be proportions or percentages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

data classification considerations

A

grouping of numerical data into classes for mapping, with each class represented by an individual symbol

class interval: where to put breaks in the data
number of intervals : 4-7

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

describe equal intervals

A

equal intervals or steps along the number line
determine data range
not very good
susceptible to outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

describe quantiles

A

each class contains the same number of observations/values
easy tp understand

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

describe mean standard deviation

A

derive classes from the descriptive statistics of overall data distribution
worst method

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

maximum breakes (defined interval)

A

derive classes from groups of similar data values according to local citerion
calculation of classes order data from low to high
use largest differences as class breaks
can be good
susceptible to outliers
you dont see contrast

17
Q

natural breaks

A

subjective, visual/manual determination of logical breaks in data distribution in dispersion graph or histogram
depends on what you want to highlight

18
Q

geoeetrical intervals

A

class breaks are based on a geoetric series
good for highly skewed data
good for computers

19
Q

optimal (fisher-jenks)

A

computational approaches to mimimizing classification error
most common method
indentifies low points in data

20
Q

rating of classification methods major points

A

quantiles is only good for ordinal data
optimal is only good one for helping assist with selecting number of classes

21
Q

enumeration and spatial fallacies

A

areal aggregation
census tracts
best when units are similar sizes

MAUP
depending on the geometry of spatial organization impacts your outcome and how it will look on the map
change the area = change results

22
Q

jenks (optimal) tends to stay away from using mean as a central measure T/F?

A

True

23
Q

what is multivariate mapping

A

encoding two or more variables into the symbolization

trade off between the information content and the complexitiy of the map

two main groups
inter-symbol encoding
intra-symbol encoding

bivariate choropleth maps
bivariate normalization (value by alpha)

24
Q

what is inter-symbol encoding

A

symbolize 2 symbols concurrently (complimentary symbols)

25
Q

what is intra-symbol coding

A

multiple visual variables in one symbol
combination of size and hue could be applied

26
Q

selection of colour for classed maps

A

kind of data
colour vision impairment
simultaneous contrast
colour associations - cognitive
aesthetics
purpose - exploration vs presentation
cost of production

27
Q

uni polar vs bipolar data characteristics

A

unipolar - sequential
lightness usually preferred - varying saturation cna enhance visual contrast

bipolar - diverging
two hues diverge from a common light hue or grey

28
Q

representing uncertainty

A

accuracy and precision
uncertainty - difference between the real geographic phenomena and the users understanding of the phenomena

ex. inaccuracy of inerpolated maps
completeness of census data
nature of data collection - standardized methods or not

29
Q

grouping techniques

intrinsic-extrinsic

coincident-adjacent

A

intrinsic - extrinsic
intrinsic = vary existing object
extrinsic = use new object
coincident-adjacent
coincident = shown in same frame
adjacent = small multiple
static-dynmic
dynamic = animations or interactivity

30
Q

colour schemes and classified maps

A

map readers process colour by seeing differences in hue, stauration, and value

mapping different “things”
saturation (changing lightness)
hues ( best used to identify map featueres and differentiate)
value (lightness )

31
Q

Look up tables

A

LUTs are used in lots of ways
data structures that map values of attributes to something else
colour scale can be lenghtened by adding saturation
keep hue constant but vary saturation and lightness

look up tables = enhancement

32
Q

what type of data is pseudo colour table used on

A

unclassed data

33
Q

what is an anthrom?

A

human controlled ecosystem

34
Q

3 type of histogram graphs with LUTs

A

linear stretch (min-max enhancements)
exponential stretch
logarithmic stretch

35
Q

are stretch and enhancement synonymous terms?

A

yes

36
Q

what method is best used with contrast stretch

A

standard deviation