Analysis of ecological communities Flashcards

1
Q

Biodiversity data reduction

A

Summarization of how sample units relate to each other, Two types:

  1. Categorization.
  2. Creating synthetic variables (e.g., ordination).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Problem with using traditional multivariate tools in community ecology

A

Data are accompanied by unwanted properties

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Questions asked by comunity ecology

A

What are the species living in an area and why?
What controls diversity?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Ordinal variables

A

Categorical variables of a ranked nature

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Nominal variables

A

Categorical variables of a qualitative nature with no inherent rank

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Dummy variables

A

Categorical variables with n possible states transformed into binary variables in which 1 or 0 represent presence or absence of something categorical.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Discontinuous variables

A

Variables with fixed numerical values (e.g., percent)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Continuous variables

A

Variables with potential infinite numerical values. In practice, seldom true (e.g., set graduations in a measuring device).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Routes for analyzing a species matrix

A

Q route: Relationships among sample units
R: Relationships among species

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Transposing (matrices)

A

Flipping a matrix vertically or horizontally.
In a normal matrix species are treated as axes;
In a transposed matrix, species are treated as points.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Calculated matrices

A

Matrices wholes values are calculated from multiplying variables from an original matrix.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Metaobjects

A

A sample containing objects sich as species, environment, treatment, and species traits.
Columns and rows of a matrix are considered objects.

Each pair of objects can potentially form a different matrix, altough not always conceptually appealing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Sparse matrix

A

A matrix containing many zero values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Relativization

A

Standarizing a number of variables into a single scale (e.g., number of standard deviations from the mean).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Focal species

A

A species in a community with special interest (e.g., a rare species interesting for conservation).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Focal species-habitat relationship

A

Two ways to examine:
1. Species-centeres approach: predict species performance from habitat variables (e.g., regression)

  1. Habitat-centered approach: describe variation in habitat variables and evaluate species position within that variation (e.g., ordination).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Sampling (McCune’s definition)

A

The process of selecting objects of study from a larger number of those objects, where each object is subjected to one or more measurements.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Examples of species abundance measurements

A
  1. Cover
  2. Frequency
  3. Counts
  4. Biomass
  5. Basal area
  6. Importance values (average of >1 sampling measurements)
  7. Presence/absence
  8. Size-class data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Types of sampling schemes

A
  1. Random
  2. Stratified random
  3. Regular (sample units are spaced at regular intervals)
  4. Arbitrary without bias (haphazard)
  5. Subjective (e.g., we sampled the most diverse landscapes)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Sources of random numbers

A

Random number tables, stopwatch, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Types of cover sampling units

A
  1. Fixed area: circles, squares, rectanlges
  2. Point intercept: percent cover is calculated as proportion of hits by points
  3. Line intercept: percent cover measured as proportion of a line superimposed on a species
  4. Distance method: Distance from a randomly chosen point to nearest tree (based on the concept that distances can be used to calculate the mean area occupied by an object).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Number of sample units (rule of thumb)

A

For every imporant controlling factor, 20 additional sample units are needed;

Alternatively, a species accumulation curve can be used to calculate needed number of samples (when a curve flattens out, addiitonal sampling stops being beneficial).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Measures of sample adequacy

A

Precision: Increases with addiitonal digits.
Accuracy: How close a measurement is from reality
Bias: Any systematic directional error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Measuring accuracy

A

A sample taken as “the truth” is taken by intensive sampling and resampling an area with dufferent observers and a large number of sampling units.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Sample size-number tradeoff
Many small samples: results in an incomplete species list Few large samples: results in overestimating rarer species
26
Pseudoreplication
When replicates are not trully independent (e.g., spatially non-independent plots)
27
Repeated measures
Succesive measurements from one sample result in correlation (e.g., samplig an area on different dates).
28
Pseudo-turnover
Seemingly change in a community due to sampling biases (e.g., differet observers)
29
Nesting (aka multistage sampling)
Subsampling within a sample unit
30
Paired sampling
Samples pairs are identical except for the condition of interest. Only really achievable under controlled conditions.
31
Topographic variables
1. Aspect of a slope (direction that it faces; degrees need to be transformed) 2. Heat load: Amount of solar energy a slope receives 3. Slope correlation of distances (physical distances in a slope need to be transformed)
32
Diversity (formula)
where *α* is a constant assigned; *S* = number of species; *Dα* = diversity measure based on constant l *Pi* = proportion of individuals in a particlar species *i*
33
Diversity and uncertainty
Maximum diversity yields maximum diversity in the species that one will draw next from a sample.
34
Three applications of B-diversity in relation to environmental gradients
1. Direct gradient: the environmental gradient is directly measured 2. Indirect gradient: a gradient is presumed from measures of a species 3. No specific gradient: compositional heterogeneity without reference to a gradient
35
Rate of change, *R*
Refers to the steepness of a species response in relation to a gradient. Can be graphed against the gradient (see image).
36
Gleasons
Unit of species change. Simiar to R, masures steepness of a species response curve. It's the sum of slopes of individual species at a point along the gradient axis.
37
Half-change
The amount of change in a community resulting in 50% similarity. A gradient that is 3 half-changes long means that the community has changed by half three times.
38
Beta turnover
Measurment of species change in a commuity with presene/absence data. Uses gained species, lost species, and α.
39
Pielou's *J*
Measure of species evenness calculated as: *J* = *H'*/log*S* , where H' is the Shannon inex
40
Distance curve
Works by calculating the average distance between the centroid of a subsample and the centroid of a whole sample. The more representative the sample, the shorter the distance. Distance curves have an inverse distribution to richness.
41
Problems in species response curves in response to gradients (in relation to real data)
1. Zero truncation: Abbundance cannot be lower than 0 2. Curves are solid: Species is usually less abundant than potential 3. Curves are complex: Can be polymodal, asymmetric, discontinuous, etc.
42
Frequency distribution of species abundance (most common characteristic)
Usually resemble negative binomial distribution (strongly skewed).
43
Bivariate distributions
Used to examien the relationship between the distributions of two species. A) positively associated B) Negatively associated
44
Dust bunny distribution
Commonly observed in a scatterplot of one species versus another. Most sample units lie along the corners of the p-dimensional space. The more strongly correlated the variables, the more elongate the curve.
45
B in relation to species matrices
If β = 1, the matrix is full; If β = 3, the mean sampling unt has 1/3 of all species.
46
Double zero problem
many common methods for examining relationships between species take absence as an indication of a positive relationship. Therefore, as (0,0) values are added to a dataset, negative association move toward positivity. This issue prodcues data misinterpretation.
47
The flexibility of using distance to analyze biological communities
1. Resemblance can be measured as dissimilairty (distance) or similarity 2. Distance measures can be converted into similarities and vice-versa 3. Distance can be applied to binary and quantitative data
48
Graphical representation of the following dataset in ordination
\* Alterntively, sample units as points along species axes
49
Three main categories of distance measures
Metric, semimetric, nonmetric
50
Metric distance measures
Obbey thre rules: 1) min values is 0 for two identical items 2) distance is possitive for two differing items 3) Ssymmetry: distance from A to B = distance B to A 3) Triangle inequality axiom: with three objects, the distance between two objects cannot be longer than the sum of the other two
51
Semimetric distance measures
Violate the triangle inequality axiom
52
Nonmetric distance measures
Violate one or more metric distance measure rules
53
Euclidean distance
A distance measure based on the Pythagorean theorem
54
Manhattan (city block) distance
Distace measure calculated while moving along one direction at a time
55
Correlation coefficient (*r*) (distance measure)
Distance measure calculated by using the angle beteen lines connecting two poins with a centroid; It eliminates scale incompatiblities between values (i.e., different orders of magnitude between values) *r* = cos(angle) Rescaling: *r*distance = (1 - *r*)/2
56
Proportion coefficients
Distance method consisting of city bock distance expressed as proportions of the maximum distance possible; Includes **Sorensen** coefficient (aka **Bray-Curtis** coefficient): 2*w*/(A+B) and **Jackard** coefficient: *w*/(A+B-*w*), Where *w* is the area overlapping between curves. *w* can be represented by the overlap between two curves of species abundance against an environmental gradient.
57
Quantitative symmetric dissimilarity (QSK)
Throws different results with raw vs relativized data. When data is relativized, gives the same result as Sorensen.
58
Relativized Sorensen
Distance measure that is equivalent to Bray-Curtis coefficient when data are relativized by sampling unit (SU) total.
59
Relative Euclidean distance (RED)
Distance measure designed to put differently-scaled variales on the same scale. Ranges from 0 to 1.
60
Chi-square distance
Distance measure that usually performs poorly, but can be sueful for some ordination techniques.
61
Mahalanobis distance (*D*2)
Corrects for correlation in original variables; Can be used to test outliers by calculating distance between each point and remaining points in the cloud; Functions similarly to an F ratio;
62
Distance: Loss of sensitivity (concept)
All distance measures show a degree of curvilinearity as the environmental gradient axis elongates, losing seinsitivity (this effects is strongest for correlation coefficient measure).
63
Redundancy
Where a pair of species fails to be informative, another pair of species will be.
64
Nonmetric multidimensional scaling (NMS) (advantage over other techniques)
It's based on ranked distances. This allows to linearize the relationship between distance in species space and distance in an ordination space.
65
Criteria for choosing distacne measures
1) Availability in stastistical packages 2) Compatibility with a particular multivariate analysis 3) Theoretical basis (e.g., what the distance measure represents) 4) Intuitivity (ease of interpetability)
66
Geodesic distance method vs shortest path method
While aiming to use the shortest distance possible, the *shortest path method* considers only one point at a time, while the *geodisic method* considers all points at a time
67
For the following dataset, calculate Euclidean distance and Manhattan distance for sites AB
Euclidean distance AB = sqrt(1+1+1+1+81) = 9.165 Manhattan distance AB = 1+1+1+9 = 12
68
Outliers
Sample units with extreme values. for individual variables (univariate outliers) or multiple variables (multivariate outliers). Note multivariate outliers are not necessarily univariate outliers.
69
Ways of detecting multivariate outliers
1) Mahalanobis distance 2) Calculate number of standard deviations for distance 3) Look for isolated entities in clusters
70
Types of data transformations for multivariate analysis
1) Monotonic transformations 2) Beals smoothing 3) Relativization 4) Double relativization
71
Monotonic transformations (examples)
1) Power (xp): p = 0 gives presence absence; square root (p =0.5) and other roots. 2) Logarithmic: log(x). Requires special treatment for zeros (not recommended) 3) Arcsine: (2/π) \* arcsin(x); requires proportion data 4) Arcsine squareroot (2/π) \* arcsin(√x): Spreads the ends of the scale for proportion data
72
Beals smoothing
Designed for community data. Replaces each cell in the community matrix with the proability of the target species occuring in that particular sample, based on the joint occurrences. Replaces presence/absence data with quantitative values.
73
Relativization
Rescales individual rowsor column in relation to one criterion (e.g., maximum, mean, median, SD...) Most effective when row totals are unequal. (consistency of row totals can be evaluated by the coefficient of variation: CV = 100\*(SD/mean))
74
Weighting by ubiquity
A type of relativization that gives more weight to species tat occur in many samples.
75
Information function of ubiquity
Based on information theory, maximum weigt is applied to species occurring in 50% of the samples because they provide the most information.
76
Double relativization
Relativizations applied serially in a given combination.
77
Deleting species
Deleting species based on very low occurrences is ok for studies interested in correlation. The aim in this is trying to reduce noise. Correlation coefficients peaks at an intermediate level of species retention. Deleting species when the aim is to examine patterns of species diversity is not correct.
78
Combination of entities
In community matrices, sampling units can be combined (averaged) provided they are similar enough.
79
Difference between two dates
Before and after data on species abundance can be analyzed by difference instead of original values *b*ij = *a*ij2 - *a*ij1
80
Grouping strategies for multivariate analysis
1) Hierarchical (groups nested witin groups) 2) Nonhierarchical (groups not related to a particular level of grouping) 3) Polythetic: Use diferent species for decigin groupings/divisions 4) Monothetic: Grouping based on one species (or variable) of interest 5) Agglomerative: Groups formed by fusion 6) Divisive: Starts with a group of everything and sequentialy divides
81
Dendrogram grouping and information
Information is lost as groups are made. The *objective function* measures informaton lsot at each step and allows rescaleing from 0 to 100% information. The goal is to maximize information and minimize grouping (groups should still be interpretable ecologically).
82
Multivariate regression trees
Hierarchical grouping method. Impurity of groups is the sum of the squared Euclidean distance from individual samples units to the centroid.
83
Nonhierarchical grouping
Groups are specified and then items are placed into those groups.
84
Evaluating quality of groups
Chaining is the addition of single items to existing groups. Chaining increaes path lenght, so the goal is to reduce chaining.
85
Circularity
testing for differences among groups using variables that were used to define groups.