Analysis of ecological communities Flashcards

1
Q

Biodiversity data reduction

A

Summarization of how sample units relate to each other, Two types:

  1. Categorization.
  2. Creating synthetic variables (e.g., ordination).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Problem with using traditional multivariate tools in community ecology

A

Data are accompanied by unwanted properties

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Questions asked by comunity ecology

A

What are the species living in an area and why?
What controls diversity?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Ordinal variables

A

Categorical variables of a ranked nature

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Nominal variables

A

Categorical variables of a qualitative nature with no inherent rank

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Dummy variables

A

Categorical variables with n possible states transformed into binary variables in which 1 or 0 represent presence or absence of something categorical.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Discontinuous variables

A

Variables with fixed numerical values (e.g., percent)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Continuous variables

A

Variables with potential infinite numerical values. In practice, seldom true (e.g., set graduations in a measuring device).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Routes for analyzing a species matrix

A

Q route: Relationships among sample units
R: Relationships among species

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Transposing (matrices)

A

Flipping a matrix vertically or horizontally.
In a normal matrix species are treated as axes;
In a transposed matrix, species are treated as points.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Calculated matrices

A

Matrices wholes values are calculated from multiplying variables from an original matrix.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Metaobjects

A

A sample containing objects sich as species, environment, treatment, and species traits.
Columns and rows of a matrix are considered objects.

Each pair of objects can potentially form a different matrix, altough not always conceptually appealing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Sparse matrix

A

A matrix containing many zero values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Relativization

A

Standarizing a number of variables into a single scale (e.g., number of standard deviations from the mean).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Focal species

A

A species in a community with special interest (e.g., a rare species interesting for conservation).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Focal species-habitat relationship

A

Two ways to examine:
1. Species-centeres approach: predict species performance from habitat variables (e.g., regression)

  1. Habitat-centered approach: describe variation in habitat variables and evaluate species position within that variation (e.g., ordination).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Sampling (McCune’s definition)

A

The process of selecting objects of study from a larger number of those objects, where each object is subjected to one or more measurements.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Examples of species abundance measurements

A
  1. Cover
  2. Frequency
  3. Counts
  4. Biomass
  5. Basal area
  6. Importance values (average of >1 sampling measurements)
  7. Presence/absence
  8. Size-class data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Types of sampling schemes

A
  1. Random
  2. Stratified random
  3. Regular (sample units are spaced at regular intervals)
  4. Arbitrary without bias (haphazard)
  5. Subjective (e.g., we sampled the most diverse landscapes)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Sources of random numbers

A

Random number tables, stopwatch, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Types of cover sampling units

A
  1. Fixed area: circles, squares, rectanlges
  2. Point intercept: percent cover is calculated as proportion of hits by points
  3. Line intercept: percent cover measured as proportion of a line superimposed on a species
  4. Distance method: Distance from a randomly chosen point to nearest tree (based on the concept that distances can be used to calculate the mean area occupied by an object).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Number of sample units (rule of thumb)

A

For every imporant controlling factor, 20 additional sample units are needed;

Alternatively, a species accumulation curve can be used to calculate needed number of samples (when a curve flattens out, addiitonal sampling stops being beneficial).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Measures of sample adequacy

A

Precision: Increases with addiitonal digits.
Accuracy: How close a measurement is from reality
Bias: Any systematic directional error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Measuring accuracy

A

A sample taken as “the truth” is taken by intensive sampling and resampling an area with dufferent observers and a large number of sampling units.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Sample size-number tradeoff

A

Many small samples: results in an incomplete species list
Few large samples: results in overestimating rarer species

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Pseudoreplication

A

When replicates are not trully independent (e.g., spatially non-independent plots)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Repeated measures

A

Succesive measurements from one sample result in correlation (e.g., samplig an area on different dates).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Pseudo-turnover

A

Seemingly change in a community due to sampling biases (e.g., differet observers)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Nesting (aka multistage sampling)

A

Subsampling within a sample unit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Paired sampling

A

Samples pairs are identical except for the condition of interest.
Only really achievable under controlled conditions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Topographic variables

A
  1. Aspect of a slope (direction that it faces; degrees need to be transformed)
  2. Heat load: Amount of solar energy a slope receives
  3. Slope correlation of distances (physical distances in a slope need to be transformed)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Diversity (formula)

A

where α is a constant assigned; S = number of species; Dα = diversity measure based on constant l Pi = proportion of individuals in a particlar species i

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

Diversity and uncertainty

A

Maximum diversity yields maximum diversity in the species that one will draw next from a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Three applications of B-diversity in relation to environmental gradients

A
  1. Direct gradient: the environmental gradient is directly measured
  2. Indirect gradient: a gradient is presumed from measures of a species
  3. No specific gradient: compositional heterogeneity without reference to a gradient
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

Rate of change, R

A

Refers to the steepness of a species response in relation to a gradient. Can be graphed against the gradient (see image).

36
Q

Gleasons

A

Unit of species change. Simiar to R, masures steepness of a species response curve.
It’s the sum of slopes of individual species at a point along the gradient axis.

37
Q

Half-change

A

The amount of change in a community resulting in 50% similarity.
A gradient that is 3 half-changes long means that the community has changed by half three times.

38
Q

Beta turnover

A

Measurment of species change in a commuity with presene/absence data. Uses gained species, lost species, and α.

39
Q

Pielou’s J

A

Measure of species evenness calculated as:

J = H’/logS

, where H’ is the Shannon inex

40
Q

Distance curve

A

Works by calculating the average distance between the centroid of a subsample and the centroid of a whole sample. The more representative the sample, the shorter the distance.

Distance curves have an inverse distribution to richness.

41
Q

Problems in species response curves in response to gradients (in relation to real data)

A
  1. Zero truncation: Abbundance cannot be lower than 0
  2. Curves are solid: Species is usually less abundant than potential
  3. Curves are complex: Can be polymodal, asymmetric, discontinuous, etc.
42
Q

Frequency distribution of species abundance (most common characteristic)

A

Usually resemble negative binomial distribution (strongly skewed).

43
Q

Bivariate distributions

A

Used to examien the relationship between the distributions of two species.
A) positively associated
B) Negatively associated

44
Q

Dust bunny distribution

A

Commonly observed in a scatterplot of one species versus another.
Most sample units lie along the corners of the p-dimensional space.
The more strongly correlated the variables, the more elongate the curve.

45
Q

B in relation to species matrices

A

If β = 1, the matrix is full;
If β = 3, the mean sampling unt has 1/3 of all species.

46
Q

Double zero problem

A

many common methods for examining relationships between species take absence as an indication of a positive relationship.

Therefore, as (0,0) values are added to a dataset, negative association move toward positivity.

This issue prodcues data misinterpretation.

47
Q

The flexibility of using distance to analyze biological communities

A
  1. Resemblance can be measured as dissimilairty (distance) or similarity
  2. Distance measures can be converted into similarities and vice-versa
  3. Distance can be applied to binary and quantitative data
48
Q

Graphical representation of the following dataset in ordination

A

* Alterntively, sample units as points along species axes

49
Q

Three main categories of distance measures

A

Metric, semimetric, nonmetric

50
Q

Metric distance measures

A

Obbey thre rules:

1) min values is 0 for two identical items
2) distance is possitive for two differing items
3) Ssymmetry: distance from A to B = distance B to A
3) Triangle inequality axiom: with three objects, the distance between two objects cannot be longer than the sum of the other two

51
Q

Semimetric distance measures

A

Violate the triangle inequality axiom

52
Q

Nonmetric distance measures

A

Violate one or more metric distance measure rules

53
Q

Euclidean distance

A

A distance measure based on the Pythagorean theorem

54
Q

Manhattan (city block) distance

A

Distace measure calculated while moving along one direction at a time

55
Q

Correlation coefficient (r) (distance measure)

A

Distance measure calculated by using the angle beteen lines connecting two poins with a centroid;

It eliminates scale incompatiblities between values
(i.e., different orders of magnitude between values)

r = cos(angle)

Rescaling: rdistance = (1 - r)/2

56
Q

Proportion coefficients

A

Distance method consisting of city bock distance expressed as proportions of the maximum distance possible;

Includes Sorensen coefficient (aka Bray-Curtis coefficient): 2w/(A+B) and
Jackard coefficient: w/(A+B-w),

Where w is the area overlapping between curves.

w can be represented by the overlap between two curves of species abundance against an environmental gradient.

57
Q

Quantitative symmetric dissimilarity (QSK)

A

Throws different results with raw vs relativized data.
When data is relativized, gives the same result as Sorensen.

58
Q

Relativized Sorensen

A

Distance measure that is equivalent to Bray-Curtis coefficient when data are relativized by sampling unit (SU) total.

59
Q

Relative Euclidean distance (RED)

A

Distance measure designed to put differently-scaled variales on the same scale. Ranges from 0 to 1.

60
Q

Chi-square distance

A

Distance measure that usually performs poorly, but can be sueful for some ordination techniques.

61
Q

Mahalanobis distance (D2)

A

Corrects for correlation in original variables;

Can be used to test outliers by calculating distance between each point and remaining points in the cloud;

Functions similarly to an F ratio;

62
Q

Distance: Loss of sensitivity (concept)

A

All distance measures show a degree of curvilinearity as the environmental gradient axis elongates, losing seinsitivity (this effects is strongest for correlation coefficient measure).

63
Q

Redundancy

A

Where a pair of species fails to be informative, another pair of species will be.

64
Q

Nonmetric multidimensional scaling (NMS) (advantage over other techniques)

A

It’s based on ranked distances. This allows to linearize the relationship between distance in species space and distance in an ordination space.

65
Q

Criteria for choosing distacne measures

A

1) Availability in stastistical packages
2) Compatibility with a particular multivariate analysis
3) Theoretical basis (e.g., what the distance measure represents)
4) Intuitivity (ease of interpetability)

66
Q

Geodesic distance method vs shortest path method

A

While aiming to use the shortest distance possible, the shortest path method considers only one point at a time, while the geodisic method considers all points at a time

67
Q

For the following dataset, calculate Euclidean distance and Manhattan distance for sites AB

A

Euclidean distance AB = sqrt(1+1+1+1+81) = 9.165

Manhattan distance AB = 1+1+1+9 = 12

68
Q

Outliers

A

Sample units with extreme values.
for individual variables (univariate outliers) or multiple variables (multivariate outliers).

Note multivariate outliers are not necessarily univariate outliers.

69
Q

Ways of detecting multivariate outliers

A

1) Mahalanobis distance
2) Calculate number of standard deviations for distance
3) Look for isolated entities in clusters

70
Q

Types of data transformations for multivariate analysis

A

1) Monotonic transformations
2) Beals smoothing
3) Relativization
4) Double relativization

71
Q

Monotonic transformations (examples)

A

1) Power (xp): p = 0 gives presence absence; square root (p =0.5) and other roots.
2) Logarithmic: log(x). Requires special treatment for zeros (not recommended)
3) Arcsine: (2/π) * arcsin(x); requires proportion data
4) Arcsine squareroot (2/π) * arcsin(√x): Spreads the ends of the scale for proportion data

72
Q

Beals smoothing

A

Designed for community data.

Replaces each cell in the community matrix with the proability of the target species occuring in that particular sample, based on the joint occurrences.

Replaces presence/absence data with quantitative values.

73
Q

Relativization

A

Rescales individual rowsor column in relation to one criterion (e.g., maximum, mean, median, SD…)

Most effective when row totals are unequal.

(consistency of row totals can be evaluated by the coefficient of variation: CV = 100*(SD/mean))

74
Q

Weighting by ubiquity

A

A type of relativization that gives more weight to species tat occur in many samples.

75
Q

Information function of ubiquity

A

Based on information theory, maximum weigt is applied to species occurring in 50% of the samples because they provide the most information.

76
Q

Double relativization

A

Relativizations applied serially in a given combination.

77
Q

Deleting species

A

Deleting species based on very low occurrences is ok for studies interested in correlation. The aim in this is trying to reduce noise. Correlation coefficients peaks at an intermediate level of species retention.

Deleting species when the aim is to examine patterns of species diversity is not correct.

78
Q

Combination of entities

A

In community matrices, sampling units can be combined (averaged) provided they are similar enough.

79
Q

Difference between two dates

A

Before and after data on species abundance can be analyzed by difference instead of original values
bij = aij2 - aij1

80
Q

Grouping strategies for multivariate analysis

A

1) Hierarchical (groups nested witin groups)
2) Nonhierarchical (groups not related to a particular level of grouping)
3) Polythetic: Use diferent species for decigin groupings/divisions
4) Monothetic: Grouping based on one species (or variable) of interest
5) Agglomerative: Groups formed by fusion
6) Divisive: Starts with a group of everything and sequentialy divides

81
Q

Dendrogram grouping and information

A

Information is lost as groups are made.
The objective function measures informaton lsot at each step and allows rescaleing from 0 to 100% information. The goal is to maximize information and minimize grouping (groups should still be interpretable ecologically).

82
Q

Multivariate regression trees

A

Hierarchical grouping method. Impurity of groups is the sum of the squared Euclidean distance from individual samples units to the centroid.

83
Q

Nonhierarchical grouping

A

Groups are specified and then items are placed into those groups.

84
Q

Evaluating quality of groups

A

Chaining is the addition of single items to existing groups. Chaining increaes path lenght, so the goal is to reduce chaining.

85
Q

Circularity

A

testing for differences among groups using variables that were used to define groups.