Compositional Data Flashcards

1
Q

Compositional Data

A
  • multivariate sets of non-negative components
  • measured directly as proportions that sum to one or measured in absolute terms with different totals
  • interest is in the size of components relative to the total and relative to each other
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Historic Compositional Data

A
  • non-negative vectors that are subject to a uni-sum constraint
  • proportions that sum to 1
  • sample space - simplex
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Spurious Correlation

A
  • early work of compositional data based upon
  • arises due to sum constraint
  • increase in one component realtive to the total reduces the share of the other components, induces a negtaive correlation in the relative values
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Spurious Correlation - outside simplex

A

when outside simplex this is not the case
number of Ford crashes is not negatively correlated with the number of VW crashes. however, when consider the proportions - a negative correlation can emerge. An increase in the proportion of Ford crashes relative
to the total would automatically reduce the proportion of Volkswagen crashes, even if the
actual number of Volkswagen crashes remains unchanged.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Scale Invariance

A
  • ratios between components unchanged under rescaling

multiplying by constant - ratios stay the same

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Subcompositional Coherence

A
  • relantionships between parts remain valid even when analysing a subset of components

A,B,C - analysing just A,B should not give conflicting conclusions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Subcompositional Dominance

A
  • if one component dominates the full composition, it should dominate in any subcomposition

if A always greater than B in A,B,C - then should remain true in A,B

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Permutation Invariance

A
  • order of the components should not affect the analysis

A,B,C should give the same results as B,A,C

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Ternary Diagram

A
  • graphical visualtion of three components
  • near vertex - high concentration of that component
  • near centre - equal proportions of all components
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

ALR

A

additive log-ratio
* takes log of components to one reference component
* dependent on choice of divisor
* asymmetric

issue if there is not one non-zero component

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

CLR

A

centered log-ratio
* takes log of components to the geometric mean
* covariance singular as all components retained - determinant equal to 0

robustness issue during to singularity - some tehchniques not suitable (discriminant analysis)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

ILR

A

isometric log-ratio
* uses orthonormal coordinates to transform components
* creates independent, orthogonal coordinates

harder to interpret and complex to construct

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Rounded Zeros

A
  • represent values that fall below some detection limit
  • not true zero values
  • due to measurement error or below detection limit
  • treated by replacing the zero values
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Structural Zeros

A
  • true zeros
  • actual zero or absence of component
  • informative
  • carry important information
  • model-based approaches
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Restrictive Definition of Compositional Data

A

compositional data in broader context

  • confining compositional data to simpelx too restrictive
  • assumes compositions must be analysed through relative terms rather than absolute values
  • compositional data can often orginate from counts / absolute values
  • absolute values can carry important inforamtion as they can influence the variance and overall dynamics of the data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly