6 - Phylogenetic Inference: Trees and Distances Flashcards

1
Q

List four things that HMMs can do in bioinformatics

A
  1. Build and refine multiple alignments or profiles in a mathematically rigorous way
  2. Assess the probability that a sequence belongs to a given family of sequences used to train the HMM
  3. Gene finding/annotation in genome sequences
  4. Regulatory region (promotor0 finding in genomes
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Give four terms used in trees

A
  • Root
  • Branch
  • internal node (eg. branch point/hypothetical ancestor)
  • Terminal node
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Draw the tree for the following Newick tree file format

((A,B),C),(D,E)

A

Each comma is a node, the comma between brackets is the root.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are cladograms and phylograms?

A

Cladogram: a branching diagram showing the cladistic relationship between a number of species.

Eg. the branchlengths don’t mean anything.

Phylogram: Scaled branch lengths indicate divergence. Ultrametric trees are often called ‘strict molecular clock trees’ because their branchlengths are directly proportional to time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How can you tell if a tree is rooted or not?

A

Ask yourself where is the root (must come to a point) and how was it determined?

Most tree building methods will not explicitly root the tree unless you tell it to.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Give three ways for rooting a tree

A

Ad hoc: Historical precedent or other knowledge. You decide placement of root in the tree

Intrinsic: Eg. parsimony methods, likelihood, UPGMA etc. Midpoint rooting is where you find the point of the tree that is equidistant from all the tips (in an ultrametric tree) or some approximation to this.

Outgroup rooting: Add species/sequences to your analysis that you know based on some other criteria (fossil data, morphological phylogeny, previous molecular phylogenetic results etc)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are monophyletic groups?

A

A clade on a tree. Clades are subtrees of larger trees. The set of all clades in a tree can be expressed by the cladogram.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is polyphyly and paraphyly?

A

Polyphyly: Multiple origins for the groups, the group does not contain the common ancestor

Paraphyly: SIngle origin for the group, but the group does not contain all the descendents of the common ancestor

A paraphyletic group often represents a grade on the tree (ie. a kind of morphological organization in a paraphyletic group).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are three broad reasons for gene trees not necessarily being species trees?

A
  • Coalescence of alleles within species
  • Paralogy and orthology
  • Lateral gene transfer (xenology)

Eg. paralogs result from gene duplication and orthologs result from speciation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly