Complexity of Biological Systems & Molecular Functions (1) Flashcards

1
Q

What is the equation for complexity?

A

C= f (S; E)

S: adjacency matrix
E: entropy matrix
Measured in cbit: complexity bit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Maxam and Gilbert DNA sequencing?

A

Chemical sequencing- based on chemical modification and scission
Involved multiple steps, including radio-labelling
Dependent on other DNA technology

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Sanger DNA sequencing?

A

Chain termination- based on a biological activity (DNA replication)
Used a processive enzyme (DNA polymerase)
Simple and amenable to improvement
Amenable to automation
First of “sequence-by-synthesis” (SBS) technologies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How has Sanger sequencing been improved and automated?

A

Fluorescently labelled chain terminators (single lane)
Capillary electrophoresis separation (extension)
Automated data transfer to computer (collection)
Incorporation of PCR technology (sensitivity)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are examples of first generation genomic sequencing?

A

Maxam & Gilbert sequencing
Sanger sequencing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How is sequence data from Maxam and Gilbert sequencing interrogated?

A

Identify what you need to sequence then sequence it

Restriction mapping
Fragment isolation
End-labelling
Two secondary cleavages
Chemical sequencing
Find sequence overlap

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How is sequence data from Sanger sequencing interrogated?

A

Sequence randomly and puzzle data together

Sequencing
Sequence assembly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is pairwise sequence alignment?

A

Compare 2 similar sequences
Simple dot matrix: match/mismatch -> scoring system

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What algorithms are used in bioinformatics?

A

Dynamic (recursive) algorithms: break down the problem into multiple steps and consider all possible alignments
Heuristic algorithms: find best answer using a scoring system (variable)
Scoring system: values for matches and mismatches, penalties for gaps

Must be efficient: computing time and memory

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How does global alignment with a linear gap penalty work?

A

Matrix-based algorithm, recursive for each cell in matrix
Alignment of two sequences over entire length

Fill table from top left and read from bottom right

Sequence broken and regions of similarity identified

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How does local alignment with a linear gap penalty work?

A

Looks for high-scoring matches between regions of two sequences

Fill from top left and read from the highest value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the terms of the Needleman-Wunsch algorithm?

A

H (i,j) =
diagonal: H (i-1, j-1) + S (ai, bj)
vertical: H (i-1, j) - g
horizontal: H (i, j-1) - g

IN LOCAL ONLY: start again: 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the differences between global and local Needleman-Wunsch alignment?

A

Initialisation:
In global, first row and column could be subject to gap penalty.
In local, first row and column are 0

Scoring:
In global, scores can be negative
In local, any negative scores are set to 0

Traceback:
In global, read from bottom right to top left
In local, start at highest score and end at 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the PAM matrix?

A

Point accepted mutation matrix
Used for aligning protein sequences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is combined opening and extending gap penalty?

A

W (l) = g open + g ext (l-1)
g ext > g open usually

Trace between 3 matrices

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is progressive pairwise alignment for multiple sequences?

A

Initial pairwise alignment for each pair of sequences
Calculate ‘relatedness’ for each pair of sequences
Construct a guide tree (phylogenetic) based on relatedness
Global pairwise alignment of most closely related sequences
Successive alignment of aligned clusters using average scores

17
Q

What is D in DNA sequence evolution?

A

D = Distance
A fraction that differs
Not linear or additive
e.g., when time (t) is small, D increases linearly with t
3/4 when t is large

18
Q

What is d in DNA sequence evolution?

A

Evolutionary distance (d) as estimated number of substitutions per site

19
Q

What is Nx in de novo genome assembly?

A

The % of the sequence in contigs

sequences -> overlaps -> contigs -> scaffolds -> draft genome

20
Q

What are examples of 2nd generation DNA sequencing methods?

A

Solexa sequencing (Illumina)
Pyro-sequencing (454 Life Sciences)
SOLiD (ABI)

21
Q

What are examples of 3rd generation DNA sequencing methods?

A

Single molecule real-time sequencing (SMRT)
Zero-mode waveguides (ZMWs)
PacBio: (parallel ZMWs)
Protein Nanopores
Oxford Nanopore