Lecture 1 Flashcards

1
Q

What are dideoxynucleotides

A

They are structural analogues of deoxynucleotides - lack OH group on the sugar

They can’t join on extending nucleotides and stops the chain extending any further.

They can incorporate into a DNA strand during DNA synthesis but do not permit further extension of the chain

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What did sanger sequencing involve

A

The method exploited dideoxynucleotides. Its an example of sequencing by synthesis (replicating the strand as we sequence it)

  1. A DNA sample is denatured and as a new strand is synthesised using a mixture of dNTPs and ddNTPS, the new DNA strand extends an existing short DNA molecule of a known sequence called a primer.
  2. As each nucleotide us added to the chain, there is a chance that a terminator nucleotide will be added instead so no more bases can be added and end up with a truncated sequence
  3. The products are run side by side on a polyacrylamide gel to separate them according to size and the sequence can be read off the gel from the bottom upwards
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What was a later development of the Sanger sequence

A

The use of fluorescently-labelled dideoxy bases which meant that sequencing could be performed in a single capillary tube with the bases distinguished by the colour of the fluorescent label

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

When was the first microbial genome published

A

It was the bacteriophage OX174 genome in early 1977, the image shows an electron micrograph of the virus which is a pathogen of the bacterium e.coli

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the features of Bacteriophage ox174

A

It has a small genome, consists of a circular DNA molecule of 5386 base pairs.

The genome is extremely compact and had overlapping open reading frames (same section of DNA encodes two different proteins- increasing gene density)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Whats the largest single DNA molecule to have been sequenced

A

Phage lambda genome (48502 base pairs)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What did Fred Blattner propose

A

The possibility of sequencing an entire bacterial genome of 5Mb. This demonstrated considerable ambition as the total amount of DNA sequenced worldwide was only 2.3Mb

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is E.coli K-12

A

Originally isolated from a convalescent diphtheria patient in 1922.

It became adopted as the standard E.coli for lab studies, referred to as the workhorse of molecular biology (high rate of replication, doubling size of 20 minutes)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

E.coli K-12 clone by clone sequencing

A

Section of the E.coli genome were clonsed into bacteriophage lambda and ordered based on informed from a genetic map.

The clones each contained about 250kb of the E.coli genome and there were a series of publications of the sequences of individual segments between 1993 and 1995

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What was the first genome to be completed

A

A team led by Craig Venter published the 1.8Mb genome sequence of haemophilus influenzae Rd in 1995.

Instead of a clone-by-clone approach, they adopted a shot gun sequencing strategy involving the sequencing of random sections of genome followed by computational assembly of the complete chromosome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why was H.influenzae chosen to be sequenced

A

Because it’s a small non-motile, gram-negative bacterium of the family Pasterurelleceae.

Its a human commensal found in the upper respiratory tract and also cause infections in the respiratory tract and middle ear.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does Sanger-based shot gun seqeuncing involve

A

The bacterial chromosome is randomly fragmented through enzymatic means (e.g. using restriction enzymes) or physical methods such as sonication.

The fragments are cloned into a evtor such as a plasmid or bacteriophage and inserted into E.coli or another organism to grow many copies of each individual clone.

Individual copies are picked and stored as a library of clones which are sequenced using Sangers method

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do we assemble shot gun sequencing data

A

The reads are obtained from either end of each DNA fragment. They can be computationally assembled to produced long sequences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Why is a complete chromosome unlikely to be obtained directly from shot gun sequencing

A

There are likely to be gaps either due to regions of the genome which could not be cloned or due to repetitive sequences which could be resolved during assembly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Whats the finishing step in sanger sequencing

A

Manually closing the gaps in the alignment through the use of molecular biology methods such as PCR, southern blots and sequencing

17
Q

Whats the second bacterial genome

A

Mycoplasma genitalium- gram positive pathogen

At the time was the smallest known genome of free-living organism (580kb and was the found to contain 470 predicted protein coding genes)

18
Q

Comparison of H.influenzae and M. genitalium

A

There were 240 genes conserved across both, this list of genes did not represent mininmal gene set for cellular life.

22 genes encoding non-orthologous replacement proteins were identified

19
Q

What was the proposed minimal gene set

A

256 genes

20
Q

Features of E.coli K-12 genome

A

4.6Mb and contained 4288 protein coding genes- hailed as a landmark for molecular biology

21
Q

What are FUN genes (function unknown)

A

Large number of open reading frames which had no known function and no similarity to previously characterised sequences

22
Q

What is campylobacter jejuni

A

One of the major causes of food poisoning worldwide. It’s a helical shaped gram-negative bacterium and its compact genome and importance as a pathogen has made it a strong candidate for an early genome sequencing project

23
Q

What did sequencing of C. jejuni reveal

A

The presence of a large number of hypervariable sequences in the genome, usually poly G:C tracts.

These regions can vary in length during DNA replication due to a strand slippage and this can result in changes in the reading frame

24
Q

What are hypervariable regions

A

Long tracks of C and Gs in a row, they vary in length during DNA replication which can change reading frames of proteins, genes are found associated with synthesis of protein structure.

These phage variable regions allow C.jejuni to adapt to lots of different environments

25
Q

What did sequencing M.tuberculosis provide

A

Provided insights into biology of the organism including genes associated with antigenic variation and pathogenicity and the large portion of the genome devoted to lipid metabolism

26
Q

What was the reason some genes could not be cloned

A

Because they encoded gene products which are toxic to e.coli. Such toxic genes are of considerable potential in development of biotechnological applications and novel antimicrobial therapies

27
Q

What are comparative genomics

A

Analyses of diverse new bacterial genomes revealed differences in genomic composition and organisation from e.coli paradigm.

28
Q

What are homoplymeric repeats

A

Tandem repeats of the same base

29
Q

What did C.jejuni contain

A

Several dozen hypervariable homoplymeric repeats concentrated in genes that were responsible for biosynthesis or modification surface structures

30
Q

What does reductive genome evolution mean

A

Occurs when sexually isolated lineages adapt to a restricted niche - characterised by the creation of non functional pseudogenomes followed by a complete loss of sequences that are no longer needed for bacterial survival

31
Q

What did genomic mining of plant pathogens such as pseudomonas syringae reveal

A

Many new type 3 secretion system effectors

32
Q

What did virulence factors encoded in genomes of non pathogens lead to

A

A new “eco-evo perspective” - genomic analyses of pathogens and commensals were embedded in non ecological and evolutionary context. Takes into account lifestyle shifts

33
Q

What did sequencing the genomes of Shewanella oneidensis lead to

A

Provided insights into the process of metal ion reduction in the environment and opened up new possibilities for bioremediation

34
Q

What do SNP based comparisons allow

A

Between the genomes of a sensitive parent strain and a resistant daughter strain can be used to identify the targets of new drugs

35
Q

What does nano pore sequencing allow

A

A single molecule long read sequencing for the masses - can generate reads that are long enough to span large scale repeats and early proof of principle studies suggest that it can deliver genome scale assemblies for bacteria

Its also portable - enables near patient or in the field sequencing but is less accurate

36
Q
A