Bacterial Genomics Flashcards

1
Q

Microbial genomes

A

small
high density with small intergein regions with little or no intros with very small amount of repeat or non coding dna

short protein coding genes

operons with promoters just upstream

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Bacterial Genome organisation

A

Chromosomes
Generally a single circular chromosome (always DNA)

Some species have linear chromosome(s) - Borrelia, Streptomyces

Some species have two chromosomes - Vibrio cholerae

Both circular and linear - Agrobacterium tumefaciens, Borrelia burgdoferi

Plasmids
Independent autonomous replicon - circular or linear

may integrate into chromosome

copy number varies 1 to 10s

often carry non-essential genes that confer an adaptive advantage in certain conditions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

sanger method history

A

Sanger chain-termination method, developed in the late 1970s, second Nobel Prize for Fred Sanger.

Technical improvements in the 1980s led to increased sequence read length from 400 to over 800 bp

Principal limitation of Sanger sequencing is requirement for cloning step in library construction with non-clonable regions not present in library

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

how the sanger method is done

A

A DNA primer is radiolabeled

The primer is annealed to the template DNA

The primer is extended by DNA polymerase
Incorporation of a deoxynucleotide - further extension possible
Incorporation of a dideoxynucleotide – chain termination

Four reactions set up
ddATP, dATP, dCTP, dGTP, dTTP
ddCTP, dATP, dCTP, dGTP, dTTP
ddGTP, dATP, dCTP, dGTP, dTTP
ddTTP, dATP, dCTP, dGTP, dTTP

then results shown on autoradiograph (read from bottom)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

The development of Automated singer sequencing

A

Replaced radioisotopes with fluorescent dyes

  • Safer
  • Each of the four DNA bases assigned a different colour
  • All four reactions could be run on a single lane rather than four as previously
  • The migration of the dye could be read because of the fluorescence
  • This information allowed automatic gel reading

Further improvements were made

  • Improved dye chemistry using fluorescent dideoxy-terminators
  • Replacing slab gels with capillaries
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

whole genome shotgun sanger sequencing

A
  1. random shearing of genome (cutting)
  2. size selection and put in plasmid vector
  3. clone and pick colonies to make shotgun library
  4. sequence each insert with two primers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Era of high through put sequencing

A
100x faster, 100x cheaper!
Different technologies available
- 454 (Roche)
- Illumina
- Ion Torrent
- PacBio

Fundamentally different from Sanger sequencing

  • Solid-phase amplification of clonal templates
  • New chemistries for sequence reading
  • > 454: pyrophosphate detection on base addition
  • > Illumina: reversible de-protection of fluorescent bases
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

High throughput shotgun sequencing

A
  1. shear randomly
  2. size exclusion
  3. add adapters
  4. amplify
  5. sequence
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

stages in bacterial genome sequencing project

A

Sequencing

  • Shotgun sequencing of randomly generated fragments – restriction enzymes or more commonly physical fragmentation
  • many fold coverage to ensure completeness and few ambiguities – very quick with modern approaches

Assembly
- Ordering of sequences through identifying overlaps, computationally intensive, may yield complete continuous genome sequence, often “gaps” that need filling manually – time intensive

Annotation

  • From sequence data to gene list, automated or manual – very time intensive
  • Allows display in more visually appealing way
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Genome annotation

A

Annotation is the addition of information about the predicted sequence features to the flat file of DNA code

Identification of potential coding sequences - CDS

Homology searches to predict function

Other features can be annotated as well (rRNAs/tRNAs, Promoters, Small non-coding RNAs, Repeat sequences, Insertion sequences (ISs), transposons, gene fragments)

Location of the origin of replication

Determination of the number of bases, genes and G+C%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what have we found out about bacterial genomes?

A
  1. Variety of genome size
  2. Some genomes are shrinking
  3. A genome is not representative of a species
  4. Genome sequence data as the ultimate epidemiological tool
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Massive gene decay in the leprosy bacillus

A
  • has pseudogenes (27% inactive reading frame)
  • 50% protein coding
  • remaining 23% noncoding (mutated beyond recognition and regulatory sequences)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Beyond to genome to the pan genome

A

“one bacterial species - one genome sequence” is no longer the paradigm

Bacterial genomes comprise:

  • core sequences — genes that encode proteins involved in essential functions, such as replication, transcription and translation
  • dispensable sequences that encode proteins which facilitate organismal adaptation

Dispensable sequences are characterized by:

a variable pattern of presence or absence in different bacterial isolates.

high rates of nucleotide sequence variability

association with pathogenicity islands - virulence and resistance to antimicrobials

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

whats a genome

A

The entire hereditary information of an organism that is encoded by its DNA (or RNA for some viruses).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

whats the pan genome

A

The global gene repertoire of a bacterial species that comprises the sum of the core and the dispensable genome (from the Greek ‘pan’, meaning whole).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

whats the core genome

A

The pool of genes that is shared by all the strains of the same bacterial species.

17
Q

whats the meta genome

A

The global genetic repertoire of an environmental niche that is constituted by diverse organisms such as free-living microorganisms in the wild or the commensals of a particular niche in a mammalian host (from the Greek ‘meta’, meaning beyond).

18
Q

genome sequencing and epidemiology - Clostridium difficle

A

most common cause of antibiotic associated diarrhoea

associated with hospitalization and antibiotic treatment

Gram-positive, spore forming and toxigenic

Infection can be asymptomatic or cause diarrhoea, pseudomembranous colitis and death

500,000 cases per year in US hospitals and long-term care facilities with 15,000 to 20,000 patient deaths

an emergent clone

stain responsible for large number of deaths and sine has been documented

sequenced 151 strains and located 1 SNP to see where it had originated from by phylogenic tree

19
Q

Genome sequencing and epidemiology - Cholera

A
  • hati outbreak
  • caused by vibrio cholerae in dirty water
  • spreading in dominican republic and cuba

found out by genome sequencing that source was from UN camp sewage