Lecture 22 - Genomics and Bacterial Evolution Flashcards

1
Q

Functional genetics

A

Work out how a protein works from the genetic code, and experimental data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Study of a single genome

A

Genomic analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Study of several genomes

A

Comparative genomics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What did Fred Sanger initially sequence?

A

PhiX bacteriophage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Size of PhiX

A

~5Kb

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q
Sanger sequencing method
1)
2)
3)
4)
A

1) Break sequence of interest into fragments
2) Place in test tube with dideoxynucleotides, each with an individual dye. ddnucleotides terminate chain elongation
3) Run fragments on a polyacrylomide gel, which can resolve to individual base pair level
4) The dye colour of nucleotides is read

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Read size of sanger sequencing

A

~600bp

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Read

A

The length of a single piece of DNA that can be sequenced by a particular method

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Read assembly

A

Reads are placed together, according to consensus sequences.

This forms a contig, which is a sequence of reads

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Contig

A

Where read sequences overlap, make a sequence of consensus sequences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Gap

A

When a computer can’t find a match in reads to make a contig

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why can gaps occur?
1)
2)

A

1) DNA polymerase can’t extend sequence for some reason

2) If there is a repeated region, and the read size is smaller than the size of the repeat.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Automated Sanger sequencing method

A

Capillary electrophoresis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q
Illumina sequencing 
1)
2)
3)
4)
5)
6)
7)
8)
9)
A

1) Break DNA of interest into fragments
2) Adaptors of known sequence are added, ligate to the ends of dsDNA
3) A glass slide is prepared, with sequences complementary to primers adhered to surface
4) Hybridisation of primers, adhered complementary sequences
5) Add unlabelled nucleotides, DNA polymerase. Bridge amplificaiton
6) DNA synthesis, bridges become double stranded
7) Denaturation, to ssDNA
8) PCR to make high-density DNA clusters
9) Bases tagged with fluorescent dyes added. When a base is added, emits fluorescence which is detected.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Key difference between Sanger and Illumina

A

Illumina sequencing can continue on same strand after dye-tagged base is added.

Fluorescent part is cleaved off when base is incorporated, so it doesn’t interfere with further elongation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

MiSeq output per run

A

15Gb

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

NextSeq500 output per run

A

120Gb

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

HiSeq2500 output per run

A

1000Gb

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

MiSeq read number

A

25 million

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

NextSeq500 read number

A

400 million

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

HiSeq2500 read number

A

4000 million

22
Q

MiSeq read length

23
Q

NextSeq500 read length

24
Q

HiSeq2500 read length

25
MiSeq time for run
~4 hours
26
Most inexpensive sequencing method
Illumina
27
PacificBio RS output per run
375Mb
28
PacificBio RS read number
~45,000
29
PacificBio RS read length
Over 20Kb
30
``` What is PacificBio RS? 1) 2) 3) 4) 5) ```
1) Single molecule, real time sequencing 2) DNA synthesis by immobilised DNA polymerase 3) Phospholinked nucleotides release light when incorporated 4) No amplification 5) Under 180 minutes per run
31
``` PacificBio RS method 1) 2) 3) 4) 5) ```
1) Don't fragment DNA of interest too much (reduces read length) 2) Repair ends 3) Adaptor ligation to DNA ends 4) DNA is polymerised by DNA polymerase fixed in a 0-mode waveguide well 5) When a phosphonucleotide is incorporated, light is emitted and detected. Each base has a different dye, and emits a different wavelength of light
32
Size of wells used in PacificBio RS
Zeptolitre quantities
33
Why is it better to not have an amplification stage in sequencing?
Not all DNA is amplified at equal levels. | This can affect results
34
What are long read lengths useful for?
For complex sequences of DNA, such as repeat regions.
35
Sanger output per run
9600bp
36
Read number of sanger
96
37
Sanger run time
3 hours
38
PacificBio RS run time
30 minutes - 3 hours
39
Sanger cost per Mb
$2400
40
Illumina cost per Mb
$0.15
41
PacificBio RS cost per Mb
$1
42
Genome annotation
A process which locates genes in a genome map
43
How to annotate a genome 1) 2)
1) Identify open reading frames | 2) Experimentally identify gene function, or compare to other genes
44
Open reading frame
Over 100 codons that are uninterrupted by a stop codon. | See if there is an obvious ribosomal binding site at the 5' end, terminator sequence at 3' end
45
Bioinformatics 1) 2) 3)
1) Analysis of a genome using computers 2) Generates information of genome structure, content, arrangement 3) Uses annotation to determine location of genes on newly-sequenced genome
46
Significance of an open reading frame
Presumed to encode a protein
47
BLAST
Basic local alignment search tool
48
A tool used in bioinformatics
BLAST
49
BLAST role
Compares primary sequence information from different genomes
50
Type of sequencing methods that Illumina and PacificBio RS are
Sequencing by synthesis