Lecture 22 - Genomics and Bacterial Evolution Flashcards

1
Q

Functional genetics

A

Work out how a protein works from the genetic code, and experimental data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Study of a single genome

A

Genomic analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Study of several genomes

A

Comparative genomics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What did Fred Sanger initially sequence?

A

PhiX bacteriophage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Size of PhiX

A

~5Kb

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q
Sanger sequencing method
1)
2)
3)
4)
A

1) Break sequence of interest into fragments
2) Place in test tube with dideoxynucleotides, each with an individual dye. ddnucleotides terminate chain elongation
3) Run fragments on a polyacrylomide gel, which can resolve to individual base pair level
4) The dye colour of nucleotides is read

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Read size of sanger sequencing

A

~600bp

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Read

A

The length of a single piece of DNA that can be sequenced by a particular method

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Read assembly

A

Reads are placed together, according to consensus sequences.

This forms a contig, which is a sequence of reads

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Contig

A

Where read sequences overlap, make a sequence of consensus sequences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Gap

A

When a computer can’t find a match in reads to make a contig

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why can gaps occur?
1)
2)

A

1) DNA polymerase can’t extend sequence for some reason

2) If there is a repeated region, and the read size is smaller than the size of the repeat.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Automated Sanger sequencing method

A

Capillary electrophoresis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q
Illumina sequencing 
1)
2)
3)
4)
5)
6)
7)
8)
9)
A

1) Break DNA of interest into fragments
2) Adaptors of known sequence are added, ligate to the ends of dsDNA
3) A glass slide is prepared, with sequences complementary to primers adhered to surface
4) Hybridisation of primers, adhered complementary sequences
5) Add unlabelled nucleotides, DNA polymerase. Bridge amplificaiton
6) DNA synthesis, bridges become double stranded
7) Denaturation, to ssDNA
8) PCR to make high-density DNA clusters
9) Bases tagged with fluorescent dyes added. When a base is added, emits fluorescence which is detected.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Key difference between Sanger and Illumina

A

Illumina sequencing can continue on same strand after dye-tagged base is added.

Fluorescent part is cleaved off when base is incorporated, so it doesn’t interfere with further elongation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

MiSeq output per run

A

15Gb

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

NextSeq500 output per run

A

120Gb

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

HiSeq2500 output per run

A

1000Gb

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

MiSeq read number

A

25 million

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

NextSeq500 read number

A

400 million

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

HiSeq2500 read number

A

4000 million

22
Q

MiSeq read length

A

2x300bp

23
Q

NextSeq500 read length

A

2x150bp

24
Q

HiSeq2500 read length

A

2x125bp

25
Q

MiSeq time for run

A

~4 hours

26
Q

Most inexpensive sequencing method

A

Illumina

27
Q

PacificBio RS output per run

A

375Mb

28
Q

PacificBio RS read number

A

~45,000

29
Q

PacificBio RS read length

A

Over 20Kb

30
Q
What is PacificBio RS?
1)
2)
3)
4)
5)
A

1) Single molecule, real time sequencing
2) DNA synthesis by immobilised DNA polymerase
3) Phospholinked nucleotides release light when incorporated
4) No amplification
5) Under 180 minutes per run

31
Q
PacificBio RS method
1)
2)
3)
4)
5)
A

1) Don’t fragment DNA of interest too much (reduces read length)
2) Repair ends
3) Adaptor ligation to DNA ends
4) DNA is polymerised by DNA polymerase fixed in a 0-mode waveguide well
5) When a phosphonucleotide is incorporated, light is emitted and detected. Each base has a different dye, and emits a different wavelength of light

32
Q

Size of wells used in PacificBio RS

A

Zeptolitre quantities

33
Q

Why is it better to not have an amplification stage in sequencing?

A

Not all DNA is amplified at equal levels.

This can affect results

34
Q

What are long read lengths useful for?

A

For complex sequences of DNA, such as repeat regions.

35
Q

Sanger output per run

A

9600bp

36
Q

Read number of sanger

A

96

37
Q

Sanger run time

A

3 hours

38
Q

PacificBio RS run time

A

30 minutes - 3 hours

39
Q

Sanger cost per Mb

A

$2400

40
Q

Illumina cost per Mb

A

$0.15

41
Q

PacificBio RS cost per Mb

A

$1

42
Q

Genome annotation

A

A process which locates genes in a genome map

43
Q

How to annotate a genome
1)
2)

A

1) Identify open reading frames

2) Experimentally identify gene function, or compare to other genes

44
Q

Open reading frame

A

Over 100 codons that are uninterrupted by a stop codon.

See if there is an obvious ribosomal binding site at the 5’ end, terminator sequence at 3’ end

45
Q

Bioinformatics
1)
2)
3)

A

1) Analysis of a genome using computers
2) Generates information of genome structure, content, arrangement
3) Uses annotation to determine location of genes on newly-sequenced genome

46
Q

Significance of an open reading frame

A

Presumed to encode a protein

47
Q

BLAST

A

Basic local alignment search tool

48
Q

A tool used in bioinformatics

A

BLAST

49
Q

BLAST role

A

Compares primary sequence information from different genomes

50
Q

Type of sequencing methods that Illumina and PacificBio RS are

A

Sequencing by synthesis