Sequencing genes and genomes Flashcards

1
Q

What are the two approaches to DNA sequencing?

A

Chain termination and chemical degradation - chemical degradation is not used much anymore as there are automated methods that are based on chain termination.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What forms the basis of chain termination?

A

During the synthesis of the DNA strand the formation of the phosphodiester bond between the growing strand and the incoming nucleotide requires a 3’ hydroxyl group as well as the 5’ tri-phopshate. Without the 3’ hydroxyl, the bond cannot form.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What do dideoxynucleotides lack?

A

The 3’ hydroxyl group.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What happens if dideoxynucleotides are incorporated into the chain?

A

The ddNTP cannot form another phosphodiester bond and the chain terminates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What happens in chain termination sequencing?

A

DNA polymerase (Klenow or more specialised enzymes) is mized with the single-stranded template, primers, a supply of dNTPs and a small amount of ddNTPs (of all different bases). The result is four families of molecules that are stopped at a particular ddNTP.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How can the sequence be read?

A

Polyacrylamide gel electrophoresis can separate the single stranded DNA molcules. If in a capillary system, it is possible to resolve ssDNA of all lengths up to 1500 nucleotides.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What happens after the size separation?

A

The fragments are run past a fluorescence detector to determine if the molecule is ATC or G .The data is fed to a a computer and the sequence is available as a nucleotide letter sequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the problem if the data is represented graphically?

A

There is some noise underneath the peaks. This needs to be looked at to determine how reliable the sequence data is.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why would internal primers be used instead of a universal primer?

A

The maximum size you can sequence is 500-750 base pairs, but many internal primers based on the sequence can be synthesised for longer sequences and the total sequence is built up from the partial sequences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why is there a size limit?

A

Resolving single stranded DNA molecules by electrophoresis gets more difficult as they are longer.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How can errors be identified?

A

Sequence the same area several times and in both directions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the two approaches to genome sequencing?

A

Shotgun and clone contig.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What happens in the shotgun sequencing method?

A

Random fragments are sequenced and the overlaps are identified. This information is used to build up a continuous genome sequence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What happens in the clone contig approach?

A

Overlapping clones are identified and a map is generated prior to the sequencing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a potential problem with shotgun sequencing?

A

It is random and the building up of the fragments is based only on computer programming, and is hard to tell if it’s 100% correct.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are other problems associated with shotgun sequencing?

A

It is based for smaller genomes as the computational aspects are complex and any identical repeated sequences can ruin the data.

17
Q

What is the pre-sequencing phase in the clone contig approach?

A

The fragments are cloned and organised into an ordered overlapping series of cloned DNA fragments.

18
Q

What is the clone contig method best for?

A

Larger genomes, but it is more expensive and time consuming.

19
Q

What cloning vectors are used in the clone contig method?

A

Vectors capable of taking very large fragments such as bacterial artificial chromosomes (BACs).

20
Q

What is the chromosome walking method used in BACs?

A

One clone is used as a hybridisation probe to the whole BAC library. The clones in which this clone hybridises with are then used to determine further overlapping clones to build up a clone contig. These clones are then each sequenced by the shotgun method.

21
Q

What method should be used if the genome of the species in question has already been sequenced?

A

Shotgun method as it is the easiest and cheapest method. The reference genome can help with the sequence assembly.

22
Q

What can happen after a genome has been sequenced?

A

Genes and their function can be identified. Most is done on a computer and requires expert systems based around knowledge of molecular biology coupled with computer power.

23
Q

How can genes potentially be identified?

A

A computer can search for an initiation codon (ATG) and a stop codon (TAA, TAG, TGA) in the same reading frame. These are likely to be genes.

24
Q

Why is identifying genes in eukaryotes harder?

A

Lots of the sequence is non-coding DNA, repeated sequences and the introns disrupt ORFs.

25
Q

How can genes be identified in eukaryote sequences?

A

Some good signposts include GC-rich regions usually occur at the start of genes, there may be codon bias that can help identify ORFs. Intron-exon boundaries can also be identified, upstream regulatory consensus sequences can also be recognised, along with training expert systems on common features of genes from a particular species or group of species.

26
Q

How can homology searches identify genes?

A

Looking at similar DNA based on common evolutionary histories of species and genes. The gene can also be translated into a sequence of amino acids and compare those.

27
Q

What are orphans?

A

Genes of unknown function.

28
Q

How can orphan genes be identified?

A

Using reverse genetics and making gene knockouts to see if there is a phenotypic change that can provide a clue of their function. This is easier in some species, such as yeast, compared to others.