L14 - Genetic Analysis of Gene Function Flashcards
Explain how the Sanger sequencing method has been automated
Rather than radioactively labelling primers, the ddNTPs used in sequencing are tagged with fluorescent dyes, eye base with a different colour. Then the dsDNA template strand is denatured by heating to break the hydrogen bonds between complimentary base pairs. DNA polymerase and a batch of primers are then added to the mixture and DNA synthesis can occur. At random points in the newly synthesised DNA strand a ddNTP will be incorporated into the strand resulting in termination of DNA synthesis. This will produce DNA strands with ddNTPs at every position in the sequence. These strands can then be separated on a gel allowing the sequences to flow continuously. A camera then measures the fluorescence at a fixed point in the gel as the DNA sequences flow underneath. This camera records the colour of the fluorescence corresponding to each base at each position in the sequence. The results are then presented as a graph showing the intensity of fluorescence over time, this is known as a trace and allows the computer to determine the DNA sequence base-by-base.
What is the main limitation of automated sequencing
Automated sequencing is restricted to 1000-1500 base pairs at a time
Progressive sequencing is one method of sequences genome sequences greater than 1kbps, explain how this process works
You start with a genomic library and genomic sequences. Automated sequencing is then used to determine the 1000bps at either end of the genomic insert, adjacent to the vector sequence. The ends of each clone are sequences using primers that have been designed for the vector sequence. Once the first 1000bps at either end of the genomic insert have been determined, primers are then designed for each of these sequenced regions. Another round of sequencing is then performed to determine the next 1000bp sequence of the genomic insert adjacent to the already determined regions. This process repeats until the sequences produced overlap in the middle.
To sequence large fragments of DNA greater than 1kbp, plasmids are used as a vector, T or F
F – bacterial artificial chromosomes are used
Other than progressive sequencing, what other method has been used to sequence DNA fragments greater than 1kbps and how does this work
Shotgun sequencing. First, a genomic plasmid library is created. The ends clone present in the vector plasmids are then sequenced at random using primers for the vectors. This produces short random sequences from each plasmid that are then assembled by a computer program stitching overlapping regions together to produce a contig. This contig represents the assembled sequence of overlapping regions.
What is are the main advantages of shotgun sequencing
Shotgun sequencing requires no thought and only requires a batch of primers predesigned for the vector sequence rather than needing the design and synthesis of multiple sets of primers. The process can thus be automated
What are the main disadvantages of shotgun sequencing
Because of the random sequences produced it requires the sequencing of more the 6x the size of the genome and is hence very inefficient. It will lead to the sequencing of some regions multiple times and there are always gaps. These gaps require progressive sequencing to be filled in.
Which method was used to sequence the human genome
A combination of shotgun and progressive sequencing
How many base pairs are there approximately in the human genome
3.2x109 – 3bn
How many genes are there in the human genome
23000
When was the human genome started and completed
Started in 1990, finished in 2003
Roughly how long does it take to sequence a human genome now
56 hours
The price of human genome sequencing has decreased from $100m to $8k today, T or F
T
How can gene prediction software be used to identify genes in a sequenced region of DNA
Prediction software can be used to analyse newly sequenced regions of DNA scanning for promoters, start/stop sequences and intron splice sites which would implicate a gene.
What is the problem with gene prediction software
You cannot be sure that predictions are always true and a gene has or hasn’t been identified