Earlier genomic analysis approaches; Whole genome sequencing Flashcards
Early in the 20th century…prior to the development of early methods for DNA sequencing
Geneticists typically followed a two-part classical genetics approach to identify and characterize all of the genes in an organism’s genome:
- they identified spontaneous mutations or collected mutants induced by chemical or physical agents.
- they generated linkage maps using these mutant strains.
Identified genes in model organisms like Drosophila, mice, maize, yeast, bacteria and viruses.
The genomics era was introduced by
Sanger et al. when they sequenced the 5400 bp genome of the phage ɸX174 in 1977
In the 1980s, geneticists interested in mapping human genes began using recombinant DNA technology to map DNA sequences to specific chromosomes. Most of these sequences were not actually full-length genes but marker sequences such as restriction fragment length polymorphisms (RFLPs). Once assigned to chromosomes, these were used in pedigree analysis to establish linkages between the markers and disease phenotypes for genetic disorders. This was called positional cloning and was used to map, isolate, clone, and sequence genes for cystic fibrosis, neurofibromatosis, and many other disorders.
In the 1990s it was estimated that there were approximately 100,000 genes in the human genome which was later found to be inaccurate. Rapid advances in DNA-sequencing methods now enable sequencing larger and more complex genomes of eukaryotes, including the human genome. These DNA-sequencing technologies are responsible for modern genomic analysis.
Many disadvantages:
require a mutation in a gene before a linkage map can be constructed; very slow; only in non-human organisms.
In the 1980s, recombinant DNA technology was used to map human DNA sequences to specific chromosomes.
These sequences were not full-length genes but marker sequences such as restriction fragment length polymorphisms (RFLPs).
Once these markers were assigned to chromosomes, it could be used to establish linkages between the markers and disease phenotypes for genetic disorders.
Allowed for more than 3500 genes and markers to be mapped to human chromosomes.
In the 1990s, human genes estimated at ±100 000 – impossible to map and clone using traditional methods
Genomics allows sequencing of
entire genomes
Most widely used strategy for sequencing and assembling an entire genome involves
variations of a method called whole-genome sequencing (WGS) or shotgun cloning.
In whole-genome sequencing (WGS) or shotgun cloning, what are the purpose of restriction digests?
(or sonication) of whole chromosomes generate thousands to millions of overlapping DNA fragments.
Whole-Genomes are Sequenced and assembled using Bioinformatic applications
Software that create DNA sequence alignments.
Alignments identifies overlapping sequences, which can be used to map onto chromosomes.
Overlapping sequences are adjoining that together form a continuous DNA fragment, called a contig.
The WGS shotgun method was developed by
J. Craig Venter at The Institute for Genome Research (TIGR) in 1995, when they sequenced the 1.83-million-bp genome of the bacterium Haemophilus influenzae.
This was the first complete genome sequence from a free-living (i.e. nonviral) organism demonstrating “proof-of-concept” that shotgun sequencing could be used to sequence an entire genome.
In the example, alignment software has identified an overlap between three fragments of sequenced DNA (contigs 1, 2 and 3) from human chromosome 2. The software is able to assemble the three sequences into one much larger sequence using the overlaps. In this way, the sequence of the entire chromosome can be assembled in silico.
While possible, it is a time consuming and costly exercise to sequence an entire genome by the Sanger method.
The major technological breakthrough that made genomics possible was the advent of computer-automated sequencers (high throughput) like the 454 sequencing method.
High-Throughput Sequencing and Its Impact on Genomics
Conventional sequencing is too slow for WGS…
Discovered High-Throughput Sequencing (HTS)
Computer-automated DNA sequencers:
Designed for high-throughput sequencing, thus making genomics possible
Essential for Human Genome Project
Sequencers contained multiple capillary gels (96)
Generated over 2 million bp per day