Sequencing and Bioinformatics Flashcards
dNTP vs ddNTP?
What do these abbreviations stand for?
Similarities and Differences?
Nickname for ddNTP?
Both can incorporate into a newly synthesised DNA strand
Deoxy Nucleotide (dNTP) - Oxygen present on 3’ C allows for chain extension
Dideoxy Nucleotide (ddNTP) - 3’ oxygen removed preventing chain extension; Terminator nucleotide
Explain Sanger Dideoxy sequencing
It is an example of ‘sequencing by _______’?
Sequencing by Synthesis
DNA is denatured, and new strand is synthesised with a mixture of dNTPs and ddNTPs
New DNA strand extends a known primer
As each nucleotide is added to the chain, there’s a chance that a terminator nucleotide will be added
If this occurs then no more bases can be added
The products are then run on a gel to figure out the sequence
Name of sequence once a ddNTP is added
Truncated sequence
Alternate ways of carrying out Sanger sequencing?
Running 4 reactions, each with a different ddNTP i.e. ddATP, ddGTP etc.
Explain Dye-Terminator Sequencing
What are the benefits?
Using fluorescently labelled ddNTPs
- This allows all the products to be run in the same lane in capillary gel electrophoresis
Each base is identified depending on the colour/wavelength of the fluorescent tag
This process can be automated and scaled up to industrialise the process of sequencing to sequence large mounts of DNA
How many bp can the ABI 370 sequencer read at a time?
800bp
Problems with sequencing human genome?
3 billion base pairs; Takes a long time if reading 800bp at a time
Piecing together the genome is also a challenge
Process the Human Genome Project Strategy (IHGSC) used? (hint: BACs)
- Extract human genomic DNA from multiple people
- Fragment DNA so they are small enough for the sequencers to read
- Size selection of 100-200kb fragments
- Clone fragments into Bacterial Artificial Chromosomes (BACs)
What are BACs and what do they do?
They are plasmids which contain sequence elements which trick E. coli into replacing/copying them during the cell cycle (as if they were a native plasmid)
Explain how BAC cloning is performed
BAC cloning amplifies DNA
BACs are transformed into E. coli cells and these grow colonies
These colonies grown contain millions of copies of the same fragment
What are genetic and physical maps used for in the sequencing of the genome?
Established a dense set of genetic markers across the human genome
What do genetic maps rely on?
What is the relationship between distance between markers and recombination frequency?
They rely on recombination frequency between markers
The further apart markers are from one another, the higher the recombination frequency
How do they link BAC clones to the genetic and physical maps?
Clones are tested for PCR markers that have known locations on genetic and physical maps
How are overlapping clones identified?
What is the likely origin for such clones?
The end of the insert, in the BAC containing the marker, is sequenced
Using the sequence used to design PCR primers, BAC clones containing that end sequence are looked for
Such BAC clones are likely to come from a neighbouring genetic locus
How are BAC clones organised?
Which types of BAC clones are identified?
They are ordered relative to the genetic map, with PCR primers
- These PCR primers are complementary to the sequence of a particular marker, allowing us to test and identify BAC clones that overlap the genetic marker
- Once we know that a BAC clone overlaps a marker, we know roughly where to place it on the genetic map
How is sequence data from the end of the insert obtained?
What is done with this data?
Sequence from the vector backbone into the BAC insert
This sequence data is used to design a new set of primers
How are these primers used with the BAC library?
Where are the clones identified likely to be?
They are ran with the BAC library to identify clones which contain the end sequence but not the original genetic marker
These are likely to be derived from a genomic region adjacent to the source of the original BAC clone
How is this BAC insert sequencing process repeated?
What can be done in the other direction?
The end of the second BAC clone insert is sequenced, allowing for another set of primers to be designed to identify a third BAC clone with an insert that overlaps the second
The same thing can be done at the other end of the original clone which overlaps the marker, to obtain overlapping BAC clones in the other direction
What is this whole approach of identifying and ordering overlapping BAC inserts called?
Chromosome walking
Allows a library of clones to be built up in the correct order relative to the genetic map
How does shotgun sequencing work?
Many fragments are sequenced at random and then assembled
Size of BAC inserts used in shotgun sequencing and how are the BAC clones generated?
BAC clone DNA is fragmented into smaller 5-10kb fragments and cloned into plasmid vectors
How are primers used in shotgun sequencing to derive plasmid insert sequence?
The sequence of the plasmid that the insert is in is known, so primers could be designed to sequence the insert
Done many times to derive a consensus of the insert sequence
What did Celera want to do with the human genome data?
Patent and commercialise it
What was the sequencing approach Celera used?
How is this different to IHGSC?
They used a shotgun Whole Genome Sequencing (WGS) approach
Instead of creating BAC clones, Celera fragmented into much smaller fragments of 2-50Kbp