1-24 prework Genetic Testing Flashcards
Human Genome Project
3 billion base pairs of “reference sequence”, therefore dipoloid genome 6 billion
techniques: mapping mRNA and expressed sequences to genome, computational tools to predict coding sequence, comparative genomics to other known species genomes, ENCODE project to ID regulatory sequences within the genome
20,000 genes mapped to human genome, less than people anticipated in 2001- only 1.5% is highly conserved coding, 3% highly conserved other, 45% transponson based repeats, 6% heterochromatin, 45% other non-conserved
lots of “junk” “genomic dark matter” DNA, poorly understand this dna
Mitochondrial genome
93% highly conserved coding, 5% highly conserved other, less non-conserved
clinical utility
evidence of improved measureable clinical outcome, requires “clinical validity” (relationship between the test result and the disease”,
clinical validity requires “analytical validity”, which is test accuracy
Methods to detect known genetic variations
- primer/probe design (requires prior knowledge of sequence)
- amplification (target or signal)
- detection (size or sequence variation)
polymerase chain reaction
first amplification method
realtime pcr- quantitative PCR, can make standard curve and quantify amount of starting material in unknown samples
methods to detect novel sequence variation
- sanger dideoxy chain terminating: 10% of nucleotides dideoxy so sequence gets terminated and you run on gel or flourophore- you cannot quantify amount of mutant allele presence and you can only detect down to 10% of mutant allele is present (mosaicism), you can detect novel variants in coding region, you cannot detect whole exon dels/dups
- Pyrosequencing: similar to sanger but via release of light is used to track, this also gives you quantitative data, useful for short sequences
- Massively parallel sequence (NextGeneration, NGS): used pyrosequencing in a massively parallel region, 1. create library 2. ligate adaptors 3. capture and amplify 4. sequence 5. trim and align 6. analyze; can be quantitative, has low limit of detection, and removes cis-trans ambiguity, there are some errors so you want enough reads of strands to remove those erros
Sequencing limitations
many known mutations are non-conding, non-splice site so we won’t detect them by traditional exon sequencing
primers are designed to amplify segmentsi ncluding coding sequence and slice sites only, therefore large deletions will not be detected
Massively parallel sequence (NextGeneration, NGS):
used pyrosequencing in a massively parallel region, 1. create library 2. ligate adaptors 3. capture and amplify 4. sequence 5. trim and align 6. analyze; can be quantitative, has low limit of detection, and removes cis-trans ambiguity, there are some errors so you want enough reads of strands to remove those errors
not at clinical grade detection of single nucleotide changes but getting better
limitations: certain muts are not detectable, certain regions are not covered, analytical validity does not equal clinical validity
Transition Transversion missense nonsense silent
purine for purine (vice versa) purine for pyrimidine (vice versa) change in single AA codes for stop codon (stop gain) codes for same aa (synonymous): this could still give a splice site change