18.02.15 NGS applications Flashcards
What is WGS?
Whole genome sequencing = sequencing of the entire genome - mtDNA + nuclear DNA.
What is WES?
Whole exome sequencing covers coding sequences of all annotated protein-coding genes (~23,000). Equivalent to 1-2% of the total haploid genomic sequence (~30Mb).
Contains 85% of all DNA mutations that have an effect on human disease.
What are the advantages of WGS?
- Allows examination of SNVs, indels, SV and CNVs in coding and non-coding regions of the genome. WES omits regulatory regions such as promoters and enhancers.
- WGS has more reliable sequence coverage. Differences in the hybridisation efficiency of WES capture probes can result in regions of the genome with little or no coverage.
- Coverage uniformity with WGS is better than WES
- Regions of the genome with low sequence complexity restrict the ability to design useful WES capture baits resulting in off-target capture effects
- PCR amplification isn’t required during library prep, reducing potential of GC biase. WES frequently requires PCR amplification as the bult input amount needed to capture is generally around 1ug of DNA
- Sequencing read length isn’t a limitation. Most target probes for exome-seq are designed to be less than 120nt long, making it meaningless to sequence using a greater read depth
- A lower average read depth is required to achieve the same breadth of coverage as WES
- WGS does not suffer from reference bias. WES capture probes/baits tend to preferentially enrich reference alleles at heterozygous site producing false SNV calls.
- In WES the RefSeq collection is targeted, hence current capture probes only target exons that have been identified so far. The exome will change as our understanding of the human genome improves.
What are the advantages of WES?
- WES is targeted to protein-coding regions, so reads represent less than 2% of the genome. This reduces the cost to sequence a targeted region at a high depth and reduces storage and analysis costs (although the cost of WGS is likely to decrease more rapidly than WES).
- It reduces the data fatigue associated wtih WGS in interpreting the large amount of data produced, when only a relatively small subset of the variation that WGS detects have demonstrated health consequences.
- Reduced costs make it feasible t increase the number of samples to be sequences, enabling large population based comparison.
- Until very recently the time, cost and technical expertise required to generate and analyse WGS data largely precluded serious consideration of its use outside of research settings, but the situation is changing rapidly and this can no longer be assumed to be the case.
Which types of diseases are more suited to WES/WGS technology?
WGS: suitable for mendelian and complex trait geen identification as well as sporadic phenotypes caused by de novo SNVs or CNVS
WES: GOod for hihgly penetrant mendelian disease gene identification.
What types of variants are detected in WGS vs WES?
WGS: Uncovers all genetic and genomic variation (SNVs and CNVs). Discovery of functional coding and noncoding variation. ~3.5 million variants.
WES: Focuses on ~1% of the genome. LImited to coding and splice-site variants in annotated genes. ~20,000 variants.
How were disease traditionally mapped?
~3000 loci identified through positional mapping e.g. karyotyping, linkage,, CNV analysis followed by Sanger sequencing.
Difficulty identifying rare disease loci due to small number of cases/families to study, reduced penetrance, locus heterogeneity and diminished reproductive fitness.
What other methods have been used for gene discovery? Briefly summarise.
GWAS: have contributed to the discovery of loci involved in complex traits but in almost all cases, they collectively account for only a small fraction of the observed heritability of the trait. Little is known about the extent to which rare alleles contribute to the heritability of complex traits.
NGS: one-step approach but present a challenge invariant interpretation. The number of variants identified in WGS/WES varies depending on exome enrichment kit, sequencing platform, algorithms used for mapping and variant calling, quality filters and % of ready showing the variant.
How should variants be prioritised for suspected pathogenicity?
- Unique in patients/very rare in general population
- Located in protein-coding gene
- Directly affecting the function of the protein encoded by the mutated gene
Strategies for finding causal alleles against this background depend on factors such as the mode of inheritance of a trait; the pedigree or population structure, whether a phenotype arise owing to a de novo or inherited variant, extend of locus heterogeneity for a trait.
Which disorders does exome sequencing have the potential to aid in the accurate diagnosis of?
Mendelian disorder which
1) Present with atypical manifestations
2. Are difficult to confirm using clinical or laboratory criteria alone e.g. when symptoms are shared among multiple disorders
3. Require extensive or costly evaluation e.g. a long list of candidate genes, such as for NSHL and or CMT/IPN, ID syndromes
What are the ethical considerations with WES?
Increased the chance of incidental findings; clinically useful results unrelated to the primary aim of investigation e.g. susceptibility to cancer.
Give examples of how WES strategies can be used to aid diagnosis in patients with rare disease.
- Sequence and filter across multiple, unrelated, affected individuals. This approach is used to identify novel variants in the same gene(s).
- Sequence and filtering among multiple affected individuals from within a pedigree to identify a gene(s) with a novel variant in a shared region of the genome.
- Sequencing parent-child trios to identify de novo mutations.
- Sampling and comparing the extremes of the distribution for quantiative phenotype.
What are the advantages of targeted NGS vs WGS/WES?
1) Slightly cheaper
2) Coverage better as can fill gaps with Sanger
3) Can interpret and report fully the importance of each variant identified, whereas variants have to be filitered in WES/WGS as there are so many
4) Less chance of incidental findgins.
What are the general applications of targeted NGS?
Hereditary disorder
Therapeutic decision making for some somatic cancer
Infectious disease testing
Tumour profiling
MRD
Amplicon sequencing
Prenatal testing and screening - NIPT/PGD
What are the advantages and disadvantages of using targeted NGS
Only genes with a known disease relationship are included.
Inflexibility of target panels mean redesign is required to incorporate novel targets.
The reduction in the cost of exome capture and sequencing reagents makes a virtual exome approach more viable; sequencing an exome and masking all but the desired data.