The 100,000 Genomes Project; Cardiovascular GeCIP Flashcards
How is library preparation pursued to facilitate WGS?
- DNA sample first retrieved, extract Extract nucleic acids from blood, tissue, saliva, etc.
- Shear dsDNA into fragments
- Attach adapters to fragments
How is the library sequenced?
Library taken and loaded onto a flowcell: similar to a microscope slide
HTS machine processes a flowcell containing lanes
Each lane may contain multiple samples (indexed with a DNA barcode)
Read group (RG) specifies a combination of sample and flowcell
DNA libraries deposited on flowcell
-> amplified to form clusters
Used method to amplify fragments to clusters which is a variation of pcr- called bridge amplification because we are performing this process on the surface of the flow cell
Once flowcell and clusters prepared, loaded onto sequencing machine and sequencing by synthesis performed, fragments sequenced and different coloured reversible terminators used to determine base pair sequence of each of the fragments on flow cell
Results in enormous amount of short read sequences representative of all our library or genomic DNA sample
How is NGS data analysis facilitated?
- Sequence read alignment: Taking reads and aligning against reference genome in our case human reference genome and identifying all the mapping locations of these reads, along that reference genome
- Once process complete, can perform quality control of that data look at coverage, how well is the genome covered?
- Once pleased with data, variant calling pursued. Looking for variants in data relative to reference genome
Typically, we will detect 3 Million variants in any given human genome
True of false
True
What does all this genetic variation mean?
We need extensive annotation tools to make informed decisions about variants
What is the order of importance of functional consequences?
- NonSense
- Frameshift causing indels
- Splice site
- NonSynonmous
- Non frameshift causing indels
- Synonymous
- NonCoding
The function of non genetic variants is more difficult to elucidate
What can be used?
Functional predictions programs
- SIFT (protein altering)
- PolyPhen
What is the 100,000 genomes project
100,000 genomes project
Bring direct benefit to patients
Enable new scientific discovery and medical insights
Create an ethical and transparent programme based on consent and engagement
To kick-start the development of a UK genomics industry
England- wide collection
GMCs (genomic medicine centres)
What is the NHS?
NHS was established in 1948
Worlds largest publically funded health service
One of the largest employers in the world (1.5 Million)
100k doctors, 300k nurses
1.4 million patients treated every 24 hours
116 Billion annual budget
Who/what is being sequenced in by Genomics Health England?
Rare diseases - families
Cancers – germline and tumour samples
What is the role of technology in Genomics health england?
Illumina have been commissioned to sequence the DNA of participants
Whole genome sequences returned to
Genomics England ready for analysis
Dedicated team of bioinformaticians for clinical interpretation.
How does genomics health england use data?
The data is analysed within secure infrastructure (information governance)
Participant privacy and confidentiality
Anonymised data
Scientists and clinicians will access the data for research
Improve diagnosis rate of rare diseases
Understand how genomics impacts health and healthcare
Suggest possible new treatments (precision medicine)
Understand the causes of disease
How are we interpreting the mutations ?
This is the most difficult part
We can recruit patients and generate genetic data faster than we can interpret it
Genetic findings can be actionable
What is used to aid variant interpretation, apart from ACMG?
Genomics England Panel App
Community driven genetic interpretation
Crowdfunding research
‘Experts’ develop lists of possible genes than can cause a disease panels are reviewed by the community
Diseases have specific sets of virtual gene panels as a first port-of-call to look for pathogenic mutations
What is is the classification of variants by genomics England designed to do?
Maximise diagnostic efficiency
Balance sensitivity and specificity
Variants within virtual panel divided into three tiers-
Tier 1 variants
Known pathogenic
Protein truncating
Tier 2 variants Protein altering (missense) Intronic (splice site)
Tier 3 variants
Loss-of-function variants in genes not on the disease gene panel
Expert review is required