The 100,000 Genomes Project; Cardiovascular GeCIP Flashcards

1
Q

How is library preparation pursued to facilitate WGS?

A
  1. DNA sample first retrieved, extract Extract nucleic acids from blood, tissue, saliva, etc.
  2. Shear dsDNA into fragments
  3. Attach adapters to fragments
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How is the library sequenced?

A

Library taken and loaded onto a flowcell: similar to a microscope slide

HTS machine processes a flowcell containing lanes

Each lane may contain multiple samples (indexed with a DNA barcode)

Read group (RG) specifies a combination of sample and flowcell

DNA libraries deposited on flowcell
-> amplified to form clusters

Used method to amplify fragments to clusters which is a variation of pcr- called bridge amplification because we are performing this process on the surface of the flow cell

Once flowcell and clusters prepared, loaded onto sequencing machine and sequencing by synthesis performed, fragments sequenced and different coloured reversible terminators used to determine base pair sequence of each of the fragments on flow cell

Results in enormous amount of short read sequences representative of all our library or genomic DNA sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How is NGS data analysis facilitated?

A
  1. Sequence read alignment: Taking reads and aligning against reference genome in our case human reference genome and identifying all the mapping locations of these reads, along that reference genome
  2. Once process complete, can perform quality control of that data look at coverage, how well is the genome covered?
  3. Once pleased with data, variant calling pursued. Looking for variants in data relative to reference genome
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Typically, we will detect 3 Million variants in any given human genome

True of false

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does all this genetic variation mean?

A

We need extensive annotation tools to make informed decisions about variants

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the order of importance of functional consequences?

A
  1. NonSense
  2. Frameshift causing indels
  3. Splice site
  4. NonSynonmous
  5. Non frameshift causing indels
  6. Synonymous
  7. NonCoding
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

The function of non genetic variants is more difficult to elucidate

What can be used?

A

Functional predictions programs

  • SIFT (protein altering)
  • PolyPhen
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the 100,000 genomes project

A

100,000 genomes project

Bring direct benefit to patients

Enable new scientific discovery and medical insights

Create an ethical and transparent programme based on consent and engagement

To kick-start the development of a UK genomics industry

England- wide collection

GMCs (genomic medicine centres)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the NHS?

A

NHS was established in 1948
Worlds largest publically funded health service
One of the largest employers in the world (1.5 Million)
100k doctors, 300k nurses
1.4 million patients treated every 24 hours
116 Billion annual budget

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Who/what is being sequenced in by Genomics Health England?

A

Rare diseases - families

Cancers – germline and tumour samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the role of technology in Genomics health england?

A

Illumina have been commissioned to sequence the DNA of participants

Whole genome sequences returned to
Genomics England ready for analysis

Dedicated team of bioinformaticians for clinical interpretation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How does genomics health england use data?

A

The data is analysed within secure infrastructure (information governance)

Participant privacy and confidentiality

Anonymised data

Scientists and clinicians will access the data for research

Improve diagnosis rate of rare diseases

Understand how genomics impacts health and healthcare

Suggest possible new treatments (precision medicine)

Understand the causes of disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How are we interpreting the mutations ?

A

This is the most difficult part

We can recruit patients and generate genetic data faster than we can interpret it

Genetic findings can be actionable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is used to aid variant interpretation, apart from ACMG?

A

Genomics England Panel App

Community driven genetic interpretation

Crowdfunding research

‘Experts’ develop lists of possible genes than can cause a disease panels are reviewed by the community

Diseases have specific sets of virtual gene panels as a first port-of-call to look for pathogenic mutations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is is the classification of variants by genomics England designed to do?

A

Maximise diagnostic efficiency
Balance sensitivity and specificity

Variants within virtual panel divided into three tiers-

Tier 1 variants
Known pathogenic
Protein truncating

Tier 2 variants 
Protein altering (missense)
Intronic (splice site)

Tier 3 variants
Loss-of-function variants in genes not on the disease gene panel

Expert review is required

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the objectives of Cardiovascular GeCIP?

A

To provide the genetic diagnosis to patients with cardiovascular diseases

To identify new causative genes for phenotypes included within the project

To functionally characterise causative genetic variants and mechanisms of disease

To examine genotype/phenotype relationships and identify factors relevant to prognosis and response to treatment
If possible, to identify potential for orphan drug or novel therapeutic strategies

To equip trainees with the necessary genomic medicine knowledge and skills for the genomics era

17
Q

The range of diseases covered by the Cardiovascular GECIP is…

Therefore Cardiovascular GeCIP is divided into what?

A

broad, involving conditions of the myocardium, blood vessels, lymphatics, and also lipid metabolism leading to premature atherosclerosis

‘sub-domains’

  • Cardiomyopathy
  • Arrhythmias
  • Familial thoracic aortic aneurysms and dissection
  • Congenital heart disease
  • Primary lymphoedema
  • Familial hypercholesterolaemia
  • CADASIL negative small vessel cerebral disease
18
Q

What three main arrythmia syndrome phenotypes eligible for WGS?

A

Long QT Syndrome (LQTS)
Brugada Syndrome (BrS)
Catecholaminergic polymorphic ventricular tachycardia (CPVT)

19
Q

What does the data access include?

A

The dataset includes de-identified, linked information for each participant:

  • Genome sequence data
  • Variant call files
  • Phenotype/clinical data
  • Hospital Episode Statistics (HES)

GeCIP members need to belong to a participating institution e.g. St Georges

Access to the 100,000 Genomes dataset is via a remote desktop hosted by the Genomics England datacentre – the Research Environment

After logging on, a virtual desktop will open in a window of your internet browser

No sequencing or clinical data is available for download

On board high-performance computer (HPC) for genome analysis