Lecture 12: DNA STRUCTURAL VARIATION: COPY NUMBER VARIANTS (CNVS) Flashcards

Question

The Infinium Whole Genome Genotyping Workflow = process..

Answer 1

1. genomic DNA 200-400ng DAY 1: 1. Make amplified DNA 2. Incubate amplifies DNA DAY 2: 3. Fragment amplified DNA 4. Precipitate and Resuspend 5. Prepare BeadChip 6. Hybridise samples on BeadChip DAY 3: 7. Image BeadChip 8. Genotype and LOH/CN analysis

Answer 2

image labelled of the software and what is in it on slide 34

Answer 3

1. LOG R RATIO (LRR) - Copy number - Log R Ratio (LRR) is a normalised measure of the total signal intensity for the SNP. - Any deviations from zero for LRR are evidence for a copy number change. 2. B ALLELE FREQUENCY (BAF) - BAF is a measure of the 'allelic intensity ratio' - 'Proportion of hybridized sample that carries the B allele as designated by the Infinium Assay'

Answer 4

CN = 2 NORMAL CN=1 DELETION CN =3 DUPLICATION

Answer 5

DIAGRAM ON SLIDE 38 - No copy number change - LogR 0.00 - No heterozygous BAF

Answer 6

'Uniparental disomy (UPD)' ...2◦ Usually a single large ROH (or a couple of large ROH) on same chromosome ...3.◦ May not overlap the imprinted region 'Identity by descent' ...4◦ Multiple ROH across different chromosomes ...5.◦ Arises when close ancestry/isolated ethnic population

Answer 7

Benign variant found in the normal population OR Clinically significant CNV associated with the patient’s phenotype

Answer 8

1. DGV (Database of Genomic Variants) http://dgv.tcag.ca ◦ Compare patient CNV to normal individuals 2. DECIPHER https://decipher.sanger.ac.uk ◦ Compare patient CNV to other patient genotype/phenotype details 3. ClinGen https://clinicalgenome.org ◦ Resource that defines the clinical relevance of genes and variants ◦ Dosage sensitivity scores for curated genes/regions (HS and TS) ◦ Database of patient CNVs categorised by clinical significance 4. UCSC Genome Browser http://genome.ucsc.edu ◦ View RefSeq/OMIM genes and numerous other information tracks 5. PubMed http://www.ncbi.nlm.nih.gov/pubmed ◦ Research publications relevant to a particular phenotype/gene of interest 6. gnomAD https://gnomad.broadinstitute.org/ ◦ Genome Aggregation Database with exome and genome sequencing data from large scale projects ◦ V2.1.1 data from 125,748 exomes and 15,708 whole genomes from unrelated individuals ◦ Various disease specific and population studies

Answer 9

1. Pathogenic * Variant contributes to the development of disease. 2. Likely Pathogenic * High likelihood that this variant is disease-causing 3. Uncertain * Not enough information at this time to support a more definitive classification of this variant 4. Likely Benign * Not expected to have a major effect on disease; however, the scientific evidence is currently insufficient to prove this conclusively 5. Benign * This variant does not cause disease

Answer 10

1 ➢Testing family members may be appropriate. 2 ➢Test parents for recurrence risk. 3 ➢Rule out balanced rearrangement 4 ➢Genetic counselling recommended

Answer 11

1 ➢Testing parents may be helpful 2 ➢Genetic counselling may be appropriate.

Answer 12

1. ➢Small sequence variants, balanced rearrangements, Low-level mosaicism not detected 2 ➢If a specific disorder is suspected then further testing may be appropriate.

Answer 13

1. *Much higher resolution and therefore higher diagnostic yield 2 * i.e 10-15% more significant abnormalities detected in patients with intellectual disability, developmental delay, autism and multiple congenital abnormalities 3 *Cheaper than karyotyping because of ease of automation 4 *Robust technology 5 *Tissue culture not needed

Answer 14

1 *Unable to detect balanced rearrangements 2 *No positional information for a duplication – do not know if tandem or transposed elsewhere in the genome 3 *Can be time-consuming to analyse and report 4 *Many CNVs are novel with unknown clinical significance 5 *Can detect incidental findings (eg deletion of cancer suppressor genes)

Answer 15

1. Multiplex PCR method to investigate the copy number of ~60 targets in one MLPA reaction MLPA technique 2. Wide variety of commercial kits available (>350) 3. Routine diagnostic tests used globally in many laboratories 4. - Only requires a thermocycler and capillary electrophoresis equipment 5. - Up to 96 samples can be processed simultaneously 6. - Results available within 24 hours. 7. - Targeted to specific regions of interest NOT whole genome analysis

Answer 16

1. MPLA probe mixes have probes that target specific genomic sequences. 2. An MLPA and RPO consist of 2 parts; a left and a right probe oligonucleotide (LPO and RPO). 3. The LPO and RPO contain PCR primer and DNA hybridisation sequences. 4. An additional stuffer sequence in the RPO gives each probe a unique length.

Answer 17

1. Sample denaturation and probe hybridisation - In the first step, purified sample DNA is denatured. This is followed by overnight incubation with MLPA probe oligos. The LPO and RPO parts of each probe hybridise to immediately adjacent target DNA sequences. 2. Probe ligation -The second step is the ligation of probe oligos that hybridised to immediately adjacent target sequences. No mismatches around the ligation site are permitted, making the ligation reaction highly specific. The number of probe ligation products is a measure for the number of target sequences in the sample. 3. Probe amplification - In the third step, ligated probes are amplified in a multiplex PCR using a single universal primer pair. Only ligated probes are exponentially amplified, making the removal of unbound and non-ligated probes unnecessary. 4. Fragment separation -The fourth step is fragment separation by capillary electrophoresis. PCR products are loaded onto a capillary electrophoresis device and separated by length. Each fragment corresponds to a specific MLPA probe. Step 4: Separation by capillary electrophoresis - each peak is the amplification product of a specific probe - samples are compared to a control sample - a difference in relative peak height or peak area indicates a copy of the probe-target sequence ...... 5. Data analysis The final step is data analysis by Coffalyser.Net™. Relative copy numbers are determined by comparing the relative peak heights of reference probes and target probes in the test samples with those in reference samples with a known normal copy number. Advanced quality checks help to recognize unreliable data.

Answer 18

diagram slide 56

Answer 19

PROBLEM!!! - Structural variants usually occur in regions with complex genomic architecture: flanked by repetitive sequence elements (LCRs). - Difficult to sequence. Maybe misidentified or missed entirely by existing methods using short-read sequencing

Answer 20

1 *CNVs detected with lower confidence than SNVs/indels 2 *Inconsistencies among different methods 3 *Lack of a high-quality reference for CNVs from ES data 4 *Most algorithms use depth of coverage assuming read depth is linearly correlated with the underlying true copy number 5 *However, read depth in NGS is variable .... 6 * sample batching, GC content, PCR duplication bias, targeted depth, sequencing efficiency, and mappability 7 *Difficult to differentiate between technical artefacts and the real signal for a true copy number change. 8 *Detecting CNVs in polymorphic regions of the genome is challenging

Lecture 12: DNA STRUCTURAL VARIATION: COPY NUMBER VARIANTS (CNVS) Flashcards

(46 cards)