18.02.06 SNP arrays Flashcards

1
Q

What is a SNP and what are their genomic distribution?

A

Single nucleotide polymorphism (SNP) - a DNA sequence variation occurring commonly within a population (e.g. >1%) in which there is a single nucleotide change e.g. C to T. Approximately 50 million SNPs have been found in the human genome.

The genomic distribution of SNPs is not homogenous; SNPs occur in non-coding regions more frequently than in coding regions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How can SNP density be predicted?

A

SNP density can be predicted by the presence of microsatellites: AT microsatellites are potent predictors of SNP density, with long (AT)(n) repeat tracts tending to be found in regions of significantly reduced SNP density and low GC content

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the different types of SNP in coding DNA?

A
  1. Synonymous SNPs, which do not affect the protein sequence, but still can affect its function.
  2. Nonsynonymous SNPs which do change the amino acid sequence of protein. The nonsynonymous SNPs are of two types: missense and nonsense.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Describe the three principles of SNP arrays.

A
  1. An array containing immobilized allele-specific oligonucleotide (ASO) probes.
  2. Fragmented nucleic acid sequences of target, labelled with fluorescent dyes.
  3. A detection system that records and interprets the hybridization signal.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does the signal intensity in a SNP array depend on?

A

The amount of target DNA in the sample (to detect copy number), and

The affinity between target DNA and probe (binding will occur more efficiently between sequences with 100% of matched bases than those with 90% matched bases due to a SNP mismatch).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

In general, what are SNP arrays used for?

A

In general, information obtained from the array data on the presence/absence of more than one allele is used to look for UPD and LOH, whilst the combined information on the number of alleles at individual SNP sites is used for determining copy number.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Describe the underlying priciples of Agilent and Illumina SNP arrays.

A

Both techniques require specialised equipment to measure signal intensity of the probe following hybridisation to the target sequence.

Computational algorithms are used to convert probe signal intensity into genotypes. Genotyping accuracy is quoted at over 95.5%.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Describe the Illumina SNP array.

A

Attached to each Illumina bead is a 50nt long sequence complementary to the sequence adjacent to the SNP sit.

The single base extension (T or G) that is complementary to the allele carried by the DNA (A or C) then binds and results in the appropriately-coloured signal (red or green).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the resolution of the Illumina SNP array?

A

Illumina latest array is the CytoSNP-850k. As name suggests, has 850,000 SNPs

Has 3262 targeted regions (inc ISCA and CCMC genes)

> 50kb backbone resolution

> 10kb resolution in ISCA regions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Describe the Agilent SNP array.

A

The assay consists of probes, 25 nucleotides long for both alleles.

The locations of the SNP locus varies from probe to probe

The target DNA binds to both probes regardless of teh allele it carries.

When the target sequence is complementary to all 25 bases of the probe the signal strength is strong

The presence of a SNP results in reduced affinity between the probe and the target sequence and a weaker signal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the resolution of the Agilent SNP array?

A

> Affymetrix array is the Genome-Wide Human SNP Array 6.0

> 1.8 million markers. 946,000 for CNV detection, 906,600 SNPs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why is the term ‘absence of heterozygosity’ used in SNP array analysis?

A

Terminology, ‘absence of heterozygosity’ important as it does not imply any mechanism, see only AA, or BB pattern, nothing in the AB heterozygosity track indicating heterozygosity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Why might a combined SNP and oligo approach be used?

A

SNPs do not occur evenly across the genome leading to arrays with uneven coverage, therefore the current “Gold Standard” for copy number detection is aCGH. To overcome the problem of lower overall resolution, combined SNP and Oligo arrays have been designed, whereby standard oligos are added to the SNP design to fill in the gaps and improve coverage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the applications of SNP arrays in the postnatal setting?

A

Microarray testing is now the first line test for patients with developmental delay and/or dysmorphism in the majority of cytogenetic labs in the UK. With experience and a defined workflow the problems of interpretation and high signal-noise ratio can be overcome in order to get the benefits of a combined SNP/oligoarray platform.

Although SNP arrays provide LOH data, it is only relevant to report such cases with careful consideration – for example if a UPD syndrome is suspected in the clinical details (e.g. Prader-Willi syndrome). It is not usually relevant to report consanguinity (suggested by patches of LOH across the genome). Any UPD result should be confirmed by MS-MLPA or equivalent test.

Long stretches of Homozygosity may be useful to unmask potential recessive diseases in families where the clinician suspects/knows the involvement of a particular gene – further testing is needed to confirm a homozygous mutation in the suspected gene.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the applications of SNP arrays in the prenatal setting?

A

Prenatal testing by SNP array needs even more careful consideration. In-house experience (usually from postnatal microarray testing), a platform with minimal false positive calls, a fast turnaround time, a defined workflow, access to internal and external databases, extensive literature searching, and good communication with clinical geneticists are vital in order to provide a good service. A combined SNP/oligoarray platform provides extra reassurance for each call. SNP arrays do not require sex-matched controls, therefore the sex of the prenatal sample is not needed before set up.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the applications of SNP arrays in oncology?

A

The combination of SNP array genotyping combined with genomic copy number analysis has been the method of choice for tumour genetic studies where detecting copy number changes and acquired uniparental disomy are informative e.g. CLL, MDS, ALL, AML, multiple myeloma, renal cell carcinoma.

Can be used to detect:

  1. LOH
  2. DNA CNV
  3. Combined CNV and LOH
  4. Methylation analysis
  5. Allelic specific gene expression
17
Q

Give an example of why it is useful to be able to detect LOH and CNV in oncology samples via SNP array.

A

LOH and Allelic Imbalance (AI) – originally SNP arrays were used to find AI which allowed further refinement/differentiation of subgroups of cancer e.g. small cell lung cancer can be differentiated from non-small cell lung cancer by LOH

DNA Copy Number Aberration Analysis – e.g. copy number gain of the 8q region, where the prostate cancer overexpressed androgen regulated gene, TPD52 is located.

18
Q

What is the advantage of a combined SNP and oligo array approach in cancer?

A

Combined LOH and DNA Copy Number Analysis – better than a SNP array alone.

Analyzing both the LOH and DNA copy number changes pioneered the application of SNP analysis in the study of LOH without loss of DNA copy number in human malignancies

E.g1. In acute myeloid leukemia a high frequency (20%) of UPD, and UPD regions on chromosome 19 coincided with a previously identified homozygous mutation in the CEBPA gene.

Further mutation screening identified mutations at four distinct loci, WT1, FLT3, CEBPA, and RUNX1, within the UPD regions in 7 out of 13 cases of acute myeloid leukemia.

19
Q

What are the principles of using SNP array for methylation detection?

A

SNP arrays have been adopted for epigenetic and gene expression analysis. The current principle of Affymetrix array analysis is to apply selected restriction enzyme digested DNA samples for hybridization. It can be easily adjusted for DNA methylation studies by comparing the presence and absence of DNA sequences between the methylation sensitive and non-sensitive enzymes.

20
Q

How can allelic specific gene expression be detected using SNP arrays?

A

Using cDNA instead of genomic DNA as starting material, allelic specific gene expression can also be detected by SNP array analysis. Allelic specific gene expression may be important in tumourigenesis

21
Q

What are the limitations of SNP arrays?

A
  1. Unable to detect balanced rearrangements/gene fusions and whole genome ploidy changes.
  2. Mosaicism is not always reliable under 20-30% (depending on size and direction) i.e. so not good for Minimal residual disease (MRD) detection, or specific nucleotide mutations.
22
Q

What other applications of SNP arrays are there?

A
  1. In both pre and postnatal applications SNP alleles with give evidence of maternal cell contamination (MCC) or mosaicism if present. Levels of mosaicism need to be 20%+ to be reliably detected, however lower levels have been recorded.
  2. Markers – can be used to determine the origins of small supernumerary marker chromosomes.
  3. Allows sample comparison should a sample mix-up need to be excluded.