W3 - Microarrays Flashcards

1
Q

What is a microarray?

A

And ordered assembly of nucleic acids immobilised on a solid support.

Support - glass - like a microscope slide
If you look at it closely with a microscope and illuminate it with a scanner, will be able to see multicoloured spots. Each colour corresponds to measurements of a specific DNA hybridisation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does the array compose of?

A

 Probes are the short pieces of single-stranded DNA immobilised on the surface of the array
 They are oligonucleotides
 Each spot on the array consists of
thousands of probes with the same sequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How does a Microarray work?

A

They each have 6 1/2 million locations on each array.
On each location, we have millions of DNA strands built up all with the same sequence (single stranded).

If we did in a DNA fragments that were tagged with fluorescent molecules, anything that they are complementary to will hybridise to correctly.

When you shine a laser light on them, the fluorescence tag will glow. This will be visible through the microscope or scanner or detector. If nothing hybridises, no glow.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How does gene expression microarray work?

A

What are the expression levels of all genes in my samples?
 The transcriptome - combination of all RNA
 Which genes are expressed at different levels between two different types of samples?

-Discover the biology of your samples
-Classify samples
-Predict which class a sample belongs to

  • lots of copies of the same probe on the spot.
  • each spot gives the relative expression for one transcript
  • detects all known transcripts in one sample
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How does expression profiling workflow work?

A

one colour and two colour array

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How does the two colour array work?

A

We have a test sample and our control sample.
The isolate the RNA and make mRNA
We label one in red = Cy5
And the control in green = Cy3
We create cDNA using reverse transcriptase.
We mix the two and hybridise them.
We scan it and get the data
We look at the relative colour.

Roughly the same = yellow
If test use stronger it will be more red vise versa.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What steps are involved in data analysis workflow?

A

feature extraction - getting the raw data from array (CEL file)

Quality control - any issues about concentration or quality of the sample

Normalisation - attempt to make the data look like everybody else’s data.

Differential expression analysis- difference of expression between our test and control samples. 

Biological interpretation- clustering analysis, geneset enrichment, pathway or network analysis.

Submit data to a public repository.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is clustering?

A

Increasing gene expression and increase in body mass index. Organises data with similar patterns into classes - we are looking to see if there are any patterns in the data.

Objects within a class are more similar to each
other than to objects outside the class

Can be displayed on a dendrogram.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are Dendrograms?

A

Dendrograms – “Trees”
Alternative way of displaying
similarity between samples
Distant samples are less similar

Eg. In osteoartheritis, there is a whole group of genes there not expressed in the normal, if we compare them side by side. We can tell if someone has it at one glance.

Eg. This can also be done in identifying cancer.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What happens in qPCR?

A

Reverse transcriptase is used on RNA to form complementary DNA (cDNA) for PCR.
We would PCR the gene of interest. Also PCR housekeeping gene. This is a gene widely expressed in all cells and tissues.

We can make RT-PCR quantitative by counting the number of copies of amplified DNA present. We count the copies by using fluorescent molecules - tags. 

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How is the exponential amplification in PCR dealt with?

A

If you begin with one sample, you can end with 17 billion copies after the 35th cycle for example. Virus if you started 15 samples, you could end up with 257 billion copies of the 35th cycle.

To establish a point where we can compare, we pick a common point.

fluorescence above background at 225 copies=Ct (threshold cycle)

So the higher the amount of starting RNA, the lower the Ct value. Ie. We need less cycles to make 225 copies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do you count the number of amplified molecules present?

A

Include a dye in the PCR reaction makes that fluoresces when it burns double-stranded DNA.

Label a probe in the PCR that only fluoresces when it is incorporated in the PCR product.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Why do we use qPCR?

A
  • qPCR is used to independently confirm differences in RNA levels between samples
  • Probe binding is noisy and differences can be detected that are not real, especially where differences are small (<2-fold)
  • RNA-Seq is a more accurate measure of RNA transcript abundance, it is more reproducible and works over a much wider range of concentrations…..but it is more expensive

There would be one in the clinic for instance as EndoPredict Risk Estimation for Breast cancer. It tests for whether chemotherapy will be effective or not.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the Breast Cancer EndoPredict Risk Estimation?

A
  • At a low score (EPclin<3), endocrine therapy (ET) alone is sufficient
  • At higher scores, ET+C is clearly beneficial
    compared to ET alone
  • This test ensures that only patients who will
    benefit from chemotherapy will receive it.
    qPCR tests would be used in the clinic.
    In EndoPredict tests for 12 genes activity.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do SNPs work in Microarrays?

A

 Genome-wide Association Studies are only possible because we can genotype large numbers of SNPs in large numbers of subjects

 This is possible by using microarrays that hybridise with genomic DNA adjacent to SNPs (rather than RNA transcripts)

 The SNP is then extended by one base that is fluorescently labelled and detected using a high definition scanner

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is in a Spot?

A

 Lots of copies of the same single-stranded
oligonucleotide – a “probe”
 Each probe is for genotyping one SNP

17
Q

How does genotyping microarrays work?

A

Each spot gives the genotype for one SNP
 Up to 5 million spots per sample on array
 Genome-wide analysis possible

If there was a SNP detecting G->A, Oligos are all the same lengths for each probe are permanently connected to the solid support (glass) Will wash over and hybridise DNA from our single person. All the pieces of DNA hybridise differently and they attach the complementary points so A-T and C-G. Underlined the base where the SNP is on the DNA that’s come in. That base is not present on the probe. We need to extend the DNA by a single base. These bases are fluorescently labelled, typically 2 colours. When the probes attach, can see that some have A or G, so they would get either a complimentary C or T being extended. Can’t extend any further because of the terminator nucleotides that we added. This would mean this particular strip is heterozygous. The combined red and green signal would be yellow under a scanner. If it was homozygous would likely be a red or a green. We have millions of these across the array.

18
Q

How do you read the SNP genotypes?

A

 Software translates the three different colour signals for each probe into genotypes
 A few SNPs are reviewed by hand (<50) but most are not

We can look at these in thousands of people - relatively cheap. Around £30 per person.

19
Q

What are Data Repositories?

A

 Maximise utility of microarray experiments
- share data
- use other people’s data
 If users provide the Minimum Information About a Microarray
Experiment (MIAME) then it is easier to compare results
 ArrayExpress, EBI (European Bioinformatics Institute)
 GEO – Gene Expression Omnibus, NCBI (National Center for Biotechnology Information, USA)

20
Q

What are Transcriptomics?

A

 What are the expression levels of all genes in my samples?
 The transcriptome
 Which genes are expressed at different levels between two different types of samples?
Discover the biology of your samples
Classify samples
Predict which class a sample belongs to

21
Q

What are SNP microarrays?

A

 Genome-wide Association Studies are only possible because we can genotype large numbers of SNPs in large numbers of subjects
 This is possible by using microarrays that hybridise with genomic DNA adjacent to SNPs (rather than RNA transcripts)
 The SNP is then extended by one base that is
fluorescently labelled and detected using a high
definition scanner

22
Q

What is a human Karyotype?

A

Cytogenesis would produce a metophase spread of the chromosomes from a person. They would spend a lot of time trying to identify abnormalities in chromosome pairs.

ACGH makes it much more routine and doesn’t require the same level of skill in detecting these things.

23
Q

What are structural Variants (SV) and Copy Number Variants (CNV)?

A

Variations mean significant changes in the genome.
Reference = ABC
- Deletions = AC
- Insertions= AEBC
- Inversion, where the order is reversed
- Tandem duplication, side by side duplication = AABC
- Dispersed duplication, ABAC
- Copy number varient, AAABC

These are all things that can be picked up via CGH

24
Q

What is Array Comparative Genomic Hybridisation (aCGH)?

A

We mix fluorescently labelled patients and controlled in a server. Cy5 and Cy3 labelling again with green and red.
Mix an equal amount, giving overall yellow solution. Then hybridise it to an array of probe for regions across the whole genome. We scan it for the vast majority of them - we will see yellow spots because equal amounts of patients in a control DNA will hybridise. Occasionally, there will have been a duplication in the patient - more green, Or deletion - more red. All done using a laser scanner.

Most variations are harmless and aren’t necessarily disease causing.

25
Q

What is microdeletion on 16p13.3?

A

This is a disease causing microdeletion.
On an array graph. 0 would be yellow and -1 green and +1 red - to see variations.
Defines a deletion - about 120 kilobytes - well under what can be seen on a karyotype, but large. Gives rise to Rubinstein Taybi syndrome.

26
Q

Prenatal testing?

A

In general array CGH is used.
- Initially just common abnormalities in PCR
- After that, if there are problems, array scanning is done to look for structural variants. It is a good technique.