L17 Genomics in Drug Discovery RP Flashcards
what is a genome?
an organisms complete set of genes and chromosomes.
what is genomics, and what can we achieve by understanding it?
the process of mapping, squencing and analysing genomes. Through understanding genomics, we can identify genes involved in a disease (novel genes).
Cloning genes to identify them may be useful when predicting what they do. This is ideal as ophan drugs can be matched with new treatment indications. e.g Albuterol - doesnt work on mexicans due to polymorphism on B2 receptors
how many base pairs are there in the body?
3 billion base pairs in the body. With 2% encoding genes. Meaning there is only a 2% chance to obtain a mutated gene that codes for something like a protein.
A gene is
a gene is a section of a chromosome in the nucleus of the cell
what is the use of mRNA in genomics?
used to measure expression. mRNA is formed from DNA via RNA Polymerase
what enzyme converts mRNA-> DNA?
Reverse Transcriptase
what is splicing
process where useless genes are removed and the exons (useful genes) are joined together. mRNA splicing occurs to rearrgange the order of exons which is also how different varieties of antibiodies can be made.
The process where useless genes are removed and the exons (useful genes) are joined together is known as
splicing
what are DNA microarrays?
They’re able to measure the expression of 50,000 mRNA transcripts and 100,00 polymorphisms per experiment.
The appropriate oligonucleotide is matched with the opposite sequence of the mRNA strand.
They’re able to measure the expression of 50,000 mRNA transcripts and 100,00 polymorphisms per experiment.
The appropriate oligonucleotide is matched with the opposite sequence of the mRNA strand.
DNA Microarrays
what is reverse transcriptase PCR used to produce?
several copies of DNA from a single mRNA strand.
several copies of DNA from a single mRNA strand is made by a process called
Reverse Transcriptase Polymerase Chain Reaction (PCR)
Explain Microarray Expression Analysis
1) In this experimental setup, the cDNA derived from the mRNA of known genes is immobilized.
2) The probe (sample) has genes from both the normal as well as the diseased tissues.
3 The probe will hybridise onto the cDNA which bears a fluorescent marker.
4) This expression pattern is then compared to the expression pattern of a gene responsible for a disease.
what does cDNA allow us to do?
cDNA allows us to compare healthy mRNA with faulty mRNA,
Healthy mRNA is the control and the regulation of specific genes are determined.
RT PCR is conducted for both healthy mRNA and faulty mRNA
give me a scenario of cDNA fluorescent markers in use
so Green fluorescence can signifty healthy. and Red can signify faulty.
Advantages of Genomics
It is fast as the expression levels for over 10,000 genes can be obtiained from the experiment.
-> It is comprehensive, as it deals with the entire human genome on two chips.
- > Flexible as can identify any SNP and genome
- > Used to identify parent groups that the drug would work in
- > It can be used to revive drugs that were thought to be never used
Disadvantages of Genomics
1) Expensive process
2) Expression of mRNA may not reflect what occurs at the protein level
3) Data analysis process is very complex
4) Huge number of redundant data points
5) Can only use for genetic diseases
In genomics, how do we identify significant genes?
Can be identified using stats. This can be achieved in three ways:
- Ranking can be used to identify the significant genes, by ranking the genes from highest -> lowest and taking the top 1%
- A Threshold can also be used. You can select which genes may have a 2X effect on a disease etc
- Inferrential and Descriptive statistics:
Inferrential Identifies significantly regulated genes, while descriptive indentifies co-regulated genes
in gene identification what is the issue in ranking genes?
- It is likely that most crucial genes may be missed out, which could be towards the bottom
- The Top 1% may not be important.
Describe the main tool used in statistical analysis in genomics
Inferential Stats: T-tests are done to identify the right order of genes to observe and the ones to cut off. This is achieved by conducting t-tests for every gene and accept these with a significance value of <0.05 (less than 0.05) However, some genes provide a false positive. The P value is implemented to correct this.
How do you calculate the P value. And when the P value is calculated what does this even mean?
Original p value(0.05)/ square root of number of all samples. In the cas ethat a few genes gave a value lower than the p value, it needs to be confirmed that genes influence the changes that are expected as the values are lower than the P values, meaning thart the genes are signifnicant.
What is the next step after identifying useful genes?
cloning the useful genes by RT PCR and then western blotting to confirm that those genes influence protein expression
what is cDNA
cDNA is known to be synthesized from mRNA. It is synthesized in a reaction that is catalyzed by the reverse transcriptase and DNA polymerase enzymes.
Compare and Contrast the Genomic and Traditional Drug discovery approach
Genomics is is a Genome wide screen approach that interrogates over 18,000 genes, and allows you to generate thousands of data points per experiment
By Contrast, Traditional DD looks Biological function, Addreses one Target or pathway and generates 1 data point per experiment.
Compare and Contrast HTS with Traditional DD and Genomics
Genomics uses Multi Arrays, and you can generate up to 48 million data points.
Traditional DD produces 96 data points per experiment, using 96 wells for HTS
What was the initial aim of the Human Genome Project?
Identify role of gene products, how they are controlled, how they interact,
and their involvement in human disease
what was the findings of the HGP?
Identity of 90% of genes are known
Structure and function of 75% of these are known
40 genes obtained directly from bacteria
40% of DNA is repetitive
what did we find out about SNPs?
– SNPs define individuality, most are neutral, many are disease causing