KJB - Human Genome Project Flashcards
The HGP consists of how many chromosomes?
22 pairs of autosomes
1 pair of sec chromosomes
Define karyotype.
A chromosome spread - a set of human chromosomes at the stage of mitosis (shortly before the cell divides)
What does Giemsa staining do?
Allows you to see characteristics of banding patterns.
How many bases of DNA are there?
3 billion
How many genes are there?
20, 000 - 30, 000
What version of the human genome sequence are we on?
38th version - still not complete
State the goals of the HGP
- Identify all the genes in human DNA
- Sequence all base pairs that make human DNA
- Store this information in a database
- Improve tools for data analysis
- Transfer technologies to private sector
- Address ethical, legal and social issues that may arise from the project.
Composition of the human genome project. Give %
Of the 3 billion base pairs, only:
- around 3% actually codes for DNA protein
- around 39% = single copy non coding DNA
- around 59% = repetitive DNA (containing transposons)
Humans do not have much more genes than creatures such as nematodes, fruit flies or any other simple organism. Explain why.
In terms of gene numbers we are not that complex compared to a nematode worm but the way that we USE these genes we are a lot more complex in how we switch genes on and off, and how we make proteins from those genes.
Although the sizes of bases in genes vary greatly, what is the average?
3000 bases per gene.
Explain what epigenetic changes are.
Epigenetic changes do not involve DNA sequences but change the chemical structure of bases e.g. by adding a methyl group to C bases. A lot of methyl additions will silence the genes.
Epigenetic changes occur through environmental exposure and are frequently involved in human cancers.
Modern drug discovery process order:
- Target identification
- Determine DNA and protein sequence
- Elucidate structure and function of target
- Proof of concept and assay development
- Assay for high throughput molecular screen
- Mass screening and/ or directed synthesis programme
- Select lead structures
Target identification requires genome information. The healthy cells are compared against diseased cells and the differences are noted. What are the 3 ways in which this can be done?
- Genetic association studies - look at DNA SEQUENCE variation in diseased v.s healthy individuals
- Gene expression studies (transcriptomics) - look for differences in GENE EXPRESSION in diseased v.s healthy individuals (i.e. which genes are ‘on’ and which are ‘off’
- Protein expression studies (proteomics) - look for differences in PROTEIN EXPRESSION and modification in diseased v.s healthy individuals (i.e. which proteins are made and how they are modified)
How do you elucidate the structure of the protein?
X-ray crystallography
What are the two ways of analysing the role of a target protein in a disease?
- OVEREXPRESSION - make lots more of the protein than normal and see what happens OR introduce a transgene i.e. a gene from a different organism
- GENE KNOCKOUT - take the gene away (and therefore stop protein production) and see what happens
IF A PROTEIN IS REALLY INVOLVED IN THE DISEASE (IS A GOOD DRUG CANDIDATE), THEN ONE OF THESE APPROACHES WILL GIVE US AN EFFECT THAT SHOULD LOOK LIKE A PATHOLOGY