Genetics 6: Comparative genomics Flashcards
How was the human genome sequenced in the end compared to original plan
Originally it was linkage mapping, physical mapping and DNA sequencing but Venter said lets do shotgun sequencing : random DNA fragments
What is genomics
The study of genes, their interactions and relationships within a species as well as comparison between species
Describe the process of whole genome shotgun sequencing
- Cut the DNA into overlapping fragments short enough for sequencing
- Clone the fragments in plasmids or other vectors
- Sequence each fragment
- Order the sequence into one overall sequence with computer software
What is bioinformatics
The application of computational methods to the analysis of large biological data sets. ie DNA sequences.
What are the general trends in haploid genome size and number of genes between prokaryotes and eukaryotes and within eukaryotes
- Prokaryotes tend to have smaller haploid genome size/ number of genes than eukaryotes (5000x less)
- In Eukaryotes the number of genes is often lower than expected given the size of their genomes
- There is no relationship in Eukaryotes between the size of genome/ number of genes and phenotypic complexity of organism. Although a threshold has to be hit to get higher organism
What is the trend of Gene density (genes per Mb) for prokaryotes vs eukaryotes
- Prokaryotes have very high gene density compared to Eukaryotes.
- Within Eukaryotes there is a large variation in gene density with Mammals incl humans have lower gene densities
What is unit Mb
Mega base = million base pairs
What is the relationship between genome size and the number of genes
None really. Humans have about the same number of genes as something 10x as small
What accounts for the differences in gene density between prokaryotes and eukaryotes
Most bacterial genomes have genes for proteins, tRNA, rRNA with remaining as non transcribed regulatory regions. However in eukaryotes lots of DNA neither codes for a protein nor is transcribed into RNA of known function.
-This is up to 10000x more than prokaryotes and can be in genes or outside
What is the percentage of repetitive DNA compared to unique sequence in human genome
Repetitive DNA (unrelated to transposable elements < includes transposable elements) takes up 58% where as unique sequence DNA (exons
What are the 4 things that make up unique sequence DNA and what % of the human genome are they (smallest to biggest
- Exons : code for proteins, tRNA or rRNA = 1.5%
- Gene regulatory sequences=5%
- Intergenic unique sequence DNA between functional genes (ie pseudogenes that have resulted from mutations making previously repeated genes non functional)= 15%
- Introns= non translated parts of human genes, varying between genes with some of them transcribed. =20%
What are transposable elements + effects + 2 classes
Mobile DNA sequences that can move from one location in the genome to another.
F: They often cause mutations by direct insertion as they are able to insert into many different locations.
- Also can promote DNA rearrangement eg. chromosome deletion, inversions and duplications
2 classes: retrotransposon and retrotransposon
How do retrotransposon work
- Retrotransposon synthesis a single stranded RNA intermediate of the retrotransposon
- The gene also makes a reverse transcriptase which produces a DNA copy of the RNA strand
- The reverse transcriptase then can make another strand which makes it a mobile copy.
- The mobile copy inserts itself into the genome somewhere else
How does transposable elements cause unequal crossing over
Homologous transposable elements around the genes of non sister chromatids cause them to align to the wrong transposable regions during meiosis.
As a result of crossing over at this point, one chromatid gets a duplicated gene and one loses the gene
How has gene duplication resulted in genome evolution
Through duplication of a gene through evolutionary history a gene family arises.
As the DNA sequence of the genes in the family change, they can have different functions.
Families can spread throughout the genome and code for several proteins, but some can become pseudogenes.