ISALAN - gene & genome duplication Flashcards
Ways for a genome to acquire new genes:
- Horizontal gene transfer – recombination between DNA allows genetic material to be transferred from one species to another (Ex. from virus to bacteria by integrases)
- Exon Shuffling
- Duplication & Divergence (1% chance for 1 gene in 1 million years)
-Originally 1 gene, but is duplicated and diverges to obtain a different function
-Ex. protease genes that have the same underlying proteolytic reaction but diverges to have different protein recognition site
Gene Duplication can occur by:
- Unequal crossing over
- Unequal sister chromatid exchange
- DNA Amplification during replication
- Replication Slippage
- Retrotransposition
Unequal crossing over
- Recombination is initiated by similar sequences that are not in identical places on homologous chromosome pair
- During meiotic recombination, repeated sequences flanking a gene are misaligned between homologous chromosomes, resulting in gene duplication in on one of the chromosomes, and gene deletion in another.
- The resulting daughter gamete will then have a gene duplicate, in which if an evolutionary advantage exists, the duplicate will also get passed on over generations
Unequal sister chromatid exchange
- During meiotic or mitotic recombination, repeated sequences flanking a gene are misaligned between sister chromatids, resulting in gene duplication in on one of the sister chromatids, and gene deletion in another.
- Depending on the species, the duplication may or may not be passed down on to the progeny.
o if duplication happens in mitosis of human somatic kidney cell, will not be inherited (because duplication is not in gametes)
o but if in single-cell species ex. amoeba, mitosis (binary fission) can lead to gene
duplication in an evolutionary context.
DNA Amplification during replication
- An unequal recombination during replication which can occur in haploid organisms that do not have polyploid chromosomes ex. bacteria
- At replication bubble, homologous DNA can exist as polymerases replicate each template strands and form daughter strands in opposite directions. Inappropriate line up of the repeated sequences flanking the gene in the replication bubble can result in gene duplication on one DNA and a deletion in the other.
Replication Slippage
- not common for genes, mostly for short DNA sequences (ex. micro,minisatellites)
- Short repeat sequences’ ability to form hairpin structure during replication can cause some repeat units to be replicated.
Retrotransposition
Certain sequences (Ex. LINES, SINES) can utilize Reverse Transcriptase to revert their RNA back to cDNA which can then be inserted back into the genome at a different position
Outcomes of Gene Duplication
- Conserved Gene duplication
- Pseudogene formation
- Gene Neofunctionalization
Conserved Gene duplication
- If an increased synthesis of the gene (due to its duplication) is beneficial, the duplicated
gene will continue to be synthesized and remain in the genome - lead to formation of multigene family, gene superfamilies
Multi gene family
- a group of similar genes that appear multiple times in the genome due to replication events. The genes can be near each other or spread out among the same or different chromosomes
Ex. RNA genes
Cells have to optimize number of RNA to match the metabolic requirements of different cell types – different cell types = different number of copies (due to different numbers of duplicated copies kept)
i.e. Mycoplasma genitalium: 2 copies;
Xenopus laevis ≥ 500 copies
Gene superfamilies
Large multigene families (more nearly identical copies of the genes)
* ex. Zinc finger domains (400 different locations)
* ex. The Globin Superfamily
o Carry out different functions in different tissues (carry O2 vs carry CO2 in different tissues) – shows how each gene in the superfamily has evolved/ diverged to specialize in different functions for the organism to survive
o Duplications are retained due to Globin’s importance:
-Present in all 3 domains of life
-Used for O2 transport, storage, sensing, detoxification
-Members in humans:
* Members have different structures (duplication patterns)
-Hemoglobin = ⍺2β2 tetramer
-Myoglobin = monomer allow for the differing O2 loading on/off properties required in
different environments they are expressed in
- Hemoglobin’s tetrameric structure allows for allosteric regulation, thus co-operative binding of O2
-At low O2 conc, hemoglobin has difficulty binding to O2, but at a higher O2 conc, each subsequent O2 binding allows for a higher binding affinity of O2 to hemoglobin – results in a sigmoidal curve
-Allows it to readily bind O2 where O2 is in excess, and to readily release O2 where O2 conc. is low - Myoglobin’s monomeric structure does not allow for cooperative binding of O2, so it has a higher affinity for O2 overall
- Hemoglobin is in red blood cells which are used to carry and supply O2 for different tissues while myoglobin is in muscle which allows it to readily load on O2 when hemoglobin reaches the muscle
Formation of pseudogenes
- If an increased synthesis of the gene (due to its duplication) is NOT beneficial, random
mutations can occur within the second copy and inactivate it. As a result, broken genes
or “pseudogenes” exist throughout the genome
(Pseudogene: A DNA sequence that resembles a gene but has been mutated into an inactive form over the course of evolution)
Types of pseudogenes:
- Non-processed pseudogenes:
-Does not go through the processing step of becoming RNA before they were replicated. Duplication happens during DNA replication, resulting in tandem duplication of genomic region only
-Inactivated due to mutations/ incomplete duplication
-Could miss regulatory regions ex. distal enhancers, promotors, but still contains ALL the intron and exon structures - Processed pseudogenes:
-Duplication occurs after the DNA has been processed into mRNA, so the mRNA has to undergo reverse transcription back into cDNA. cDNA then undergoes genome integration in order to make the second duplicated gene copy. (pseudogene as a result of retrotransposition)
-Lacks regulatory regions AND INTRONS, and can also have different exon combinations
-Can contain polyA tail and flanking repeats (UTRs)
-Can integrate into the same or different chromosomes (more spread out)
Gene Neofunctionalization
- Mutations in the duplicated gene can result in gaining of a new function or a sub function (a specialized function) apart from the original gene
- Ex. proteases: trypsin and chymotrypsin (duplicated ~ 1500 million years ago)
>Mutations following duplication result in a different specialized function
between the 2 proteases:
-Trypsin: cuts at arginine & lysine (small, neg charged side chains)
-Chymotrypsin: cuts at phenylalanines, tryptophans & tyrosines (bulky
side chains)
Genome Duplication
- There can be larger duplications than of genes/ segments ex. whole chromosome duplications.
- This will cause gene product imbalance which reduces life expectancies of the affected cell/ individuals
ex. Down syndrome – whole chromosome duplication of human chromosome 21