Human Genome Flashcards
Percentage of genes in genome
2-3%
Correlation between genome size and organism complexity
None
Definition of telomere
Ends of chromosome
Original definition of genome
Collection of genes contained within a haploid chromosome set
Modern definition of genome
Locatable region of genomic sequence corresponding to a unit of inheritance associated with regulatory regions, transcribed regions, and/or other functional sequence regions
Definition of centromere
Central region of chromosome
Arms of chromosome?
Chromatid
Shorter arm of chromosome
P arm
Longer arm of chromosome
Q arm
Number of rings in purine
Two
Number of rings in pyrimidine
One
Which bases are purines?
Adenine, Guanine
Which bases are pyrimidines?
Cytosine, Thymine
Correlation between GC content and melting point
Higher GC content, higher melting point
Which is more preferred for expression? AT or CG?
AT
Correlation between GC content and level of expression
Higher GC content, more silenced genes
Number of bonds in AT linkage
2
Number of bonds in CG linkage
3
Definition of intron and exon
Intron –> region of DNA not expressed
Exon –> expressed region of DNA
Parts of the UTR/Leading sequence
- Enhancer
- Promoter
- Operator
- Consensus sequence
Consensus sequence in a) prokaryotes, b) eukaryotes
a) Pribnow box (TATAAT)
b) Hogness box (TATA)
Parts of coding sequence
- Start codon (AUG)
- Introns and exons
- Stop codon (UAA, UAG, UGA)
Definition of open reading frame
Region of DNA corresponding to coding sequence
Name and define the regions associated with the alternating introns and exons
- Pre Region –> signal region, determines where the protein will go
- Pro Region –> determines posttranslational modification
- Mature Region –> encodes the protein
Parts of the UTR/trailing sequence
- 2nd termination signal
2. Cleavage signal (AAUAA) containing poly-A tail
Central dogma of molecular biology
- Replication –> DNA duplication
- Transcription –> RNA synthesis
- Translation –> protein synthesis
Number of base pairs in human genome
3200 M
Composition of human genome
- Euchromatic portion –> 3000 Mb, transcriptionally active
2. Constituitive heterochromatin –> 200 Mb, transcriptionally inactive
Where can constituitive heterochromatin be found?
- Centromeres
- Long arm of Y chromosome
- Short arm of acentric chromosomes (13, 14, 15, 21, 22)
- Secondary constriction of long arm of 1, 19, 16
Percentage composition of bases in genome
GC –> 41%
AT –> 59%
What are Giemsa bands?
- Dark bands –> low GC, 37%
- Light bands –> hgih GC, 45%
both bands seen in karyotype
Number of genes in human genome
30,000 - 35,000
Percentage of genes coding untranslated RNA
5-10%
Mechanism of gene distribution
Hybridization of CpG islands to metaphase chromosomes
General trend in abundance of euchromatin and heterochromatin in genes
Chromosome 1 has highest amount of euchromatin and heterochromain, amount decreases as chromosome number increases
Composition of nuclear genome
- 95.5% noncoding DNA
2. 4.5% conserved DNA –> 3% untranslated & regulatory, 1.5% coding DNA
Definition of pseudogenes
Multiple copies of genes serving as rescue genes in case original genes get damaged
Number of BP in mitochondrial genome
16,569
Number and characteristics of genes in mitochondria
37 genes –> 22 tRNA, 2rRNA, 13 protein coding
Structure and significance of D-loop in mitochondrial genome
Triple stranded region, origin for replication of mitochondrial genome
Misc. characteristics of mitochondrial genome
- All genes lack introns
- Some coding sequences are overlapping
- Some genes lack stop codons (added posttranscriptionally)
Endosymbiont theory
Mitochondria may have originated from symbiotic prokaryotic bacteria
Significance of mitochondria in apoptosis
Contains cytochrome C –> critical component in apoptosis
Enumerate RNA polymerases and their functions
- RNA Polymerase I –> ribosomal synthesis
- RNA Polymerase II –> synthesis of mRNA, snRNA, microRNA
- RNA Polymerase III –> synthesis of tRNA, rRNA 5S, misc small RNAs
Subunits of nuclear rRNA
- large subunit (18S, 5.8S, 5S)
2. small subunit (18S)
Subunits of mitochondrial rRNA
16S & 23S
Function of tRNA
Carries amino acids during translation
Significance of snRNA
Involved in RNA maturation, spliceosomal RNA
Significance of snoRNA
Guides site-specific base modification in rRNA & snRNA
Subfamilies of snoRNA
- C/D box snoRNA –> guides site-specific methylations in rRNA
- H/ACA snoRNA –> guides site-specific pseudourylation of uridine to produce pseudouridine in rRNA
Significance of miRNA
Antisense regulator of genes –> bind to complementary sequence in 3’-UTR to inhibit translation
Significance of noncoding 7SK RNA
Negative transcriptional inhibition of RNA polymerase II elongation
Significance of SRA1 RNA
Coactivator of steroid receptors
Examples of identical proteins coded by different genes
Histone, ubiquitin
Characteristics of classical gene families
- High degree of sequence homology
2. Products have short conserved amino acid motifs
Example of conserved amino acid motifs in classical gene families
- DEAD (Asp-Glu-Ala-Asp)
2. WD (tryptophan-aspartate)
Characteristics of gene superfamilies
- No significant conserved amino acid motifs
2. Share general common structural features, general
Examples of gene superfamilies
- Immunoglobulin superfamily –> immunoglobulin genes, T cell receptor genes, HLA genes
- Globin superfamily –> alpha and beta globin, neuroglobin, myoglobin
- G protein-coupled receptor superfamily –> mediate ligand induced signaling between extracellular and intracellular environments
Characteristics of pseudogenes
- Copied at the level of genomic DNA by tandem gene duplication
2, Contain all elements of a gene but have inappropriate termination codons in exons
Characteristics of a) perocentromeric and b) subtelomeric genes
a) contain sequences that have been copied recently during evolution and are located on several chromsomes
b) unstable and prone to duplication
Definition of retrogenes
Nonprocessed pseudogenes that are integrated in a chromosome next to a promoter and expressed
Definition of comparative genomics
New field of biological research dealing with the comparison of genomes of different species
BLAST
- Basic local alignment search tool
2. Searches for sets of optimal local alignments based on a query
Ways that a species’ genome may change (4)
- Expansion –> gene duplication
- Genesis –> creation of new genome
- Exchange
- Deletion
Mechanisms which govern ganges at the protein level are most likely due to: _____
Nucleotide substitution, insertion, deletion
Changes may give rise to new genes which become fixed if ____________
they give the organism an advantage in natural selection
Definition of purifying selection / negative selection
Natural selection against deleterious mutations
Significance of purifying selective force
Prevent accumulation of mutation at important functional sites, resulting in sequence conservation
Definition of neutral theory
a() Molecular evolution caused by random genetic drift through mutations that are selectively neutral or nearly neutral.
b) Describes cases in which natural selection is not strong enough to outweigh random events
Definition of positive selection
- Darwinian selection fixing advantageous mutations
- Equal to molecular adaptation and adaptive molecular evolution
- ex. people who have sickle cell anemia are less affected by malaria
Definition of a) synonymous substitution, b) non synonymous substitution
a) substitution in DNA that alters the amino acid sequence of the coded protein
b) substitution in DNA that does not alter the amino acid sequence of the coded protein
The nucleotide sequences of a pair of homologous genes have a (higher or lower?) information content than the amino acid sequences of their coded proteins
Higher, due to the fact that synonymous mutations change the DNA without changing the protein
Types of nucleotide substitution (6)
Given 2 nucleotide sequences, we can ask how their similarities and differences arose from a common ancestor.
- single substitution –> 1 change 1 difference
- multiple substitution –> 2 changes 1 difference
- coincidental substitution –> 2 changes 1 difference
- parallel substitution –> 2 changes, no difference
- convergent substitution –> 3 changes, no difference
- back substitution –> 2 changes, no difference
Transition vs Transversion
Types of substitution
- Transition –> purine for purine, pyrimidine for pyrimidine
- Transversion –> purine for pyrimidine
Which occurs more frequently? Transitions or transversions? Why?
Transitions occur 2x more frequently than transversions –> easier to replace for the same type
Enumerate the types of mutation, and their characteristics
- Based on effect –> lethal vs nonlethal
- Based on size/position –> Point mutation (missense, nonsense, frameshift)
- Based on substitution –> transition vs subversion
Differentiate missense, nonsense, and frameshift mutations
- missense –> single nucleotide change causes a different amino acid to be coded
- nonsense –> single nucleotide change causes premature stop codon
- frameshift –> insertions or deletions that change the reading frame (not divisible by 3)