Structure of Genes and the Human Genome Flashcards
Approximately how much of our genome encodes proteins or structural RNAs? What do these include?
30%
- solitary protein coding genes
- duplicated protein coding gene families
- tandemly repeated structural genes (ribosomal RNAs, 5S RNA)
What makes up approximately 50% of the genome?
repetitive DNA
- simple tandem repeats (one after another)
- interspersed repeats - mobile genetic elements
About how much of the genome does other non-repetitive DNA compose?
20%
About how many protein coding genes are there in humans?
23,000
What three components compose the human genome?
- genes encoding proteins or structural RNAs (30%)
- repetitive DNA (50%)
- non-repetitive DNA (20%)
What are solitary protein coding genes?
the expressed regions (exons) are separated by large non-expressed regions (introns)
On average, what percent of the human genome length contributes to mRNA?
5%
Blocks of very short repeated sequences (usually 5-10 nucleotides per repeat), make up approximately what percent of the human genome?
5-10%
What does the largest human gene encode?
the protein called dystrophin
- gene is more than 2.4million nucleotides in length
- contains more than 80 exons
dystrophin normally connects muscle fibers to the cytoskeleton and the cell membrane
*defects in dystrophin cause Becker and Duchenne Muscular Dystrophies
satellite DNA (recognize word, don't care too much)
simple sequence tandem repeats
LINE
- long interspersed element
- remnants of transposable elements (sequences that have the ability to move into and out of genomic DNA)
- often not full length
Insertion of a LINE into a gene can cause what?
genetic disease
Certain hemophilia patients have novel insertions of _____ in the Factor ____ (clotting factor) gene. This mutation was not present in the parental DNA, indicating that it resulted form a recent insertion.
LINES
VIII
SINEs
- short interspersed elements
- most famous/abundant SINE is Alu (about 300 nucleotides long and is related to the 7SL RNA. The 7SL RNA is part of the normal signal recognition particle involved in protein secretion)
- often not full length
On average, every how many kb of genomic DNA is there an Alu sequence?
5-10 kb
*some Alu elements are also mobile and can be inserted into the genome at random locations, causing disease