Moderately repetitive fraction of the human genome Flashcards
What is the major contributor to this fraction?
Multigene families
Are gene families common in protein coding genes?
Very. 50-75% of protein coding genes belong to a family
What is included in a gene family?
Protein coding genes, pseudogenes, and gene fragments with sequence similarity
What are the 3 types of gene families?
Classical, domain-based, motif based
What is a classical gene family?
A gene family whose members show a high degree of sequence homology over most of the gene length, especially in the coding region
Are genes that are part of the same family clustered in the same area or dispersed through the genome?
Can be either
What type of gene family are the globin and Pax gene families?
Classical
What is a domain based gene family?
A gene family whose members have a high degree of sequence similarity within a protein domain
How similar will the gene sequences between members of a domain-based family be?
Highly similar within the domain, but much lower everywhere else
What type of gene family are the Hox genes?
Domain-based
What is a motif-based gene family?
A gene family whose members have a high degree of sequence similarity within a specific protein motif of conserved AA with a specific function
How similar will the gene sequences between members of a motif-based family be?
High homology over the motif, but not much else
What type of gene family do RNA helicases with a DEAD box belong to?
Motif-based
What is a pseudogene?
Defective copies of genes with much of the sequence intact, but with a pile of accumulated mutations that result in it having no function
What are gene fragments?
A smaller duplicated region of a gene, like a single exon
What are the two types of pseudogenes?
Processed and unprocessed/classical
How do processed pseudogenes happen?
Reverse transcription of a processed mRNA that then got inserted into the genome
What do processed pseudogenes look like?
Only exons, no introns or regulatory sequences. Might find a polyA tail
Are processed pseudogenes expressed?
Not usually
Where will processed pseudogenes typically integrate?
Right in the middle of the genome far away from any regulatory sequences. Most of the time won’t be expressed and will accumulate mutations into retropseudogenes