Gene Structure Flashcards

Question 1

Q

Bacterial genomes

Answer

A

organised into operons with similar genes grouped
distinct boundaries and regulation
highly evolved
economical/efficient

Question 2

Q

Eukaryotic genomes

Answer

A

low numbers of protein coding genes
non coding repeats made up 50% of genome
small genes hard to locate
gene density surprisingly low (introns, repeat regions)
functional RNAs common
uneven gene distribution with gene rich and gene desert regions

Question 3

Q

Chromosome Structure

Answer

A

two sister chromatids held by a centromere

Question 4

Q

Other DNA

Answer

A

B chromosomes: extra chromosomes to standard complement
holocentric chromosomes: entire chromosome is a centromere
extrachromosomal DNA: plasmids/organelle DNA

Question 5

Q

Organelle Genomes

Answer

A

mitochondrial DNA: covalently closed circular DNA
chloroplast DNA: single closed circular DNA
- photosynthesis genes

Question 6

Q

Genome Sectios

Answer

A

unique and repeat regions

- 62% of genome is an intergenic region between genes (transposons, LINEs, SINEs, LTR elements

Question 7

Q

Repeated Intergenic Regions

Answer

A

differs in GC content
may have structural role (like telomeres)
tandemly repeated DNA
interspersed repeats (genome-wide repeats)

Question 8

Q

Satellite DNA

Answer

A

Satellite DNA (satDNA) is the highly repetitive DNA consisting of short sequences repeated a large number of times. It carries a variable AT-rich repeat unit 
- a and B satellite family

Question 9

Q

Tandemly Repeated DNA

Answer

A

much shorter than satellite DNA
minisatellites
associated with structural features
10-100 bp
microsatellites
simple tandem repeats
telomeres or dinucleotide repeats
less than 13 bp

Question 10

Q

Satellite Formation

Answer

A

Replication slippage: daughter strand slips back 1 repeating unit
Tandemly repeated DNA: DNA recombination and unequal crossing over
Variable Number Tandem Repeats: used in identificatoin

Question 11

Q

Interspersed Repeats

Answer

A

genome wide repeats (more than 100 bp)
moderately repetitive
transposon derived repeats: SINE, LINE, DNA transposons, LTR retrotransposons

Question 12

Q

DNA Transposon

Answer

A

target DNA cut by transposases
intermediate DNA inserted and ligated into sequence
flanked by direct repeats

Question 13

Q

Maize Transposon

Answer

A

excision of TE in somatic cells leads to pigment gene
small spots: late excision
large spot: excision early in development
no excision autonomous element not in genome
revertant: element excised and full expression restored

Question 14

Q

LINEs

Answer

A

long interspersed nuclear elements

RNA transcribed and inserted back into genome at new site
ORF1: RNA binding protein
ORF2: reverse transcriptase

Question 15

Q

SINEs

Answer

A

short interspersed nuclear elements

high copy number
no genes
transcribed by RNAPIII
borrows transcriptase synthesized by LINEs
eg. Alu family

Question 16

Q

Retrovirus Related Sequences

Answer

A

LTR Retrotransposons

Human endogenous retrovirus

Question 17

Q

Purpose of Repeats

Answer

A

metabolic burden
rate of propagation must counteract rate of elimination
maintenance suggests value
nearly all genome is transcribed
could cause evolutionary change
could just be yet to be eliminated

Question 18

Q

How a genome acquires new genes

Answer

A

horizontal gene transfer
exon shuffling
duplication

Question 19

Q

Benefits of Duplication

Answer

A

new copy can acquire a new function due to selective pressure advantage

Question 20

Q

Recombination Duplication

Answer

A

unequal crossing over (misalignment)
unequal sister chromatid exchange
DNA amplification during replication
replication slippage (adds extra unit to short repeat)
retrotransposons

Question 21

Q

Successful Gene Duplication

Answer

A

1 copy retains original sequence
2 copies may increase protein synthesis
1 copy may be neutral and become a nonfunctional pseudogene
1 copy may acquire new function (neofunctionalism)

Question 22

Q

Neofunctionalism

Answer

A

gene duplicates gaining mutations
new gene function
expressed in different time/cell type
eg. chymotrypsin and trypsin are both proteases but recognise slightly different residues : duplication of common ancestor

Question 23

Q

Pseudogenes

Answer

A

copies of functional genes with altered regions
may contains frameshift mutations and have regulatory roles
increase genome size
if transcribed may form RNA homologous to functional RNA and regulate via interaction

Question 24

Q

Processed Pseudogene

Answer

A

tandem duplication of genomic region
1 copy lack of selection
inactivating mutations/incomplete duplication
missing regulatory regions

Question 25

Q

Non-processed pseudogene

Answer

A

reverse transcriptase activity
mRNA to cDNA and genome integration
lacks regulatory regions or introns
derived from RNA
eg ribosomal pseudogenes: highly conserved large family

Question 26

Q

Multigene families

Answer

A

multigene families can arise from beneficial duplications
eg rRNA genes
tandem gene family on same chromosome
dispersed gene family on different chromosome

Question 27

Q

Globin Superfamily

Answer

A

example of gene duplication and divergence
colocation of dispersed clusters of genes
natural selections coopts existing genes to innovate
can create new useful properties
having separate proteins for oxygen storage and transport with different binding affinities is useful

Question 28

Q

Whole Genome Duplication

Answer

A

could be a source of speciation as duplicated genomes can reproduce with parents

Question 29

Q

Polyploidy

Answer

A

multiple sets of chromosomes
widespread in plant groups as having more DNA creates a larger fruit or flower product
can induce genomic shock

Question 30

Q

Autoploidy

Answer

A

multiplication of identical species within a single species
fertilisation by unreduced gametes
meiosis error accidentally producing diploid gametes so a triploid offspring
more viable than allopolyploids
can reproduce with each other permitting speciation

Question 31

Q

Alloploidy

Answer

A

hybridisation between 2 reproductively compatible species
via unreduced gametes from 2 diploid species (one step model)
via hybridisation between haploid gametes and somatic doubling (two step model)

Question 32

Q

Triploid Wheat-Rye Hybrid

Answer

A

yield of wheat with disease tolerance of rye
tetraploid wheat crossed with diploid rye to give triploid triticale seed
can artificially double chromosomes in sterile plant for fertilization with doubled gametes to give fertile offspring

Question 33

Q

Hox Gene Family

Answer

A

WGD in metazoa led to evolution of organisational body plan complexity
homeobox gene clusters in animal development are TF regulating developing with DBD
different organisms have different repeats and clusters

Question 34

Q

Hox Gene Expression

Answer

A

expression in different embryo regions
gene order reflects expression order
spatial and temporal colinearity
each specifies body segment structures

Question 35

Q

2R/3R Theory

Answer

A

Based on this model, two rounds of genome duplication occurred early in the vertebrate evolution [2, 3], but see also [4, 5]. An ancestral genome was duplicated to two copies after the first genome duplication (1R), and then to four copies after the second (2R) duplication [6, 7]. Recent data suggest that an additional whole genome duplication occurred in the fish lineage (3R or fish-specific genome duplication)
Humans have 4 duplications

Question 36

Q

Benefits of WGD

Answer

A

evolutionary diversification
defence against mutation
colonisation of new environments
improved fitness
duplication of regulatory regions generates biological complexity like body plans

Question 37

Q

Exon Shuffling Theory

Answer

A

eukaryotic proteins are mosaics of motifs
each domain has a specific function
primordial exons correspond to domains
duplication/insertion/rearrangements generate novel genes and proteins

Question 38

Q

Illegitimate Nonhomologous Recombination

Answer

A

process by which two unrelated double stranded segments of DNA are joined.
Duplication
unequal crossing over
replication slippage
increases gene length
Shuffling
domains from different genes joined together
retrotransposition or INHR
eg. aA-crystallin gene transfected into mouse
heat shock protein
domain duplication by illegitimate recombination and unequal crossing over
TOPOI nicks DNA and ligates non-homologous ends

Question 39

Q

LINEs and Gene Shuffling

Answer

A

jump around using RNA intermediate and retrotransposition into genome
transposition can take piece of DNA (3’ transduction)
transcription reads through weak terminating signal
domain retrotransposed

Question 40

Q

Transposons and Gene Shuffling

Answer

A

mutator like transposable elements
inverted terminal repeats flanking exons/introns
encode transposase moving themselves and other MULEs

Question 41

Q

Intron Phases

Answer

A

insertions and duplications affect reading frame
need exact multiples of 3 in an exon to prevent this
need intron phases the same
either 2,1/1,2/3,0 insertions on either side
Phase 0: introns lie between 2 codons (3,0)
Phase 1: introns located after first nucleotide (1,2)
Phase 2: introns located after second nucleotide (2,1)

Question 42

Q

Splice Frame Rule

Answer

A

following a successful shuffling event a newly acquired exon will be flanked by 2 introns of the same phase otherwise it will produce a frameshift in resulting coding sequence
- preserving reading frame makes less of a target for purifying selection (no loss of function with frameshift)

Question 43

Q

Evidence of Shuffling

Answer

A

multicellular structure
- adhesion/receptor molecules benefit from swapping domains to tune interactions
Franca et al.
- Phase 0 most common
- excess of symmetric exons compared to asymmetric

Question 44

Q

Exon 1-1 Shuffling

Answer

A

phase 0 introns more ancient and prevalent in non-Metazoan lineages
1/1 shuffling associated with emergence of necessary features of multi-cellularity
1:1 shuffle compatible with glycine codons (interupted)
easily split
increases efficiency of creating mosaic proteins
acquisition of domain must overcome structural limitations
small and flexible domains fold independently
linker sequences have glycine more frequently

Question 45

Q

Collagen Gene Shuffling

Answer

A

a2 type 1 collagen gene
highly repetitive sequence of glycine:x:y
exons very glycine rich and can be split with 1:1 exon phase meaning they are compatible with large and repetitive protein collagen

Question 46

Q

Tissue Plasminogen Activator Shuffling

Answer

A

found in blood clotting in vertebrates
4 exons
growth factor domain and plasminogen gene
upstream exon encodes finger module
ancestors from 3 domains formed the TPA structure
common ancestor formed by duplication and adding exons to genes

Question 47

Q

Exon Phases

Answer

A

These include the symmetric exons 0-0, 1-1, and 2-2 and the asymmetric exons 0-1, 0-2, 1-0, 1-2, 2-0, and 2-1, which can be joined into symmetric combinations. The symmetric exons (or exon sets) are the only ones that can be inserted into introns (of the same phase), undergo tandem duplication, or be deleted without disturbing the reading frame.

Question 48

Q

Protein Protein Interaction

Answer

A

mosaic proteins can interact with several proteins to become hubs of PPI networks
shuffling promotes self interaction capacity
formation of homodimeric proteins with self interacting domains
new connection networks

Question 49

Q

Amyloid Precursor Protein

Answer

A

alternative splicing generates isoforms
APP undergoes protease processing by a-secretase to give soluble APP
processing by B scretase and gamma secretase to also give B-amyloid protein (amyloid plaques in Alzheimers)
processing determined by KPI domain presence inhibiting a-secretase
KPI domain gained by shuffling
domain is flanked by introns
similarity to metazoan protein