TOPIC 1 - nucleic acids Flashcards
what does DNA contain?
genetic information of most organisms required to make proteins
B-form DNA
2 strands of DNA intertwine to form a double helix shape (most common)
what directions do DNA strands run in?
antiparallel (opposite directions)
what are the 2 DNA strands called and which which directions do they run?
Watson: 5’ -> 3’
Crick: 3’ -> 5’
how often does a helix turn?
every 10 bps
what helps maintain/stabilise regular double helix shape?
- hydrogen bonding
- base stacking causes pi-pi interactions as aromatic rings share electrons
major grooves
- backbone far apart
- binding proteins can bind easier e.g. alter structure, regulate transcription/replication
minor grooves
- backbones close together
- proteins cannot bind as easily
how does the sugar-phosphate backbone form?
- phosphate group attached to 5’ carbon and 3’ oxygen of sugar
- forms covalent phosphodiester bond
structure of RNA
- single stranded
- shorter than DNA
- larger grooves in helix
how does a deoxyribonucleotide differ from a ribonucleotide?
ribose - hydroxyl groups in the 2 and 3 position of sugar
deoxyribose - lacks hydroxyl in 2 position of sugar
how many H bonds form between A + T?
2
how many H bonds form between C + G?
3
what are pyrimidine bases + structure?
T, C, U
-single ring structure
what are purine bases + structure?
A, G
-2 ring structure
why can non-complementary bases not bind?
- lack geometry
- not form strong H bonds
- disturb double helix structure
dna helicase
unwinds double helix at replication fork
dna topoisomerase
relax and reintroduce supercoiling in DNA chain
DNA polymerase III
elongates leading DNA strand in 5’ -> 3’ direction
what is the role of polymerase III exonuclease activity?
- works in 3’ -> 5’ direction
- proofreads the strand being formed removing erroneous bases as strand synthesised
RNA primase
creates primer RNA sequence
DNA polymerase I
removes the RNA primer and replaces it with DNA
DNA ligase
seals up gaps in Okazaki fragments
single stranded binding proteins (SSBs)
keep DNA open and prevent it from annealing
how is DNA synthesised on the leading strand?
- RNA primase creates RNA primer sequence
- DNA polymerase III moves along DNA in 5’ -> 3’ direction
- adds nucleotides onto primer sequence
- DNA polymerase I degreades primers and replaces with nucleotides
- synthesised continuously
how is DNA synthesized on the lagging strand?
- RNA primase creates RNA primer
- DNA polymerase I adds short row of DNA nucleotides onto primer in 5’ -> 3’ direction up to previous fragment
- synthesised in Okazaki fragments
- DNA polymerase I degreades primers and replaces with nucleotides
- DNA ligase links fragments together
what effects could replication errors cause in germ and somatic cells?
germ cells - mutation could be heritable
somatic cells - cancer
how are DNA replication errors reduced?
- polymerase III exonuclease activity
- nucleotide excision repair
- base excision repair
- strand directed mismatch repair
base excision repair
- damaged base excised by DNA glycosylase
- gap recognised by AP endonuclease
- missing base resynthesised by DNA polymerase
nucleotide excision repair
- recognises/removes bulky DNA adducts (damage from UV light)
- missing segment resynthesised by DNA polymerase
strand directed mismatch repair
- inspects newly formed DNA (A + C undermethylated)
- enzymes remove mismatched nucleotides
how can DNA be damaged after replication?
- cytosine: deaminated to uracil, recognised as thymine, base pairs with adenine -> mutation
- methyl cytosine: deaminated into thymine -> permanent mutation as thymine not recognised as foreign
how does the body protect replicated DNA from damage?
- uracil DNA glycosylase recognises uracil
- breaks bond between uracil + sugar
- mismatch repair enzymes detect lack of base + add cytosine
human genome
map of all DNA content in human body (included coding DNA and junk DNA)
how many nucleotides make up the human genome?
3x10⁹
how many protein coding genes are in the human genome?
20,000
allosomes
sex chromosomes
autosomes
non-sex chromosomes
other than the nucleus, where can DNA be found?
mitochondria
why is human genome bigger than expected for small number of genes?
- junk DNA
- retroviral integration
- pseudogenes
- VNTRs
synteny
- DNA sequences on chromosome is present across different species
- occurs between closely related species (e.g. chimp/human) and non-closely related species (human/mouse)
example of synteny
human chromosome 2 = chimp chromosome 12 + 13
junk DNA
non-functional sequences of DNA
how do retroviruses affect DNA?
- retroviruses insert DNA copy into cells
- viral DNA integrated into host cell
- mutations can cause non-productive copy to be integrated which remains in DNA and is passed down cell generations
pseudogenes
sequences of DNA that resemble functional genes but are non-functional
how many pseudogenes in the human genome?
20,000
how are pseudogenes formed?
gene duplication - duplicate genes has mutation that renders inactive
reverse transcription - mRNA lacks introns/regulatory sequences, reverse transcribed into DNA, lacks promoter regions/introns so not active gene
VNTRs
variable number tandem repeats - multiple copies of same short sequence of bases
how are VNTRs used for testing?
- amplified through PCR
- DNA fingerprinting for forensic/paternity testing (VNTRs inherited from mother/father different)
what does polymerase slippage cause?
extra/missing copies of bases - creates sections of DNA that can no longer code/function
SNPs
single nucleotide polymorphisms -base substituted in genome and substitution in at least 1% of population
what effects can SNPs have?
- no effect
- subtle effect e.g. phenotypic change
- disease
GWAS
genetic wide association studies - provide sequence/location of SNPs
role of TATA boxes
- short run of T/A bases 25-35bps away from start of transcription
- T/A lowest energy base pairs so easiest to unwind
- transcription factors bind
- RNA polymerase II can bind to transcription factors
what % of mammalian genes has a TATA box?
10-15%
role of CpG islands
- stretches of DNA where multiple points of C-G bases upstream of start site
- if CpG island methylated, gene coding switches off
how does DNA transcription occur?
- transcription factors bind to promoter region causing DNA strand to unwind
- RNA polymerase II binds to transcription complex
- polymerase moves down antisense/template DNA strand in 3’ -> 5’ direction
- polymerase add complementary RNA nucleotides onto 3’ end of mRNA strand in 5’ -> 3’ direction
- mRNA strand is released, undergoes modification and leaves nucleus through nuclear pores
how is eukaryotic mRNA modified after transcription?
methyl capping - methyl g cap added to 5’ end to stablise/protect against degradation
polyadenylation - sequence downstream of mature RNA cut off and polyA tail added to 3’ end to stabilise/protect against degradation
polycistronic
prokaryotic mRNA that can encode for more than one protein
lac operon
gene system whose operator gene and three structural genes control lactose metabolism in E. coli
splicing
removing introns and reconnecting exons in mRNA
introns
noncoding sequences of DNA
exons
coding sequences of DNA
what structure removes introns during splicing?
spliceosome
how can different proteins be produced from same mRNA?
alternative splicing leads to inclusion of different exons
enhancer sequences
sequences far upstream/downstream of genes that regulate level of transcription of genes when proteins being
how do enhancer sequences function over large distances?
- dna between enhancer and promoter region loops out
- allows enhancer proteins to come into contact with RNA polymerase
trans-acting factors
regulatory RNA that act across DNA strand by diffusing from enhancer region
what is the function of trans-acting factors in gene transcription?
regulate transcription activity by enhancing or silencing it
regulatory RNA
non-coding RNA molecules that play a role in controlling gene expression
barr body
inactivated X chromosome in females
why do men not have a barr body?
only occurs in XX cells as have 2 copies of XX (men have XY in cells)
why is the barr body inactivated?
regulated amount of X-linked genes transcribed
what RNA causes the barr body to become inactivated?
Xist
X inactivation specific transcript (regulatory RNA)
how is genetic code degenerate?
more than one triplet codon codes for same amino acid
how is genetic code triplet?
amino acids coded by three codons
what is the start codon?
AUG (methionine)
what are the stop codons?
UAA, UAG, UGA
what is the start codon in bacteria?
N-formyl methionine
structure of tRNA
- tightly packed clover shaped
- anticodon complementary to the mRNA opposite where amino acid esterified
how are tRNAs protected from degradation by RNAses?
- tightly packed structure
- modified bases
what is wobble base-pairing?
- first base of tRNA anticodon can pair with different third bases on mRNA codon
- more economical as reduces number of tRNA molecules required as some can recognise more than one codon
what is the role of aminoacyl tRNA synthetases?
esterify correct amino acid to tRNA
how does translation occur?
- small ribosome subunit binds to 5’ region and moves down mRNA until reaches appropriate AUG codon in good position
- large ribosome subunit binds
- initiator met-tRNA binds to methionine and is recognised by initiation factors on small subunit
- ribosome moves along mRNA sequence, adding correct amino acids to polypeptide chain for mRNA using tRNA
- ribosome dissociates once stop codon reached
polysome
polysome
what is the effect of nonsense mutations?
- mutation results in premature stop codon
- causes ribosome to dissociate early
- mRNA after stop codon not translated
- exposed mRNA region digested by RNAses (nonsense mediated decay)
E site of ribosome
exit site, empty tRNA ejected from ribosome
P site of ribosome
peptidyl tRNA binding site, tRNA attaches when adding amino acid to polypeptide chain
A site of ribosome
aminoacyl tRNA site, tRNA enters ribosome
what are the differences between eukaryotic and prokaryotic ribosomes?
eukaryotic - larger, 80S, attached to outer surface of nucleus/ER
prokaryotic - smaller, 70S, free floating in cytosol, targeted by antibiotics
endosymbiogenesis hypothesis
eukaryotic mitochondria arose from invading bacteria at some point during evolution
how are secretory proteins imported into ER?
- signal sequence (first stretch of 20 hydrophobic amino acids) recognised
- ribosome docks to ER
- protein fed through membrane into lumen
- signal sequence cut off
how are transmembrane proteins imported into ER?
- ribosome docks to ER
- transmembrane domain (first stretch of 20 hydrophobic amino acids) produced and enters lumen
- translocation stops
- c terminus (chan after transmembrane domain) remains in cytosol
how are transmembrane proteins exported from ER?
- transmembrane domain now extends out of ER after modification
- domain anchors protein into lipid bilayer
- tail extends into cytosol
- part of domain extends extracellularly
what modifications are proteins subjected to inside ER?
- folding
- glycosylation
- disulphide bond formation
glycosylation
addition of a carbohydrate group
disulfide bridge formation
cystine contains thiol group -> forms S-S cross link with cystine further down peptide chain to loop it or on separate peptide chain to join together
what happens to proteins in Golgi apparatus?
- some ER sugars are trimmed or replaced
- package proteins into vesicles and direct proteins depending of signals on protein
what are the compartments of the Golgi apparatus?
cis, medial, trans
what is the difference between sugar side chains of proteins in humans and microorganisms?
human - end in salic acid
microorganism - more mannose rich (recognised as foreign in humans)
how does prenylation result in proteins anchored to inner leaflet of membrane?
- G proteins synthesised as cytosolic protein (doesn’t pass through rER of Golgi apparatus)
- at C terminus of protein, cut and lipid tail added
- protein unstable with lipid tail in cytosol so integrated into inner plasma membrane
how is biologically active insulin produced?
- single gene codes for long protein
- signal peptide sent to ER
- signal sequence cleaved in ER
- disulphide bonds form between end + C terminus
- connecting loop cut out by proteases
is insulin glycosylated?
insulin is not glycosylated - no carbon chains
recombinant DNA
DNA produced by combining DNA from different sources
how commercial recombinant insulin produced?
- fusion protein used to trick yeast to produce insulin
- cleavage recognition site engineered into fusion protein allows protease to cut off yeast section and leave insulin region ready to fold
pharming
production of pharmaceutical products from farm animals
what is an example of pharming?
production of insulin from mammary glands of cows/goats
cancer
disease caused by uncontrolled growth/division of cells
neoplasia
new growth of cells not physiologically contained
malignant neoplasia
cancerous growth which is able to metastasise
benign neoplasia
growth of cells localised so cannot metastasise
dysplasia
abnormal cell with some features of cancerous cell but not fully malignant
metastasis
the development of malignant growths away from primary site of cancer (spread of cancer)
8 hallmarks of cancer
- tissue invasion/metastasis
- sustained angiogenesis
- activated growth signalling
- evade cell death/senescence
- limitless replicative potential
- evade immune surveillance
- handle metabolic stress
- DNA damage and genetic instability
how does sustained angiogenesis contribute to cancer?
- triggers growth of new blood vessels
- bring cancer cells nutrients allowing for growth
- help metastasise
how does activated growth signalling contribute to cancer?
constantly signals for cells to divide
senescence
deterioration of cells with age e.g. loss of ability to grow/divide
how do cancer cells have limitless replicative potential and how does contribute to cancer?
- telomeres (short repeating units of DNA at end of chromosome) shorten over cell division until cannot shorten more
- cell recognises short telomeres and stops division
- cancerous cells fuel growth of telomeres so do not shorten
how do cancerous cells evade immune surveillance?
- despite genetic changes, immune system struggles to recognise cancerous cells
- immune system can act on small amounts of malignant cells
in established cancers:
- immune system overwhelmed
- develop protective mechanism
what does the Wahlberg/reverse-Warburg effect allow cancerous cells to do?
metabolise and continue dividing if cells far from blood stream
carcinoma
cancer of epithelial cells/surface tissue
adenocarcinoma
cancer of glandular cells
how are cancers named?
site + histopathology + extent of spread
what does histopathology tell us about cancers?
cellular origin of cancer
schema cell carcinoma (SCC)
cancer of skin cells
haematological cancer
cancer of blood
myeloma
cancer of plasma cells from bone marrow
leukaemia
cancer of white blood cells from bone marrow
lymphoma
cancer of lymphatic glands/nodes
sarcoma
cancer of mesenchymal cells/connective tissue
osteosarcoma
cancer of bone
myosarcoma
cancer of muscle
leiomyosarcoma
cancer of smooth muscle
rhabdomyosarcoma
cancer of skeletal muscle
how does increasing genetic instability contribute to cancer?
if instability reaches certain threshold, cell can become cancerous
how does chromothripsis contribute to cancer?
- shattering of chromosome due to ionising radiation
- cell rearranges in wrong order so incorrect synthesis of proteins
- increases genetic instability
how does translocation contribute to cancer?
- rearrangement of chromosome
- genes shifted cross-chromosomes
- genes in different chromosomes may have different promoters = triggered incorrectly
- increases genetic instability
how does genomic amplification contribute to cancer?
- repeats of same gene = more copies of protein synthesised
- protein can help with development/growth = rapid cell division
how do point mutations contribute to cancer?
- substitution, indels, frameshift
- alter triplet makeup = different amino acids produced = different protein structure/function
- protein can help with development/growth = rapid cell division
how does serial clone expansion contribute to cancer?
- mutationed cell divides to produce clone with same mutation
- new mutations added with each cell division
- increases genetic instability
oncogenes
cancer promoting genes
proto-oncogenes
genes involved in regulating cell growth and differentiation (not cancerous yet)
how do proto-oncogenes/oncogenes contribute to cancer?
- mutation in growth gene means permanently turned on/activated so growth uncontrolled
- only need one mutation (dominant effect)
tumour suppressor genes
- genes which protect against cancer
- control cell cycle e.g. slow cell division, repair DNA errors and signal cell death
how can tumour suppressor genes contribute to cancer?
loss of function (e.g. mutation of both alleles)
Knudson’s two hit hypothesis
both copies of allele have to be defective in same cell to allow tumor to develop
how does Knudson’s two hit hypothesis relate to retinoblastoma?
- RB gene stops cell cycle occuring when inappropriate e.g. cell damaged
- some born with germline mutation in RB gene
- only need one more mutation to inactivate gene = retinoblastoma cancer
how does Knudson’s two hit hypothesis relate to tumour suppressor genes?
both alleles in tumour suppressor gene must be mutated before cancer occurs
how can epigenetics contribute to cancer?
- tumour suppressor gene inactivation
- methylation of promoter factors = gene inactivated as transcription factors cannot bind
epigenetics
changes in gene expression caused by mechanisms other than changes in the DNA sequence
germline mutations
mutation in germ cells (cells give rise to gametes)
how are germline mutations linked to familial adenomatous polyposis coli?
germline mutation inactivates APC (tumour suppressor) gene = FAPC
why are cancer levels higher in older population?
- long period for genetic instability to increase
- more exposure to environmental factors
what environmental factors can contribute to cancer?
- tobacco smoking
- excess body weight
- alcohol
- UV light
- poor diet
- cancer-causing pathogens e.g. HPV
- physical inactivity