VERNON'S MODULE Flashcards
what is positive-sense?
viruses with a positive sense RNA genome means the genome is in the coding sense for protein synthesis and can act as mRNA upon entering the cell
so gets translated straight to protein
what is negative-sense?
a virus with negative-sense genome means the genome is in the non-coding sense for protein synthesis and must be converted to positive-sense before protein translation can commence
so undergoes transcription by RdRp to become positive-sense RNA and then can undergo translation to protein
outline replication of the positive-sense ssRNA virus poliovirus?
poliovirus replicates in the cytoplasm and its genome consists of one piece of +ssRNA with a polyA tail
the RNA is infectious if it was artificially put in a cell i.e. everything the virus needs to replicate is in that RNA
once virus has released genome into cell step 1 is translation to make protein (cause +ve sense)
after translation it has all the enzymes it needs to replicate genome to it replicates it (transcription) making complimentary sense copy called -ve sense replicative intermediate (antigenome)
antigenome is then a template which can undergo transcription to make more +ve sense so you can make more protein (via translation)
so you get build up of viral proteins and when concentrations high enough you get assembly
why must RNA viruses encode their own RNA dependent RNA polymerase (RdRp) and what is it?
because we do not have RdRp in our cells and the virus needs it to replicate its RNA
RdRp is a transcriptase - RNA dependent cause RNA is the template and RNA polymerase cause RNA is the product
RdRp is usually part of the replicase complex
classic antiviral target since essential for viral replication and our cells don’t have it
how does poliovirus get multiple proteins from a single gene?
polyprotein processing
outline poliovirus polyprotein processing?
virus has to make multiple proteins from one piece of RNA - so it makes a polyprotein i.e. the RNA genome has only one gene with start and stop codon and one ORF
polyprotein has protease activity embedded within it so following translation it cleaves itself at cleavage sites to release multiple proteins
cleavage doesn’t occur simultaneously; proteins have different forms which perform different functions e.g. 3CD propol is a protease, 3Cpro is a protease and 3Dpol is a polymerase - protease activity different when attached and when not and the precursors of that process will have their own specific function
this is a mechanism by which poliovirus can maximise coding potential
how does hepacivirus (HepC) of the family flaviviridae infect the host cell and replicate?
HepC associates with lipoproteins which associate with elements on host cell surface. E1 and E2 envelope glycoproteins interact with CD81 (host receptor) and then virus gets endocytosed via CLDNI and clathrin
endosome acidification changes viral glycoprotein and capsid (core protein unknown) structure causing fusion of viral and endoscope membrane –> genome release
genome is one piece of +ssRNA –> first step translation to protein then replicase complex makes -ve sense antigenome which is template for more +ve sense RNA
viral proteins assemble on ER and modify it to membranous web making replication factor
what are the 5’ and 3’ non coding regions (NCR) of hepC genome?
HepC genome single RNA strand with one ORF and one start and stop codon and embedded protease activity for cleavage
but single RNA strands can undergo local basepairing to form double strands and complex 3D dynamic structures like hairpins and loops - this is what occurs at 5’ NCR and 3’ NCR at ends of hepC RNA genome
this is a critical way that so much info can be stored in a single stranded RNA genome
5’NCR - highly conserved, domain I is critical for RNA replication, domain II-IV are IRES, critical for initiating replication and establishing cap-independent translation
3’NCR - includes conserved 98bp X-tail, U(25) essential for replication, 5B structure essential
how does canonical translation initiation occur (like normally in our cells w/o virus)?
met-tRNA forms pre-initiation complex with eIF3 and 40s ribosomal subunit which assembles on cap (methyl-guanosine group on end of mRNA) and scans till it finds a start codon. Once it does it brings in the 60s subunit and begins translating protein
how does HepC initiate translation?
hepC doesn’t have methyl-guanosine cap so cannot use normal canonical translation
has IRES (internal ribosome entry site) in 5’NCR which allows direct interaction with 40s subunit and eIF3
it then brings in other components and scans to find start codon and initiates polyprotein translation
so IRES allows it to bypass cap binding complex components thus suppressing host cell translation and maximising its own production
what kinds of translation initiation do +ssRNA viruses use?
some use IRES initiation e.g. hepC
however most +ssRNA viruses are capped and do not have IRES structures and so canonical translation initiation is normally used
once hepC polyprotein has been translated, how does protein coding and processing occur?
same as poliovirus - polyprotein has proteases in it which cleave it
NS2 and NS3 are different proteases which release themselves and other proteins with different functions from each other to produce a series of protein
extra step (to poliovirus) is that core proteins and envelope glycoproteins are released from polyprotein by host cell proteases
so virus has adapted to also use host proteases
where do HepC proteins go following translation and processing and what functions do they carry out?
they assemble on ER membranes which is where the replication complex assembles (hepC replicates on membranes)
many of these proteins carry out multiple functions e.g. NS4A and NS3 complex are proteases which don’t just cleave viral proteins but also host proteins to antagonise the innate immune response (TRIF and MAVS inactivation)
NS4B is responsible for membranous web formation which allows membrane area to be maximised for replication
what are membrane associated replication complexes and what do they provide?
membrane bound replication sites - these are a hallmark of +ssRNA viruses
they provide compartmentalisation, localised concentration of viral proteins, RNA tethering to replication site, provides lipids for replication (hepC needs these) and also protects from innate dsRNA responses
hepC maturation occurs via budding into ER and cell secretion pathways completing the replication cycle
what is the Sindbis virus?
family - togaviridae
genus - alphavirus
+ssRNA virus w more than one ORF (i.e. more than one gene on its genome)
30-42nm icosahedral virus in envelope
60-65nm diameter spherical virion
10-12,000 base genome
5’ cap and 3’ poly-A tail (canonically translated)
purified RNA is infectious
attachment via Lamina receptor, endocytosis entry then fusion with endosome, E1 and E2 type II fusion proteins on viral surface (in envelope derived from host membrane)
outline expression and polyprotein processing of Sindbis non-structural (NS) proteins (first ORF)?
first ORF; things at 5’ end necessary for viral replication
40s ribosome starts scanning from methyl-guanosine cap and continues until it finds start codon and starts translating protein and continues until it finds stop codon
this polyprotein gets cleaved by viral proteases and produces a bunch of important proteins - NSP1/methyl-transferase (puts cap on mRNA, ours is in nucleus and virus replicates in cytoplasm hence why it must encode its own), NSP2 (protease for ppp and also helices for unwinding genome for replication), NSP3 (rep complex)
but stop codon doesn’t always work in which case ribosome just keeps going (read through) to next stop codon and when read through happens there are all four proteins for replication (incl. NSP4 - RdRp which can then synthesise full length antigenome from + strand)
BUT there is still the other ORF and we haven’t made any structural proteins yet
outline expression of Sindbis virus structural (S) proteins
sub-genomic RNAs (sgRNA) are a common feature in +ssRNA viruses and can encode a polyprotein
-ve sense antigenome has very strong promoter for viral replicase for viral replicase and sometimes get (from an internal promoter part way through -ve sense) production of shorter part of +ve sense which will be the untranslated second ORF of the genome - the subgenome
this will get capped by methyl-transferase and seen as a piece of mRNA so translated into polyprotein until ribosome hits stop codon
protease cleaves it into structural proteins e.g. envelope, capsid proteins, E1 and E2 (surface glycoprotein dimers)
what is polyprotein123 and polyprotein 1234?
P123 - sindbis virus NS proteins (before processing) without read through (Nsp1;methyl-transferase, Nsp2;Hel/Pro, Nsp3;rep complex)
P1234 - sindbis virus NS proteins (before processing) with read through (Nsp1;methyl-transferase, Nsp2;Hel/Pro, Nsp3;rep complex, Nsp4;RdRp)
they are the same polyprotein one is just bigger cause read through occurred
when does RNA copying begin to occur in sindbis virus and what occurs following that?
when ORF1 (P1234) has been translated and processed cause that has all the NS replication proteins required for replication
first step is make antigenome from +ve sense template from which it can make more +ve sense and thus more protein
BUT sometimes instead of making full-length +ve sense it makes shorter one which gets capped and encodes the second ORF (containing structural proteins)
why aren’t all proteins made at the same level (using sindbis virus as example)?
not all proteins needed in the same amount
one RdRp (Nsp4) can move along template many times but capsid needs lots of protein - so virus generally needs lots more structural proteins
RdRp recognises sgRNA promoter strongly so you tend to get more of this and the S proteins it encodes
what are the two severe acute respiratory syndrome coronaviruses ?
SARS-CoV and SARS-CoV-2
SARS-CoV-2 has caused almost 800,000 confirmed cases and almost 7,000,000 deaths
what does COVID-19 affect?
ACE2 receptor present in lots of tissues so lots of different symptoms e.g. neurological, pulmonary, cardiac, GI etc. because it causes systemic infection
what underlying medical conditions make you more susceptible to severe COVID-19 and what does this indicate the importance of?
respiratory disease
heart conditions
immunocompromised
obesity
hypertension
diabetes
liver or kidney disease
hence importance of masks
who else (other than those with underlying medical issues) are more susceptible to COVID-19?
disproportionately impacts ethnic minorities including Māori and Pasifika due to higher rate of chronic health conditions, crowded housing, difficulty accessing healthcare (so due to socioeconomic inequities)
can COVID-19 be detected by PCR?
yes - viral load highest in nasophayrnx hence why you swab here
but once this decreases and can’t detect infection by PCR but symptoms may persist for as long as 12 months (long covid or post-acute symptoms)
this may be cause virus infection may remain systemic - tends to come right and less likely if vaxxed
outline the coronavirus infection cycle?
bind ACE2 receptor via surface glycoproteins, cleavage, entry, endosome membrane fusion and +ssRNA genome release, two poly proteins translated, proteolysis, formation of replication complex which replicates genome and makes sgRNA, translation, structural proteins assemble on ER, assembly, budding
all occurs within cytoplasm
what does the genome of coronavirus include?
includes viral proteases to process polyproteins
includes RNA polymerase to copy the genome which has limited proof-reading ability so makes some errors - ideal for identifying variants and tracking
big compared to other RNA virus; about 30,000bp
also makes a bunch of sgRNAs
what kind of translation does coronavirus undergo?
canonical
despite the replication module polyprotein of coronavirus consisting of ORF1a and ORF1b, why is it actually ORF1a and ORF1ab?
cap at start of ORF1a so ribosome reads along translating but when it gets to where it would usually terminate (between ORF1a and 1b) there is a pseudo knot which is a region of RNA with shitloads of base pairing complementarity and thus folded into a very stable complex structure
ribosome pauses here and either drops off (good cause needs more of the proteins from ORF1a) or shifts one base back and continues (different reading frame now)
so its always ORF1ab as it has a little bit of a at the start of it due to frameshift - produces 1ab protein
how do SARS-CoV-2 polyproteins 1a and 1ab undergo polyprotein processing?
with virus encoded proteases PL2 and 3CL (encoded in each polyprotein)
they recognise and cut specific viral amino acid sequences called recognition sequences - PL2 cuts specifically between a glycine and alanine within its recognition sequences while 3CL (aka Mpro) cleaves after glutamine and before a range of other amino acids depending on what amino acids are flanking
so encoded within both polyproteins there are two proteases that specifically process polyproteins into proteins necessary for replication
how does the subgenomic RNA get made for SARS-CoV-2 to produce its structural proteins?
SARS-CoV-2 makes a shitload of sgRNAs and they all have identical 3’ ends and same leader sequence at 5’ end and only the first ORF (gene) of each gets translated
so as RdRp synthesising antigenome it comes to transcription regulatory sequences (TRS) and it either carries on making -ve sense of next gene or shifts/skips to leader and then reinitiates RNA synthesis of neg sense of leader sequence
this results in a bunch of -ve sense sgRNA aka anti-mRNA and RdRp can use this as template to make +ve sense sgRNA
so coronavirus sgRNA are made by a series of antisense intermediates
and then that first ORF of each of these sgRNAs gets translated to protein
when sgRNA is being made for SARS-CoV-2, what defines the skip point (where RdRp skips to leader)?
the transcription regulatory sequence (TRS) varies between each gene but all have same core sequence of 5’ACGAAC
the varying sequences determine how efficiently it either continues to make longer negative sense RNA or skips to leader
so virus regulates how much of each sgRNA it makes in accordance with how much it needs i.e. it makes lots of nucleocapsid protein
i.e. variation of the TRS regulates the skip
how big is the coronavirus TRS?
ranges from 60-70bp for different coronaviruses
what are some the families of -ssRNA viruses, how many segments in their genome and what similarities to they share?
orthomyxoviridae (8), bunyaviridae (3), arenaviridae (2), rhabdoviridae, paramyxoviridae, bornaviridae, filoviridae (all 1)
all of these hail from a common ancestor and share:
- one major capsid protein
- helical capsids
- enveloped with viral glycoprotein spikes
- 10-15kb genomes
- genome cannot serve for translation as no cap or polyA tail
you need RdRp to make +ssRNA but you need +ssRNA to make RdRp, so how the fuck do -ssRNA viruses express their genome?
they carry RdRp in the virion so RNA alone isn’t infectious but nucleocapsid is cause it carries RdRp
so the first step for -ssRNA viruses is transcription to positive sense RNA
what are the three things RdRp must do?
must synthesise mRNAs (necessary to make proteins from negative sense RNA)
must synthesise full length antigenome (otherwise cannot replicate genome)
must synthesise new full-length genomic RNA from antigenome (in order to replicate genome)
what is vesicular stomatitis virus (VSV)?
of the family rhabdoviridae and a classic example of transcriptional control of gene expression (-ssRNA virus; transcription initiates replication NOT translation)
genome associated with many copies of nucleo(capsid)protein (N) which is wrapped up in a layer of matrix protein (M) which is surrounded by envelope which has trimer of viral glycoprotein (G) embedded in it. Associated with the polymerase (L) is phosphoprotein (P) both of which are critical to initiate replication
has terminal repeats (tr) on its genome
outline the lifecycle of vesicular stomatitis virus (VSV)?
interaction between glycoprotein and receptor induces endocytosis of virus (receptor binding, endocytosis)
endosome acidifies and H ion influx causes VSV glycoprotein fusion protein to fuse w endosome membrane releasing genome into cytoplasm
-ve sense genome (template) so first step is transcription to +ve sense mRNAs (one for each of its proteins N, P, M, G, L) - these proteins have targeting sequences so things like Matrix protein and Glycoprotein get sent out to plasma membrane - also more production of P and L which are needed for genome replication
P and L allow for production of +ve sense antigenome which is template for more genome and more mRNAs
switch between making mRNAs and genome from +ve sense antigenome is determined by presence of Nucleocapsid protein (less N = more mRNAs and more N = more full length)
how does VSV express multiple proteins when it has one piece of RNA?
transcription is sequential (N then P then M, G, L)
initiation at start of first gene (t) and it gets capped then polymerase makes copy till it gets to junction between N and P where it puts on polyA tail and releases mRNA
does the same for P, M etc. (each capped and tailed)
approx 2/3 of previous gene each time i.e. more N than P, more P than M etc.
amount of mRNA determines how much protein made so you get lots more N protein than L protein and this makes sense cause each genome copy needs lots more N than L
what is controlling VSV protein expression?
PL complex (P and L) initiates at 3’ end at transcription initiation sequence and starts transcribing N mRNA and caps it so ribosome can assemble
reaches termination sequence between two genes and adds polyA tail and releases mRNA for translation
2/3 of the time PL complex then restarts transcription at initiation sequence (reinitiates) and starts transcribing the next gene
1/3 of the time is dissociates and reinitiates at the 3’ end of genome
how does VSV add the polyA tail to its mRNA and what determines reinitiation?
termination of upstream gene occurs when RdRp reads through UUUUUU section of template and starts chattering and synthesising the polyA tail - this is called reiterative transcription
reinitiation for downstream gene transcription occurs if RdRp recognises sequence at 3’ end of downstream gene - this occurs 2/3 times and it then caps the 5’ end
1/3 of the time it releases RNP complex and goes back to start
how does VSV replicate its genome if it terminates at junctions?
after making proteins it needs to make genome - to make genome it needs to make antigenome - how the fuck if it keeps terminating at junctions
L, P and N needed to make antigenome
as N (nucleocapsid protein) accumulates this improves replication processivity as N binds nascent RNA as its being synthesised stopping chattering and preventing transcript termination
this means gene junctions get ignored and you get full length antigenome
so as N increases it becomes more efficient at making full length
what is sendai virus genome structure?
of the family paramyxoviridae
-ssRNA genome; similar five genes to those found in rhabdoviruses; (3’) NP (this is the N), P, M, HN, L (5’); also has nonstructural C gene and Fo between M and HN and also variants from P mRNA in diff ORF but not as important I think
3’ and 5’ ends are complimentary, makes new mRNAs the same as VSV
how does sendai virus get alternate proteins from the P ORF - the C proteins?
after NP (nucleocapsid) turned into mRNA RdRp reinitiates, caps and makes mRNA for P, chattering –> puts polyA tail on
ribosome starts scanning through but sometimes doesn’t pick up and initiate translation at start codon at 104 (AUG), in which case it continues till it hits another which is at 114 (first alternate AUG) but if it misses that then 183 (2nd AUG) then 201 (3rd). Can also initiate upstream of 104 at 81 (ACG) - called leaky scanning
this shifts the reading frame so it no longer in same ORF as P and encodes a completely different protein
C protein is in different ORF to P and has a completely different amino acid sequence - C’ (initiation at 81), Y1 (183) and Y2 (201) are variants of C (114)
so through leaky scanning sendai virus can carry more than one coding sequence on one piece of RNA
what is leaky scanning an example of?
TRANSLATIONAL control of protein expression
(even though it -ssRNA, occurs during translation step)
how does sendai virus exhibit TRANSCRIPTIONAL control of protein expression through making alternate proteins from the P ORF?
30% of the time when RdRp is making P mRNA it sticks in a G at 1183 causing reading frame to shift by one frame
results in a protein that looks like P but with a C terminus
10% of the time it chucks in two Gs at 1183 meaning there’s a different frame shift to the other fucking frame shift making a P protein with a third C terminus
so there are three possible reading frames for P protein all resulting in proteins with different tails
how many proteins can sendai virus make from P gene and why is this important ?
P gene makes P mRNA which can encode for P, C’, C, Y1, Y2 and sometimes V and W
so seven proteins from one sequence in the viral genome and all these proteins have a differing functions
this allows it to carry a lot of information in their genome
what is sendai virus post-translational cleavage?
viral protein processing can include host cell proteases and viral proteins can also undergo a maturation processing event
Fo (fusion protein) is processed to F1 and F2
this involves two cleavage events carried out by host cell proteases; leader sequence removed during secretion and then cleavage into F1 and F2 in the Golgi
what are the sendai virus protein coding strategies?
transcription initiation
non-templated editing
translation initiation (leaky scanning)
post-translational modification
what is lassavirus?
of the family arenaviridae which have ambisense genome i.e two pieces of RNA with mixed sense on them
virions contain host ribosomes
how do arena viruses code for multiple mRNAs?
genome has two ambisense genome segments
there’s no cap or polyA tail so these cannot act as mRNA even though there’s +ve sense gene at 5’ end - so because no way to initiate translation we call it -ve sense
but RdRp can recognise 3’ end and initiate transcription to make +ve sense copy and when it does this it will cap and tail it making sgRNA in +ve sense
genome can get replicated to full length - no cap or polyA tail on full length but RdRp can recognise UTR/NCR at 3’ end and make capped and tailed +ve sense sgRNA
S segment encodes two proteins: N and GPC
L segment encodes two proteins: replicase and Z
and each gene is transcribed as subgenomic mRNA
what are some generic features of orthomyxoviridae?
enveloped
spikes of Haemagglutanin and Neuraminidase (H and N refer to antigenic type e.g. H1N1)
13,500 base, segmented negative-sense RNA
complementary 3’ and 5’ ends (so the genome ends base pair)
polymerase associated with homologous ends (i.e. carries polymerase in association w genome)
Influenza A (humans, birds, swine), Influenza B and C (both humans) and thogoto virus (mammals)
all these transmission by air except the last which is ticks
what must the RNA segments of influenza virus do and what are the segments?
get into the nucleus (all 8) to initiate replication cycle
the segments are vRNAs with 5’ and 3’ UTRs which are transcribed to mRNAs
segments can be referred to as vRNPs - viral RNA wrapped up in nucleocapsid and replicase complex (PA, PB1 and PB2) associated with homologous 3’/5’ RNA ends
these 8 vRNPs get packaged in virus particle in 1 + 7 structure
how does viral nucleoprotein get into the nucleus?
it is imported by our cells standard nuclear import machinery - importins and RAS
there’s a peptide sequence in nucleocapsid protein which targets the vRNP segments to nucleus
what are the key proteins of influenza and their function?
polymerase/replicase (PB1, PB2 and PA complex) - replication of genome
haemagglutinin (H or HA) - viral attachment protein; bind sialic acid (host receptor)
nucleoprotein (NP) - binds RNA at 1 per 20 bases (genome packaging) and also facilitates full length RNA synthesis, part of vRNP complex
neuraminidase (N or NA) - release from host cell (cleaves receptor from cell to allow release after budding)
matrix protein (M1) - matrix protein (export assembly and packaging), targeting of vRNP complex to assembly site, bridges vRNP and envelope
what are some other important proteins in influenza (but change between strains)?
M2 - tetramer in membrane of infected cells (present in low amounts), ion channel protein; essential for genome release
non-structural 1 (NS1) - range of activities
non-structural 2 (NS2)
PB1-F2 - apoptosis control and replication
PA-X - host shutoff activity
what are the three types of RNA synthesised during replication of influenza?
negative genome
positive (anti)genome
positive mRNA
so three RNAs produced per segment - virion RNA (the genome), mRNA (for protein translation) and antigenome RNA (for replication)
outline genome replication in influenza?
occurs in nucleus
viral RNP = template for PB1, PB2 and PA (replicase)
full length antigenome synthesis
RdRp promoter in 3’ region of template strand (remember polymerases synthesise 5’ to 3’)
outline cap stealing and polyadenylation in influenza?
steals methyltransferase (cap) from host mRNAs as it leaves nucleus so it can interact w ribosome
uses these to prime mRNA synthesis (cap for mRNA)
poly-U tract for poly-A tail (needed for translation i.e. makes polyA tail via chattering)
8 segments encode >11 proteins
what does the virion RNA (genome) of influenza look like?
sequences at 3’ and 5’ end will be identical between all segments - these flank the gene
negative sense (3’-5’)
what does the mRNA (for protein translation) look like in influenza virus?
capped and truncated i.e. different at each end
cause 3’ end got polyA tail after RNApol chattered and dissociated 17-22 nucleotides before template actually finished
at 5’ end has got cap which was stolen from host along with hosts 10-13 first nucleotides
positive sense (5’-3’)
so this how influenza exhibits transcription control of mRNA production (cap stealing from host mRNA)
what does influenza virus antigenome RNA (for replication) look like?
an exact complement of the genome/virion RNA in full length (unless there’s some genetic drift)
positive sense (5’-3’)