Proteomics Flashcards
Which is capable of identifying a higher proportion of the proteome accurately? Peptide-centric or protein-centric approaches? Why?
• Peptide-centric approaches are capable of identifying a higher proportion of the proteome than protein-centric
o Many more protein identifications and parameters (e.g. peptides) per protein
o Every peptide generated becomes an individual parameter that represents a protein- important for quantitation
There is a lot of statistical power of using peptides to represent their parent proteins
What is the difference between relative and absolute quantitation?
o Relative vs absolute quantitation
Relative quantitation- how much there is in sample 1 vs in sample 2
Absolute quantitation- how many micrograms of each protein are present within a sample
What is the aim of comparative quantitative shotgun proteomics?
• Measures relative abundance across two different samples (one control and one test)
What are 3 methods to perform comparative quantitative shotgun proteomics?
- Method A-metabolic stable isotope labelling
- Method B- isotope tagging by chemical reaction
- Method C- stable-isotope incorporation via enzyme reaction
Describe the process of metabolic stable isotope labelling for quantitative shotgun proteomics and when it can be used
• Method A-metabolic stable isotope labelling
o Cells growing in culture or living tissues
o During the growth of the cell, incorporating a stable isotope
Stable isotopes used to quantify changes in protein abundance e.g. 2H or 13C
o Isotope is added to growth medium and cells take it up and incorporate it when they undergo protein translation
In this approach, an isotope labelled amino acid e.g. (leu or lys- light and heavy) must be incorporated during protein synthesis
• Cell will make proteins with light (or normal) isotope and proteins with heavy isotope
o Isotopes have identical chemical properties
o The proteins are then digested-should get a light version of the peptide and a heavy version of the peptide, which creates a mass difference
o Do chromatography and mass spectrometry-> the heavy and light versions of the peptides will appear as pairs: one from the control and one from the test
o Peptides tagged with differing isotopes will appear adjacent on a mass spectrum
Separated by known mass difference of the light and heavy amino acids as it is incorporated
o Area under each peak equate to relative abundance of the two peptides
Can use spectral intensity of the peaks in the mass spectrometer as a means of measuring the abundance difference (relative abundance)
What is an example of a tag/method used for metabolic stable isotope labelling
- Stable isotope labelling by amino acids in cell culture (SILAC)
What is the process of stable isotope labelling by amino acids in cell culture (SILAC) and what are its requirements?
• Process-
o Labelled peptides-> optional protein purification-> combine control and test samples and digest with trypsin-> quantitation by MS
• Grow cells in presence of a labelled amino acid for which the cell cannot biosynthesise
o Incorporation of SILAC label over time
• No labelling step as label incorporated during synthesis
o E.g. deuterated leucine (Leu)
o 100% Leu labelled, approx 50% peptides
Leucine is the most common amino acid, and lysines are extremely common in tryptic peptides
• Allows differentiation of leucine and isoleucine
• Needs cells grown in culture
Describe an example of proteomics of B cell differentiation using SILAC quantitative proteomics
- Define B cells
- Differentiation of B cells
- Study details
• Example- proteomics of B cell differentiation using SILAC quantitative proteomics
o B cells are immune cells (lymphocytes)
o Differentiate into plasma cells that secrete antibodies against non-self antigens (eg. bacteria)
o Stimulate differentiation by treating with bacterial lipopolysaccharide (LPS)
o 234 protein expression changes, including a cluster involved in antibody production
o Looked at how protein expression changed over time (samples were collected over several days)
o Performed MS-based quantitation based on unlabelled vs 13C6-Leu
Dynamic quantification shown at the single peptide level during B cell differentiation
Control- day0/day0: this should be identical
o Cluster analysis and cellular location of 234 differentially abundant proteins
Enables for formation of hypotheses
What is cluster analysis?
Cluster analysis-task of grouping a set of objects in such a way that objects in the same group are more similar to each other than to those in other clusters
What are the advantages of stable isotope labelling by amino acids in cell culture (SILAC)?
- Very reproducible
- Generally high labelling efficiency
- Multiple peptides per protein- therefore good statistical confidence
- Compatible with PTMs
- Compatible with most cell-based systems
What are the disadvantages of stable isotope labelling by amino acids in cell culture (SILAC)?
- Need cells growing in culture
- Some cells are fussy- poorly incorporate label
- Human based Tissue studies
- Blood/plasma don’t contain metabolism cells
- Also results in 2 ions or more per peptide in MS scans
Describe the SILAC mouse, how it is produced and why it is so useful
• The SILAC mouse
o For tissue studies
o Take a mouse-> give it a lysine free diet supplemented with either a light or heavy version of the amino acid-> make a SILAC mouse (with either light or heavy amino acid) depending on supplied amino acid
Good incorporation over a period of time for many proteins except for haemoglobin (red blood cells)
o SILAC mouse can reveal information about protein turnover
What is a disadvantage of the SILAC mouse and how is it overcome? What is a potential problem with the solution?
Extended labelling times however do not result in 100% labelling efficiency- most likely due to more complex recycling of amino acids
• This can be overcome by feeding mice over several generations
o Newly born mice only ever know the light/heavy diet
o Results in close to 100% incorporation
o However, sometimes the mothers can eat their young so have to be careful
What is the SILAC fly useful for and how is it produced?
• The SILAC fly
o For tissue studies
o Make yeast in culture that incorporates light or heavy version of amino acids and exclusively breed one set to eat heavy or light yeast food
What is the SILAC worm useful for and how is it produced?
• The SILAC worm
o For tissue studies
o Grow SILAC worm on E.Coli.agar plate that either has heavy or light amino acid
Worms eat the E.Coli and either become labelled with heavy or light amino acid
o Compare the proteomes of the worms
What is the procedure for isotope tagging by chemical reaction in comparative shotgun proteomics and when can it be used?
o Can be used on all cell and tissue types as incorporated after protein synthesis
o Addition of a chemical tag specific for a handle in peptide sequences
Use a specific chemical handle to add a mass tag to peptides
Handles most commonly the primary amines of peptide N-termini or Lys residues or the -SH (thiol) group of Cys
Isotopes/chemical tags have identical chemical properties but are different in mass or are isobaric
o Area under each peak equates to relative abundance of the two peptides
What are isobaric tags in isotope tagging by chemical reaction in comparative shotgun proteomics and why are they useful?
• Isobaric- has the same mass
o Enables for quantitation at tandem mass spectrometry level
o Peptides tagged with differing isotopes will appear adjacent on a mass spectrum
What are tags used in isotope tagging by chemical reaction in comparative shotgun proteomics?
- Tags- Isotope coded affinity tags (ICAT)
- Tags- Isobaric affinity tags (iTRAQ-Isobaric tags for relative and absolute quantitation)
- Tags- Tandem Mass tags
Describe the process of isotope coded affinity tags for isotope tagging by chemical reaction in comparative shotgun proteomics and the quantification process
o Process:
Sample-> lysis-> label mix samples and digest-> 2-DLC separation -> MS scan-> selected for MS/MS
o Label proteins via cysteine residues
o Two tags therefore pair-wise comparisons
o Quantification procedure
Mix label sample 1 and label sample B in equal amounts-> combine and proteolyze-> avidin affinity enrichment-> MS1 scan -> get sequence-> protein identification and quantification
1-3 peptides per protein are used to accurately quantify the relative abundance of each protein
• Limited by amounts of cysteines
Non-cys containing peptides discarded > reduces sample complexity, but also loses information such as phosphorylation
Describe the structure of the isotope coded affinity tags for isotope tagging by chemical reaction in comparative shotgun proteomics and the purpose of this structure
o 3 components-
Thiol-specific group enables protein/peptide binding (functional group)
Linker of differing mass to differentiate control and test groups
• Chemically inert
Affinity tag for purification (biotin)
o Light tag (H) or heavy (D) tag used: differ by 8 Da
o Has a biotin molecule for affinity purification
What are the advantages of ICAT (isotope coded affinity tags) for isotope tagging by chemical reaction in comparative shotgun proteomics
- Reduces complexity
- Easy analysis
- Reproducible
What are the disadvantages of ICAT (isotope coded affinity tags) for isotope tagging by chemical reaction in comparative shotgun proteomics
-Not all proteins contain Cys (20-30% cannot be analysed)
Cysteines are the second least common coding amino acid (only makes up 1.1% or 1.3% of all known coding amino acids in the protein database)
Hence, this tag doesn’t bind many peptides (drawback)
-Those that do contain Cys may have only 1 or 2, hence replicates are needed for statistical analysis
-8 Da ICAT tag difference may complicate MS spectra- by doubling the number of ions in the MS scan
Describe the process of using isobaric affinity tags (iTRAO) for isotope tagging by chemical reaction in comparative shotgun proteomics and the process of quantification using these tags
• Tags- Isobaric affinity tags (iTRAQ-Isobaric tags for relative and absolute quantitation)
o Can do many samples (8 or more)
o Process:
Samples-> lysis-> digest-> label-> mix samples and 2-DLC-> MS scan shows several peptides from same protein-> MS/MS fragmentation cleaves reporter allowing quantitative comparison and identification
o Exploits primary amines
o Only does relative quantitation
o Differentiation at mass spectrometry level-
All peptides are tagged-> no affinity purification
Perform multi-dimensional liquid chromatogaphy
The 4 tags have the same mass so only one peak seen in precursor/MS scan
During MS/MS, tag fragments revealing 4 reporter ions
Reporter mass beneath those seen for y-1 ion
• Intensity of each peak is relative to total abundance of the protein
Allows four way comparison
When undertake fragmentation, remove peptide reactive group, get neutral loss at balance region and release reporter ions-
• Allow for relative quantification at MS:MS level and peptide sequence
Describe the components of isobaric affinity tags (iTRAO) for isotope tagging by chemical reaction in comparative shotgun proteomics and their uses
o 3 components to tag- Peptide reactive group • Amine specific (Lys and N terminal) Isobaric tag (total mass= 145) • Balance (mass 31 to 28) o Neutral loss in MS:MS o Balances the mass change of reporter to maintain a total mass of 145 • Reporter (mass 114 to 117) o Charged o Gives strong signature ion in MS:MS o Gives good b- and y-ion series o Maintains charge state o Maintains ionization efficiency of peptide o Signature ion masses lie in quiet low mass region
Describe an example of using isobaric affinity tags (iTRAO) for isotope tagging by chemical reaction in comparative shotgun proteomics for characterisation of head and neck squamous cell carcinoma. Describe:
- The cancer
- The method
- The result
o Example- head and neck cancer biomarkers
Head and neck squamous cell carcinoma (HNSCC) (includes tongue, larynx, pharynx etc.)
6th most common cause of cancer-related deaths
Lack of early biomarkers (50% diagnosed at advanced disease)
Compared 15 patient samples (cancerous and non-cancerous) and a pooled non-cancerous control
• Aligned samples to pooled non-cancerous control to create heat maps
o Aligned mean fold change found in cancer patients relative to normalized control
• Pooled non-cancerous control enables measurement of individual differences in proteins that may lead to false positives
Several proteins induced/repressed in cancerous tissue
Validation using immunohistochemistry, RT-PCR (transcriptomics) and Western blots
Tested marker specificity to different kinds of cancer (YWHAZ, Stratifin, S100A7, B-actin)
What are the advantages of using isobaric affinity tags (iTRAO) for isotope tagging by chemical reaction in comparative shotgun proteomics
- Many peptides identified per protein so statistical confidence is high
- Requires fewer replicates
- All labels have same mass so improved analysis time (MS scan)
- Suitable for all biological systems
- Suitable for post-translational modification analysis
What are the disadvantages of using isobaric affinity tags (iTRAO) for isotope tagging by chemical reaction in comparative shotgun proteomics
- iTRAQ tends to underestimate n-fold changes
- iTRAQ labels may induce super-charging of peptides [particularly 8-plex iTRAQ] resulting in reduced protein identifications
- -True with high numbers of reagents
How do tandem mass tags for isotope tagging by chemical reaction in comparative shotgun proteomics work?
• Tags- Tandem Mass tags
o Work like ITRAQ tags: isobaric tags where quantification is performed at mass spectrometry level
Describe how stable-isotope incorporation via enzyme reaction works in comparative shotgun proteomics
o Can be used on all cell and tissue types as incorporated after protein synthesis
o Exploits the enzyme reaction (e.g. trypsin digest) to incorporate a stable isotope
Tags- dimethylation
o Trypsin digest adds water to the C-terminus of the newly created peptide (Lys/Arg C-terminus). Perform trypsin digest in light or heavy (deuterated) water to incorporate a mass difference
Isotopes have identical chemical properties
o Peptides tagged with differing isotopes will appear adjacent on a mass spectrum
o Area under each peak equates to relative abundance of the two peptides
What is the surfaceome and why is it useful to understand?
• Surfaceomes- proteins (or parts thereof) located on the surface of a cell
o Understanding what is on the surface of organisms can be very important in an applied way
Vaccines
How two cells interact with each other
o Membrane and surface associated proteins are important in vaccine design and we can look specifically at those surface proteins
• Surfaceomes are important for vaccine development
• Looks at proteins located on surface of cell or on the membrane
• Look at which protein/peptide epitopes are surface-exposed
What is the secretome?
• Secretomes- proteins secreted from a cell into the extracellular space
What are organellar proteomics?
• Organellar proteomics- nuclear proteins, ribosomal proteins, etc.
How is a good vaccine made/ what are requirements of a good vaccine?
• A good vaccine is made through:
o Exposed to the immune system
o Can use whole attenuated (non-pathogenic) live or dead organisms (e.g. Mycobacterium bovis BCG for tuberculosis)
Most vaccines these days are designed against specific biomolecules
• Liposaccharides are highly immunogenic but aren’t specific
• Proteins are generally specific
o Elicit an immune response (antigenic/immunogenic)
o Ideally an immune response that persists
o Then it needs to capture the whole organism upon infection so that it cannot proliferate
Who pioneered pathogen genomics for reverse vaccinology?
o Pioneered by Dr. Rino Rappuoli
What are the phases of pathogen genomics for reverse vaccinology?
Exploit genome sequencing
Sequencing of many strains/serovars/subtypes etc. to create a representative pan genome with genes conserved across all representatives
• Pan genome- the entire set of genes for all strains within a clade
Perform in silico predictions to identify potential antigens (and/or proteomics to determine their expression under different conditions)
Synthesize best candidates [can be many!] as recombinant proteins
• High copy plasmid replicated in E.Coli
Determine protection in animal models
• Take antigens and test them
Human trials
What are the symptoms of meningitis and sepsis?
o Inflammation of the lining of the brain (the meninges) or the blood (sepsis)
o Flu-like symptoms but can cause death in hours in infants and young adults
What is neisseria meningitidis?
o Neisseria meningitidis- Gram negative bacterium with five major serotypes (antibody binding): A, B,C,Y and W135
Why was serotype B of meningitidis difficult to make a vaccine of?
o No current vaccine for serotype B which causes 45-80% of N.meningitidis cases (driven in part by successful vaccination campaigns against the other serotypes)
Serotype B doesn’t work with other vaccines because the outer capsule (polysaccharide) is an autoantigen
• Serotype B is not recognised as a foreign organism in humans
How was reverse vaccinology used to make a vaccine for serotype B of meningitidis? Describe the difficulties in the process and the end result
o Genome (sequenced through whole-genome sequencing)
Sequenced a strain of the meningococcus genome
o 350 proteins successfully over-expressed into E.Coli
Not all successfully over-expressed in E.Coli because some might be toxic to E.Coli, some will form inclusion bodies and not be purifiable
o Raised antibodies in mice and screen for those that bind to the bacteria (7 proteins)
o The strongest binders were chosen for clinical trials
o Most did poorly due to:
Antigenic variability across strains
Some antibodies were missing in hypervirulent strains
Phase variability
Low copy number
o Current strategy is to use a multicomponent vaccine based on 4 proteins
Open reading frames put together and expressed as a single construct
Ensures that at least some reading frames are expressed across strains
o Completed phase 4 trials- and is effective across many B strains
o Became Bexsero- a multicomponent vaccine for prevention of meningococcal disease (available in Australia in 2015)
What elements are contained in the pan genome of serotype B N.meningitidis? Describe their significance
• 2 pathogenicity islands
• Highest number of phase variable genes
o Phase variable genes- genes that can be switched on and off- not good vaccine candidates
• Polysaccharide synthesis pathway
• 570 ORFs encode potential surface-exposed or secreted proteins (that is vaccine candidates)
What are pathogenicity islands?
o Pathogenicity islands- little bits of DNA acquired from other microorganisms or from the environemtn that are not shared with all the strains
What is the process of reverse vaccinology and the time frame?
o Process of reverse vaccinology-
Start with genome sequence-> predict candidates with bioinformatics-> prepare high copy number plasmid and recombinant protein-> test in animal models
Takes 1 to 2 years
What are the essential components for reverse vaccinology to be successful?
o Reliance on:
Genome sequence
Predictive tools to identify antigens
Guarantees that those antigens do not have human homologs
• Due to autoantigens
Proteins must be amenable to recombinant expression (often difficult with membrane proteins)
Requires scale ($$$/person years)
What is the process of conventional vaccinology and at what scale is it? What is its time frame?
o Conventional vaccinology Generally small scale Tests patient response first and works from that Purification of antigenic components Expression of those proteins o Process- Look at organism and host response to organism (small scale)->remove blood cells and plasmids-> take preparation of proteins/biomolecules and test to see whether there are antibodies against these using convalescent patient serum-> anything that human response is positive for, purify proteins and start testing in animals Takes 5-15 years
What is western blotting based on and how can it be used in vaccine development?
o Western blotting
Based on the simple principle of antibody-antigen reactivity where in this case bacterial protein lysates (or sub-cellular fractions) are the potential antigens
• One assay per gel/per blot
Patient sera will contain a series of primary antibodies against specific antigens- the secondary will be a labelled anti-human antibody
• Secondary antibody used for visualisation
What is pre-fractionation of the proteome and why is it useful? What cells/tissues can be pre-fractionated?
• Selecting specific subset of proteins to look at
• Used to answer a specific question about the organism
o Allows for focusing on specific question and better/more specific manner in which to answer it/gain data for it
o Allows for increased specificity and “zooming” on a particular part of the proteome to gain more detailed data
• More complex cell/tissues can also be fractionated:
o Surfaceome
o Mitochondria
o Nucleus
o Vacuoles
o Endoplasmic reticulum
o Golgi body
o Myofilaments
What are the advantages of pre-fractionation of the proteome for study?
- Specific analysis of a particular set of proteins (e.g. no need to look at cytoplasmic proteins if you are interested in adhesion)
- Reduces the complexity of the proteome and allows more lower abundance proteins to be visualised
What are the disadvantages of pre-fractionation of the proteome for study?
- Technically more challenging
- –Very fast ultra-centrifugation approaches
- Difficulty in achieving purity of samples
- –Enrichment only- no fractionation is ever 100% pure (abundant cytoplasmic or other proteins will always contaminate)
- Great number of samples to view entire proteome
What are the two main modes of study of the surfaceome and how good are they at studying the surfaceome?
o 2-DE gels are poor for looking at hydrophobic proteins, especially those with more than 3 transmembrane domains
o 2-DLC/MS-MS (shotgun proteomics) can also be used- relies on peptides and hence hydrophobicity is not an issue
What is the process of using 2-DE gels to study the surfaceome and what is a problem with this approach?
Process: bacterial cells-> sodium carbonate precipitation and ultracentrifugation-> membrane protein-enriched fraction-> Solubilization in 2DE specific buffer+ ASB-14-> excise spots, digest and identify by PMM MALDI-MS+ MS/MS
2-DE approach under-represents hydrophobic proteins with many TMR
• This is because the total protein is hydrophobic-not soluble
What is the process of using shotgun proteomics to study the surfaceome and why does this approach work so well with the surfaceome?
o 2-DLC/MS-MS (shotgun proteomics) can also be used- relies on peptides and hence hydrophobicity is not an issue
Hydrophobicity not an issue as trypsin only cleaves in soluble regions and since it relies on peptide analysis, that’s ok
Process: bacterial cells-> sodium carbonate precipitation and ultracentrifugation->membrane protein-enriched fraction-> digest complex membrane protein-enriched fraction with trypsin-> separate by SCX+RP and identify by MS/MS (shotgun approach)
Advantages of 2-DLC approach:
• Improved protein/proteome coverage
o More proteins identified
o Minimized dynamic range effects
o High mass and/or basic proteins
o Hydrophobic proteins and those with many TMR
• Rapid process
• Compatible with quantitation (iCAT, iTRAQ, etc.)
What are epitopes?
o Epitopes- regions of proteins that can trigger a cellular immune response mediated by T or B cells
What is a way of studying surfaceome epitopes and why would we do so?
o Mode of study-
Cell shaving-
• Surface-exposed epitopes are those most likely to interact with pathogens or conversely, with the immune system
o Characterisation and identification of vaccine candidate proteins
Can attempt to use surface-exposed peptides as protective antigens (essentially means that humans make antibodies to either the protein or the exposed epitope that can then ‘protect’ against subsequent infection with the organism
Used cell shaving approach
Describe the process of cell shaving for surfaceome epitope study
• Proteinase-K can be used: these peptides/proteins are generally in their native forms and not all amino acids of the protein will be exposed-get nice short peptides
o Trypsin can also be used
• Process A:
o Whole cells are incubated in the presence of proteinase-K/trypsin
o Proteinases shave off surface-exposed parts of the protein
Anything that is within the membrane and cytoplasm, as well as any non-protein components of the surface, remain untouched
o Peptides subjected to MS/MS
o Issue with this process- cells do not like being in incubation solution and they will lyse. To avoid this, can employ process B for false positive identification
• Process B: False positive identification
o Whole cells are incubated without proteinase-K/trypsin
o Cells removed by centrifugation and trypsin inserted in the supernatant for digest to occur
o Peptides subjected to MS/MS
Anything seen in the MS/MS result must be a cytoplasmic protein (contamination from cell lysis) due to the lack of trypsin in the incubation solution
What is the secretome?
• Secretome- all the proteins that are released into the extracellular environment
Why is it important to look at the secretome and what is the proteomic advantage/disadvantage of looking at this -ome
- Amenable to proteome analysis as obviously highly soluble
- Tend to be lower in mass as they are secreted from a secretion system -easier to deal with through proteomics
- Important virulence factors- toxins, proteases, siderophores, as well as non-protein factors
- Cell-cell communication
- However, they are technically challenging to collect
Describe the process of examining nuclear proteins/how they are collected and visualised
- Isolate the nucleus using differential centrifugation (organelle separation)
- Examine purity using microscopy/staining
- Solubilize proteins and perform 2-DE or digest/2-DLC
- Second round of enrichment using DNA-affinity chromatography- enriches for proteins that bind DNA (e.g. transcription factors)
Why is it important to enrich transcription factors when looking at nuclear proteins?
• Nuclear proteins+ DNA-binding proteins (should be enriched for low abundance transcription factors etc.)
o Transcription factors are amongst the most least abundant proteins found in a eukaryotic cell (5-20 copies per cell)- hence this is important
What types of environments do cells, tissues and organisms respond to and how?
• Cells, tissues and organisms respond to changes in their internal (genetic) and external environment by molecular adaptation
What is molecular adaptation?
o Molecular adaptation is a process involving changes in the expression of genes (and thus proteins) needed for survival under those altered environmental conditions
What are regulons?
- Regulons (regulome)- those genes/proteins that respond to a change in internal environment/genetic conditions
- Regulon- a group of genes/proteins that are controlled (promoted or repressed) by a single regulatory gene/protein
What are stimulons?
- Stimulons (stimulome)-those genes/proteins that respond to a change in external environmental conditions (e.g. temperature, nutrient availability, oxidative stress)
- Stimulon- a cluster of genes/proteins that respond to a change in the external environment (or a stimulus)
What information is shown on genomic heat maps?
• Shows how genome responds:
o Some genes are rapidly switched off and then gradually return to near normal (reduced) levels
o Some genes are rapidly switched on and then gradually return to near-normal (elevated) levels
o Some genes are gradually repressed over time
o Some genes are gradually induced over time
• Shows on a temporal time scale for dynamic visualisation
What does transcriptomics measure?
o Transcriptomics measures the relative levels of mRNAs in a given sample
Describe and explain the components influencing the correlation between transcript and protein
o An increase in transcription (mRNA) does not necessarily lead to an increase in translation (protein)
mRNAs have varying half-lives- a small amount of a long-lived mRNA can lead to much translation, while a large amount of short-lived mRNA can do the opposite
Proteins have varying half-lives
o Poor correlation between transcriptomics and proteomics when observed at a single time point
mRNA, protein and metabolites are temporally dissociated-
• mRNA responds first, followed by protein, followed by the product of that protein
• A single time-point is not representative of the temporal nature of genomic response
Transcriptomics fold changes have larger magnitude than proteomics
What influences the protein abundance?
Stability of mRNA/protein, protein abundance relates to degradation as much as synthesis [energy required to synthesise a protein]
What are problems with examining transcriptomics alone to solve a protein problem?
o Problems with looking at transcriptomics alone
Body fluids have no mRNA
Can’t examine subcellular fractions
Transcriptomics also does not account for protein post-translational modifications
How can the regulon/regulome be analysed/studied?
• Transcriptomics and proteomics can be applied to determine the regulome controlled by known regulators
o Members of such regulons change in their expression (up- or down-regulated)
• Analyse by
o i) gene knock-out- proteins that increase in abundance are repressed by the regulator, those that decrease in abundance are promoted or positively regulated
Can also silence/repress it if complete knockout is lethal
o ii) gene over-expression- proteins that increase in abundance are promoted, those that decrease are repressed
• Look for global regulators (in bacteria two-component systems, in mammalian systems transcription factors)
• Use of transcriptomics and/or proteomics
What is the role of regulatory proteins
• Role of regulatory proteins-
o If a certain repressor is mutated (deleted) or a transcription factor (activator) is over-expressed, proteins that are part of their regulons should change in expression
What is Tup1?
• Responsible for the repression of glucose-repressed genes
What is Yap1?
• Activator of environmental stress genes
What symbol represents deletions?
• Deletions have a Δ in front of them
What symbol represents overexpressions?
• Overexpressions have a +++ in front of them
What is sarA in S.aureus and what is its role?
Regulons- sarA in S.aureus
• SarA- staphylococcal accessory regulator
• Global regulator of virulence factor gene expression in S.aureus
o Secreted proteins- high in virulence factors
• Either induces (positive regulation) or represses (negative regulation) target gene expression
How is sarA studied for possible interventional targets and describe the success of this/ the reason for this level of success
• Genetic ‘knock-out’ of gene [sarA] encoding regulatory protein [SarA] enables detection of SarA-regulon using proteomics
• Lip1 is not a good interventional target for S.Aureus treatment-
o Lip1- a lipase that breaks down host cell lipids (appears in WT and SarA mutant)
o Lip2- backup for Lip1 if SarA is turned off
Describe hypoxia-inducible factor 1 (HIF-1) under normoxia (normal oxygen conditions)
• Normoxia-under normal oxygen conditions
o HIF-1 alpha is sensing oxygen and is hydroxylated by proline hydroxylase that adds the hydroxyl groups to two prolines
o Von Hippel-Lindau protein (VHL) binds to HIF-1
o Ubiquitination of the HIF-1 protein
o Allows targeting to proteasome (cellular garbage compactor) and subsequent proteasomal degradation, which makes peptides that can be recycled by the cell
Describe hypoxia-inducible factor 1 (HIF-1) under hypoxia (not enough oxygen conditions)
• Hypoxia- not enough oxygen
o Proline hydroxylation doesn’t occur to HIF-1 as there is no oxygen for hydroxylation
o Instead HIF-1 alpha goes to the nucleus and combines with its nuclear cofactor HIF-1 beta
o Protein complex binds as transcription factor to target DNA
Target DNA is referred to as a hypoxia response element (HRE)
How does HIF-1 stimulate glycolysis in hypoxia and why is this important?
o In hypoxia, HIF-1 stimulates glycolysis (which is important in tumourigenesis)
Cells sense a change in oxygen availability and increase transcription of HIF-1 (in hypoxic conditions, HIF prolyl-hydroxylase is inhibited, since it uses oxygen as a co-substrate)
HIF-1 binds to the promoter regions of glycolytic genes, increasing their expression to maintain production of ATP
Similarly PDH kinase is increased, phosphorylates and inactivates PDH and flux into the TCA cycle (slows TCA cycle as don’t want production of reactive oxygen species-> can lead to damage DNA), while stimulating LDH expression and production of lactate (decreasing pH)
Increased protease degrades mitochondrial cytochrome oxidase and both decreased TCA and COX reduce ROS (reactive oxygen species) accumulation
How does HIF-1 allow tumour cells to survive?
• HF1 and cancer
o Tumour cells divide rapidly and therefore need fuel in the form of glucose
o At the same time, growing tumours are poorly vascularized and therefore are poorly oxygenated
o HIF-1 responds to this hypoxia and increases expression of GLUT glucose transporters and glycolytic enzymes
o Tumours are also tolerant to low pH (lactate production)
o HIF-1 positively regulates expression of the hormone VEGF (vascular endothelial growth factor) which stimulates angiogenesis-> this can mean that the tumour is vascularised to keep it alive
Angiogenesis- making blood vessels
o In reality, tumour HIF-1 expression is a gradient based on oxygen availability
Describe 3 manners of visualising proteomics data and how these were used to visualise the HIF-1 regulome?
• Proteomics demonstrates the HIF-1 regulome
o Volcano plots
Fold change in x axis
P value in y axis
o Functional cluster
Which proteins does HIF regulate that we know are in the same functional pathway
o Pseudo-Western validation of known and putative HIF-1 targets
How can the stimulome be studied?
• Omics approaches can also be used to study how cells, tissues and organisms respond to a change in the external environment
What is the stimulome required for and how does it behave?
• Genes/proteins that respond to such a change (again up- or down- regulation) are part of the stimulome required to adapt to such an environmental change
• The individual components of the stimulon may change differently or identically (up-or down-regulation)
• Environmental stress (heat shock, oxidative stress, hypoxia, temperature, nutrient limitation, etc.) are all potential stimulomes
• Environmental stress-
o Over time, in order to survive, the organism must adapt to environmental changes
Can be sudden or subtle environmental changes
Describe the yeast stimulome and how it was found, and why the stimulome is such
o In yeast, one group looked at 142 different stresses/environmental conditions and used transcript-based microarrays to determine there were 900 genes in the yeast environmental stress response (about 15% of the genome)
At least in yeast, response to a variety of stresses is somewhat conserved (that is each gene responds in a near identical way irrespective of the type of environmental change)
Only 1/3rd of genes show a response specific to a given environmental change
• This is the environmental stress response
Yeast conserves how it responds to stresses by switching on/off essential and non-essential components respectively
Environmental stress response allows yeast to adapt to a change in environment
What is the magnitude of the stimuloe response dependent on?
• The greater the shock that is provided to the cell, the greater the magnitude of the genomic response
Does the stimulome adapt to environmental conditions? What happens when it does so and what is this change regulated by?
• Over time, the organism reaches a new steady state where certain genes remain elevated as part of the adaptation process
o The new steady state is a function of the size of the original environmental change
What is the role of peroxiredoxin?
• Peroxiredoxin: uses the free thiols on its cysteines as oxidant mops
o Proteins most likely mopping free radicals
Does a single gene give rise to only a single protein? Why/why not?
• A single gene may give rise to more than one functional protein due to the influence of chemical or physical modifications to proteins
o 6 different PTMs possible at different sites in the protein- 64 possible combinations of gene-products=64 different possible functions
However, most proteins encoded within the human genome have more than 6 PTM sites
Will probably never know how many proteins the human genome can encode
o It may be possible that a site in the protein can be modified by more than 1 PTM
o The capacity for an organism to post-translationally modify its proteins is commensurate with its genome size
o PTMs alter protein/peptide chemical composition and thus change the properties used to separate these biomolecules in proteomics
What are two different proteins that can result in the Nox1 gene and how? What are the purposes of each of these variants?
o Example- Nox1
Full-length nox1 gene is transcribed to produce a full-length mRNA, which in turn is used to produce the full-length integral membrane protein NADPH oxidase (NOX-1L)
• Is part of the electron transport chain in mitochondria
• Has 6 transmembrane domains
Produced by alternative splicing, NOX-1S does not contain the NADPH binding site, but is able to transport H+ ions
• Has 4 transmembrane domains
• Can act as proton transporter