Protein networks identification and analysis Flashcards
4 lectures Identifying protein localisation and interactions in cellular systems Analysing cellular systems using quantitative proteomics Analysing cellular systems using quantitative proteomics Understanding cellular-level system data using graph theory 1:
Predicting protein sequences limitations
•Reading frames, splice variants and annotated function must be correct.
•Predicting localization depends on recognisable signal peptides or motifs. In the cell many determinants of membrane association are transient.
•Post translational factors increase complexity
-These factors include localisation, modifications, interactions, abundance and turnover at the protein level.
Alternative splicing
another source of variation
“A homologue of the Downs Syndrome Cell Adhesion Molecule (Dscam) protein in Drosophila melanogaster has 38,016 isoforms
Note the entire Drosophila melanogaster genome only has 15,016 genes!”
changes of location
-During production/breakdown
-During Function
We cannot predict all the complexity and information content of the functioning cell from genomics/transcriptomics alone (or by any single technique).
Proteomics
the study of the ‘entire’ protein content of an organism, tissue or cell.
Proteomics seeks to provide information on the identity, amount, modifications, and subcellular location of proteins.
Challenges
• Proteins are dynamic in their location, abundance, splicing, modifications and interactions. – Often cannot distinguish splice variants or closely related proteins using peptide-based methods • Huge range of abundance. Low-abundance proteins may be the most important class of proteins but are the most difficult to study with current methodologies. • Rapid temporal changes. Proteins can interact, move and be modified very quickly (seconds to minutes) • Tools: Proteins cannot be studied with the scale, speed, sensitivity and reliability that is currently achievable with nucleic acids. We cannot yet do ‘protein PCR!’: no amplification. Possible new methods with RNA aptamers. An aptamer is a sequence of single strand nucleic acid with a variable region of around 40 bases.
How can proteins be measured?
- Immunological approaches
- Biochemical approaches
- Mass spectrometry-based methods
- Fluorescent or other imaging methods
- Emerging technologies
Immunological approaches
-Use antibodies to detect (i.e. localise) or to physically isolate protein
e.g. immunohistology ’Human Atlas’ project.
– Enzyme Linked Immunosorbent Assay (ELISA)
– Western blotting
immunohistology ’Human Atlas’ project.
importance in human histology- the human protein atlas. Measurements of protein location. Changes of location indicates disease. 11,274 antibodies corresponding to 8,489 protein-coding genes.
Human atlas examples
CD44
a cell-surface glycoprotein involved in cell-cell interactions, cell adhesion and migration.
Alternative splicing is the basis for the structural and functional diversity of this protein and may be related to tumour metastasis. However, the full-length nature of some of these variants has not been determined. CD44 has 30 splice variants in humans.
DSCAM
a member of the immunoglobulin superfamily of cell adhesion molecules involved in human central and peripheral nervous system development. This gene is a candidate for Down syndrome and congenital heart disease (DSCHD). DSCAM is very highly spliced in Drosophila but has only 2 isoforms in humans
Limitations of immunological approaches
•need one antibody per protein = laborious and expensive.
• The binding specificity depends on the epitope (binding sites)
– could be non-specific
– inhibited by modifications/interactions/(un)folding
– not all antibodies are suitable for all tasks
uses of antibodies in biochemistry
–Detection: histology (confocal and electron microscopy), enzyme-linked immunosorbent assay (ELISA), Western blotting.
–Isolation of protein and interacting partners by co- immunoprecipitation
Overcoming the limitation of a specific antibody for each protein
•Introduce a well characterized ‘TAG’ to the protein of interest using genetic modification. This relieves one limitation and some of the expense
Tags include:
•Small proteins: GST,GFP
•Peptides: FLAG.Myc,6xHis
•Tandem affinity tags (TAP)
-larance paper: specific ab or tagged protein immunoprecipitation advantage- high sensitivity can be achieved for proteins in low abundance.
Fusion proteins
- These fusion proteins help to visualise where a protein is located (with various limitations depending on the tag and organism or cell type)
- Fusion can be used to enrich the modified (tagged) protein and interacting partners using co-immunoprecipitation or by fractionation of cell extracts and testing for presence of tag.
Biochemical approaches
Use the physical properties of different proteins as purification tools: size, charge, affinity binding, hydrophobicity.
- You can find out where your protein is during the purification:
• by following protein’s activity (if known/traceable)
• by following presence of protein (fluorescence, epitope tag) Epitope tagging is a technique in which a known epitope is fused to a recombinant protein using genetic engineering. Epitope tags make it possible to detect proteins when no antibody is available.
Emerging tech
RNA aptamers and nanopores
Detecting proteins in complicated mixtures
- Polyacrylamide gels (denaturing, SDS-PAGE) (native no SDS)
- Western blotting (always denaturing)
- Enzyme Linked Immunosorbent Assay (ELISA)
- Fuse fluorescent tag
- Mass spectrometry to identify proteins
Mass spec
4 stages: sample, ionisation source, flight path, detector
Proteins can be prepared for MS from gel slices or from solution digests e.g total cellular proteomics. Then we digest the thousands of proteins into 20-50 peptides from each protein
LC/MS-MS
- protein
- tyrptic digest
- nanoflow LC of peptides
- MS to select intact peptide and fragment peptide
- peptide mass, retention time, fragmentation, intensity
- assign peptides through database search (larance reading database for raw ms available)