Protein networks identification and analysis Flashcards

4 lectures Identifying protein localisation and interactions in cellular systems Analysing cellular systems using quantitative proteomics Analysing cellular systems using quantitative proteomics Understanding cellular-level system data using graph theory 1:

1
Q

Predicting protein sequences limitations

A

•Reading frames, splice variants and annotated function must be correct.
•Predicting localization depends on recognisable signal peptides or motifs. In the cell many determinants of membrane association are transient.
•Post translational factors increase complexity
-These factors include localisation, modifications, interactions, abundance and turnover at the protein level.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Alternative splicing

A

another source of variation
“A homologue of the Downs Syndrome Cell Adhesion Molecule (Dscam) protein in Drosophila melanogaster has 38,016 isoforms
Note the entire Drosophila melanogaster genome only has 15,016 genes!”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

changes of location

A

-During production/breakdown
-During Function
We cannot predict all the complexity and information content of the functioning cell from genomics/transcriptomics alone (or by any single technique).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Proteomics

A

the study of the ‘entire’ protein content of an organism, tissue or cell.
Proteomics seeks to provide information on the identity, amount, modifications, and subcellular location of proteins.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Challenges

A
•	Proteins are dynamic in their location, abundance, splicing, modifications and interactions.
–	Often cannot distinguish splice variants or closely related proteins using peptide-based methods
•	Huge range of abundance. Low-abundance proteins may be the most important class of proteins but are the most difficult to study with current methodologies.
•	Rapid temporal changes. Proteins can interact, move and be modified very quickly (seconds to minutes)
•	Tools: Proteins cannot be studied with the scale, speed, sensitivity and reliability that is currently achievable with nucleic acids. We cannot yet do ‘protein PCR!’: no amplification. Possible new methods with RNA aptamers. An aptamer is a sequence of single strand nucleic acid with a variable region of around 40 bases.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How can proteins be measured?

A
  • Immunological approaches
  • Biochemical approaches
  • Mass spectrometry-based methods
  • Fluorescent or other imaging methods
  • Emerging technologies
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Immunological approaches

A

-Use antibodies to detect (i.e. localise) or to physically isolate protein
e.g. immunohistology ’Human Atlas’ project.
– Enzyme Linked Immunosorbent Assay (ELISA)
– Western blotting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

immunohistology ’Human Atlas’ project.

A

importance in human histology- the human protein atlas. Measurements of protein location. Changes of location indicates disease. 11,274 antibodies corresponding to 8,489 protein-coding genes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Human atlas examples

A

CD44
a cell-surface glycoprotein involved in cell-cell interactions, cell adhesion and migration.
Alternative splicing is the basis for the structural and functional diversity of this protein and may be related to tumour metastasis. However, the full-length nature of some of these variants has not been determined. CD44 has 30 splice variants in humans.
DSCAM
a member of the immunoglobulin superfamily of cell adhesion molecules involved in human central and peripheral nervous system development. This gene is a candidate for Down syndrome and congenital heart disease (DSCHD). DSCAM is very highly spliced in Drosophila but has only 2 isoforms in humans

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Limitations of immunological approaches

A

•need one antibody per protein = laborious and expensive.
• The binding specificity depends on the epitope (binding sites)
– could be non-specific
– inhibited by modifications/interactions/(un)folding
– not all antibodies are suitable for all tasks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

uses of antibodies in biochemistry

A

–Detection: histology (confocal and electron microscopy), enzyme-linked immunosorbent assay (ELISA), Western blotting.
–Isolation of protein and interacting partners by co- immunoprecipitation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Overcoming the limitation of a specific antibody for each protein

A

•Introduce a well characterized ‘TAG’ to the protein of interest using genetic modification. This relieves one limitation and some of the expense
Tags include:
•Small proteins: GST,GFP
•Peptides: FLAG.Myc,6xHis
•Tandem affinity tags (TAP)
-larance paper: specific ab or tagged protein immunoprecipitation advantage- high sensitivity can be achieved for proteins in low abundance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Fusion proteins

A
  • These fusion proteins help to visualise where a protein is located (with various limitations depending on the tag and organism or cell type)
  • Fusion can be used to enrich the modified (tagged) protein and interacting partners using co-immunoprecipitation or by fractionation of cell extracts and testing for presence of tag.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Biochemical approaches

A

Use the physical properties of different proteins as purification tools: size, charge, affinity binding, hydrophobicity.
- You can find out where your protein is during the purification:
• by following protein’s activity (if known/traceable)
• by following presence of protein (fluorescence, epitope tag) Epitope tagging is a technique in which a known epitope is fused to a recombinant protein using genetic engineering. Epitope tags make it possible to detect proteins when no antibody is available.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Emerging tech

A

RNA aptamers and nanopores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Detecting proteins in complicated mixtures

A
  • Polyacrylamide gels (denaturing, SDS-PAGE) (native no SDS)
  • Western blotting (always denaturing)
  • Enzyme Linked Immunosorbent Assay (ELISA)
  • Fuse fluorescent tag
  • Mass spectrometry to identify proteins
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Mass spec

A

4 stages: sample, ionisation source, flight path, detector
Proteins can be prepared for MS from gel slices or from solution digests e.g total cellular proteomics. Then we digest the thousands of proteins into 20-50 peptides from each protein

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

LC/MS-MS

A
  1. protein
  2. tyrptic digest
  3. nanoflow LC of peptides
  4. MS to select intact peptide and fragment peptide
  5. peptide mass, retention time, fragmentation, intensity
  6. assign peptides through database search (larance reading database for raw ms available)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

LC

A

To deal with very complicated mixtures of peptides. LC separate a mixture of molecules by a physical property.
1.Temporarily bind the mixture of peptides to the column
2. Wash peptides off by applying solvent gradient
3. Peptides elute off column on basis of their hydrophobicity.
The beads are covered with long carbon chains (C18). The amino acid sequence of a peptide determines its hydrophobicity (but hydrophobicity is not enough to identify a peptide).
• Liquid chromatography (LC) gives a retention time for the peptide.

20
Q

Concept C18(reverse phase) chromatography

A

Organic modifiers such as acetonitrile is used to elute the hydrophobic peptides. A gradient of the organic solvent will release the peptides in the order of their hydrophobic interaction strengths.

21
Q

Time of Flight MS

A

Ions mass to charge ratio (m/z) is determined. as ions traverse the analyser, they separate in space. A detector is positioned at the end of the analyser to measure the arrival time of ions. Ions of lesser m/z arrive first, followed by ions of greater m/z.

22
Q

Quad MS

A

The quadrupole scans a range of frequencies that lets ions of a specific mass to charge ratio reach the detector. all the other ions go to waste

23
Q

Ion trap- collision induced dissociation

A

MS- scans ions out over full mass range
CID-selected ion is retained, excited to fragment
MS2-fragment ions ejected

24
Q

Fragment to sequence

A

Can use predicted protein sequences where you match experimental fragment mass spectra to predicted fragment masses from genome. -confidence score.

25
Q

Mass spec proteomics main stages

A
  • Sample preparation with separations(LC)/fractionation(SDS page)
  • Ionisation (ESI,MALDI)
  • Measure intact peptides and fragment ions (MS/MS)
  • Data analysis and protein identification
26
Q

MS quantitative-fragments and peptides

A

– Can make standards using stable isotopes of nitrogen (N15) and carbon (C14) to create mass difference without altering biochemical properties of the peptides.
– quantification at precursor ion level (e.g. stable isotope labelling of amino acids in cell culture SILAC)
– Or quantification at fragment ion level (e.g selected reaction monitoring SRM)

27
Q

Protein identities are inferred from peptides.

A

It is important to have the correct sequence in the database.
Protein inference problem-because peptide sequences are inferred from MS, conserved peptides are a problem
Western blotting may be necessary to resolve unique protein identities

28
Q

MS not enough

A

SERK problem- one experiment tried to find what interacts with FLS2 used GFP coIP to find interactors found that a SERK protein interacts with FLS2. But not sure which one. In second experiment each of the 5 SERKs were expressed with 2 tagged proteins, immunoprecipiated then blotted to detect the other

29
Q

Current status of proteomics

A

– Integration of genetic methods, imaging, mass spectrometry and biochemical approaches.
– Identification of proteins by MS widely ‘routine’
• Total numbers comparable to transcriptomics
• Identification of protein from gel bands, affinity purification, cell lysates
– Quantification more common and reliable; not quite routine
• spectrum counting, peak areas, selected reaction monitoring
• Stable isotopes can be used to greatly improve accuracy (depends on cell types)

30
Q

Generating and navigating proteome maps using Mass Spec by Ahrens

A
  • there has been improvement in proteome coverage
  • ms used to generate partial proteome maps
  • 2 complimentary approaches taken
  • 1 systemic fractionation
  • ms data generated from a particular species
  • difficulty in knowing completeness
  • assess completeness if protein product for all annotated ORFs located in the genome of the species is conclusively identified
  • tight control over protein false discovery needed
31
Q

Protein turnover

A

The length of time that a protein remains available to perform its function is significantly influenced by its turnover rate.
Protein turnover is the balance between protein synthesising and protein degradation

32
Q

Isotopes

A

Pulse chase labelling used to be measured with radionucleotides 14C,35S,32P limited sensitivity of scintillation counter or photographic film
Now we tend to use stable isotopes instead Or fluorescent labels

33
Q

Stable C13

A

We use naturally occurring stable C13 isotopes to determine charge state

34
Q

Stable 15N

A

Unlabelled light peptide = all 14N

Fully labelled “heavy peptide” = All 15N

35
Q

Stable isotope labelling of amino acids in cell culture (SILAC)

A

Separately metabolically label haploid or diploid yeast with stable isotopes in cell culture (SILAC).
After treatment mix light and heavy cultures. Proteins and peptides are chemically identical, so they stay together throughout the fractionation and identification steps in LC- Ms/MS.
But their mass (m/z) differs and the peak area of each is quantitative

36
Q

Yeast protein turnover (degradation rate) by Claydon & Beynon

A

made estimations in exponentially growing cells

37
Q

Global analysis of protein localisation in yeasts Huh et al.

A

Systematically GFP-tag each (6234) yeast ORF in its chromosomal location
– by homologous recombination
– GFP added at C-terminus
Transform yeast → 6,029 strains with chromosomally GFP-tagged ORFs
Analyse yeast with fluorescence microscope → 4,156 (i.e. 75% of yeast proteome) showed GFP fluorescence
Strains initially classified into 12 localisation categories by eye. Refined by co-localising with known markers
Good correlation between localisation and transcriptional co-regulation
Good correlation between protein localisation and protein-protein interaction data
Chromosomal integration (of GFP-fusion proteins) solves the problems related to overexpression artefacts
• The resulting database (http://yeastgfp.yeastgenome.org/ ) is available

38
Q

usual problem

A

The C-terminal fusion of GFP causes mis-localisation of proteins in a number of compartments

39
Q

organelle proteomics

A

by density fractionation
• Physical separation of organelles by their density
• Identification of proteins by mass spectrometry
• Label-free quantitation of peptides by mass spectrometry.(applies two quantification strategies: (1) spectral counting or (2) spectrometric signal intensity to measure the protein expression. )
• Correlation of protein abundance profiles with known proteins established as markers for an organelle.

40
Q

protein correlation profiles

A

Protein correlation profiles translate directly into fluorescent staining patterns
Function and predictive results from Foster
• Mapped 1404 proteins to 10 subcellular locations
–39% of organelle proteins had multiple locations
• Comparison of localisation with Huh 2003 results 74% agreement between yeast and mouse cell lines
• Identified assumed trans-acting transcription factors and cis- promoter motifs of organelle related genes.

41
Q

Examples of databases and resources for protein localisation:

A

UNIPROT, DAVID, illuminated plant cell

42
Q

Methods to identify protein-protein interactions

A

• In vivo imaging methods: FRET, BiFC, FRAP
• Yeast 2-hybrid (Y2H) and derivatives
• Phage display
• Protein chips/peptide arrays/substrate arrays
• Affinity methods
– Co-sedimentation, size exclusion filtration
– Affinity chromatography
– Co-immunoprecipitation
– Proximity labelling
– Interactions can be stabilized by cross-linking

43
Q

membrane yeast two hybrid(MYTH)

A

does not require that interactions occur in the nucleus, thereby allowing the use of full length membrane proteins, in their natural context of the cellular membrane
In MYTH, a membrane bait protein is fused, at a cytosolic terminus, to a tag consisting of Cub linked to an artificial transcription factor.
However, a major drawback is that some interactions of proteins from other organisms may not occur in the yeast milieu because of the possible lack of associating factors or protein modifications

44
Q

LUMIER

A

LUMIER provides a powerful method to study mammalian PPIs in their natural environment in an automated

45
Q

mammalian protein-protein interaction trap

A

PPIs take place in the cytosol of intact mammalian cells.

a novel screening method for anti-HIV compounds in intact human cells.

46
Q

Phage display

A

identify and optimize polypeptides with novel functions
Phage libraries, which can consist of artificial or naturally existing peptides, proteins and protein variants presented on phages
particularly suited for rapid screening and optimization of molecular interactions, making it of special interest in drug development.