Expression of Recombinant Proteins in E.coli Flashcards
Why study proteins?
- Proteins are the workhorses of the cell: structural, transport, catalysis etc
- Knowledge of protein function and action can help with e.g.: drug design, engineering enzyme for use in e.g. biocatalysis
How do we study proteins?
- Purification: biochemical/biophysical characterization of isolated proteins, structural studies (X-ray, NMR, cryo-EM)
- In vivo studies: protein labelling techniques
- Cloning and overexpression of proteins: higher yield of purified proteins, possibility to mutate amino acid residues specifically (engineering function, studying function)
Protein-based therapeutics
- Protein-based therapeutics include: hormones, cytokines, vaccines, monoclonal antibodies
- 1/3rd are produced in E. coli
- Currently there are: 140 protein-based therapeutics have been approved and
500 are in clinical trials
Protein Purification
What is it?
- Purify a single protein from a mixture of proteins
Why is it needed?
- Separation from proteins with similar function to measure activity/function of one particular protein
- Comparison of mutant proteins
- Structural studies (X-ray, NMR)
Protein Purification – pre-DNA Technology Era
- Proteins were purified from the native organism/tissue
- No overexpression, most proteins typically occur at <1% of total protein: purifying Cytochrome C from ca. 1 kg Horse Heart yields 3-4 mg pure protein
- With rise of recombinant DNA technology and next generation sequencing these methods are rapidly disappearing from modern research
- Other developments: new overexpression systems, Tag-based protein purification, cell free expression systems
- ‘old-fashioned’ methods are still used for proteins that require extensive and very specific post-translational modifications
Protein Production & Purification - DNA Technology Era
- Developments in High throughput genome sequencing: coding sequence of our protein of interest is typically known
- Gene of interest can be amplified by PCR and cloned into an protein overexpression vector: inducible overexpression results in high levels of protein in dense cultures, possibility to manipulate the gene, facilitation of the introduction of purification tags
Protein Production In a Nutshell
- DNA of the protein-encoding gene (geneAB) is transcribed into mRNA
- mRNA is translated into protein (ProteinAB) by the ribosome
- Several regions upstream of geneAB are crucial for both transcription and translation
Elements Required for Transcription/Translation
- 16S rRNA 3’-end complements the Shine-Dalgarno sequence, also known as ribosome binding site (RBS)
- Insures that the Ribosomal complex binds in the correct location of the mRNA
- Both elements need to be precisely oriented relative to the ATG start codon
Restriction endonuclease–based cloning
- Create PCR product with restriction sites on either end
- Digest PCR product and plasmid vector
- Ligate digested PCR product and digested plasmid vector
Recombinant Protein Production in E. coli
- PCR (geneAB)
- restriction endonuclease-based cloning
- geneAB and purification tag join plasmid vector suitable for overexpression in E. coli
- transformation
- induce enzyme production in E.coli
Inducing enzyme production in E.coli
- Inoculate with E. coli containing overexpression construct and grow at 37oC to mid-exponential phase (OD600 ≈ 0.5)
- Induce expression of protein of interest by adding inducer followed by growth at xx oC (typically between 20-37 oC) for 4 hours to overnight
Requirements for production of a protein starting from a gene (i.e. DNA)
- Transcription: RNA polymerase must be able to bind to promotor and transcribe gene into mRNA
- Translation: mRNA has to be translated
- How is it achieved?: Insert gene of interest in between upstream regions needed for starting transcription (Promotor) and translation (Shine-Dalgarno, RBS) and terminating transcription (Terminator)
Plasmid Vectors for Protein Production in E. coli
- Plasmid vectors for protein production typically contain a multiple cloning site (MCS) at the correct location relative to those 3 elements
- Regulatory elements for transcription are exploited to switch protein production on/off by addition of a specific small molecule to growth medium
Promotor Systems for Protein Production in E. coli
- Integrated into a specific protein expression plasmid
- based on proteins that prevent RNA-polymerase binding to their associated promoters when inducer molecule is absent
- Addition of specific small molecules to growth medium results in repression (small molecule=repressor) or induction (small molecule=inducer) of protein overexpression
- If repressor can be consumed by bacterium in which the expression plasmid propagates so-called autoinduction can be used
Examples of promotor systems
- lac-repressor: based on the regulatory sequences and proteins of the lac-operon
- PBAD promotor: based on the arabinose inducible arabad system
- tet-repressor: based on the tetracycline repressor