McPherson Lectures 1-3 Flashcards

1
Q

what is directed evolution

A

the creation of new protein activities through the principles of natural selection to chang: pH and thermal stability, optimum pH and temp, binding properties (i.e. km for substrates) and catalytic properties

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

although the ultimate goal is to be able o design a protein de novo which performs a certain function, why do most directed evolution experiments explore a sample of proteins with activity

A

most random sequences won’t fold into a functional protein so existing proteins already have stable folds, display functional properties and may have minor activities which can be enhanced by sequence changes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what are the methods for introducing variability

A

error prone per, DNA shuffling, family shuffling or random mutagenesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

why is sparse sampling used in directed evolution experiments

A

possible sequence space = 1.3x10^55kg of protein = 2x mass of earth so sparse sampling just uses enough diversity to select interesting proteins

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

how does error prone PCR work

A

reduce the PCR fidelity e.g. using non-proof reading Taq polymerase, high conc of mgcl2, increased extension time or altered relative nucleotide conc to introduce variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

how many mutation per codon are produced in error prone pcr

A

1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

why is recombination usually performed after error prone pcr

A

it is unlikely to lead to concerted accumulation of beneficial mutations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

describe the process of DNA shuffling

A
  1. fragment DNA by physical method e.g. dnase 1
  2. heat denaturing allows random annealing which self prime upon addition of DNA polymerase. repeat cycle to build up size of whole gene
  3. resulting genes can be amplified by PCR to form a variant library which can be cloned into a vector
  4. screen for desired activity then carry out saturation mutagenesis to improve
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

why can error prone pcr/dna shuffling not be used to explore complete sequence space

A

limited by codon usage- with 1 mutation/codon tryptophan, for e.g., can only be mutated to 5 aa

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

how are protein coding sequences isolated once the desired protein is selected for

A

a physical association between the expressed protein and its coding protein has to be created. in vivo it can then be transfected into a cell then cells are grown and DNA coding sequence is recovered as plasmid DNA or by PCR. in vitro can be done via ribosome display and reverse transcription, mRNA display or micro fluids encapsulation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what are 3 possible methods for selection

A

survival phenotypes, fluorescence selection, chromogenic enzyme activities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

how can we overcome the problem of not being able to select for “better” mutants if you find something that allows the cell to grow

A

downregulate protein expression or introduce a degradation signal to select for mutants which allow growth under these pressures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

describe screening

A

pick individual colonies, put them in a 96 well plate, grow, lyse and assay for activity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is the problem with screening

A

low throughput, inefficient and expensive if most variants are inactive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

which substrate does Cyt p450 usually use

A

camphor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

which substrate did they try to change cyt p450 to use

A

naphthalene

17
Q

describe the first stage of the directed evolution experiments performed on cytochrome p450

A

error prone PCR was used to generate 200,000 clones of which 32000 were screened using fluorescence and found 3 variants

18
Q

which 3 variants were further tested following error prone pcr and fluorescence screening of cyt p450

A

M7-6H which had 11 fold higher activity and contained the mutations R280L and E331K, M7-4H 5 fold higher activity and E331K mutation and M7-8H with 5 fold higher activity and mutations E331K and C242F

19
Q

where was the E331K mutation discovered by directed evolution situated

A

not near the active site- would not have been an obvious choice

20
Q

what is APEX

A

a haem peroxidase

21
Q

why was directed evolution carried out on APEX

A

it can be used for intracellular specific protein imaging by EM and spatially resolved proteomic mapping but it isn’t as efficient as one might like

22
Q

describe the experiment to produce APEX2 via directed evolution

A

first a base enzyme was generated which can be expressed in cytosol. they then generated a library of variants using error prone pcr, cloned them into ecoli then transformed into yeast. treated with substrate to label the variants then used FACS to select for improved activity. in 2nd and 3rd round increased selection pressure was applied to test for efficient haem incorporation. found APEX2 and VPGAPEX and in both most mutations were away from the active site

23
Q

what was the major mutation in APEX2

A

A134P

24
Q

what are the applications of APEX2

A

as reporter to generate contrast for EM/VM, mass spec analysis

25
Q

describe the process of family shuffling

A

it mimics natural evolution, recombining reated sequences from different organisms to create new “functional variant” sequence combinations

26
Q

what is the benefit of family shuffling

A

recombining to create new variants should creat functional proteins so few colonies can be screened and the vast majority will be functional. in addition family shuffling can maintain one quality whilst enhancing another

27
Q

describe the experiment used to create Savinase, the leading commercial subtilisin

A

segments of subtilisins were cloned by PCR from 25 natural bacilli. they retained the N and C coding ends but swapped the majority of the coding region in between using shuffling between aa 60-224. these variants were then screened using a skimmed milk screening assay and found most were inactive but selected for the most active. repeated but with a casein screening assay. casein is quenched upon cleavage by a protease. this screen produced 654 clones which were then screened for activity at 23c, thermostability, solvent stability and pH dependence. the mutant which was most thermostable was suaveness which contained 32 mutations compared to most thermostable parent.

28
Q

what are the 3 possible rational approaches

A

saturation mutagenesis, designed enzyme and computational structure based design

29
Q

describe the trinuclrotide approach in saturation mutagenesis

A

you only add 1 codon per amino acid so you get a small library with relatively equal numbers of each amino acid

30
Q

what is the CAST approach

A

combinatorial active site saturation test

31
Q

how does CASTing work

A

you look for combinations of mutations in the active site which give an improvement

32
Q

describe how the enzyme design approach was used to create HG3, a kemp eliminase

A

began with xylanase scaffold and computationally designed 11 point mutations, the most successful of which was HG13.17 which was created after 17 rounds of mutagenesis and screening. it has high shape complementarity between the TS and active site and 3 groups which stabilise the developing negative charge in TS

33
Q

what is the basis for computational structure based design

A

computationally identify and predict residues that are important in catalysis

34
Q

describe the use of computational structure based design to generate a diels-alder enzyme

A

it decided on a catalytic mechanism and ideal active site including a binding pocket for the 2 substrates in the correct relative orientation with suitable hydrogen bond donors and acceptors. rosetta methodology was used to design in silicon enzyme models with active sites with the desired properties, searching 207 stable protein scaffold for backbone geometrics which allow the 2 catalytic residues and substrates. a total of 84 designs were selected for experimental validation then genes for these were synthesised with a his tag and expressed in e.coli. 50 were soluble and purified and Diels-alder activity was measured by LC-tandem MS. 2 designs were found to have activity then crown sourcing and further mutagenesis experiments was used to generate improved enzymes