Protein Folding Flashcards
Folding landscape
1) Representation of different conformational states
-All have an associated free energy
2) In folding process -> decrease in nº of conformations, free energy, and nº of contacts (native or non-native)
Full native conformation
-Only native contacts
-Protein reaches minimal free energy conformation
Intrinsically disordered proteins (IDPs)
1) Don’t reach folded conformation
2) no hydrophobic core as w/globular proteins
3) More abundant than thought; up to ~30% of eukaryotic proteins
4) Many related to diseases
5) Have v. important functions
6) Complement functional repertoire of ordered proteins
7) Remarkable binding properties
8) Examples:
-Cancer: p53
-Alzheimer’s: ß -amyloid protein
-Parkinson’s: α-synuclein
-Taupathies: Tau
-Prion diseases: prion protein
Protein folding problem
1) Why does it take that particular structure? Why does it fold at all? Why does it fold so fast?
2) help predict process that proteins go through to get native conf
3) Understanding this will:
-Combat misfolding related to human diseases
-Produce new proteins with novel functions
-Predict 3D structure from primary sequence
Afinsen experiment
1) Native proteins contain conformation w/ lowest minimal energy
2) All info to reach native structure is in 1º seq
Method:
1) Ribonuclease (14 kDa) w/well-defined sequence pattern of disulphides
2) Denature in 8M urea + ß-mercaptoethanol (BME) to unfold protein
3) Protein had no catalytic activity = completely unfolded
4) Remove BME led to formation of mostly non-native disulphides
5) Remove urea gives completely unfolded protein
6) Adding trace amounts BME = protein recovers catalytic activity
7) Conclusion: BME breaks diS bonds = allows protein to explore other conformations w/smaller energy to achieve native structure
Protein stability
1) Native state is more stable than denatured state
2) Protein stability = the conformational stability of the N is given by the difference in free energy between D and N: ∆GD-N
Chemical stability
1) Ability of maintaining chemical structure of the native states (e.g. intact covalent bonds, oxidation states, metal coordination, etc)
Processes include:
1) Deamination of Asn/ Gln to Asp/Glu
2) Hydrolysis of peptide bond of Asp at low pH
3) ox. of Met at high temperature to met sulfoxide
4) Elimination of diS bonds
5) Thiol-catalyzed diS interchange at neutral pH
6) ox of Cys residues to thyol radicals, Cys-S* -happens when thiol in Cys loses protons but keeps ē -> v. reactive species that can form non-specific bonds (dangerous)
Conformational stability
1) Ability to adopt well-defined conformation rather than random coil (D)
-Associated with 2º/ 3º structure
2) Local structure adopted by backbone atoms described in phi and psi angles
-Phi and psi rotate allowing various conformations
3) Refers only to formation of non-covalent bonds
Determinants of protein folding (1)
1) ∆G= ∆H-T∆S
2) 2 parameters: change in entropy vs enthalpy
3) Change in entropy = changes in conformations = reduction in entropy when folding protein to N = unfavourable
4) Enthalpy contribution from intra-molecular side-chain interactions (e.g. hydrophobic, diS, etc) = favourable to stability of protein
5) Major favourable energetic contribution to protein folding/stability of N comes from expulsion of water from hydrophobic core
6) All interactions have to be broken to form new interactions in N
Parameters influencing protein folding
1) Stability of protein depends on pH, temp, pressure, ionic strength, crowding, amount of salt, etc
2) max activity of protein correlated to protein stability
3) Crowding = putting protein in solution w/co-solvents that mimic cellular environment
-Apoflavodoxin in normal buffer has thermal stability of 48ºC; but in 400mg/ml of Ficoll, temperature increases -> diff for each protein
-Protein inside cell is more stable just because it’s surrounded by proteins at high concentrations that act as solvent
4) Non-covalent interactions
-Average stability of monomeric small protein ~5-15 kcal/mol
-For a protein to fold, need thousands of these interactions
5) If too stable = v. rigid = cannot perform function i.e. to perform catalytic function = need conformational changes to interact w/other proteins to form complexes
Determinants of protein folding (2)
1) Covalent interactions
-diS bond formation = reversible process
2) Compaction
-a-helices and b-sheets individually are v. compact
3) Folding is directed by internal residues related to hydrophobic core
-Most surface mutations do not affect folding; may affect solubility, stability
-Balance between compactness and flexibility; surface residues may have greater flexibility but have hydrophobic core that is rigid
4) Hierarchy
-Domain = sequence that can be folded on its own; soluble in solution
-Subdomains = form independently from domains; form 3º or 4º structure
5) Adaptability
-Hydrophobic cores are adaptable to changes
-Mutations can be accommodated with local shifts without affecting fold of backbone
6) Sequence versatility
-Conservation of sequence and folding
-20% amino acid sequence identity = same overall fold = but have huge variability
-Certain residues are key to maintain conformation of shape
Techniques to measure protein stability
1) Methods distinguish betw. D and N state
2) Absorbance (v.small diff betw. D&N)
3) Fluorescence
-Most common chromophore used
- greatest change in signal between N and D
-greatest S:N
-Rely on aromatic side chains (Trp, Tyr, Phe)
4) Circular Dichroism (CD)
5) NMR
6) Differential scanning calorimetry (DSC)
7) Catalytic activity (does not give thermodynamic parameter)
Protein denaturation
1) unfold proteins w/heat or cold
-just before 0º, small fraction will unfold
- more common heat denaturation = stronger signal
2) pH extremes
3) Organic solvents
4) Chaotropic agents
-Most common after heating
-E.g. urea, guanidium hydrochloride
-Sigmoidal curve related to cooperativity
-Refer to individual Tms to check protein stability
-D50 parameter = concentration at which 50% of protein is denatured
Circular dichroism (CD)
1) Measures the molar absorption difference (∆ε) between left and right circularly polarized light
2) Circular polarized light = shift phase by quarter cycle (rotating circularly; forms helical shape)
-For opp direction, shift by 45º
3) In CD have sample and 2 circularly polarized lights; sample only absorbs one of them in only one direction; measure how one was absorbed preferentially to other
4) Spectropolarimeter
-At different wavelengths get specific pattern
-a-helix min = 208 and 222nm
-b-sheet peak = 208nm
-Most cannot read below 200nm
5) To measure stability of a protein, take frequencies, measure CD only at particular wavelength (at region w/greatest signal between random coil and folded protein)
6) CD monitors protein folding and biological activity
-Can tell you proportion of a-helix or b-sheet in protein
7) Disadvantage: spectra does not give you AA level information
Planar vs circular polarized light
1) Planar = horizontal and vertical components in phase
2) Circular = horizontal and vertical components out of phase
-Direction of rotation depends on relative phase
-Chiral molecules (e.g. L-amino acids) are optically active = have different refractive indices and different extinction coefficients for L and R circularly polarized light
-Optical activity also affected by environment i.e. location in secondary structure elements
3) Mix = elliptical
4) Combining 2 circularly polarised lights in different directions = linearly polarized light in particular direction (just a single plane)
5) Only circularly polarized light in one direction = protein absorbs = circularly polarized light with smaller amplitude/signal
-In 2 directions = add contributions of two circles (big circle for light that was not absorbed + small circle for light that was absorbed) = ellipse
Ellipticity
1) Difference in absorbance does not tell you which of the two lights was absorbed preferentially
2) magnitude of θ tells you the difference in absorption between the two lights
The Levinthal Paradox
1) Folding of protein is not result of random search; must always follow defined pathways
2) Impossible in timescale to randomly sample every conformation until lowest E one is found
What makes a protein fold so fast?
1) Timescale: ms-s
2) Not random; direction toward native structure is funneled
3) Energy landscape has local minima
4) Too many different minima= non-productive folding = decelerate folding = can prevent protein from reaching native state
5) Minima produce intermediates partially structured w/no optimal stability and exposed hydrophobic regions = interaction w/other components in cell = aggregation = toxic
6) Smooth folding pathways avoid minima (make as small as possible) =efficient folding
2-state folding: single molecules
1) Folding directly from D to N state w/o stable intermediates
2) Discovered by forster resonance energy transfer
i) Green donor and red acceptor dyes attached to N and C of two-state protein
ii) N = high transfer efficiency = termini are close
iii) D = larger separation between dyes = transfer efficiency is small
iv) Folding/unfolding events correspond to jumps in FRET efficiency
v) Distance dependent = less than 60nm apart
2-state folding: protein ensemble
1) Stop flow device
2) syringe 1: protein + denaturant
3) Syringe 2: buffer + no protein & denaturant
4) Piston pushes both solutions; mix; travel through capillary tube; detector; change in fluorescence over time
5) Cannot use fluorometer; folding is too quick
6) Region w/ no data = time taken by device to mix samples; fit data to single exponential; predict where it goes
2-state vs 3-state folding intermediates
1) Residual plot = subtract experimental data from ideal curve
-Tells us how good is the fit to our data
2) Single exponential behaviour = protein follows 2-state mechanism of protein folding (no stable intermediates)
3) Curve is combination of 1+ exponentials (some faster than others) = signalling presence of folding intermediates only observable by kinetic experiments
4) Intermediates need to fold faster to give N = faster exponential
5) N = slower exponential
6) Still no good fit = add another exponential = protein has 2-folding intermediates
7) No intermediates in eq. conditions; they are formed and absorbed by N too quickly
Transition state theory
1) TS v. short lived = cannot be observed directly = v. difficult to characterise structurally (no NMR, crystallisation, etc)
2) TS is betw. D and N; has much higher free energy
3) TS has critical contacts that decide if protein can go from N to D (vice-versa)
4) TS theory: direct correlation between folding rates and energy difference between D and TS
5) More stable TS = faster folding rate
6) K ∝ exp(-∆G(TS-D))
7) 2-state folding kinetics is explored via mutational analysis
Mutational analysis
1) Avoid mutations that include new groups in AA side chain
2) Use conservative deletion mutants = remove methyl group, charge, or ring (do not add extra groups)
3) Assumptions:
i) Mutants decrease stability but do not affect structure of native state
ii) Mutants do not create new interactions
4) Default mutation to Ala = remove most of side chain but left w/b-carbon; if mutate b-carbon, then affect protein conformation
5) Compare differences in thermodynamic stability and folding rates between wild type and mutants
Protein folding in vivo
1) Experience molecular crowding
2) Av. MW of proteins in human cell: 50kDa
2) AV. [protein] per cell: 20-30%