Lecture 4: Protein Sequence and Structure Determination Flashcards

1
Q

Uses of determining amino acid sequence in a protein?

A
  • compare with all other known sequences (including DNA) to determine whether similarities exist
  • sequence comparison of the same protein in different species can yield cluse about evolutionary pathways
  • sequence comparison of the same protein in the same species (ie humans) can reveal the molecular mechanisms of (genetic) disease

* conserved amino acids (between very different species) suggest that a pathway is very significant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

HCl - Determination of sequencing amino acid composition

A
  • hydrolyze polypeptide in HCl at 110 c for 24 hours
  • amino acids can then be separated by ion exchange chromatography
  • ninhydrin can be used to ID amino acids
  • modifies asparagines and glutamines to aspartic acid and glutamic acid
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does ninhydrin do?

A
  • chemically modifies asparagines and glutamines to aspartic acid and glutamic acid
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Elution profile of amino acids

A
  • Peaks are in a ratio
  • double height means double of that amino acid compared to another
  • if asparagine is present is degrades it into aspartic acid and NH3 (see those peaks)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Purpose of cleavage and reduction of polypeptides

A
  • accuracy of amino acid sequencing generally declines as the length of the polypeptide increases
  • must be enzymaticaly (proteases) or chemically fragmented to be sequenced efficiently
  • if disulfide bridges are present, they must be broken (reduced) and the resulting cysteine-sulfhydryl groups prevented from reformation of disulfide bonds (modified) [add reductant and then modify sulfhydryl groups]
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Edman degradation?

A
  • sequentially removing one residue at a time from the amino end of a peptide (fragment)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Steps of Edman degradation

A
  1. phenyl isothiocyanate reacts with the uncharged terminal amino group of the peptide to form a phenylthiocarbamoyl derivative
  2. the cyclic form of the derivative is liberated and can be identified by chromatographic methods
  3. in multiple labeling-release rounds, the amino acid sequence of a peptide can be determined

* keeps attackign N-terminus over and over to remove successive amino acids

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Overlapping and Edman degradation

A
  • divide the same polypeptide chain by different segments
  • arrange the segments so the two kinds overlap
  • this way you can tel how to connect the segments
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is mass spectrometry?

A
  • analytical technique that measures the mass to charge ratio (m/z) of charged particles in a gas phase
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Steps of mass spectrometry

A
  1. molecules to be analyzed (anylate) are first ionized in a vacuum
  2. when the newly charged molecuels are introduced into an electric and or magnetic field, their paths through the firls are a function of theis m/z ration (mass/charge)
  3. measure property of the ionized species can be used to deduce the mass (M) of the anylate with high precision
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Three essential components of mass spectrometer

A
  • ion source
  • mass analyzer
  • detector
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Conversion of macromolecules into gas phase ions required?

A

conversion of macromolecular anylates such as proteins and nucleic acids into gas-phase ions (ionization) could not be achieved efficiently until the development of

–> electrospray ionization mass spectrometry (ESI MS)

–> matrix assisted laser desorption/ionization mass spectrometry (MALDI MS)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Mass to charge ration of ions and molecular mass of proteins

A
  • gas phase macromolecules acquire a variable number of protons and thus positive charges, from the solvent, which creates a spectrum of species with different mass to charge ratios
  • each successive peak corresponds to a species that differs from that of its neighboring peak by a charge difference of 1 and a mass difference of 1.
  • the molecular mass M can be determined from any two neighboring peaks
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Mass Spectrometry Equations

A

compare two peaks

p1 = (M + z1) / Z1

p2 = (M + z1 - 1) / (z1 - 1)

  • just subtract everything by 1
  • adjacent peaks have difference on 1 in protons and in mass

can solve for M and z1

  1. M = z1 (p1 - 1)
  2. z1 = (p2 - 1) /(p2 - p1)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

ESI MS process

A
  • analyte solution is passed through an electricaly charged nozzle into a chamber of low pressure, evaporating the solvent and ultimately yielding the ionized analyte
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How fast do different sized particles move in ESI MS

A

more charge/more mass –> slower

less charge/less mass –> faster, detected first

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

MALDI MS process

A
  • analyte solution is evaporated to dryness in the presence of a volatile, aromatic compound (the matrix) that can absorb light at specific wavelengths
  • laser pulse tuned to one of these wavelengths excites and vaporates the matrix, converting some of the analyte into gas phase
  • subsequent gaseous collisions enable the intermolecular transfer of charge, ionizing the analyte

* matrix is important to keep proteins from getting burned, makes sure they end up in gas form

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

MALDI TOF analyzer

A
  1. protein sample is ionized
  2. electrical field accelerates ions
  3. lightest ions arrive at the detector first
  4. laser triggers a clock
19
Q

TOF and mass to charge ratio

A

mi/zi = 2eEl/(ti/ld)2

* mass to charge ratio is dependent on time

plug in time and get out the mass/charge ratio

–> first particles to arrive have the largest mass/charge ratio

20
Q

Tandem mass spectrometry

A
  • alternative to edman degradation as a means of sequencing proteins
  • ionized proteins are analyzed by a first mass spectrometer and then broken down into smaller peptide chains
21
Q

Steps of tandem MS

A

ion source –> MS1 –> collision cell –> MS 2 –> detector

  1. catch protein as it arrives after a certain time
  2. analyze it further

* pretty difficult to do

22
Q

From protein mixtures to peptide mass identification

A

protein mixture (separation/2D electrophoresis) –> individual proteins (fragmentation/spot excision, site specific cleavage) –>

peptide fragments (fingerprinting/ mass spec) –>

peptide mass spectrum (database search/MSfit, MASCOT, Proetinlynx) –>

ID

23
Q

Unique peptide sequences

A
  • most of at least 6 amino acids are unique in the proteome of an organism and map to single gene products
24
Q

Effect of sig figs in m/z searches

A

no decimals could lead to MANY hits (478)

going up to 4 decimals could lead to only 2 hits

25
Q

Determination of fidelity of “known” sequences

A
  • recombinant proteins
  • synthetic peptides
26
Q

detection of natural or biosynthetic mutations

A

isoforms

in vitro mutated proteins - random or site specific

27
Q

identification of endogenous postranslational modifications

A

phosphorylation - regulatory or catalytic

disulfide bonds

28
Q

identification of experimental chemical modifications

A

affinity or group secific labels

29
Q

X ray diffraction general steps

A

crystal –> diffraction pattern –> electron density map –> atomic model

(refinement)

30
Q

Componnts in an X ray crystallographic analysis

A
  • an xray source generates a beam, which is diffracted (scattered) by a crystal
  • the resulting diffraction pattern is collected on a detector
  • want a lot of crystals for it ro work
  • to get crystals get close to the ppt point and crystals will sometimes form
31
Q

Physical principles of x ray crystallography

A
  1. electrons scatter x rays –> the amplitude of the wave sattered by an atom is proportional to its number of electrons
  2. the scattered waves recombine
  3. the way in which the scattered waves recombine depends only on the atomic arrangement
32
Q

Resolution and electron density

A
  • the better the crystal the better the resolution of th eimage
33
Q

NMR

A
  • carried out on macromolecule sin solution (xray crystalloggraphy is limited to molecules that can be crystallized)
  • illuminate the dynamic side of protein structure, including conformational changes, protein folding and interactions with other moelcules
  • depends on the fact that certain atomic nuclei are intrinsically magnetic due to a nuclear spin angular momentum, which can take either of two orientations or spin states calle a and b when a magnetic field is applied
34
Q

spin states

A
  • the enrgies of the two orientations of a spin 1/2 (such as 1H) depend on the strength of the applied magnetic field
  • absorption of electromagnetic radiation of appropriate frequency induces a transition from the lower to the upper level (and resonance will be obtained)
  • change of energy of one H will be transferred only to others in close proximity

* see chemical shifts of Hs on the neighboring C

35
Q

2D NMR - overhauser effect

A
  • identifies pairs of rpoteins that ar ein close proximity
  • whatever is on straight line has no interactions
  • dots off the line interact
  • when proteins fold Hs that are far away int he chain still come into contact and interact
36
Q

homologs

A
  • biochemical entities related by common ancestry
  • detectable by significant similarity in nucleotide or amino acid sequence and is (alomst always) manifested in three-dimensional structure
37
Q

paralogs

A

homologs that are present within one species

  • usually a rsult of gene duplication
  • often differ in their detailed biological function
  • different versions of the same protein for different tissues

* evolution –> develop from existing gene “tamplate” and are altered from there –> every gene begins from another

38
Q

orthologs

A
  • homologs that are present within different sepcies and have very similar or identical functions
39
Q

statistical analysis of sequence alignments

A
  • significant sequence similarity between two DNA, RNA or protein molecules implies that they are homologs and hence have same evolutionary origin
  • as proteins are composed of a larger number of building blocks (20 aas) than DNA or RNA (4 nucleotides) random sequence agreements are less likely
40
Q

alignment algorithms to compare sequences

A
  • identities
  • gap insertion
  • conservative sumstitution
41
Q

conservation of 3d structures

A

3d structures of proteins or RNA relate directly to their functions and hence are more evolutionarily conserved than primary structures or amino acid sequences

  • similarities in structures:

–> can be detected without significant similarities in sequence aliggnments

–> usually indicate common functional mechanisms

42
Q

similar structures but different functions

A

example:

  • in primates a-lactalbumin expression is upregulated in response to the hormone prolactin and increases the production of lactose
  • lysozymes are generally enzymes that damage bacterial cell walls by hydrolyzing their petidoglycan component
43
Q

Convergent evolution

A

some proteins are structureally and funcitonally similar in many important ways but do not have a common ancestor

–> very different evolutionary pathways can elad to the same biochemical solution

chymotrypsin vs subtilisin