Mid-term 1 Flashcards
What is a Ribozyme?
an RNA molecule that acts like a protein enzyme (in catalyzing biochemical reactions).
a catalytic RNA molecule with a well defined tertiary structure that enables it to catalyze a chemical reaction.
What is an intrinsically disordered protein (IDP)?
a protein that lacks a fixed or ordered three-dimensional structure.[2][3][4] Meaning that the SECONDARY SRUCTURE IS NOT FIXED IDPs cover a spectrum of states from fully unstructured to partially structured and include random coils, (pre-)molten globules, and large multi-domain proteins connected by flexible linkers
What is an “Open Reading Frame” ORF?
an open reading frame is the part of a reading frame that has the ability to be translated. An ORF is a continuous stretch of codons that begins with a start codon and ends at a stop codon. An ATG codon within the ORF may indicate where translation starts.
What does it mean to say that the genetic code is “degenerate”?
meaning it includes redundancies (the codons of 4 nucleic bases translate not into 4x4x4=64 amino acids but only into 20). This redundancy is an important safeguard in biology.
What does it mean to say that Protein Coding regions of DNA evolve as a result of protein function?
In other words, the function of the protein is selected based on the environment and its physiochemical viability. Meaning: Evolution is a change in DNA based on the function of proteins.
What are Homologous genes?
genes that arise from a common ancestral DNA sequence (vertically RELATED genes).
- Orthologous genes diverged after a speciation event, while paralogous genes diverge from one another within a species.
- ANALOGOUS is opposed to HOMOLOGOUS. Sequences may seem very similar, so it may be assumed that they come from a common ancestor, but analogous sequences arise similar but independently, not being passed down from a common ancestors
Why is it easier to detect homology in proteins rather than the coding DNA?
Function matters more in evolution than the sequence.
• Redundancy in the genetic code makes it more difficult for insightful alignments.
• Therefore, if you are studying a known protein coding regions, it is more insightful to align protein sequences rather than the corresponding DNA, but not always…
What is the relationship between sequence similarity and homology?
similarity is not the same as homology (coming up). Similarity only implies homology
Methods for Pairwise Sequence Alignment
- Dot-matrix • Display all possible sequence alignments as diagonals on the matrix • Find short patterns and detect inverted repeats • Does not show the actual alignment. • Computationally efficient.
- Dynamic programming • Most used alignment method. Needleman-Wunsch for global alignment / Smith-Waterman as local alignment. Smith-Waterman was used for human genome• Became necessary for dealing with gapped alignments • Dynamic programming is guaranteed to get the optimal alignment • Can be very computationally expensive
- Word / k-tuple method • Search for identical short stretches of sequences (words/k-tuples) • Join these words into an alignment by dynamic programming • FASTA/BLAST are also heuristic
What is the difference between a local and a global alignments ?
in a local alignment, you try to match your query with a substring (a portion) of your subject (reference). Whereas in a global alignment you perform an end to end alignment with the subject (and therefore as von mises said, you may end up with a lot of gaps in global alignment if the sizes of query and subject are dissimilar
What is the difference between Pyrimidines and purines?
Pyrimidines: thymine and cytosine -ONE RING PYRIMIDINES
purines (Adenine and Guanine) – TWO RING PURINES
What are the functional classes of RNA?
- Viral genomic RNA
- Messenger RNA (mRNA)
- Transfer-RNA (tRNA)
- Ribosomal-RNA (rRNA)
- Catalytic-RNAs (ribozymes)
- Regulatory or non-coding RNAs (ncRNAs)
- Small nuclear RNAs (snRNAs)
- Long noncoding RNAs
- Ribonucleoproteins
- Riboswitches
Describe the RNA world hypothesis.
proposes that life based on RNA pre-dates the current world of life based on DNA, RNA and proteins. RNA is able both to store genetic information, like DNA, and to catalyze chemical reactions, like a protein (the “functional RNA” or “Ribozymes” do). It may therefore have supported pre-cellular life and been a major step in the evolution of cellular life.
The RNA world evolved into a world of RNP enzymes (ribonucleoprotein enzymes), such as the ribosome, before giving rise to the DNA, RNA and protein world of today.
What is a spliceosome?
a RNP complex containing snRNA and protein subunits that removes introns from a transcribed pre-mRNA segment
Describe RNA structure.
RNA molecules usually come as single strands but left in their environment they fold themselves in their secondary structure because of the same hydrogen bonding mechanism. (They don’t look as ordered as DNA, more messy).
Stems (Helices) are formed intra-molecularly.
Loops
Jucntions
What forces stabilize protein structure?
Protein structures are stabilized by covalent bonds such as disulfide bond and non-covalent contacts such as van der Waals, electrostatic, hydrogen bond and hydrophobic interactions, the last of which is the driving force for protein folding
What is a zwitterion?
dipole molecule
Mostly Amino Acids exist in In the dipolar form, also called zwitterion form. It means that it has one functional group that has a positive charge and one functional group that has a negative electrical charge. making the net charge of the entire molecule zero. Amino acids are the best-known examples of zwitterions.
In the dipolar form the amino group is protonated (-NH3+) and the carboxyl group is deprotonated (-COO-).
Which AA is most active?
cystine
Types of bonds (in order of strength, strongest to weakest)
- Covalent: they stabilize proteins
- Disulfide: tertiary structure of insulin has disulfide bonds
- Salt bridge:
- Hydrogen bond (with donors and acceptors) – one of the strongest intermolecular attractions, but weaker than a covalent or an ionic bond. Hydrogen bonds are responsible for holding together DNA, proteins, and other macromolecules.
- Long range) electrostatic interaction
- Van der Waals interaction: dipol/dipol interaction.they turn so that the negative sides are away from each other. driven by induced electrical interactions between two or more atoms or molecules that are very close to each other. Van der Waals interaction is the weakest of all intermolecular attractions between molecules.
- Pi-bonds/pi-stacking: noncovalent interactions between aromatic rings. Pi stacking can be “sandwiched” or in T-shape or parallel displaced. Pi elices (and 3/10 helices) are never long
What is a clathrate cage?
Water will form a cage around a hydrophobic molecule. If there is another hydrophobic molecule those two cages will be combined in order to minimize the surface area of the cage (because he water rather wants to bind with itself than build more cages).When 2 cages combine water molecules will be released into the free floating water, thus increasing the entropy.
A sphere the smallest amount of surface area in relation to its volume. Therefore oil builds spheres in water, so that the biggest amount of water molecules can get “released”,. can become free floating.
Classes of Protein Structure
Class all-α: the secondary structure is composed entirely of α-helices, with the possible exception of a few isolated β-sheets on the periphery. still 30% of amino acids are (unbound?) in loops and therefore ready for interaction
Class all-β
Class α/β: the secondary structure is composed of alternating α-helices and β-strands along the backbone. The β-strands are therefore mostly parallel
Class α+β: α-helices and β-strands that occur separately along the backbone. The β-strands are therefore mostly antiparallel
Membrane/cell surface: interact with biological membranes either by inserting into it, or being tethered via a covalently attached lipid. 20–30% of all genes in most genomes encode membrane proteins.
Multiple domains: Protein Domains: Compact regions of tertiary structure form domains. A given protein can have several domains which serve different functions.
Protein domains are:
independently foldable
have a specific function
are evolutionary preserved
Small proteins (longest protein: approx. 36,000 residues; has some folded regions but too large to fold completely. proteins with repeats are often not folded)
Intrinsically disordered proteins
Coiled-coil proteins
Designed proteins
What is an advantage of quarternary structure?
multiple proteins combined as a quarternary structure: they can combine freely, increasing the number of functions greatly (1 protein can have different structures and different functions. But still they are always together)
Why is Folding is considered the “second half of the genetic code” ?
because: genetic code - protein (1st half) protein - function (2nd half) folding determines function
Why do proteins fold?
to become functional. An amino acid CHAIN cannot function. Thermodynamic forces are responsible for the folding. The protein goes into an energy minimum state. Hydrophobia and entropy are driving elements of protein folding where the hydrophobic protein domains are being buried inside of a fold. The solvent will gain entropy from the burial of hydrophobic groups (i.e., elimination of water clathrates), and there is enthalpy gain of favorable intra-chain charged, polar, and van der Waals interactions
How do proteins fold?
Intermediates are needed.
They fold quickly because they have funnel-shaped energy landscapes that progressively direct the protein towards increasingly low energies as it folds.
Folding is limited by a finding of an optimal pathway throughout the rugged landscape. The structure of the native state is shown at the bottom of the surface. There are ‘key residues’ in the structure; when on the way down these residues have formed their native-like contacts the overall topology of the native fold is established.
What are other models of protein folding?
- *Framework model: In the intermediate state there is plenty of (secondary) structure but the intermediate state is not compact Sequential mechanism of protein folding: 3 major stages
- *Hydrophobic collapse model: In the intermediate state there is no (secondary) structure but the intermediate state is compact. What makes sense in this model is that when collapsing the hydrophobic side chains are being buried inside
- *Nucleation condensation: with a collapsing nucleus
Structural features of Molten Globule” States
- Some folding intermediates have molten globule structure
- Substantial 2° structure, little 3° structure
- Collapsed state but disordered core accessible to solvent
- Obtained at mild denaturing conditions (low/high pH, high temperature, low/moderate concentrations of strong denaturants)
- Often binds fluorescent dye ANS