DNA & Proteins Flashcards
The 4 Hypotheses on the Origin of Life on Earth
- Organic chemical synthesis in a reducing atmosphere.
- Carriage by meteorites.
- Organic chemical synthesis in deep ocean vents.
- RNA world.
- Organic Chemical Synthesis in a Reducing Atmosphere
It was thought that early Earth had a reducing atmosphere, rich in hydrogen and methane. The methane, ammonia and hydrogen mix to electrical discharges (lightning) in the presence of water. This resulted in a prebiotic soup (amino acids and nucleotides). There is no data on how soup forms organic networks (lipids and macromolecules) encompassed by a membrane. The question is whether the atmosphere at the time reducing and the current consensus says no.
- Carriage by Meteorites/Comets
Panspermia is the attractive theory due to the sudden appearance of life on Earth and its amazing uniformity (no actual data). Organic compounds are common in space e.g. amino acid (glycine) found on a comet 2009. Based on studies complex oranic chemicals could arise of Titan (moon of Saturn). This theory only moves the question how did life originate in space. The blast impact of meteors and comets also make it very difficult for organic matter to survive so that is also disputed.
- Synthesis on Metal Sulphides in Deep Sea Vents
Vents are sites of abundant biological activity which is independent of solar energy. Chemical energy sources lead to another prebiotic soup theory. Prebiotic soup self-organises into life-supporting networks on metal sulphide surfaces. Networks must incorporate membranes however there is no data on this.
- RNA World
The question as to whether the first self-replicating entity simpler than a cell. Short RNA molecules were discovered that can store information and catalyse chemical reactions (ribozymes). RNA molecules have been synthesised that are capable of self-replication. The question of how lipid membranes formed is still in question and unexplained.
Representations of DNA
Chromatin, chromosomes, double-helix, uncondensed.
Discovering DNA Structure
This was done by Watson and Crick 1953. This discovery was facilitated by many other people. Watson sees X-ray diffraction image of DNA and began to work with Crick who was working on helical diffraction in proteins in the same lab. The idea was then made to use X-ray diffraction to find the structure of DNA. This was assisted by Franklin who developed a better X-ray image of DNA. The first model by Watson and Crick produced a 3-stranded DNA model which was inconsistent with the images that Franklin had gathered. Chargaff found that there was a unity of the A/T and G/C ratios which pointed more inconsistencies in Watson and Cricks model. With Franklins work they discovered that DNA had a double helix structure.
Helical Diffraction
A technique by which to identify the structure and then assume and identify the function of proteins.
DNA
This is the genetic material. It is base-paired, anti-parallel, right-handed double helix. The code is cracked (triplets of A,T,C and G) code for individual amino acids. Amino acids are the building blocks of proteins. Gene to protein relationships established, the control of gene expression partly elucidated and large scale sequencing of genomes is now common (cost and speed).
How Genetic Code Works
Peptides (met-enkephalin) are present in humans made up of an amino acid sequence (Met, Tyr, Gly, Gly, Phe, Met) which has a DNA code (ATG, TAT, GGT, GGT, TTT, ATG). The 3 bases code for a single amino acid e.g. TTT = Phe. There are 64 possible combinations of genetic code that are possible which codes for 22 amino acids and ‘stop’ codes. The difference in the number of codes compared to amino acids means that some amino acids can be coded by multiple different base codes.
Parts of an Amino Acid Sequence
The start of this is known as an ‘N terminus’ while the end is known as a ‘C terminus’. In between these are where the different amino acids are placed. The ‘N terminus’ is adjacent to the 5’ end of DNA while the ‘C terminus is adjacent to the 3’ end.
DNA Code in Context
Control regions on DNA have specific sequences of G, A, T and C but not organised in triplets. These control regions also have certain characteristics as the upstream (before) controls has identifiers for the RNA to replicate and downstream (after) controls has stop codons. These also control the time and expression ability of the code for regulation. Code is composed of triplets of any of the 4 bases with each triplet being used for a particular specification. This is comparable to the 1/0 in binary code.
The Central Dogma
DNA self replicates -> The DNA is transcribed by RNA -> the RNA is translated into proteins. In RNA there can be reverse transcription where it becomes double stranded and can enter your DNA and replicate. This is done by certain viruses e.g. HIV and COVID19.
Nucleotides
The basic structural units of both RNA and DNA. This consists of a sugar, a nitrogenous base and a phosphate group.
Nucleosides
These are also structural units of both RNA and DNA. This consists of a sugar and a nitrogenous base only.
Naming
If a sugar is deoxyribose the prefix for the name should be ‘deoxy’ e.g. deoxyadenosine, deoxyadenosine monophosphate (dAMP). If the sugar is ribose the prefix should be ‘ribo’ e.g. riboadenosine triphosphate (rATP). DNA (deoxyribonucleic acid) contain deoxyribose sugar whereas RNA (ribonucleic acid) contains ribose sugar.
How the DNA/RNA Chain is Linked
The 5’ phosphate can form a phosphodiester bond by reacting with the 3’ hydroxyl group of another nucleotide.
Base Pairing
Between nitrogenous bases there is hydrogen bonding. A only combines with T (vis versa) in DNA whereas G only combines with C (vis versa) which is known as complementary bonding. Between A and T groups there is only 2 hydrogen bonds whereas with C and G groups there are 3 hydrogen bonds meaning C and G bonds are harder to break.
Antiparallel
The strands of DNA are in opposite directions e.g. if 1 strand is 5’-3’ then the opposite stand is 3’-5’.
Information Encoding
information is encoded by the order of bases 5’-3’. One strand is the coding strand while the other is the non-coding strand. The bases are read in triplets known as codons with 1 codon coding for 1 amino acid.
Right-Handed Double Helix
Looking down the double helix follows a clockwise pattern. The DNA forms major and minor grooves due to the right-handed double helix shapes. The major grooves help proteins to interact with the DNA molecule to control the expression of certain genes.
Modern Day DNA
Transgenics and gene knock out/in occurs in order to identify the purposes of genes to test organisms (typically bacteria) with lost/inactive genes (knock-out mutants) and those with additional genes (knock-in mutants). Genetic screening is used for personalised medicine. Viruses and living cells are created from synthetic DNA constructs. ‘Bioinformatics’ is a new scientific field now available.
Modern DNA Research
‘Junk DNA’ is not junk trying to find other purposes. DNA can change to other forms (A, Z, G). Chromosomal position and movement within the nucleus is preserved across species and affects gene expression.
General Features of Chromosome Replication
The complementary base-airing enables semiconservative DNA replication, DNA synthesis initiates at origins, synthesis usually moves bidirectionally away from an origin via 2 replication forks which creates a replication bubble, this moves in a 5’-3’ direction and the synthesis of new DNA always requires a primer.
Complementary Base Pairing
Each strand of a double stranded DNA molecules serves as a template for synthesis of a new complementary strand. A binds only with T while G only binds with C. The 5’ strand is called the S strand while the 3’ strand is called the S’ strand. The reason that DNA replication is accurate and identical is because it uses each of the old strands as a template in order to gain new bases and form 2 identical strands.
Semiconservative DNA Replication
Each strand of a DNA molecule serves as a template for synthesis of a new complementary strand. Each daughter molecule has parental strand plus a new strand. The accuracy and speed of replication is 1000 nucleotides per second without error. The group of proteins meet to operate as a protein machine moving along a replication fork. DNA polymerase adds nucleotides to the 3’ end of the new strand. DNA polymerase has proofreading property to reduce the error rate.
Origins
Double stranded DNA are pried apart at this point by helicase at an identified position with a particular DNA sequence. Shorter strands will only have one of these but larger DNA strands will have multiple. Strands with multiple of these will combine in order to synthesise even more DNA.
Bidirectional
From the origins the DNA will replicate in both directions.
Replication Forks
These structures are the points at which the DNA is not yet separated and seems to converge. There are 2 of these at every origin. All of the new DNA is synthesised at these points in a 5’-3’ direction. Due to the directionality (3’-5’) of one of the lagging strands Okazaki fragments are formed wheras on the leading strand the polymerisation process occurs continuously.
Okazaki Fragments
The the discontinued fragments of DNA added to the lagging strand which it is forced to do so as the direction of polymerisation is 5’-3’. Because one strand isn’t moving in this direction polymerisation must occur in small segments as the DNA continues to be separated.
Replication Bubble
This is created by the separation of the 2 strands of DNA during replication which leaves a space in the middle which looks similar to a bubble.
DNA Polymerase
This enzyme adds each deoxyribonucleotide to the 3’ end of a primer strand attached to the template strand.
DNA Primase
This is an enzyme that synthesises a short strand of RNA on a DNA template. On the leading strand in creates 1 RNA primer in order to synthesise the entire strand. On the lagging strand each Okazaki fragment requires an RNA primer meaning it makes jumps to place primers further down the strand to ensure there is a place for DNA polymerase to attach and continue polymerisation.
DNA Ligase
This enzyme joins the Okazaki fragments by their sugar-phosphate backbones and removes the primer once the polymerisation process is finished.
Single-Strand DNA Binding Proteins
These help to stabilise single stranded DNA and aid the helicase.
Helicase
This enzyme pries apart (unwinds) the double helix in order to form 2 single stranded DNA molecules for replication.
Sliding Clamp
This protein maintains the attachment of DNA polymerase to DNA strand and pushes it along making it more efficient at polymerisation.
Clamp Loader
This protein assembles the full clamp on the DNA is association with the sliding clamp and DNA polymerase using ATP energy. It also can assist in the efficiency of polymerisation.
DNA Polymerase I (E. Coli)
This form of DNA Polymerase is seen in numbers of 400 per cell where it functions in DNA repair as well as the maturation of Okazaki fragments (removes RNA primer and fills in gaps), It is slow moving adding 20 nucleotides per second and has a low affinity for nucleotides.
DNA Polymerase II (E. Coli)
This form of DNA Polymerase is seen in number of 100 per cell where it functions in DNA repair it is the slowest form adding 5 nucleotides per second and has a low affinity for nucleotides.
DNA Polymerase III (E. Coli)
This form of DNA Polymerase is seen in numbers of 10-20 per cell with it being the main DNA replication enzyme (for both strands), it is the fastest form adding 1000 nucleotides per second and has a high affinity for the nucleotides.
DNA Polymerase in Mammals
There are 5 types of these in mammals consisting of alpha, beta, gamma, delta and epsilon. Alpha is found in the nucleus and is involved with primase and elongates the primer with a short length of DNA. Beta is found in the nucleus and is involved in DNA repair. Gamma is found in the mitochondria and involved in the replication of mitochondrial DNA. Both delta and epsilon are found in the nucleus and their functions are highly debated and controversial however they both seem to be involved in the synthesis of the lagging and leading strands respectively.