Synthetic Genome/ Metabolism Flashcards

1
Q

What is the history of genome sequencing?

A

Phi X174 was the first virus genome to be sequenced in 1977

H. Influenzae was the first bacterial genome to be sequenced in 1995

Yeast was the first eukaryote genome sequenced in 1996

Arabidopsis was the first plant genome sequenced in 2000

The Human Genome Project took 15 years and cost $2.7 billion in 2001

MinIon from Nanopore can sequence a human genome in less than a day for less then $1000

Sequencing has become faster, cheaper and more accessible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Describe Phi X174 and what were the challenges in sequencing?

A

A bacteriophage that invades E. coli

Technical challenge -> being 5-6 kb in size was overcome using gel purification to collect the correct size DNA

Technical challenge -> difficult to assemble large constructs accurately and quickly was overcome using ligation under stringent annealing conditions

Technical challenge -> difficulty avoiding incorrect length oligonucleotides and proofreading was overcome by using assembly PCR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Describe Syn 1.0

A

1 kbp from M. mycoides synthetic genome was implanted into an M. capricolum cell, which then took on the implanted genome’s phenotype.

This created almost an exact copy of the M. mycoides genome.

This was technically reverse engineering.

Comparative genomics was used to get a hypothetical minimum number of genes necessary for life.

Oligonucleotides were created and annealed into 1.4 kbp dsDNA fragments which were inserted into E. coli plasmid vector (7 kbp in each vector) with 15 vectors then being assembled in yeast to make 1 large plasmid.

The plasmid was then extracted from yeast using rolling circle amplification in vitro. This created 1/8 of the needed genome thus had to be repeated 7 more times.

The genome was then either methylated or placed into a host that didn’t have a restriction system.

The whole process took 3 weeks and ended up not working which showed that we didn’t know enough to build a genome.

Syn 1.0 was planktonic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What were the problems in the methods with creating Syn 1.0?

A

Methods for transferring genome treatments into M. capricolum

Methods for extracting genome fragments from yeast

Methods for stitching large fragments together (Gibson assembly)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How was it decided what improvements to make to Syn 1.0?

A

Tn5 puromycin resistance transposon mutagenesis screen was performed on Syn 1.0 and 80,000 colonies were pooled together

PCR and sequencing was used to identify where in the genome the mutations were and were categorised as follows;

Genes with no mutations -> essential

Genes with frequent hits -> non-essential

Genes with hits but growth impairments -> quasi-essential

However paralog genes also exist with the same function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Describe Syn 2.0

A

Analysis identified 26 genes to add back in to the genome after Tn5 puromycin resistance transposon mutagenesis screen to create 2.0 which is considered to be the first bacteria with a functional reduced genome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Describe Syn 3.0

A

Further analysis identified 42 genes for removal using Tn5 puromycin resistance transposon mutagenesis screening.

3.0 has 438 protein coding genes and 35 RNA coding genes, 79 of which cannot be assigned a function

Retained all genes required for the synthesis and processing of macromolecules but loss of genes involved in biosynthesis pathways of small molecules. All transport proteins were also retained.

It was found that rearranging the genes by type had no impact on the phenotype or functionality.

Slow growing filamentous mats were created thus is polymorphic to 1.0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Describe Sc 2.0

A

Yeast S. cerevisae synthetic genome project

~ 6000 genes across 16 chromosomes

Unlike Syn, this approach utilised the natural capacity of yeast for homologous recombination

30-60 kb sections were replaced at a time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Describe HGP 2.0

A

First HGP was about reading and the second is about writing ie. Whole genome engineering of human cell lines

So much remains unknown after HGP 1.0

Also allows the testing of controversial ideas such as recoding human codon usage to make us virus resistant

There are also many ethical concerns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Describe malaria and artemisinin

A

Malaria caused half a million deaths a year and A. annua (sweet wormwood) contains both artemisinin and dihydroartemisinin which can be biosynthesised (Tu Youyou Nobel Prize, 2015)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the problems with getting artemisinin from sweet wormwood?

A

Sweet wormwood takes between 6-8 months to harvest

Yields are variable due to weather as drought can affect seedlings and damp conditions can cause lodging

Price varies wildly which is unfair in places where malaria is prevalent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Describe the first iteration of the artemisinin biosynthesis pathway

A

The synthetic pathway is put into yeast and amplified using fermentation.

The mevalonate pathway begins with acetyl coA and ends with farnesyl pp. The farnesyl pp is converted to amorphadiene using amorphadiene synthase amorphadiene is then converted to artemisinic acid using cytp450 and cytp450 reductase. The artemisinic acid is then transported out of the yeast cell.

The artemisinic acid is then chemically converted to artemisinin.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Describe how the second iteration improved the artemisinin pathway

A

Increased the carbon flux to artemisinic acid by increasing the cellular activity of the key enzymes and restricting/removing unwanted enzymes (HMGR was x2 in the mevalonate pathway and farnesyl pyrophosphate synthetase was upregulated leading to a decrease in squalane (carbon competition) production.

Pathway was completed in a UPC2-1 semi dominant mutant background

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How much products were produced after the 2nd iteration artemisinin biosynthesis?

A

150 mg/l amorphadiene in shake flasks

100 mg/l artemisinic acid in yeast

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Describe the 3rd iteration improvements artemisinin pathway

A

3x HMGR

Use of strong Gal promoters

CENPK.2 industrial yeast strain which increased the amorphadiene production but not the artemisinic acid production so fermentation conditions were changed

Copper repression of squalane which was cheaper than methionine repression

CPR1 copy number decreased as it positively interacted with p450s

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Describe how amorphadiene becomes artemisinic acid

A

6 genes

CYB5 amorphadiene -> artemisinic alcohol

ADH1 artemisinic alcohol -> artemisinic aldehyde

ALDH1 artemisinic aldehyde -> artemisinic acid

17
Q

After the 3rd iteration artemisinin pathway, how many products were there?

A

25 g/l artemisinic acid

This is enough for commercialisation and was reconstructed in a host suitable for commercial fermentation from inexpensive carbon sources

In 2015, 10% of artemisinin was semi-synthetic

18
Q

Why is there a need for renewable fuels?

A

Atmospheric carbon dioxide levels are increasing

There is an increasing demand for fuels however there is only a finite supply of fossil fuels

Geopolitical concerns i.e. Ukraine war

The transport sector is the 2nd biggest consumer of fossil fuels and accounts for more then 60% of global oil consumption

19
Q

Describe established biofuels

A

Biofuels are seen as a practical solution but there are problems to consider (food vs fuel/ first-generation feedstock such as sugar or palm oil)

Alcohols (primarily ethanols)

Biodiesels (BAMES, FAEES) but these don’t have the diversity of fuels needed for blends (blended with petroleum distillate at 10-15%)

The hygroscopic and gelling properties are not ideal as temperatures, branched alkanes and oxygen attracts water which leads to rust

20
Q

What are some problems with biogenic alkane sources?

A

Limited range of chain lengths

No branched chains (which is needed to stop stacking)

Not easily scalable from some sources

21
Q

Describe how fatty acids of different chain lengths can be produced in E. coli to produce different length alkanes

A

Alter their fatty acid profile by expressing luxCED, NpAD, fatB1, bFabH2 and branched chain ketone dehydrogenase complex

9 genes added to E. coli

Barely works but makes alkanes needed for diesel blending

22
Q

Describe the fatty acid pathway already present in E.coli

A

A condensation reaction producing free fatty acids from acetyl coA

23
Q

Describe how the free fatty acids in E. coli are made into alkanes

A

LuxCED (a fatty acid reductase) from a Cyanobacteria (P. Luminescens) and fatB1 (plant thioesterase) encodes a fatty acid reductase complex that produces fatty acyl-ACP from free fatty acids in E. coli

Aldehyde deformylating oxygenate (NpAD) from N. Punctiforne makes alkanes from the fatty acyl-ACPs instead of the original pathway of fatty acid to fatty aldehyde

Co-expression of 5 genes (luxCED, NpAD and fatB1

This alters the fatty acids thus alter as the alkane chain lengths

24
Q

Describe how branched alkanes are made in the fatty acid pathway

A

Branched chain ketone dehydrogenase complex made of 4 subunits and bFabH2 (bacillus enzyme) produce branched alkanes

25
Q

What is the CETCH pathway?

A

An entirely novel, in silico designed metabolic pathway for carbon dioxide capture

26
Q

How was the CETCH pathway conceptualised?

A

Using protein evolution, in silico protein design and enzymes from many sources

27
Q

What enzyme was used instead of Rubisco and why?

A

Enoyl-coA carboxylate reductase

ECRs are a diverse enzyme class from alpha-proteobacteria that are involved in many areas of metabolism (but not autotrophic carbon capture), span a wide range of substrates, are oxygen insensitive, use NADPH and have a higher catalytic turnover than Rubisco

28
Q

How was the CETCH pathway designed?

A

A design restraint was that the cycle must be autocatylitic and must have a dedicated output chemical (malate)

Weren’t limited to known enzyme catalysed reactions and instead used EC defined reactions which suggested a number of pathways

Pathways were assessed for thermodynamically feasible reactions and their energy consumption and the synthetic pathway chosen was found to be more energy efficient than the Calvin cycle

Contains genes encoding for every step and was expressed, purified and characterised

Iterative improvements were made in carbon dioxide conversion to malate

Includes a structure guided design to alter co-factor requirements

18 core reactions, 17 enzymes from 9 organisms across 3 domains of life

29
Q

What does CETCH stand for?

A

Crotonyl coA/ ethyl malonyl coA/ hydroxybutyrl coA