Lecture 9 - Molecular breeding of medicinal crops Flashcards

1
Q

Summarise the different approaches in molecular breeding

A

Must begin with genetic variation

  1. Natural variation from wild ancestors
  2. Induced variation from mutagenesis
  3. Engineered variation from transgenics

Then use next generation sequencing for DNA markers and gene discovery

  1. Parent selection MAS
  2. Forward (trait) screens
  3. Reverse (genetic) screens/TILLING/NGS
  4. Transgenic event selection MAS

Identify indiviual with improved genetics/traits

  • Quality control using DNA markers
  • Stacking of traits from the same or different surces of variation using MAS

Field trial potential new varieties

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the process of 454 pyrosequencing?

A
  1. A pool of small DNA fragments are generated from genomic or cDNA sources
  2. A single strand is bound to a bead, which is then amplified in a water-in-oil microreactor resulting in 10 million copies of a unique DNA template per bead
  3. A single clonally amplified bead is depositied into a well in the PicoTiterPlate device with 400,000 wells
  4. Bases (TACG) are flowed sequentially across the picotiterplate device during a sequencing run
  5. when a nucleotide is incorperated, pyrophosphotase is released and this is linked to a luciferase based light emission which is recorded by the CCD camera
  6. The signal strength is proportional to the numeber of nucleotides incorperated
  7. The current FLX system generates 400 000 quality reads with an average read length of 230 bases
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Give a disadvantage of 454 pyrosequencing

A

Hard to quantify the number of repeated bases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the time period for 454 pyrosequencing?

A

DNA library preparation and titration - between 4.5 and 10.5 hours

emPCR - 8 hours

sequencing - 7.5 hours

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why is 454 pyrosequencing important?

A

Important in the manner which it is used

  • Interactions of soil microorganisms
  • Gut microbiome
  • Plant natural variation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What can RNA sequencing be used for?

A
  • Gene discovery
  • Establishing expression levels of an expressed gene
  • Identifying polymorphisms within genes
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the method of RNA sequencing to establish the levels of expression within genes?

A
  1. Isolate mRNA
  2. Convert to cope DNA (cDNA) using reverse transcriptase
  3. Sequence cDNA (electronic northern blot - hybrididsation based method for quantifying mRNA instead of radioactively labelling probe which gives a signal when it binds to mRNA)
  4. If the sequence occurs at a higher proportion to others (e.g. the small subunit of RuBISCO would expect a high band of mRNA) whereas low levels of expression e.g. TF not see bright band
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the process of marker assisted breeding?

A
  1. Identify molecular markers based on natural/induced variation as a result of the polyplodism of DNA
    • Single nucleotide polymorphisms (SNPs)
    • Short sequence repeats (SSRs)
    • Amplified FRagment Length polymorphisms (often come from using specific primers to amplify DNA, if there is a polymorphism where the primer normally binds it won’t work)
  2. Construct a linkage (genetic) map based on the segregation between hundreds of markers
  3. Identify quantitative trait loci (QTLs) based on the cosegragation of traits with molecular markers. This can be achieved by genotyping and phenotyping F1 or F2 mapping populations
  4. Use markers that associate with positive QTLs for tracking traits and selecting individuals with good genetic background for plant breeding
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How can SNPs and SSRs be identified from 454 sequence data?>

A

If take individuals from populations and sequence data the majority are going to be identical but some polymorphisms. Mix up DNA and sequence it in ‘pool’. Computer algorith will spot where there are consistantly gene alterations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define forward genetics

A

Screen for mutatns with the phenotype of your dream, characterise the mutant and backcross to the recurring parent

Finding out which gene you have mutated to give the phenotype of interest is challenging. The manner by which the gene is identified depends on how the mutant was created.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the pros of forward genetics?

A

No prior knowledge of relevent genes required

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the cons of forward genetics?

A

Screens can be labourious/impractical

Small gene families/gene homologues can mean single gene mutations do not have a phenotype

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Define reverse genetics

A

Start with the gene sequence, identify a mutation in that gene and see what the effect is on the plant

Loss-of-function mutants are normally the most informative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How can loss of function mutants be produced?

A

Can be produced by antisense technology, insetional mutatgenesis or by heteroduplex mapping. (HDM = TILLING) in a mutagenised population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the pros of reverse genetics?

A

For HDM/TILLING mutations can be detected in the heterozygous state and crosses and selfing be used to produce homozygous single and double mutants

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the cons of reverse genetics?

A
  • Extensive knowledge of the candidate genes required
  • Predominantly find loss of function mutations
  • Not very good for increasing the activity of an enzyme
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the CNAP fast track breeding system?

A
  1. Take seeds
  2. Treat with a chemical (EMS)
  3. Caues mutations in germline cells that go onto form seed
  4. Grow up plants and collect seed
  5. Grow lots of M2 individuals and isolate DNA (from around 5000 individuals)
  6. Use TILLING method to identify mutations in individual genes
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the TILLING method?

A
  1. Isolate DNA from lots of individuals on gene A
  2. Want to find mutations in gene A so have lots of copies of gene A
  3. Use primers to amplify gene A (already know sequence)
  4. Put into pools of individuals and amplify with PCR
  5. During melting and reanealing if the first individual carrys a SNP mutation get an area of heteroduplex mismatch
  6. Enxyme Cel1 nicks areas with a heteroduplex in DNA
  7. When this is ran on polyacrylamide gel, can spot samples that have been cut as labelled the DNA
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What does EMS mutatgensis result in ?

A

High mutation frequency in each individual

A backcross breeding strategy is used to remove mutation load

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is the mode of action of EMS?

A

Guanine to O-6-ethylguanine which can pair with thymine

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the mutation frequncy?

A

Mutation frequency = total length of DNA screened (kbp) / number of mutations

(must times by two if there are x number of diploid plants)

22
Q

What is the arabidopsis genome size?

A

137Mbp

23
Q

What is the process of backcrossing following EMS mutagenesis that is used to remove high mutation load?

A
  1. Design molecular markers and backcross to recurrent parent (elite line for crops)
  2. Once do one backcross 50% progeny will be elite parent (recurrent parent) and 50% will be mutant
  3. If backcross F2 generation again this will be further halved
  4. Go through multiple backcrosses all the time selecting for individuals carrying genes of interest
    • Using molecular markers to identify scewed individuals so can get away with 1 or 2 backcrosses
24
Q

What is the latin name of the opium poppy and why was it named so?

A

Papaver somniferum L.

The milky juice (latex) was seen to be sleep inducing which inspired linnaeus

Morpheus greek god of sleep - morphine

25
Q

What were the features of morphine when it was first isolated?

A
  • Isolated by Fredrich Serturner, 21 yr old pharmacists apprentice
  • 1804
  • first plant derived drug to be isolated and prepared in pure form
  • commercially sold by merk in 1827
  • can’t be produced industrially because of the centre structure
26
Q

What are the main five alkoids produced in opium poppy?

A
  • Morphine (analgesic)
  • Codeine (analgesic, cough supressant)
  • Oripavine
  • Thebaine
  • Noscapine (cough supressant, potential anticancer agent)
27
Q

Through what pathway are Sanguinarine, Noscapine, Codeine and Morphine thought to be produced?

A

Benzylisoquinoline alkaloid metabolism in opium poppy

Looking to use TILLING on these steps to improve C flux through morphine production

28
Q

How were ten genes identified that were thought to be involved in noscapine synthesis?

A
  • Through transcriptome analysis in P. somniferum cultivar
  • Had a high morphine variety, high thebaine variety and a high noscopine variety. Sequences and isolated DNA from capsules (where most of the drug is produced) and identified a block of genes encoding enzymes that were only expressed in the variety making noscopine
  • Hypothesised that these genes were responsible for production of noscopine
29
Q

What allowed the development of PCR based dominant moleulcar markers for genes related to noscapine?

A

The ten genes exclusively expressed in HN1 are absent in HM1 and HT1

-identified by PCR on genomic DNA

30
Q

What did the development of PCR based dominant moleulcar markers for genes related to noscapine allow?

A

Follow the segregation of each of the genes in F2 mapping populations and investigate their link with the noscapine trait

31
Q

What evidence suggested that the 10 genes related to noscapine existed as a gene cluster?

A

Tight linkage in F2 (when HN1 crossed when HM1 and F1 plants selfed)

Co-segregation with the noscapine trait

32
Q

What level of noscapine is produced when the gene cluster is in a heterozygous state?

A

Produced at a low level

33
Q

How was it discovered that all of the 10 noscapine genes have evolved together and become clustered in the genome?

A

Made a library of genominc DNA from HN plants, cloned DNA into bacterial articificial chromosomes (BACs)

Allows the cloning of massive fragments (around 200kbs) into a single plasmid and express it in E.coli

Make a BACV library and sequenced

34
Q

How was the pathway of noscapine expression identified?

A

Used virus induced gene silencing to intoroduce antisense versions of the genes into plants

Knock down genes and then able to construct the pathway

(functional genetics)

35
Q

What did sequencing reveal about the noscapine gene cluster and how>?

A
  • screening of an HN1 BAC library using part of a CYP82X2 as screening probe yeilded 6 overlapping clones
  • sequencing relvealed that the ten genes span a region of 221 kb
  • for some genes duplication and neofunctions appear to have occured at the cluster site
  • For others gene dupliation appears to have been at a remote site followed by relocation
36
Q

What was the impact on breeding opum poppy varieties with novel alkoid profiles?

A
  • accumulation of nomally low abundance intermediates in BIGS-plants (e.g. narcotoline)
    • raises the prospect of obtaining novel varieties with stable high levels of these compounds through mutation breeding
  • the complex biochemical trait of noscapine biosynthesis segregating like a single medelian genetic locus mean that it can be introgressed with relative ease into opium poppy cultivars optimised for other traits (e.g. agricultural performance, specific alkoid profiles)
37
Q

What does artemenisin do?

A

Kills the malaria parasite in the blood

38
Q

What is the time line for artemisinin discovery?

A

1964 - malaria causes huge numbers of casulties

1967 - initiaition of progect 523 in may 1967

1971-2 - discovery of atriminsin extracts effective at curing malaria, inspired by an ancient recipe from traditional chinese medicene. Arteminisin identified as the active ingredient

1977 - first publication of the checmical structure of arteminisin

2001 - WHO recommended the use of artemenisin based combination therapies (ACTs) in global fight against malaria

39
Q

What are the targets for the development of new varieties of A annua with increased yield of arteminsin per hectare?

A
  • Increase amount of artemenisin in the trichome metabolite pathways
  • Increase the number of tichome/lead: trichome density/laf area
  • Increase the number of leaves on a plant: braching, total biomass, leaf:stem ratio, delayed flowering
40
Q

How was it discovered that arteminisin is made in the trichomes?

A
  1. Isolated RNA from trichomes
  2. Sequenced RNA
  3. Discvovered genes for arteminisin production
  4. Used a number of strategies to try and make plants that made more ateminisisin
41
Q

How were individuals with high arteminisin yield produced?

A

Induced variation -> reverse genetic screens

Natural variation -> marker assisted breeding

  • > identfied individuals with high artemininsin yield
  • > developed robust new varieties
42
Q

What traits were looked at for high levels of arteminisin production and how were these measured?

A

Flowering times

  • Time to flower
  • number of leaves at flowering
  • number of branches at flowering

Biomass

  • Number of branches
  • Height
  • Average branch length
  • Conformation by wet and dry weight

Trichome density

  • Trichome reflection
  • Haririness
  • confirming through microscopy

Arteminisin content

  • High-throughput ELISA screening
  • Medium-throughput TLC screening
  • Confirmation by HPLC
43
Q

What were the features of the forward (trait) screen for high arteminisin levels?

A

High through-put screen of M2 (selfed) plants for arteminisin yield

  • chloroform dip
  • UPLC MS with 2.5 min run time

Identify high yielding individuals and confirm trait in the field

1000 plants screened every two weeks

44
Q

What were the results from the forward screen on arteminisin levels?

A
  • Screened 21000/25000 plants
  • identified 230 high yeilding individuals
  • (between 1.5 and 3 times higher)
45
Q

What were the actions taken following the forward screen of arteminisin levels?

A
  • SNP genotyping in artemis
  • Linkage map of Artemisia annua (mapping - matebolites, archtechure, leaf traits, tichomes)
    • 1536 SNPs were genotyped for 1152 samples using illumina
      • 275 (breeding population) F1 artimis mapping population (to produce genetic linkage map)
      • 2 artemis parental lines (C3 and C4) per plate as controls
    • segregation data on each SNP provided the genotype data for mapping
  • Screened thousands of markers against locus of interest
    *
46
Q

What QTLs were identified for atemenisin based on co segragagation of traits with molecular markers?

A
  • Metabolites
    • arteminism
  • Trichome
    • number
    • density
  • Leaves
    • leaf area
    • leaf to stem ration
  • Biomass
    • leaf dry weight
    • whole plant dry weight
  • Flowering time
  • Plant architechure
    • height
    • branching
    • number of nodes
47
Q

What was found of the hybrids generated from selected parents for high artemisin levels?

A

Number of positive QTLs in best hybrids better than the number of positive QTLs in lower hybrids

48
Q

What is Hyb8001 “Zenith”?

A
  • A fast growing, high biomass hybrid suited to china uganda and madagascar
    • Trialled at 13 independent sites across chine, india, madagascar and uganda
  • Artemisinin yeild: Maximum concentration 1.44% (madagascar); Maxiumu leaf dry weight 4488kg/ha (china); Maxiumum yield; 54.6kg/ha (madagascar)
49
Q

What have been the impacts of the artemisinin study?

A
  • Molecular breeding platform - linkage and QTL maps
  • Growing seed sales of CNAP hybrids into Africa over the last 3 years sufficient to provide up to 200 million ACT treatments for malaria sufferes in the developing world
  • registration of a lead hybrid Hyb8001r in china in 2015
50
Q
A