Manipulating Genomes Flashcards
What is Sanger sequencing ? (Chain termination method)
Uses modified nucleotides called DIDEOXYNUCLEOTIDES
- they pair with the complementary nucleotides on template strand during DNA replication
- when DNA polymerase encounters dideoxynucleotide on developing strand it stops replicating - hence CHAIN TERMINATING metho
Chain termination method ?
https://youtu.be/KRjfIXh_u2w?feature=shared
- 4 test tubes that contain the singe strand template DNA to be sequenced, DNA polymerase, DNA primers, free nucleotides A, C, T, and G, and 1/4 types of dideoxynucleotide; either A, C, T, or G
- Test tubes incubated at temp that allows DNA polymerase to function
- Primer anneals to the start (3’ end) of single stranded template - produce short section of double stranded DNA at start of sequence
- DNA polymerase attaches to double stranded section/begins DNA replication using free nucleotides in test tube
H bonds form between complementary bases on nucleotides - DNA polymerase can insert one of the dideoxynucleotides (ddNTP) by chance -> termination of DNA replication
- Complementary DNA chains produced vary in length , as when the ddNTP are inserted varies
- Once incubation period is over, the new complementary DNA chains separated from TEMPLATE DNA
- Single stranded DNA separated according to length using GEL ELECTROPHORESIS - smaller fragments travel further
What is high throughput sequencing ?
Describes multiple DNA sequencing technologies that allow simultaneous sequencing of multiple DNA strands
- faster /cheaper way to sequence genomes
Most methods are now AUTOMATED instead of manual interpretation - faster
E.g : capillary gel electrophoresis
Nanopore sequencing /pyrosequencing - next generation sequencing methods (newer)
What are high throughput sequencing methods useful for?
Fields fo study in:
Evolutionary biology /classfiication
Personalised med
Disease diagnosis
What is DNA Sequencing ?
Technique that allows the nucleotide base sequence an organism’s genetic material to be identified
What is a genome?
The complete set of genetic material present in a cell/organism
What is bioinformatics ? How has it allowed for comparison between genomes ?
Field of biology that involves the storage, retrieval, and analysis of data from biological studies
- studies may generate data on DNA /RNA /proetin sequences/relationship between genotype and phenotype
- large databases contain info about organisms gene sequences/ amino acid and protein sequences
- when genome is sequenced , bioinformatics allows scientists to make comparisons with the genomes of other organisms using the databases
- see how closely related organisms are
How is genetic variation and evolutionary relationships investigated by comparison of genomes?
Species with high level of genetic variation will have larger no. Differences in base sequences between individuals
Species with small no. Differences between their genomes likely to share more recent common ancestor than species with large no. Differences
- cytochrome C proetin can be used for comparison
How can genome sequencing aid the understanding of genotype-phenotype relationships?
Explored by stopping the expression of different genes /observing its effect on the phenotype
- when organisms genome sequence is known , scientist can target specific base sequences to ‘knock out’
Can also: predict an organism’s phenotype by analysing its genotype
Bioinformatics has allowed the comparison of all data and identified genotype-phenotype correlations - help predict what health problems people are at risk of
How can genome sequencing help in epidemiology?
Genomes of pathogens sequenced /analysed to aid research and disease control
- highly infectious strains can be identified
- The ability of a pathogen to infect multiple species can be investigated
- most appropriate control measures can be implemented/its spread
- Potential antigens for vaccine production can be identified
Computerised comparisons between genomes of those with and without disease used to detect any mutations that could pose increased risk of disease
How can gene sequencing be used to predict protein tertiary structure?
genetic code used to predict the amino acid sequence within a protein (order of bases codes for sequence of amino acids)
- can then predict how the new protein will fold into its tertiary structure
This information can be used in synthetic biology
What was the human genome project?
Began in 1990 - international,collaborative research programme
—> publicly funded so no commercial interests/influence
DNA samples taken from multiple people around the world, sequenced, and used to create a reference genome
Labs around the globe sequenced different sections of specific chromosomes
By 2003 the human genome sequenced to 99.9%accuracy
- finished genome was over 3 billion base pairs long but only about 25,000 genes (quite low)
Applications of the human genome project ?
- info from project used to tackle HUMAN HEALTH ISSUES - nd goal of finding CURES
- Scientists found a correlation between changes in specific genes and the likelihood of developing certain inherited diseases (cancers, Alzheimer’s )
What is a proteome and why is it difficult to determine the proteome for humans?
Proteome: full range of proteins produced by the genome
Large amounts of NON CODING DNA present in genomes
Regulatory genes/alternative splicing effect synthesis of proteins/gene expression
Why is the proteome larger than the genome?
Alternative splicing - allows for single gene to produce multiple proteins
Post translational modification of proteins