Molecular 7-13 Flashcards

1
Q

In non-homologous sequences how do you find local alignments ?

A

Add gaps

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When comparing homologous sequences what does similarity not tell you ?

A

The order of descent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why is comparing amino acid sequences less prone to error than comparing DNA sequences ?

A

DNA sequences contain indels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Is a single long gap more likely then multiple short gaps during sequence alignment ?

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

If different alignments generate a similar score what is likely to be the best alignment ?

A

The one with the fewest gaps

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What receives the largest scoring penalty during automised alignment ?

A

Gap penalties > mismatch penalties

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Why does the COI gene have no indels with closely related species ?

A

Slow rate of evolution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is needed for the construction of phylogentic trees with regards to sequence quality ?

A

> 20% amino acid identity for protein coding sequences

>66% for non-coding DNA sequences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is an orthologous gene ?

A

A gene that is in 2 separate species due to a speciation event

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a paralogous gene ?

A

homologous gene that occurred due to a duplication event

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a xenolog ?

A

A gene that is acquired by HGT

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What type of genes are phylogentic trees based on ?

A

Orthologous genes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Other than trees being based on base or amino acid similarity what else can trees be based on ?

A

Character states such as specific bases in a specific site,

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the benefits and limitations of 3rd generation sequencing ?

A
Benefits:
Avoid PCR thus reduces error and amplification bias
Real time measures 
No DNA fragmentation required
Low price per Mbp generated 
Cons: 
High error rate
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the 3 steps in 2nd generation sequencing ?

A
  1. DNA fragmentation and adapter ligation
  2. Clonal amplification by emulsion/bridge PCR
  3. Cyclic array sequencing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What defines ancient DNA and what are some of its properties?

A

Ancient DNA (aDNA) is defined as that recovered from biological samples that were not preserved specifically for later DNA analysis. It is typically broken into short fragments and damaged by exposure to high temperatures, moisture, ultraviolet radiation, oxygen.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are some of the uses of aDNA?

A

It is used for the analysis of the phylogeny of extinct species and to estimate timing and impact of events, such as migrations, hybridisation and extinctions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Which is the name of the science that uses ancient DNA?

A

Paleontology

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What are they key steps in Sanger sequencing ?

A
  1. Attain ssDNA and add radioactive primers
  2. Seperate into 4 tubes and add ddNTP, polymerase and dNTPs.
  3. Run through a denaturing gel
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What are 2 disadvantages of non automised sanger sequencing ?

A

Cant run everything in one lane

Cant read sequences at top of gel

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

How was sanger sequencing made automised ?

A

ddNTPs were fluorescently labelled which can then be read by a laser

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What are the 4 applications of protein sequencing ?

A

Identification of the protein family
Prediction of the sequence of the gene encoding the particular protein
Discovery of the structure and function of the protein
Evolution history of protein family

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What are the steps in sangers protein sequencing method ?

A
  1. Add sangers reagent to protein
  2. Gives a derivative protein which form a yellow DNP on the terminal amino
  3. Apply acid hydrolysis which cleaves the protein and yellow DNP which is attached to the terminal amino acid.
  4. Identification of terminal amino acid by chromatography
  5. Repeat method with different fragmentations of the protein
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What are the steps in Edman degradation protein sequencing ?

A
  1. Add PITC to protein which reacts with terminal amino group
  2. Apply acid hydrolysis which cleaves the protein
  3. Repeat steps 1-2 until all terminal amino groups have been cleaved
  4. Analyse PTH residues by TLC and HPLC
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is mass spectrometry ?

A

Analytical technique that ionises chemical species and sorts them ions based on size and mass.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What does mass spectrometry allow ?

A

Allows the precise identification of a peptide mass and thus the amino sequence.

27
Q

What are the two methods of mass spectrometry ?

A

Peptide mass fingerprinting

Tandem mass spectrometry

28
Q

What are the steps in mass spectrometry ?

A
  1. Digest protein
  2. Sort protein by chromatography and charge added in a MS machine
  3. This allows mass identification of protein fragments
  4. Additional fragmentation by collision
  5. Compare fragments peaks to database
29
Q

What is the difference between tandem mass spectrometry and peptide mass fingerprinting ?

A

Additional fragmentation step done by HPLC

30
Q

What are the 2 theories regarding mt and cp origins ?

A

Autogenous: descended from nuclear genome
Endosymbiotic: Descended from free living prokaryotic cells

31
Q

What are the 2 theories within the endsymbiotic theory ?

And what do they state ?

A

The gradual origin: Aerobic bacteria + Anaerobic prokaryote = aerobic eukaryote with mt.
Primitive aerobic eukaryote + photosynthetic bacteria= Photosynthetic eukaryote with mt and cp

The fateful encounter: same events by chance

32
Q

What evidence is their for the Endosymbionts theory ?

A
mt and cp have :
Developed cell membranes 
Can self replicate 
Produce proteins 
Have own DNA
33
Q

What are conserved in land plant chloroplasts ?

A

introns

34
Q

What is genomics ?

A

defined as the study of the genome—the complete set of genes and DNA information from complete organisms, genes and their functions.

35
Q

What is the difference between genomics and genetics ?

A

Genetics scrutinises the function and composition of a single gene where as genomics addresses all genes and their relationships in order to identify combined influences on the organism.

36
Q

What is transcriptomics ?

A

Is the study of the transcriptome—the complete set of RNA transcripts that are produced by the genome, under specific circumstances or in a specific cell—using high-throughput methods, such as microarray analysis.

37
Q

What does the comparison transcriptomics allow ?

A

Comparison of transcriptomes allows the identification of genes that are differentially expressed in distinct cell populations, or in response to different treatments.

38
Q

What does proteomics allow ?

A

analyse the structure, function, and interactions of the proteins produced by the genes of a particular cell, tissue, or organism.

39
Q

What is proteomics ?

A

is the study of the proteome,—the complete set of proteins expressed in a cell, tissue or organisms by a genome

40
Q

What is metabolomics ?

A

is the study of the metabolome—the collection of all metabolites in a biological cell, tissue, organ or organism, which are the end products of cellular processes

41
Q

What is name of the science that combines information generated from omics ?

A

Systems biology

42
Q

What is systems biology ?

A

the computational and mathematical modelling of complex biological systems. It is a holistic approach to deciphering the complexity of biological systems that starts from the understanding that the networks that form the whole of living organisms are more than the sum of their parts.

43
Q

What is metagenomics trying to answer ?

A

trying to identify who is in the community through genomic data and what are they doing through transcriptomics, proteomics and metabolomics.

44
Q

What is the difference between genomics and metagenomics ?

A

Genetic material is recovered from the environment and not pure cultures.
Metagenomics is the study of a community

45
Q

What are the 2 sequencing methods used in metagenomics ?

A

Amplicon sequencing- metabarcoding eg COI barcode for animals
Whole metagenome sequencing

46
Q

What does metatranscriptomes show ?

A

Shows how the community respond to environmental changes . Show which genes are expressed

47
Q

What does metaproteomics shows ?

A

Proteins produced by community

48
Q

What is an application of metagenomics and transcriptomics ?

A

Bioremediation- Oil spill in gulf of mexico. Microbes involved in break down of hydrocarbons

49
Q

What is an application of meta proteomics ?

A

Gives significant insight into how crohns disease occured through comparisons of healthy proteomes and disease proteomes

50
Q

What are the limitations of metagenomics ?

A

DNA extraction is difficult
Assembly of DNA is difficult
Many important phenomena that occur at strain level

51
Q

What is needed for the calculation of rate of point mutations ?

A
Frequency of mutant (q)
Selection coefficient (s)
52
Q

How do you calculate neutral mutation rate ?

A

Use non-functional homologous DNA of 2 species with known divergence and generation times and compare.
Around 100 new point mutations per gen

53
Q

Why can synonymous mutations be used to calculate divergence ?

A

Because they do not effect fitness

54
Q

Why do pseudogenes undergo a high rate of nucleotide substitution ?

A

Less selection pressures as they are non functional

55
Q

What is meant by substitution rate ?

A

The rate it takes for all individuals of a species to have a particular mutation

56
Q

What does correlation on the molecular clock allow you to calculate ?

A

Divergence for other species pairs

57
Q

What controversies are there about the molecular clock ?

A

Rate of amino acid replacement per site is higher during speciation event
Dating may be wrong
Test developed independent of fossil records

58
Q

How do you test a molecular clocks accuracy ? Describe it

A

Relative rate test :
Using 2 species with a common ancestor and an outgroup calculate number of substitutions between species.
If values are not significantly different from 0 the species evolved at the same rate

59
Q

What does dN/ds>1 and <1 mean ?

A

dN= Number of non synonymous mutations per non synonymous site
ds= Number of synonymous mutations per synonymous site
>1 means positive selection
<1 means purifying selection

60
Q

What are the 3 hypothesis that explain why rats substitution rates are higher than humans ?

A

Generation time effect = Short gen= faster substitution rate
DNA repair hypothesis= better DNA repair= low substitution rate
Metabolic rate hypothesis = High metabolism = high substitution rate

61
Q

What is the relationship found in bird with regards to molecular clock ?

A

Strong negative relationships between body mass and substitution rates of birds. eg large bird = low rate

62
Q

What is the difference between phylogenetic gradualism and punctuated equilibrium ?

A

PG says evolution and speciation is regular and slow

PE says evolution and speciation is calm periods interrupted by rapid evolutionary change and intense speciation

63
Q

Why is mutation rate higher in mt protein coding genes than nuclear protein coding genes ?

A

Low fidelity of replication
Inefficient repair
High conc of mutagens
Smaller population size