1. Intro Flashcards
Terms and concepts
* prerequisites - molecular genetics - principles of Sanger sequencing, next generation sequencing
* sequence assembly
* Margaret Dayhoff & her contributions
* examples of applications of sequencing data in biology
* linux: connect to the cluster using ssh
Explain Sanger sequencing
Explain Illumina sequencing
Explain PacBio sequencing
Protein Sequencing?
Deducing (assembling) the target sequence
data: protein sequence fragments (eg breakdown P and breakdown Q of same protein)
approach: test all possible combinations for overlap between P and Q fragments
Name two early Bioinformaticians and their background
Margaret Dayhoff and Walter Goad (physicists and statisticians)
- became interested in biological data
- imported tools & problems & practices into biology
biological sequences: fundamental in biology
* became increasingly available in the 1950s
* one dimension, patterns of symbols:amenable to quantitative & computational tools
Early sequence bioinformatics ?
- data storage and management
* centralizing of data
* sequence database - sequence analysis ➟ mathematical problems:
* pattern matching and detecting
- assemble sequences, analyze conserved sequence patterns, identify homologs
* statistical problems
- codon bias
* “paleogenetics”: use sequences to document
evolutionary history of sequences & organisms