Tutorials for final Flashcards
Describe microsatellites
short simple sequence repeat frequent in the human genome (~every 30000bp) 1-5bp long tandem repeats of 15-100 copies mutate by replication error highly polymorphic
What do you know in there are multiple independently-derived mutants with the same phenotype?
has to be a mutation in the SAME gene
can be different mutations in the same gene
Name a type of next generation sequencing
pyrosequencing
What is the average read length of pyrosequencing? How many reads per run?
700 bp
1 million reads per run
What is the accuracy of pyrosequencing?
99.9
How long does it take to complete one run of pyrosequencing?
24 hours
What are TAIR and MGI?
specialized databases
TAIR is for Arabidopsis
MGI is for mice
What is blastx?
nucleotide query translated into all 6 reading frames against a protein database
What is tblastn?
protein query, searched against a nucleotide database in all 6 reading frames
What is tblastx?
nucelotide query translated to protein in all 6 reading frame searched against a nucleotide database translated in all 6 reading frames
What is tblastx?
nucelotide query translated to protein in all 6 reading frame searched against a nucleotide database translated in all 6 reading frames
What does the score tell you?
how many hits there were
the longer the sequence the higher the potential score
ie 800/1000= 800 score
but 100/100= 100 score (more accurate)
When a high score a better match?
only when comparing fragments of the same size
How are scores given?
positive score for each matching pair, negative value given for each gap needed to match
How many nucleotides are currently needed to have a unique sequence?
19