How do You Sequence a Genome Flashcards
what states can DNA/RNA/Protein strands exist?
DNA, two: single stranded like viral genomes or double stranded
RNA, one: single stranded
Proteins, one: single stranded
what does the information of DNA sequence used for?
gives the ability to align sequences (usually in 11 bases)
some sequence runs overlap with information, what does it give us?
the alignment of theses sequence runs allows us to create a summary sequence called contain
how long does a sequence have to be for us to expect it to be unique
how many distinct permutations in sequence can a stretch of a sequence n bases long have?
4^n = the number of permutations
what is a contiguous sequence
summary sequence of aligned sequences
some sequence runs overlap with information, what does it give us?
the alignment of theses sequence runs allows us to create a summary sequence called contain
he alignment of theses sequence runs allows us to create a summary sequence called contain
what is a contiguous sequence
when is a sequence unique
when the number of permutations is greater than the number of sequences of n bases in the genome
how many would you expect to have the sequence? say you have 9 genomes made of random sequence of 3 x 10^9
4^17=17x10^9>6x10^9
6x10^9/17x10^9=1/3
therefore, 1/3x9=3
you would expect about 3 of these random genomes to have this sequence
what are the total bits of a sequence n?
it is the addition of the bits at each position
A=00
G=01
C=10
T=11
what are sequence logos
they are a representation of the consensus sequence that DNA binding proteins search for in genomes
what does DNA binding proteins do
recognizes a site that has a TAAT
what does information theory provide?