Genomes Flashcards
Long repeated sequence
several thousand nucleotides long
Tandem LRS
next to each other
Dispersed LRS
spread throughout genome
Short repeating sequences
difficult to sequence b/c can can fold back on itself to form double stranded structure
can make secondary structure
human genome similarity
99.7-99.9%`
Individual’s genome can indicate
susceptible to disease
drug sensitivity
personalized medicine
Genomes contain sequence types:
- protein coding regions
- noncoding regions
- regions that are transcribed into RNA but never translated into protein
Genome annotation
process by which researches ID various types of sequence present in genome and where they are located
Sequence Motiff
signurature of protein coding gene.
looking for very long strands of reading frames that make open reading frames
Protein coding region
contains open reading frame
Open reading frame motif
long stretch of codons for amino acids with no stop codons
transfer RNAs
forms hairpin structure, can fold back on itself
Transcription factor motifs
hard b/c short
6-8 nucleotide long sequences
close to each other, upstream of long ORF
Notes about Genome annotation
imperfect
hypothetical protein
analyzes differences and similiarities in protein coding genes in genomes of different species
Hypotehtical protein
Common annotation,
found in large ORF, but dont know what it is
conserved regions
Comparative Genomics
analyzes differences and similiarities in protein coding genes in genomes of different species