Bioinformatic Tools Flashcards
1
Q
What are the completed genome sequences as of May 2008?
A
- More than 2000 viruses, defective viruses, and viroids
- 1325 plasmids
- 1373 mitochondria
- 131 chloroplasts
- 109 archaea
- 687 true bacteria
- 23 eukaryotes
2
Q
What are the completed genome sequences as of August 2015?
A
- 4860 viruses, defective viruses, and viroids
- 5714 plasmids
- 7015 organelles
- 46,988 prokaryotes
- 2384 eukaryotes
- Additional eukaryotic genome sequences are underway
3
Q
What happens to the genome sequences after they are completed?
A
- The sequences are incorporated into various databases, eg. GenBank and EMBL data library
- Bioinformatics tools are essential for analysing, understanding and mining the sequencing data
4
Q
What is GenBank?
A
- A Bioinformatics tool
- GenBank is maintained by the National Center for Biotechnology Information (NCBI) in the USA, and is the largest publicly available database of DNA sequences
- Contains about 100 billion bases of sequence, and doubles every 14 months
5
Q
What is GenBank used for?
A
- Permits identification of genes after a genome has been sequenced -> Annotation
- Compares known sequences in the database with newly sequenced genomic DNA using BLAST (Basic Local Alignment Search Tool)
- Also, as individual genes are sequenced, identified and named, they are deposited into GenBank and given an accession number that can be used for retrieval