Bioinformatics Flashcards
What kind of data is used for translational bioinformatics?
Sequences
Structures
Networks
What is the FASTA format?
It’s used for proteins and nucleic acids
Most common and standard sequence format
How is FASTA structured.
Comment line identified by > symbol
Sequence
Optional end sequence * symbol
What is FASTQ?
This format derived from FASTA and it is applied for the sequences generated from massive parallel sequencing
What does FASTQ contain?
Four lines:
- Starts with @ symbol, contain identifier and description
- raw sequence letters
- optional starts with + symbol and may contain the same description as 1
- encodes the quality values for each letter in line 2, must have an equal number of characters
What catalogues are used for bioinformatics?
NAR - nucleic acids research
Give integrated search engine that can be used
NCBI search
EBI
What are genome browsers?
System to navigate and visualise genomes and their annotations
Complex views using great amounts of information
Integrates phenotypical and molecular information
Allows customised views of different tracks
What tools are used during bioinformatics?
BLAST
basic local alignment search tool
What characteristics can BLAST look for?
Local alignments
Alignments with gaps
Rapid heuristic