Bioinformatics Flashcards
What are the most common bioinformatics formats, and what are their primary uses?
FASTA: Stores nucleotide or protein sequences. Used for sequence alignment and similarity searching.
FASTQ: Includes sequence data and quality scores from sequencing technologies. Essential for NGS analysis.
PDB (Protein Data Bank format): Represents 3D structures of biomolecules. Used in structural biology and molecular modeling.
GFF/GTF (General Feature Format/General Transfer Format): Annotates genome features. Used in genomics for visualizing genes, exons, and regulatory elements.
VCF (Variant Call Format): Describes genomic variants. Crucial for genetic studies, especially in identifying SNPs.
What are some essential bioinformatics databases
NCBI GenBank: Repository for genetic sequences.
UniProt: Comprehensive protein sequence and annotation resource.
Ensembl: Genome browser for vertebrates.
KEGG: Pathway database linking genes and molecules to biological functions.
What are essential bioinformatics tools ?
BLAST (Basic Local Alignment Search Tool): Identifies regions of similarity between sequences.
Clustal Omega: Performs multiple sequence alignments.
MAFFT: Efficient tool for multiple sequence alignment.
PhyML: Constructs phylogenetic trees.
How do you retrieve and analyze biomedical information using bioinformatics resources ?
Retrieving Sequences -
Use NCBI GenBank or Ensembl to download genetic or protein sequences.
Performing Sequence Alignments -
Input sequences into BLAST or Clustal Omega to find similarities or alignments.
Annotation:
Use InterProScan or Pfam to annotate functional regions in protein sequences.
Variant Analysis:
Import data into VCFtools or ANNOVAR for genetic variant analysis
What are the main applications of bioinformatics in biomedical sciences ?
Drug Discovery:
Predicting drug targets using protein-ligand docking and analyzing biomolecular interactions.
Personalized Medicine:
Identifying genetic variants linked to disease for tailored treatment strategies.
Gene Expression Analysis:
Studying transcriptomic data using tools like RNA-Seq.
Evolutionary Studies:
Understanding phylogenetics and evolutionary relationships through sequence data.
Disease Diagnosis:
Using machine learning models on omics data to predict disease states.