20.02.20 External databases/ Bioinformatic resources Flashcards

1
Q

What are external bioinformatic databases

A

Databases that store biological data information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Types of bioinformatic databases

A
  • Genome/sequence
  • Gene expression
  • transcriptomics
  • proteomics
  • epigenetic
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What two types of DNA databases are there

A
  • Primary: contain experimentally derived data

- Secondary: data produced from the analysis of primary data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Examples of Primary DNA databases

A
  • EMBL-EBI: European Molecular Biology Laboratory- European Bioinformatics Institute. Ensembl= genome browser, BLAST/BLAT= sequence search
  • GenBank (NCBI- National Centre for Biotechnology Information). Contains DNA sequences from a group of sources (Genbank, refseq) for 300,000 organisms
  • DNA databank of Japan= nucleotide sequence and evolution data.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Examples of secondary DNA databases

A
  • OMIM (Online Mendelian Inheritance in Man)= contains genotype-phenotype information on mendelian disorders.
  • RefSeq= annotated references for genomic, transcriptomic and protein data.
  • 1000 genome project= data is available on the Ensembl platform
  • HapMap= Map of haplotype regions and SNPs within them.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is phastCons and phyloP scores

A

-Used for phylogenetic and evolutionary conservation predictions. Models are used in UCSC genome browser and other variant classification software (Alamut).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Human splicing finder

A

a tool to predict the effects of mutations on splicing signals or to identify splicing motifs in any human sequence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is gnomAD

A
  • Genome aggregation database

- Large population dataset from unrelated individuals. 125,748 exomes, 15,708 genomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is Alamut Visual

A

-Software that incorporates multiple datasets from different sources to allow user friendly and efficient variant classification and genome interrogation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Examples of gene expression databases

A
  • ArrayExpress: archived functional genomic data from microarray and sequencing platforms.
  • Human protein atlas: expression profiles of human protein coding genes expressed in mRNA and protein levels and in multiple tissue levels.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Examples of transcriptomic databases

A
  • miRBase (microRNA database) from Manchester Uni= published miRNA sequences and annotation.
  • Rfam= Collection of RNA families, represented by multiple sequence alignments, consensus secondary structures.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Examples of protein sequence databases

A
  • Disprot: database of manually curated experimental disorder evidence.
  • Interprot: provides functional analysis of proteins.
  • Pfam (EMBL-EBI): collection of protein families shown as multiple sequence alignments and hidden Markov Models. Enables identification of protein domains
  • Uniprot and Swissprot (EMBL-EBI): Curated protein sequence information
  • NCBI (National centre for Biotechnology Information): database of nucleotides, genomes, SNPs, proteins.
  • Protein databank: archive of macromolecular structure data (X-ray, NMR).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Protein interaction databases

A
  • BioGRID (Biological Gnereal Repository for interaction datasets): archive of protein interaction data from model organsisms and human studies.
  • RBPDB: database of RNA-binding protein specificity.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is Cancer genome atlas (TCGA)

A

-Epigenomic, transcriptomic and proteomic data from 20,000 cancer patients and matched controls for 33 cancer types.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is MethBase

A

Reference methylomes from different organisms.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly