Bioinformatics Lecture 1 Flashcards
DNA sequencing machine
DNA comes out in random order
meta genomics
data recovered directly form environmental sources
includes microbiome
approach / goals of bioinformatics
store, process, analyse, model , predict
biomarkers
measurable characteristics informative about a biological state
can be e.g. genes or metabolites
Kaplan Meier curve
for progressive diseases
for incomplete observations
estimates / predicts survival
sources of data in bioinformatics
clinical data
imaging
non-high throughput data
high throughput data
high throughput profiling
automated process
outputs many different types of biological data
Alzheimer’s progression
present one or two decades before symptom onset
treatment only effective if early
FTD
frontal temporal dementia
more rare
little known about etiology
TAU and TDP proteins play a role
mass spectronomy
identifies mass to charge ratio of ions
used to identify proteins
gene networks
co-regulated / co-expressed genes
entire pathway goes up or down
individual differences in protein expression
some are noise
some are natural variation unrelated to disease
some influenced by e.g. what people ate before
gene set enrichment analysis
method to analyse genes or proteins that are overrepresented in a large dataset
identifying genes that are regulated together
often related to disease phenotypes
finding out functions of proteins
either wikipedia
or a biobank
or BLAST
BLAST
extremely widely used
output: homologous protein sequnces
aligned to the query (input) sequence