HC 8 - Multi-omics: Integration & Interaction Networks Flashcards

hoorcollege 8

1
Q

Why multi-omics studies?

A

Because omics levels interact

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Challenge of multi-omics

A

Integration of multiple omics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why integration of multiple omics datasets?

A

> Describe parts of unknown system and effect after perturbation (alteration in function/ disruption)
find a good marker of system state or change
pinpoint a certain mechanism: which molecules are impotant for development or treatment of a disease in a mechnism
Systems biomedicine: holistic measures for preventive medicine

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Which kinds of measurements are important in systems biomedicine?

A

-Omics measurements
-diet, cardiovascular data
-Coaching sessions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

P4 participatory medicine

A

Systems medicine, big data and patient involvement leads to predictive, preventive, personalized and participatory medicine.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Characteristics of systems medicine

A

-Pro-active patient involvement
-Earlier diagnose, earlier treatment
-Effective stratification: personal treatment (more effective)
-reduction of time, costs, and error margins

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Integration of datasets in systems biomedicine

A

Integration of personalized omics and clinical phenotyping

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Synergy

A

Wet-lab experiments, bioinformatics and computational modeling
> communication between life scientists, medical doctors and computational scientist

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Challenges multi-omics integration

A
  • There are differences in the data
    > Different technical limitations
    > Different dynamic ranges
    > Different number of analytes
  • There are differences in time scales
    -Incongruent analytes
    -Unknown analytes
    -Missing links
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

The differences of dynamic ranges between transcriptomics, and metabolomics in volcano plot

A

Much more significance for the RNA, because of higher dynamic range in sequencing than in massspec

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Differences in time scales: metabolomics, proteomics, transcriptomics and phosphoproteomics life spans

A

-Phosphoproteomics: 15-200 s
-Metabolomics: <1 min
Transcriptomics: 10 hr
Proteomics: 1 day

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Differences in time scale: order of pace/resp. time for metabolomics, proteomics, transcriptomics and phosphoproteomics

A

Phosphoproteomics: 15-200 s
Metabolomics 0.1 micros - 10 s
Transcriptomics: 10-100 nt / s
Proteomics: 10 aa/s

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Problem with different time scales

A

When are you performing the measurement: when it is important for metabolomics bc of predicted difference? on that moment: is there interesting information about other omics?
> if you measure different time points, can you compare those

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Problem of different data

A

if you measure more transcripts than metabolites due to different technical limitations, dynamic ranges and amounts of analytes > problems with isoforms and difficult in statistic tests > which data are comparable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Incongruent analytes

A

If you got a protein, is this connectable to a specific isoform of a transcript?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Missing links

A

Which metabolite belongs to which enzyme?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Approaches systems biology

A

-Bottom up
-Top down

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Bottom up systems biology

A

-You use prior knowledge about the system and make a mathematical model based on this to learn something about the whole system
> e.g. metabolite reaction of the glycolysis enzymatic reaction: what happens to X,Y and Z when the input (glucose) is lower
> first less X then Y then Z until steady states: formulate differential equations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Bottom up with non-linear systems: the problem

A

> even simple systems may not allow us to make reliable predictions regarding their responses to stimuli > no obvious responses to changes in input
we cannot longer rely on intuition for predicting response of such systems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Emergent properties

A

Systems properties that differ from the properties of the systems parts
> result from interactions
> self-organisation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Top-down systems biology

A

-Measuring omics
-Statistical method to find out what parts you measured for process of interest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Can top-down systems biology detect the non-linear properties and model them?

A

Yes, but that is easier to see based on bottom-up approach. In top-down, it is more complicated.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Approaches multi-omics integration

A

-Using sequence information (top-down)
-Using purely (semi-) quantitative omics data (top-down)
-Using knowledge and omics data (bottom-up)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Multi-omics integration: using sequence information

A

-Based on steps of central dogma
-e.g. measure mRNA ratios and gene functions based on clusters with protein and transcript stability > correlation protein and mRNA amounts depend on function for the cell

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Integration based on quantitative data: what is quantitative data?

A

Something like an amount of molecules in a certain volume or per cell

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Semiquantitative

A

You know that there is more/less phosphorylation but not exact for example

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

A small proportion of the dataset from the quantitative omics approach is categorial data like:

A

-Conditions
-Experimental treatments
-State of health

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Integration based on quantitative data: common dimensionality reduction

A

Reduce features across samples for multiple omics to a single matrix by using factors like conditions, experimental treatments, health, age, organisation, tissue, time points and give weights for contribution of the features to the integrated matrix of factors against samples
> factors: the reduced dimensions from different dimensions
> integrate as samples against factors (like with PCA)

28
Q

The position of every dot in a factor dotplot of the samples is determined by

A

the interaction/connection of multiple omics levels
> 1 set value per sample
> which genes were important for placing dots: dependent on the weights on different genes

29
Q

Integration based on quantitative data: predictability

A

How good is 1 omics level in predicting the other
> e.g. effect microbiome on metabolome of host
> can the microbiome predict metabolites in the blood?
> microbiome ko mice in germ free environment
> measuring all microbes and analysis of every metabolite > use different regression for every metabolite with use of microbiome

30
Q

Why integration based on quantitative data?

A

-Pinpoint mechanism (what is the reason for disease) > not possible, a causal relationship cannot be measured
-Describe parts of unknown system (what happens after perturbation)
-Fund good marker for system state or change (what is specific for disease)

31
Q

What do you do with metabolites in blood which are well predictable by microbiome?

A

Check the function since they might be related to microbiome > bile acid metabolism

32
Q

What is a biological model?

A

A (mathematical) description or representation of a biological system
> falsifiable simplification, usually of some predictive value
> is not a statistical model

33
Q

Biological systems characteristics

A

-Non-linear
-Dynamic
-Have emergent properties
-Span scales (size, space, complexity, time)

34
Q

Bottom up systems biology uses

A

mechanistic models

35
Q

Integration based in quantitative data: association networks: what is association?

A

-Observed together
-Similar change (correlation)
-Other dependence/predictability (anti-correlation, mutual information)

36
Q

How to make an association network with different omics levels

A

-Make nodes of different omics variables (metabolites, genes etc)
-Make edges between nodes with association
-Label edges red or blue for negative / positive correlation

37
Q

What is mutual information

A

How much information of variable B rests in variable A

38
Q

Correlation graph and associations

A

Positive correlation: positive slope
Negative correlation: negative slope

39
Q

In association molecules, different molecules from different omics levels are compared …

A

separately

40
Q

Correlation can be unjustly. How to test for this?

A

Permutation test
> watch out for false positives

41
Q

What is a network?

A

-Consisting of objects with pairwise relationship
-Objects: nodes (NL: knopen)
-Connections: edges (NL: kanten)
-Nodes and edges can have attributes
-Mathematicans call a network a graph

42
Q

Why use networks

A

-Show interaction and test hypotheses
-Find out which objects show same patterns of behavior (delineate components of same interaction patterns)
-identify components with special interaction patterns
-maths needed, but less knowledge needed than for differential equations
-Visual
-intuition of a connected world (with hubs and centrality)

43
Q

Sorts of networks

A

-Static vs dynamic
> which nodes are close to each other and which distances change?
-Directed vs undirected
> do the edges have a direction like kinase for protein
> could be both ways
-Uni-, bi- or multipartite
> which types of nodes are in network (levels) and how are they connected?
-Weighted vs unweighted
> weights, labels

44
Q

Bipartite

A

For example TFs and DNA
> DNA can interact with TFs but not with other DNA
> TFs can interact with DNA but not with other TFs
-Only from other levels

45
Q

Unipartite

A

All levels can interact > all from the same group

46
Q

Weights/labels in networks

A

-Thicker edges for stronger interactions
-Labels: positive or negative interactions
> mathematically needed otherwise all edges are understood as equal importance

47
Q

Which network to choose?

A

Dependent on question: is direction important, or is that not possible (when measuring correlation: no direction)

48
Q

Ways of presenting a network?

A

-Edge list: table which shows one edge per row
> two columns of source node and sink node
-Adjacency matrix: table which shows pairwise interactions as binomial values.
> if directed, the rows are source nodes and the columns are sink nodes
> can be weighted: numbers larger than 1 for interaction

49
Q

What does a network tell us?

A

-Topological characteristics
> of network: connectedness, path length, how large is the number compared to whole number
> of node: centrality, degree (amount of edges), clustering
> of edge centrality, load
> of sets of nodes/edges: participation in paths, circles, modules, motif: amount of nodes in a structure

50
Q

Networks to search subnetworks containing …

A

-Many nodes of interest
-Relevant paths

51
Q

Correlation network can be used to compare ..

A

groups with disease or healthy for example

52
Q

Integration of multi-omics: use prior knowledge and omics data (bottom-up)

A

Knowledge-based integration
> show interaction between two omics levels with use of a model (of metabolism f.e.)
> you know that the mechanism of interaction exists but not how it changes

53
Q

How is the knowledge for knowledge-based interaction derived?

A

F.e. protein-protein interaction from small studies with specific interest of a specific TF and TFs with which it interacts

54
Q

Detection protein interaction with interactomics: how is the method called and how does it work?

A

Yeast-to-hybrid screen
> test if protein A interacts with protein B
> split a TF of a reporter gene in two parts (DB, AD), each bound by A and B (DB-A, B-AD)
> if interaction > normal function of the DB-AD transcription factor > expression reporter gene
-Performed in yeast
-Measure for 17,500 proteins as A and 17,500 as B

55
Q

How is the knowledge of transcription-factor targets derived?

A

-ChIP-seq
-promotor sequence
-small scale studies

56
Q

Where is he knowledge about phosphorylation and metabolism derived from

A

Small-scale studies > databases
+ text-mining + real reading

57
Q

Integration approach (bottom-up) first step

A

Analyses each dataset on its own and look for similar patterns
> identify problems: difference in technical limitations, dynamic ranges and amounts of analytes

58
Q

Integration approach bottom-up second step

A

Enrich analysis with prior information from ChIP-seq for transcription factor targets f.e.
> kinase and phosphatase targets
-Then: activity calculation

59
Q

Activity calculation based on connecting transcripts and protein phosphorylation

A

Which transcripts/kinase or phosphatase have which TFs connected
> activity difference between tumor and healthy > f.e. a kinase is more active in tumor sample
-Then build multi-omics networks of transcripts, protein phosphorylation and metabolites based on model of the signalling and metabolism.

60
Q

COSMOS network

A

multi-omics integrated network by COSMOS software
> some nodes are measured and some are from databases

61
Q

Analysing a network

A

-Smart algorithm: which possible paths have the largest possible effect or agree best with measurements
-Highest scoring subnetwork: subnetwork with best agreement
> like tumor suppressors, or purine metabolism
> forming of new hypothesis: are these nodes good markers for tumors f.e.?

62
Q

Validation of hypothesis based on network

A

Different multi-omics dataset needed
> do you find the same subnetworks?

63
Q

When the follow-up experiment is the investigation of a specific response, what needs to happen with the amount of replicates in the experimental design?

A

-Possibly less
> no need to adjust FDR (FDR is lower)
> less needed for statistical power
> targeted assay is more precise
-Possibly more
> because measurement is simpler, cheapre, quicker
> we may want to reach more certainty or be more general

64
Q

How can a study of microbiome as predictors for metabolites be validated? Microbe to metabolite

A

By growing microbes in a medium with heavy isotopes of carbon
> metabolomics with TOF
> identify labeled metabolites
> calculate ratio labelled/unlabelled
> compare colonized and control
- vice versa: inject mouse with isotope labelled threonine and analyse gut content
> you know the direction, the metabolites are derived from microbes

65
Q

Validation: microbial metabolite to effect

A

> find candidate gene in metagenome
express enzyme for microbial metabolite biosynthesis in lab-bacteria
extract metabolites from bacteria
check for presence of metabolite
check for effect

66
Q

Validation: screening for signalling

A

-Extract metabolites from microbiome
> fractionate
> determine metabolites
> incubate cell lines expressing receptor-reporters with metabolite fractions
> one cell line per receptor
> retrieve receptor-metabolite pairs

67
Q

Challenges for multi-omics integration: , which approaches?

A

-Differences in timng
> Sequence information
> Quantitative data
-Incongruent analytes
> Sequence information
> Knowledge-based
-Different technical limitations and biases, dynamic range, number of analytes
> Quantitative data
-Unknown analytes or missing links
> Knowledge-based