W7L2 system genomic Flashcards

Question 1

Q

breaking biological systems into single components

Answer

A

Biological systems are complex, consisting of different componenets that are individually more simple and similar to or share properties with each other
Complex systems dynamics is not easily understood by humans who need to “single out” the effect of each independent feature
-They exhibit emergent properties, whose attributes cannot always be understood through decomposition and reductionist approach
-

Question 2

Q

Modelling the biologicals properties from complex system

Answer

A

-Biological systems exhibit non-linear dynamics
Non-linear dynamics can emerge from the interaction of the multiple elements
Multiscale structure combining molecular, cellular, physiological and behavioural components
- not alway understood by the reductionist approach

Question 3

Q

basic information that one should consider for system genomic

Answer

A

complexity is not proportional to size: us highway can be easily understood but structure of a spider web is still unknown
-mutliscale response in a biological system and emergent properties

Question 4

Q

Properties of biological network: Patern of gene expression

Answer

A

Analysis of transcriptomic data, differential expression using RNA-seq
-identify differential expressed gene over a period of time
-separate into gene cluster that have similar expression
track how similar the expression between two gene is, building a network of co-expression

Question 5

Q

Properties of biological network: analysis of network, graph theory

Answer

A

-Pairwise differences/similarity in gene expression can be formally represented as adjacency matrices. (can by repesented visually in a network graph)
- Some core topology of the network include: the degree of a network: the number of edge connected to a node. Centrality: how important nodes and edges are for the connectivity of the network

Question 6

Q

different type of graph network

Answer

A

scale free network: lots of hub but low number of edge per node
-transivity: node are more internally connected into cluster

Question 7

Q

Properties of biological networks: multi omics data

Answer

A

-Framework can be integrative of multiple analytical technology metabolomics and proteomics, transcriptomic and genomics

Question 8

Q

descriptive analysis

Answer

A

using resources from public database: homogenised annotations for a great number of genes can be retrieved. These are gene ontology terms structured as trees
-these GO terms have been slimmed to GO-slim terms for a more comprehensive assessment

Question 9

Q

descriptive analyses: dimensionality reduction

Answer

A

-each molecule can be characterized by a large number of factor or features, potentially with complex effects.
-looking if we can represent the dataset on a low-dimensional space with a single metric, identify which variable drove the flattening (important gene)

Question 10

Q

Machine learning in system biology decision tree

Answer

A

-predictive modelling in systems biology uses a collection of supervised learning approaches to connect complex input data to a simple output
-mostly use decision trees to identify what class does a gene belong in. the final model is the average number of trees built using a dataset and drop in a subset of predictor.
-not sensitive to overfitting (too many info which lead to low statical power)

Question 11

Q

Machine learning model: deep learning

Answer

A

using artifical neural network, have an input layer with many hidden layer which transform the input. after the hidden layer is the output layer
-use to predict the state of output layer which is flexible and robust
-lacking biologicals specificity and perimeter interpretability

Question 12

Q

machine learning feature: developing feature

Answer

A

-development of very large number of feature that are used as predictors, desceiptive of the system
-overfitting is well handeled
-but correlation of features is a problem, ML work better with independent feature

Question 13

Q

Problem of ML

Answer

A

-No established framework for parameter tuning
-Ad hoc design (number of layer, convolution, pooling)
-multiple Optimisation criteria (leading to accuracy change, loss)

W7L2 system genomic Flashcards

(13 cards)