QBIO2001 Flashcards
Big data
What are biomarkers?
Data signatures that are diagnostic of different people’s signatures
What is GWAS?
Genome Wide Association Studies
A genome-wide association study is an approach that involves rapidly scanning markers across the complete sets of DNA, or genomes, of many people to find genetic variations associated with a particular disease
Why are some diseases not detectable by SNPs?
• The inability to detect some disease through SNPs is because disease is the result of an interaction between genes and the environment
What is something in gene analysis that should be done in the future?
• Gene analysis has no temporal analysis, but we need to study genes, diseases and the environment comprehensively and dynamically
• Studying the system when it is perturbed is useful
o Weaknesses of the system are exposed
o Connections between system and environment are figured out
Why are humans unreliable test subjects? What is the solution to this?
they can lie and not go through the treatment properly
• This is why mice and animals are used
What is an example of an experiment that had to be done with mice, and not humans due to the flaws of human experiments? What were problems with this study?
• For example, diet studies
o Mice on high fat western diet had:
Increase in anxiety, short term memory and laziness
Weak bone structure
High blood glucose
o Mice with calorie restriction
Lived longer
• However, the diet study is unreliable as all mice are genetically identical and belong to the same mouse strain
o Doesn’t take genetic diversity into account
o Calorie restriction can be good but can also be bad depending on genetics, which is why the previous experiment doesn’t work
o Fat, glucose and insulin response also depends on genes
What are the three major sub-species of laboratory mice?
o Laboratory mice are derived from three major sub-species
Musculus Domesticus
Musculus Musculus
Musculus Castaneus
How is genetic diversity introduced in lab mice and why?
o Original population of mice are largely genetically different
o Get collection of strains by crossbreeding the populations
o This is useful to induce genetic diversity in environmental experiments
o See gene + environment result
What is the difference in what has to be studied in monogenic vs polygenic diseases?
Monogenic vs polygenic diseases
• Monogenic- can look at a SNP and say with high probability that there will be the disease
• Polygenic- have to look at both disease and environment
What are the different types of networks and a bit about them?
• Cell signaling networks
o Phosphorylation
o Kinase has to recognize substrate and bind to substrate
• Transcriptional networks
o Which gene is regulating expression of which gene
o Genes regulate each other and themselves
o All different transcripts change expression of other transcripts
o Transcription factors have to interact with each other to form protein complexes
o Gene regulatory networks
o Gene regulatory circuitry
• Protein-protein interactions networks
o Proteins interact together to function
• Metabolic networks
o Looking at metabolites
• And more
Talk about the insulin cell signaling network
• Cell signaling network- insulin
o Insulin receptor
Recognize and allow insulin to bind, which triggers signaling cascade
o IRS-1 will phosphorylate different kinases (Mik-1, Mik-2, Erk)
o The kinases phosphorylate the substrates by recognizing them by motifs
o Kinases eventually control expression of the genes
o GLUT-4: vesicle that translocate through the membrane and brings glucose from the surface into the cell
If that pathway is broken your cell is insulin insensitive
What are transcription factors and what can they do?
o Transcription factors are proteins that recognize DNA sequences and bind to specific DNA sequences called motifs
Allow the cell to differentiate
What could transcription factors be used for in medicine?
Embryonic stem cells can differentiate into different cell types- all kinds of them
• Could be used to generate tissues
• Can be studied by using Chip sequencing
How are protein-protein interaction networks looked at?
o Physical interaction networks
Multiple proteins come together and physically attach each other
o Cross-link different proteins
o Measured by using mass spectrometry
How are metabolic networks organised?
o Organized by concept of functions of cells and the metabolites that contribute to the function
What is DNA sequencing?
Sequence DNA of an animal/plant
What is transcriptome/RNA sequencing?
Measure the letters from mRNA and know what the mRNA is floating in the cell and how many copies there are- can translate that expression to how high the gene expression is
o Can measure transcriptomes- transcription level of different genes
o What combinations of genes are expressed in different cell types
o Can be expressed at different (cell specific genes) or similar level (housekeeping genes- genes that are conserved in all cell types) in different cell types
What is ChIP sequencing and how is it done?
o Measure the DNA sequence that a transcription factor binds to
o Know where transcription factor is binding- what motifs of DNA it binds to
o Transcription factor will regulate the gene to which it binds to
Transcription factors are protein
o If re-align sequence back into genome can see exactly where the transcription factor binds to
o There will be background noise in sequencing experiments- signal is coming from very sharp peaks
o There is
Histogram of gene expression
Accessible Chromatin
Histone modifications
Transcription factor binding
What is a pathway database?
o Pathway Database: Computerize current knowledge of molecular and cellular biology in terms of the pathway of interacting molecules or genes
What is a genes database?
o Genes Database: Maintain gene catalogs of all sequenced organisms and link each gene product to a pathway component
What is a ligand database?
o Ligand Database: Organize a database of all chemical compounds in living cells and link each compound to a pathway component
What are pathway tools?
o Pathway Tools: Developed new bioinformatics technologies for functional genomics, such as pathway comparison, pathway reconstruction and pathway design
What is clustering a typical procedure for?
Creating regulatory networks
How can you study protein phosphorylation?
Have control cells and test cells
Mix lysates 1:1
Enzymatic digestion
• Break proteins up so can be passed through the mass spectrometer
Enrichment of phosphorylated peptides
nanoLC-MS/MS analysis
Lots of computation
Signalling dynamics graph
Do clustering
Once have all phosphorylation sites measured, partition them into different patterns
K-mean clustering used to partition different phosphorylation sites into different clusters which have different patterns