Genome Diversity/Dynamics in Time - After Midterm Flashcards
Expression
- Expression is just the process of getting from the gene itself to the functional product
- This means expression involves transcription, translation, and protein folding
- Expression is dynamic, and changes over the lifetime of a cell
- Phenotype depends (in part) on what parts of the genome are expressed
Very Brief overview of Transcription in Prokaryotes
- There are sequences upstream of the gene called the -35 box and the -10 box that make up the promoter
- A protein? called the sigma factor binds to the boxes and recruits RNA polymerase, which then carries out transcription
Very Brief Overview of Transcription in Eukaryotes
- Eukaryotic promoters are very different, as they can be made-up of different recognition elements (but the TATA box is usually one of them)
- The recognition elements recruit transcription factors
- These transcription factors then help recruit additional transcription factors, which then recruit the RNA polymerase
RNA Dynamics
- The expressed part of the genome is in constant flux, so RNA will keep being produced as long as the machinery is present
- RNA will also degrade at a particular rate
- If the rates of RNA transcription and RNA degradation are perfectly balanced, there will be a steady amount of RNA in the cell
- the strength of the promoter will affect RNA production rates — different promoters with different sequences can have different binding affinities, with promoters with higher binding affinities being able to produce RNA at a faster rate
- regulatory elements either upstream or downstream of the binding sites can recruit other proteins that either increase or decrease transcription rates
Regulatory Elements
- “regulatory elements” refers to the full collection of transcription factors and their DNA binding sites
- proteins that increase transcription are called ‘activators,’ and proteins that decrease transcription are called “repressors”
- DNA sequences that promote transcription are called “enhancers,” and DNA sequences that decrease or prevent transcription are called “silencers”
Degradation of mRNA
- miRNA can do targeted degradation of mRNA
- miRNA “silences” these genes by cutting up their mRNA
- There are a variety of other components (transcription factors, miRNA, etc) that act together to influence the level of mRNA in the cell
Transcriptomics
- The study of RNA expression levels across the genome, making comparisons both within genome (between genes) and between samples (different phenotypes)
- Is a “holistic” approach, like all other -omics approaches
Combinatorial control
- the integration of many inputs to give an output
- a combination of genes, transcription factors, etc go into transcription rates
- transcriptomics is a good technique to use to measure combinatorial control since it offers a whole-genome perspective
Types of Plastic Responses to the Environment
Explain them and draw them out
Direct response: When the cell’s phenotype changes as a direct response to an environmental trigger
- An example is mechanosensitive channels in cells
Gene mediated response: cell phenotype changes after an environmental trigger turns (a) gene(s) off or on
Histone modification: a signal in the environment triggers histone modification to occur, which can change gene expression, and thus change protein levels
- a well known example is when acetylation of the lysine residue on the histones causes the DNA to be more loose (euchromatin)
Methylation: an environmental signal triggers the methylation of specific DNA residues to occur, which leads to a change in gene expression and thus a change in protein levels
- The methylation of cytosine promotes heterochromatin formation, and thus decreases DNA transcription
- Histone modification and methylation are considered to be epigenomic modifications
Siamese Cat Example: What type of plastic response is at play?
- In Siamese cats, their extremities tend to be black/darkly colored, and their bellies/undersides tend to be lighter
- The molecule responsible for the darker color is called melanin
- The pathway to produce melanin involves tyrosinase taking various chemical precursors and assembling them into melanin
- In siamese cats, their tyrosinase has a mutation in which the tyrosinase is temperature sensitive and can’t work at higher temperatures, and therefore can’t produce melanin in the warmer parts of the cat
This phenotype is due to a direct response, because tyrosinase responsible for the phenotype (melanin or no melanin) is responding directly to the temperature
Seasonal Coat Color Example: What type of plastic response is at play?
- Seasonal coat color change is due to a gene mediated response
- We can look at how different genes’ expression patterns differ when the fur is one color vs when the fur is another color to try and see which genes are responsible for this change, and what environmental cues they may be responding to
- One way we can measure gene expression levels is by doing RNA-seq
Seasonal Coat Color: RNA-seq
- RNA-seq has a few basic steps. First, we synthesize cDNA from RNA so we can sequence it later on. We then fragment the cDNA, attach illumina adapters to the fragments and sequence them, then align (match up) the fragments to the genome and look at how much RNA is present (after normalizing, of course).
- RPK and RPKM by themselves for a gene don’t tell us much, because we have to have something to compare to
- One comparison that can be done is to compare to a housekeeping gene
- A housekeeping gene is one that is responsible for cell maintenance and won’t change much if at all between different organisms(might have to be in same species though) or different time-points in a single organism’s life
- A more popular way to compare genes is to do a direct comparison
- A direct comparison involves using mean-centered expression levels
- mean-centered expression levels involve taking the average for that same gene at different time points in the same organism (or maybe for the same gene in different organisms?), calculating the mean, and the calculating the fold above or below for each sample (see 11/8 lecture for examples)
- This type of gene expression profiling is useful for comparing samples with identical genomes and with mutated genomes!
RPK
- Reads per kilobase
- RPK is how we normalize for gene length when looking at how much RNA is present in RNA-seq
- to calculate RPK, we use the following equation: # reads gene A/(length gene A/1000), which is the same as (# reads gene A / length gene A) x 1000
Things I should do practice problems on:
- Calculating RPK (11/6 - 11/8)
- Calculating RPKM (11/8)
- Heat map examples (11/8)
- Mean-Centered expression levels (11/8)
Done
RPKM
- Reads per kilobase per million reads sequenced
- RPKM is how we normalize for RNA sample size when doing RNA-seq
- The calculation is: RPK/total reads x 1,000,000
GO
- Gene Ontology
- It is a formal naming scheme that assigns functions to genes and clusters them together
Combinatorial Control vs Simple Control
- Simple control is when only one gene is responsible for a phenotype (I think?), and combinatorial control is when multiple things are working together/happening at once to give a particular phenotype
Coregulation
- When gene expression of different genes go up or down together
- Co-regulation is a hallmark of genes that are turned on (or off) by the same transcription factor
ChIP-seq
- Chromatin Immune Precipitation Sequencing
- Is used to tell where in the genome a protein of interest binds
- Is particularly useful for seeing where certain transcription factors bind, since the location of their binding can give insight into what genes they are regulating
ChIP-seq works in the following way: - You take your genome and cross-link the proteins to the DNA so they are covalently bound
- The genome is fragmented
- The fragments are then purified, usually be running over a column of some sort that has antibodies specifically for your protein of interest, so the DNA-protein complex of your protein of interest will stick to the beads while you wash everything out, and then you’ll elute your DNA-protein complex off
- The cross-linkages between the DNA and protein is then broken, and the remaining fragments can be sequenced and mapped to the genome
- Straight-up ChIP-seq data can’t really tell you if a gene is unregulated or not, because 1) it doesn’t tell you if the transcription factor is an activator or repressor, and 2) transcription factors don’t always bind directly in front of the gene they are acting on
- To overcome the first problem, we usually use ChIP-seq data with RNA-seq data to determine if up regulation or down-regulation is occurring
Post-It Note: Why are some potential reasons individuals/species evolved to even bother with gene mediated responses in the first place?
- To set a threshold for response
- To be able to conserve resources
- To be able to adjust the level of response
- To be able to integrate multiple signals and affect multiple genes (combinatorial response)
- To control timing
Histone Modification
- An epigenetic plastic response
- Typically has a longer-lasting affect than gene-mediated responses
- There are 8 histones that form a nucleosome, and each histone has a “tail;” these tails control how tightly the histones bind to the DNA
- When the tails are unmodified, they are positively charged and thus wrap tightly around the DNA, putting us in the heterochromatin state
- When the tails are acetylated, however, they become uncharged and they “let go”
- It should be noted that acetylation isn’t the only modification that can lead to this chromatin change, and that the acetylation that is being discussed isn’t the only acetylation that can cause this change
- In this example, it is the lysine residues in the tails that get acetylated, causing them to lose their positive charge
- Acetylation occurs by histone acetyltransferases (HATs)
- De-acetylation occurs by histone deacetylaces (HDACs_
- Since it is acetylation that leads to the DNA being more accessible, HAT promotes transcription and gene expression
- We can use ChIP-seq to target different histones to see which parts of the genome have histone modifications; we can even target specifically modified histones and not include un-modified histones
Types of chromatin
- Euchromatin: open DNA in which the DNA isn’t tightly bound to the histones and is available for transcription
- Heterochromatin: “closed” DNA in which the DNA is wound tightly around the histones and is not accessible for transcription
- Heterochromatin can be converted into euchromatin via histone acetyltransferases (HATs). The HATs acetylate the lysine residues on the tails of the histones, which causes them to become neutral and not have a tight of a hold on the DNA
- Euchromatin can be converted back to heterochromatin via histone deacetylases (HDACs). The HDACs remove the acetyl groups from the lysines on the histone tails, causing them to bind more tightly to the DNA since they have regained their positive charge
Methylation
- An epigenetic plastic response
- Methylation is longer-lasting than gene-mediated responses (and histone-modification?)
- Methylation is normally stable (unchanging) throughout and organism’s lifetime
- The methylation of C’s in the CpG islands that are adjacently upstream of 70% of genes represses the expression of those genes (these CpG islands are located in the promoters)
- ## The methylated cytosines in the CpG islands recruit HDACs, which promote heterochromatin formation and repress gene transcription
Bisulfite sequencing
- A technique used to locate the methylated cytosines in a genome
- Bisulfite sequencing takes advantage of bisulfite conversion: un-methylated (normal) cytosine can be converted into uracil via bisulfite conversion
- If a cytosine is methylated, however, this conversion doesn’t happen
- Bisulfite sequencing has the following steps:
- First, the DNA from the cell type of interest is subjected to bisulfite conversion
- Next, Illumina sequencing is done on the DNA
- Lastly, we compare our generated sequence to a reference, and look for locations in our sequenced genome that are C’s in the same locations that they are C’s in the reference genome. These spots are where the methylated cytosines are
- Un-methylated cytosines will show up as T’s in our Illumina sequencing, because they will have been converted into U’s via bisulfite conversion
development
- Changes in phenotype and the mechanism that carry them during the reproduction and growth of an organism
- these mechanisms often include change in gene expression that can be measured overtime and in space
division of labor
- labor is divided in two different ways in a developing organism, in the following order:
1) regional specification: setting up unique areas within the embryo that will eventually become different body parts
2) differentiation: the process of cells within particular regions becoming more specialized
The “-derms”
- ectoderm, endoderm, and mesoderm
- These are the “seed” cell types from which all specialized cell types arise
- ectoderm cells become things like skin cells, nerves, and pigment cells
- endoderm cells become things like lung cells, pancreas cells, and GI tract cells
- mesoderm cells become things like heart muscle cells and blood cells
- They are pluripotent stem cells
stem cells
Def: cells from which multiple cell types could “stem”
- Stem cells have different levels of “potency”, meaning different levels of the ability of which they can turn into there cell types.
- The highest degree of potency is totipotent embryonic stem cells
- The second highest degree of potency is pluripotent embryonic stem cells
- The third level of potency are multipotent stem cells
- The last level, which are the most differentiated, are the differentiated cells
- Over an organism’s development, their cells go from originally being totipotent to fully differentiated
- de-differentiation is the process of going back up the scale to a more potent cell type to produce induced stem cells
iPSCs
- induced pluripotent stem cells
- They are actually totipotent stem cells, meaning they can become any cell type (pluripotent stem cells are one of the three “-derm” cell types and are thus more limited in what they can be differentiated into)
- Scientists have been able to make iPSCs by taking a person’s skin cells and using either a chemical cocktail or specific gene transcription factors that can turn the cells back into totipotent embryonic stem cells
- The chemical cocktails target known regulators of differentiation
- Creating iPSCs is basically reprogramming your cells by “tricking” them into thinking they’re young again
- The obvious potential therapeutic usage of iPSCs is that they can be differentiated into different cell types, maybe even regenerate an entire new organ to be able to do an organ transplant
- To make sure the reprogramming worked to turn the cells into iPSCs, we can compare the epigenome of normal vs our artificial stem cells
- There are two reprogramming “failure types” that we can get:
1) retained “memory” of the differentiated cell, in which the expression pattern of our attempted iPSCs is similar to that of the somatic cells, or at least in some regions
2) induction specific aberrations (problems) - researchers want to ensure that the epigenome is normal before using the iPSCs for therapeutic use, since they could potentially be cancerous or cause other problems if not checked out first