Block 1 - genetics Flashcards
what factors regulate gene expression
TFs
chromatin and histone modification
nuclear organisation
what do ES differentiate into in the presence of low retinoic acid
cardiomyocytes
what do ES differentiate into in the presence of high retinoic acid `
neuronal cells
what are the marker genes for ES cells, cardiomyocytes and neurons and what are their functions
ES - OCT4 - TF for self renewal
cardiomyocytes - cardiac troponin C (TNNC) - binds calcium ions to activate muscle contraction
neurons - MAP2 - stabilises microtubules in the dendrite
what is quantitative/real time PCR used for
it is used to amplify and quantify the amount of a specific RNA in a sample
what are the steps in RT qPCR
- culture cells and treat them (e.g. retinoic acid)
- purify RNA from cells
- use RT to synthesise cDNA from RNA (RAN can’t be used in PCR). mRNA has poly A tail so we use oligo (dT) or random hexamer primers. Primers anneal and RT fills in cDNA which is then separated from RNA using alkali.
- PCR using cDNA
- fluorescent dye binds to DAN and fluoresces - fluorescence is measured and doubles with every PCR cycle
what material are required for RT qPCR
RT primers gene and reference gene dNTPs buffer high quality RNA PCR mix dye - SYBR fluorescence measuring machinery and computer for analysis Taq pol
what is semi-quantitative PCR
after a set number of PCR cycles an agarose gel electrophoresis is carried out and DNA is detected by staining
how does the fluorescence work in RT q-PCR
SYBR green fluorescent dye intercalates into dsDNA
geometric phase data is collected for accurate quantification of starting material
what is the Ct value
PCR cycle number where the fluorescence becomes greater than the threshold
how do we normalise quantification calculations
they are normalised to a reference gene (don’t change in the experiment) which is used to control for any experimental variability e.g. sample preparation, RNA isolation, RT efficiency, PCR set up and efficiency
give 3 examples of reference genes
B-actin
gadph
BIII-tubulin
describe the 3 main stages in transcription
- initiation - polymerase binds to the promoter, forming a TF bubble near the start site. RAN polymerase catalyses phosphodiester linkage of 2 initial rNTPs
- elongation - RNA pol advances 3’ –> 5’, melting the DNA duplex and adding rNTPs to the forming RNA (forming in 5’ –>3’ direction)
- termination - at the stop site the RAN pol releases completed RNA and dissociates form DNA
what are the 3 types of RNA, what is their function, what are they made by and what is their proportion
rRNA - made by RNA pol I - ensures translation accuracy - 80%
mRNA - made by RNA pol II - translated into peptide - 5%
tRNA - made by RNA pol III - brings amino acids to the ribosome - 15%
what is RNA pol II transcription initiation regulated by
TFs that bind to the promoter region in DNA and recruit coactivators and RNA pol II
name and describe 3 gene regulatory elements
promoter proximal elements - activators/repressors that bind to the promoter
enhancer elements - loop over and interact with the factors bound at the promoter
TATA box - DNA sequence that recruits basal/general TFs found at most pol II genes, they recruit and activate
TFs are modular and often act as ………..
They have ….. domains called …….
dimers
2
activation domain and DNA binding domain
what is the serum response factor
it is a TF that binds to the serum response element in the promoter
the core binding sites for the SRF are 4-12bp and often palindromic due to the TFs often being dimeric
what morphological change can happen to DNA when TFs bind
it can bend
how are TFs recruited
the TATA box recruits TFIID which recruits other basal or general TFs
what does TFIID consist of
TATA binding protein (TBP) and TBP associated factors (TAF)
what is the role of the pre-initiation complex
it positions RNA pol at the transcription start site and allows for phosphorylation of the polymerase c-terminal domain
what is the pre-initiation complex made of
TFs
mediator complex (coactivator)
mRNA
how is the pre-initiation complex formed
- sequence specific and basal TFs recruit coactivators e.g. the mediator complex which is required for nearly all RNA pol II transcribed genes
- the mediator complex acts as a bridge between TFs, basal TFs and the c-terminal domain of RNA pol II
describe 3 ways that TF activity can be regulated
- conformation alteration through ligand binding/phosphorylation (alter ability to bind DNA, interact with proteins, nuclear localisation)
- expression level alteration
- alter DNA binding site accessibility
how is the ELK1 TF activated
phosphorylation
how is the CREB TF activated
ligand activation
what are MAPK pathways and what do they drive
mitogen activated protein kinase pathways
- they drive proliferation or stress responses
what are the 3 basic steps of an MAPK pathway
- growth factors bind to cell surface receptors
- receptor activation –> phosphorylation cascade –> amplification of signal
- at the bottom of the cascade TFs get phosphorylated that drive transcription
what happens when MAPK are constitutively activated and where might this be seen
transcription is constitutively active so growth is continuous. this is often seen in oncogenic cancer mutations
what is mitogen
growth factor
EGF signalling leads to ……. phosphorylation and …. ……..…. ultimately leading to the formation of the PIC and transcription
ELK1
DNA binding
how do we produce antibody
we inject an adjuvant bound antigen into host animal and harvest antibodies from blood after immune response has been generated
what is SDS
an anionic detergent used to denature protein samples and coat them in a negative charge
on what basis are proteins separated in sds page
based on size
the sds page gel is vertical/horizontal
vertical
in a Coomassie stain of an sds gel highly expressed proteins have thicker/narrower bands
thicker
describe the process of western blotting
- transfer proteins from sds page gel onto membrane using electric current (blotting)
- incubate membrane with Ab specific for protein being studied
- secondary Ab bound to HRP enzyme is added which recognises the primary Ab and amplifies the signal
- HRP substrate added and light emitted is detected by CCD camera
what do western blots show
size of the band recognised
Ab specificity
relative amounts of proteins in samples
quality of protein in sample
what is GADPH
a loading control
what is immunohistochemistry and how quantitative is it
it provides spatial information and uses Abs to detect proteins in cultured cells or tissues often visualised using secondary Abs with fluorescent tags
it is semi quantitative
describe the steps in DNA folding
double helix beads on a string (nucleosomes) chromatin fibre of packed nucleosomes topologically associated domains interphase chromosome metaphase chromosome
what is a nucleosome
DAN wrapped around 8 histone proteins
there are 4 histones and there are 2 of each in each nucleosome
146bp of DNA is wrapped around the octamer twice
what are histone tails
they contact linker DNA and other nucleosomes and are important in chromatin folding and cofactor recruitment
in what 2 ways does chromatin regulate gene expression
folding controlled - (access to DNA)
platform for a range of post translational modifications - (accessibility and TF/RNApol recruitment)
what are the 3 core histone tail post translational modifications
acetylation
phosphorylation
methylation
why is histone tail modification of significance
the modifications bind cofactors which cause activation/folding/repression
describe histone tail acetylation
promotes transcription
occurs on lysine residues by HAT (using acetyl coA) which are recruited by TFs. we see hyperacetylation of histone n-terminal tails
describe histone tail deacetylation
inhibits transcription
occurs on lysine residues by HDAC which are recruited by repressive TFs and we see deacetylation in the TF vicinity
what are acetylated residues bound by (what is the reader)
bromodomain containing proteins (high affinity where multiple acetylated sites exist in proximity). coactivators with bromodomains promote binding of other TFs and the mediator complex leading to RNA pol II recruitment and pic formation
what is constitutive heterochromatin
chromatin that is always highly condensed into heterochromatin
where is constitutive heterochromatin found
centromeres and telomeres
what is DNA methylation
works with repressive histone modifications to condense and silence chromatin
describe histone tail methylation
lysine residues can be mono, di or tri methylated by HMT (uses s-adenosyl methionine) which can promote or suppress gene expression depending on the location
describe histone tail demethylation
KDM removes methyl groups from lysine residues
what reader binds methylated lysine residues
chromodomain containing proteins - can be associated with activation or repression
describe euchromatin
less condensed, active genes, gene promoters have active histone modifications, gene promoters are not methylated, replication occurs throughout the S phase, acetylated histones
describe constitutive heterochromatin
highly condensed, repressive histone modifications, methylated DNA, no meiotic recombination, replicated in late S phase, non-coding RNA important for centromeric chromatin
describe facultative heterochromatin
condensed, inactive genes, repressive histone modifications, methylated DNA (where silenced), replicated in later S phase, non-coding RNA may be involved in repressive regions
if genes are active is the chromatin in the form of euchromatin or heterochromatin
euchromatin
where are rRNA genes transcribed
nucleolus
what are lamin associated domains
heterochromatic regions of chromosomes associated with lamins of the nuclear membrane. most of the genes are silenced. genes move in and out of LADs as they are activated/repressed. LADs are replicated in late S phase
where are active domains located in the chromosome territory
towards the surface and enriched in interchromosomal contacts
where do LADs locate in the chromosome territory
towards the surface but they don’t have interchromosomal contacts but are enriched in long range intrachromosomal contacts
where do silent domains locate in the chromosome territory
internal positions and are enriched in intrachromosomal contacts
in what phase of the cell cycle do chromosomes organise into distinct territories
interphase
describe the process of sanger sequencing
- uses fluorescently labelled ddNTPs which don’t have a free 3’OH, mixed with dNTPs
- wherever DNA pol incorporates a ddNTP it wont be able to add any other nucleotides
- the DNA mix is run on a gel and each base is read as they separate according to size. DNA molecules are sequenced one at a time
what kind of sequencing was used in the human genome project
sanger sequencing
short regions cloned into plasmids and sequenced
describe the process of NGS and how it is different to sanger sequencing
- DNA molecules are amplified by PCR not by cloning into individual bacteria
- the product is spatially separated e.g. on beads, in an emulsion or on a slide
- the DNA templates are sequenced simultaneously in a massively parallel fashion
in NGS how is the incorporation of specific bases detected
fluorescent tags
release of H ions using semiconductor chip
blockage of ions through nanopore
describe the solexa approach of NGS
bind single DNA molecules to surface
amplify
visualise clusters that form with a camera
describe the sequencing by synthesis solexa/illumina ngs approach
bind single DNA molecules to surface, amplify, visualise clusters with camera
- add nucleotides and pol
- image array, remove label and terminator
- add fresh nucleotides and pol
- sequence clusters in parallel
- different colours –> different bases
how do we sequence when there is no reference genome available
de novo genome assembly - 35-100bp leads aligned with each other to produce a consensus genome
for this we need at least 10x coverage
what regions in particular are difficult to sequence
repetitive sequences are difficult to sequence
centromeres and telomeres are very difficult to sequence
how much coverage is required when sequencing to identify SNPs
10-30x
give 3 applications of NGS
- identify driver mutations for cancer and personalise treatment
- look for SNPs to study chronic disease therapy
- RNAseq - transcriptome analysis, epigenetic analysis
describe the process of RNA seq
- RNA isolation from sample
- cDNA amplification
- library preparation and sequencing
- data analysis and alignment to reference genome
what is higher through put RNA seq or RT qPCR
RNA seq - samples processed quicker
which sequencing technique would be most appropriate to quantify gene expression
RNA seq
give 3 applications for RNA seq
- identify up or down regulated genes using RPKM score and validate results using RT-qPCR or western blotting
- gene ontology - are altered gens associated with particular functions
- ingenuity pathway analysis - are altered genes associated with particular pathways
what is RNAi
regulatory mechanism of translation by small RNAs in eukaryotes
what is the difference between micron RNAs and small interfering RNAs
miRNA - inhibit mRNA translation
siRNA - induce mRNA degradation and inhibit mRNA translation
describe the process of RNAi function
- ds region of pre-miRNA or pre-siRNA is cut by dicer which recognises the stem structure and cleaves off the loop, releasing a 22bp RNA
- A ss miRNA or siRNA associates with proteins to form RISC
- RISC binds to cellular mRNA due to complementarity with the miRNA or siRNA within RISC
- siRNA has high complementarity –> mRNA degradation. miRNA has low complementarity –> no cleavage, translation inhibited
are miRNAs restricted to binding one mRNA
no - because mismatches are permitted
give 2 examples in biology where miRNAs are particularly important
development and differentiation
how are siRNA useful in research
they are used to KO proteins
what are some practical issues with RNAi
specificity - is it only targeting the gene of interest
interferon response - non specific cellular response to dsRNA
incomplete breakdown - reversible
delivery to humans - stability and quantity
describe the application of RNAi in Huntington’s disease
using an miRNA expression construct contained in an AAV vector and virus injected, mutant Htt RNA levels decreased with increasing time post infection as seen from RT qPCR normaqlised to PPIA
less Htt protein and less aggregates resulted
motor coordination and depressive phenotype improvements
miRNA bind mRNA and affects its stability and translation efficiency
what is Huntington’s disease and what causes it
it is caused by a single dominant allele
aggregation of mutant Htt results in damage to brain cells leading to gradual loss of coordination, mental ability decline and personality changes
polyglutamine tracts are toxic in some neurons and we get aggregates
describe the ASO clinical trial for Huntington’s
ASO injected intrathecally and targets an intron in the pre-mRNA of the Huntington transcript - treatment well tolerated - further trials to see if progression slows
what is a key difference between siRNAs and ASO
ASO can cross the membrane and enter the nucleus easily but siRNA can’t
how does the AAV virus work
infects dividing and non dividing cells
doesn’t replicate in humans
doesn’t integrate into genome (decrease cancer risk)
no pathogenic activity
limitation - only small DNA construct possible
what is CRISPR cas9
system of adaptive bacterial immunity used to defend against bacteriophage
what is cas9
programmable RNA guided DNA endonuclease
how does CRISPR cas9 work
we have a cas9 and a 98bp sgRNA. the protospacer RNA guides the cas9 to the corresponding sequence I the genome. cas9 cuts both strands of the genome. changing the protospacer redirects the cas9. the cleavage by cas9 leaves a ds break. a sequence of interest can then be inserted or the break is repaired naturally which can induce mutation.
what are the 2 mechanisms for fixing ds break from CRISPR cas9
non homologous end joining (NHEJ)
homology dependent repair (HDR)
what is the mechanism of NHEJ
stick ends back together, sometimes inducing mutations that result in a frameshift –> functional KO
can occur at any stage of the cell cycle
1. trimming of overhanging nucleotides to make blunt ends
2. ligase joins strands together
why might the HHEJ mutations be useful
in the lab for studying gene function but they are not useful for normal cells
what is the mechanism of HDR
can only occur in the G2 phase of the cell cycle
no errors are introduced
1. nucleolytic processing with nucleoprotein filaments attached at break to make overhang
2. search for the homologous region on the other chromosomal copy (overhang invades other copy of chromosome
3. joint molecule formation
4. strand elongation
5. base pairing with other damaged strand
6. gap filling and ligation
compare the stage of the cell cycle that NHEJ and HDR occur
NHEJ can occur at any stage of the cell cycle whereas HDR can only occur in the G2 phase
state 4 practical issues with CRISPRcas9
- off target cleavage
- delivery method - ex vivo vs in vivo
- NHEJ mutations are variable
- HDR deficient in non dividing cells and absent in terminally differentiated/non-dividing cells
describe the CRISPR cas9 application in Duchenne muscular dystrophy
exon skipping is a viable therapeutic approach as loss of some exon in the rod domain doesn’t affect protein function
they used CRISPR cas9 to cleave before and after the stop codon so its not read
delivered by AAV vector
RT-qPCR revealed edited alleles and some dystrophin was restored, improving the DMD phenotype
what is Duchenne muscular dystrophy
most severe and common type of muscular dystrophy
affects mainly males and is characterised by wasting away of muscle
caused by a range of mutations in the X linked dystrophin gene (severe phenotypes result from frameshift mutations)
death usually occurs by congestive heart failure