Genetics Flashcards
What regulate gene expression?
- Transcription Factors
- Chromatin and Histone modicifications
- Nuclear organisation and Heterochromatin
How can gene expression be regulated at the DNA, RNA and protein levels?
What do Embryonic Stem cells differentiate into in the presence of low retinoic acid?
Embryonic Stem cells differentiate into cardiomyocytes in the presence of low retinoic acid.
What do Embryonic Stem cells differentiate into in the presence of high retinoic acid?
Embryonic Stem cells differentiate into Neuronal cells in the presence of high retinoic acid.
What are the marker genes for Embryonic Stem cells, Cardiomyocytes and Neurons and what are their functions?
Embryonic Stem Cell Genetic Marker
OCT4 - A transcription factor for self-renewal.
Cardiomyocytes
Cardiac troponin C (TNNC1) binds Ca2+ to activate muscle contraction.
Neurons
MAP2 - stabalises microtubules in dendrites.
What is quantitative/real time PCR used for?
quantitative RT-PCR / real time RT-PCR
A technique for amplifying and quantifying the amount of a specific RNA present in your sample
- q = quantitative / real time
- RT = reverse transcriptase
- PCR = polymerase chain reaction
What are the steps to quantifying mRNA expression?
- Culture cells and treat them according to your experimental protocol
- Purify RNA from the cells
- Reverse transcriptase reaction to synthesise cDNA from RNA
- PCR: Polymerase Chain Reaction
- Fluorescent dye binds to DNA and fluoresces - fluorescence is measured and doubles with every cycle of PCR.
How is RNA converted into cDNA for PCR?
Reverse Transcriptase is used to synthesise cDNA from RNA as RNA cannot be used in PCR. mRNA has poly A tail so oligo(dT) or random hexamer primers are used. The primers anneal to the poly A tail and Reverse Transcriptase forms the cDNA which is then separated from the template RNA using alkali.
What are the steps involved in PCR?
What materials are required for RT qPCR?
- Reverse Transcriptase
- Oligo(dT) or random hexamer primers
- Gene and reference gene
- dNTPs
- buffer
- High quality RNA
- PCR mix
- Dye - SYBR
- Fluorescence measuring machinery and computer for analysis
- Taq polymerase
What is semi-quantitative PCR?
PCR is said to be Semi-quantitative when Agarose gel electrophoresis is conducted after a set number of cycles. Detected by DNA stain.
How does the fluorescence work in RT q-PCR?
SYBR green fluorescent dye intercalates into double stranded DNA products.
What is the Ct value?
The Ct value is the PCR cycle number at which fluorescence becomes greater than the threshold.
How do we normalise quantification calculations?
They are normalised to a reference gene (don’t change in the experiment) which is used to control for any experimental variability e.g. sample preparation, RNA isolation, RT efficiency, PCR set up and efficiency.
Give 3 examples of reference genes
- β-actin
- Gapdh
- βIII-tubulin
Describe the main stages of transcription
Initiation
- Polymerase binds to promoter sequence in duplex DNA. “Closed complex”.
- Polymerase melts duplex DNA near transcription start site, forming a trasncription bubble. “Open complex”.
- Polymerase catalyses phosphodiester linkage of two initial rNTPs.
Elongation
- Polymerase advances 3’ → 5’ down template strand, melting duplex DNA and adding rNTPs to growing RNA.
Termination
- At trascription stop site, polymerase releases completed RNA and dissociates from DNA.
What are the 3 types of RNA?
- ribosomal RNAs (rRNA), RNA polymerase I 80% of RNA
- messenger RNAs (mRNA), RNA polymerase II 5% of RNA
- transfer RNAs (tRNA), RNA polymerase III 15% of RNA
What is RNA pol II transcription initiation regulated by?
Transcription Factors that bind to specific DNA sequences and recruit coactivators and RNA polymerase II.
Name and describe 3 Gene regulatory elements
- Promoter proximal element - activators/repressors that bind to the promoter
- Enhancer elements - loop over and interact with the factors bound at the promoter
- TATA box - DNA sequence that recruits basal/general TFs found at most pol II genes, they recruit and activate
Transcription Factors are modular and often act as ……
They have ….. domains called …….
Transcription Factors are modular and often act as dimers
They have 2 domains called activation domain (AD) and DNA binding domain.
What is the serum response factor?
It is a Transcription Factor that binds to the serum response element in the promoter.
The core binding sites for the SRF are 4-12bp and often palindromic due to the TFs often being dimeric.
What morphological change can happen to DNA when Transcription Factors bind?
The morphological changes that can happen to DNA is that it can bend.
How are Transcription Factors recruited?
The TATA box recruits TFIID, which consists of TATA-binding protein (TBP) and TBP- associated factors (TAFs).
What does TFIID consist of?
TATA binding protein (TBP) and TBP associated factors (TAFs).
What is the role of the pre-initiation complex?
It positions RNA polymerase at the transcription start site and allows for phosphorylation of the polymerase C-terminal domain of RNA polymerase II.
What is the pre-initiation complex made of?
- Transcription Factors
- Mediator complex (coactivator)
- mRNA
How is the pre-initiation complex formed?
- Sequence specific and basal Transcription Factors recruit coactivators e.g. the mediator complex which is required for nearly all RNA polymerase II transcribed genes.
- The mediator complex acts as a bridge between Transcription Factors, basal Transcription Factors and the C-terminal domain of RNA polymerase II.
Describe 3 ways that Transcription Factor activity can be regulated.
1.
How is the ELK1 TF activated?
Phosphorylation-activated DNA binding.
How is the CREB Transcription Factor activated?
Ligand-activated DNA binding
What is the MAPK pathway?
- Growth factors bind to cell surface receptors
- Receptor activation sets up cascades of protein phosphorylation inside the cell, amplifying the signal.
- At the bottom of the cascade, transcription factors get phosphorylated and activated, driving transcription.
- MAPK signalling can drive proliferation, or stress responses
- Oncogenic mutations that constitutively activate MAPK signalling are common in cancer
What is Mitogen?
Mitogen is a growth hormone.
EGF signalling leads to ……. phosphorylation and ..…. ultimately leading to the formation of the PIC and transcription
EGF signalling leads to Elk1 phosphorylation and DNA binding, ultimately leading to the formation of the PIC and transcription.
How are antibodies produced?
- Antigen: purified protein or short peptide
- Antigen is fused to an adjuvant protein, which stumulates an immune reaction, and is injected into host animal (e.g. rabbit, mouse, goat)
- Host generates an immune response to the protein, and blood samples are taken a regular intervals (e.g. once a month) to see if the serum now contains antibodies to the protein of interest.
What is SDS-PAGE and how does it work?
SDS-PAGE is Sodium dodecyl sulphate - polyacrylamide gel electrophoresis
- Protein samples denatured with the anionic detergent
- SDS Proteins migrate towards the positive anode
How are proteins differentiated using SDS-PAGE?
Proteins are differentiated between on the basis of their respective molecular weights (kD).
In a Coomassie stain of a SDS-PAGE gel, highly expressed proteins have thicker/narrower bands
Select the correct description
In a Coomassie stain of a SDS-PAGE gel, highly expressed proteins have thicker bands.
What are the steps involved in Western Blotting?
Western Blotting
- Conduct SDS-PAGE on proteins.
- Transfer proteins to membrane using electric current (blotting)
- Incubate membrane with Antibody specific for protein being studied (usually raised in rabbit or mouse)
- Secondary Antibody bound to Horse Radish Peroxide enzyme is added which recognises the primary Antibody and amplifies the signal.
- HRP substrate added and light emitted is detected by CCD camera
What does Western Blotting reveal?
Western Blots can show:
- Size of band recognised by antibody
- Specificity of antibody
- Relative amounts of protein in samples
- Quality of protein sample
What is GAPDH?
GADPH is a loading control that shows the same amount of protein in each lane (every band will have the same thickness).
What is immunohistochemistry and how quantitative is it?
It provides spatial information and uses Antibodies to detect proteins in cultured cells or tissues. Often visualised using secondary Antibodies with fluorescent tags.
It is semi quantitative.
Describe the steps in DNA folding
- double helix
- beads on a string (nucleosomes)
- chromatin fibre of packed nucleosomes
- topologically associated domains
- interphase chromosome
- metaphase chromosome
What is the nucleosome?
The nucleosome is an octamer of core histone proteins: H2A, H2B, H3, H4 (2 of each) 146 bp of DNA is wrapped twice around the octamer.
What is the role of Histone tails?
- Contact linker DNA and other nucleosomes
- Are important for chromatin folding and co-factor recruitment
What are the two oppurtunities chromatin provide to regulate gene expression?
- The folding of the chromatin can be regulated, thus controlling the access of proteins to the DNA.
- The chromatin provides a platform for a range of posttranslational modifications that control DNA accessibility and transcription factor and RNA polymerase recruitment.
What are some Post-translational modification of core histone ‘tails’?
- Acetylation
- Phosphorylation
- Methylation
Why is histone tail modification of significance?
The modifications bind cofactors which cause activation/folding/repression.
Describe histone tail acetylation
- Promotes transcription
- Occurs on lysine residues by Histone Acetyl Transferase (using acetyl coA) which are recruited by Transcription Factors. We see hyperacetylation of histone n-terminal tails.
Describe histone tail deacetylation
Inhibits transcription
Occurs on lysine residues by Histone Deacetylase which are recruited by repressive Transcription Factors and we see deacetylation in the Transcription Factor vicinity.
What are acetylated residues bound by?
Bromodomain-containing proteins (high affinity where multiple acetylated sites exist in proximity). Coactivators with bromodomains promote binding of other Trancription Factors and the mediator complex leading to RNA polymerase II recruitment and pre-initiation complex formation.
What is constitutive heterochromatin?
Chromatin that is always highly condensed into heterochromatin
Where is constitutive heterochromatin found?
Centromeres and telomeres
What is DNA methylation?
Works with repressive histone modifications to condense and silence chromatin.
Describe histone tail methylation
Lysine residues can be mono, di or tri methylated by HMT (uses s-adenosyl methionine) which can promote or suppress gene expression depending on the location.
Describe histone tail demethylation.
KDM removes methyl groups from lysine residues and turn them into lysine once again.
What reader binds methylated lysine residues?
Chromodomain containing proteins can be associated with activation or repression
Where is Constitutive heterochromatin found?
Constitutive heterochromatin is found at centromeres and telomeres.
What are characteritistics of Constitutive Heterochromatin?
Highly condensed, repressive histone modifications, methylated DNA, no meiotic recombination, replicated in late S phase, non-coding RNA important for centromeric chromatin.
Describe characteriatics of facultative heterochromatin
Condensed, inactive genes, repressive histone modifications, methylated DNA (where silenced), replicated in later S phase, non-coding RNA may be involved in repressive regions.
If genes are active is the chromatin in the form of euchromatin or heterochromatin?
If genes are active the chromatin is in the form of euchromatin.
What are centromeres?
Region where spindle fibres attach and pull apart chromatids during mitosis. They are repetitive sequences; their chromatin contains specialised histones. The chromatin are also always very highly condensed into heterochromatin.
What are telomeres?
They are essentially chromasome ends. They consist of DNA repeats, maintained during replication by telomerase. The chromatin structure is always very highly condensed into heterochromatin. Telomere length decreases as an organism ages, except in stem cells.
Where are rRNA genes transcribed?
rRNA genes are transcribed in the nucleolus.
Where are Lamin Associated Domains located?
Lamin-associated domains (LAD) are located at the nuclear periphery.
What are Lamin-associated Domains?
- Heterochromatic regions of chromosomes
- Associated with lamins of the nuclear membrane
- Most genes in LADs are silenced
- LADs are replicated late in S phase
- Genes can move in and out of LADs as they are activated or repres
Where are active domains located in the chromosome territory?
Towards the surface and enriched in interchromosomal contacts.
Where do LADs locate in the chromosome territory?
Lamin - associated domains locate towards the surface but they don’t have interchromosomal contacts but are enriched in long range intrachromosomal contacts.
Where do silent domains locate in the chromosome territory?
Internal positions and are enriched in intrachromosomal contacts
Describe euchromatin
Less condensed, active genes, gene promoters have active histone modifications, gene promoters are not methylated, replication occurs throughout the S phase, acetylated histones
In what phase of the cell cycle do chromosomes organise into distinct territories?
Chromosomes organise into distinct territories in interphase.
Describe the process of sanger sequencing
- Uses fluorescently labelled dNTPs which don’t have a free 3’OH, mixed with dNTPs
- Wherever DNA polymerase incorporates a dNTP it wont be able to add any other nucleotides
- The DNA mix is run on a gel and each base is read as they separate according to size. DNA molecules are sequenced one at a time
What kind of sequencing was used in the human genome project?
Sanger sequencing was used in the human genome project. Short DNA regions were cloned into plasmids and sequenced accordingly.
Describe the process of Next Generation Sequencing/High throughput sequencing and how it is different to sanger sequencing
- DNA molecules within the library amplified by PCR, not by cloning individually into bacteria
- Amplified DNA templates are spatially segregated: eg on beads, in an emulsion, or on a slide.
- DNA templates are sequenced simultaneously in a massively parallel fashion
In Next Generation Sequencing how is the incorporation of specific bases detected?
Incorporation of specific bases can be detected in various ways, eg:
- By fluorescent tags using a camera
- By release of H ions using a semiconductor chip
- By how they block the flow of ions through a nanopore
Describe the solexa approach of Next Generation Sequencing
Describe the sequencing by synthesis Solexa/Ilumina Next Generation Sequencing approach
- Bind single DNA molecules to surface, amplify, visualise clusters with camera
- Add nucleotides and DNA poylmerase
- Image array, remove label and terminator
- Add fresh nucleotides and DNA polymerase
- Sequence clusters in parallel
Different colours → Different bases
How do we sequence when there is no reference genome available?
- 35 – 100 bp reads are aligned with each other to produce a consensus genome sequence.
- Need at least 10x coverage. Repetitive regions are harder to sequence.
What regions in particular are difficult to sequence in De novo genome assembly?
Repetitive sequences are difficult to sequence. Thus, centromeres and telomeres are very difficult to sequence.
How much coverage is required when sequencing to identify SNPs?
10-30x coverage is needed to identify SNPs.
Give 3 applications of Next Generation Sequencing
- Identify driver mutations for cancer and personalise treatment
- Look for SNPs to study chronic disease therapy
- RNAseq - transcriptome analysis, epigenetic analysis
Describe the process of RNA seq
- RNA isolation from sample
- cDNA amplification
- Library preparation and sequencing
- Data analysis and alignment to reference genome
What sequencing method has a higher throughput? RNA-seq or RT-qPCR?
RNA seq has a higher throughput as the samples are processed quicker.
Which sequencing technique would be most appropriate to quantify gene expression?
RNA seq is the most appropriate DNA sequencing method to quantify gene expression.
Give 3 applications of RNA-seq data
- Identify up or down regulated genes using RPKM score and validate results using RT-qPCR or western blotting
- Gene ontology - are altered gens associated with particular functions
- Ingenuity pathway analysis - are altered genes associated with particular pathways e.g. MAPK signalling, Insulin signalling etc.
What is the RPKM score?
RPKM score: Reads Per Kilobase of transcript per Million mapped reads
What is RNAi?
Regulatory mechanism of translation by small RNAs in eukaryotes.
What is the difference between micron RNAs and small interfering RNAs?
- miRNA inhibit mRNA translation
- siRNA induce mRNA degradation and inhibit mRNA translation
Describe the process of RNAi function
- double stranded region of pre-miRNA or pre-siRNA is cut by dicer which recognises the stem structure and cleaves off the loop, releasing a 22bp RNA
- A single stranded miRNA or siRNA associates with proteins to form RISC
- RISC binds to cellular mRNA due to complementarity with the miRNA or siRNA within RISC
- siRNA has high complementarity → mRNA degradation. miRNA has low complementarity → no cleavage, translation inhibited
Are miRNAs restricted to binding one mRNA?
What are miRNAs important for?
miRNAs are important for development and differentiation.
How are siRNA useful in research?
siRNA are used to knockout proteins in research.
What are some practical issues with RNA interference?
- Specificity – is siRNA only targeting the gene of interest?
- Interferon response – a non-specific cellular response to dsRNA Need to use negative control siRNA, and repeat with different siRNA triggers.
- Incomplete knockdown; Reversible
- Delivery in humans – how to stably get the siRNA to the correct location?
Describe the application of RNAi in Huntington’s disease
- Using an miRNA expression construct contained in an Adeno-Aociated Virus vector and virus injected, mutant Htt RNA levels decreased with increasing time post infection as seen from RT-qPCR normalised to PPIA
- Less Htt protein and less aggregates resulted
- Motor coordination and depressive phenotype improvements
- miRNA bind mRNA and affects its stability and translation efficiency
What is Huntington’s disease and what causes it?
- It is caused by a single dominant allele
- Aggregation of mutant Htt results in damage to brain cells leading to gradual loss of coordination, mental ability decline and personality changes
- Polyglutamine tracts are toxic in some neurons and we get aggregates
Decribe the Phase I Clinical trial of IONIS-HTTRx
- Antisense DNA oligonucleotide (ASO) targeting an intron in the pre-mRNA of the Huntingtin transcript. DNA-RNA hybrid digested by Rnase H
- Injected intrathecally (as ASOs do not cross the blood-brain barrier), once every 4 weeks, on 4 occasions
- The treatment was well tolerated, further trials are needed to see if progression slows.
What is a key difference between siRNAs and ASO?
Unlike siRNA molecules, ASOs can cross cell membranes and enter the nucleus.
How does the AAV virus work?
The Adeno-Associated Virus
- Infects dividing and non-dividing cells.
- Doesn’t replicate in human cells
- Version used for gene therapy doesn’t integrate into the genome (reduced cancer risk)
- No apparent pathogenic activity
- Only small DNA constructs possible.
What is CRISPR Cas 9?
- CRISPR: clustered regularly interspaced short palindromic repeat
- Cas: CRISPR associated genes
- It is a system of adaptive bacterial immunity used to defend against bacteriophage
What is cas9?
Programmable RNA guided DNA endonuclease.
How does CRISPR cas9 work?
We have a Cas9 and a 98bp sgRNA. the protospacer RNA guides the Cas9 to the corresponding sequence in the genome. Cas9 cuts both strands of the genome. Changing the protospacer redirects the Cas9. The cleavage by Cas9 leaves a double stranded break. A sequence of interest can then be inserted or the break is repaired naturally which can induce mutation.
What are the 2 mechanisms for fixing a double stranded break from CRISPR Cas9?
- Non homologous end joining (NHEJ)
- Homology dependent repair (HDR)
What is the mechanism of Non Homologous End Joining?
- DNA-PK bound to KU80/KU70 heterodimers. The heterodimers bind to the sites of the double stranded DNA break. In doing so, trim thwe overhanging nucleotides to make blunt ends.
- The ligase then joins the seperated DNA strands together and remove the DNA-PK and KU80/KU70 heterodimers.
Why might the NHEJ mutations be useful?
In the lab for studying gene function but they are not useful for normal cells.
What is the mechanism of Homology Directed Repair?
Can only occur in the G2 phase of the cell cycle
- No errors are introduced
- Nucleolytic processing with nucleoprotein filaments attached at break to make overhang
- Search for the homologous region on the other chromosomal copy (overhang invades other copy of chromosome
- Joint molecule formation
- Strand elongation
- Base pairing with other damaged strand
- Gap filling and ligation
Compare the stages of the cell cycle that Non Homologous End Joining and Homology Directed Repair can occur in.
NHEJ can occur at any stage of the cell cycle whereas HDR can only occur in the G2 phase.
State 4 practical issues with CRISPR Cas9
- Off target cleavage
- Delivery method; ex vivo versus in vivo approaches
- NHEJ mutations variable
- HDR inefficient in dividing cells, absent (?) in terminally differentiated, non-dividing cells