lecture 3 - nuclear organisation and next generation sequencing Flashcards

1
Q

where is Constitutive heterochromatin found ?

A

at centromeres and telomeres

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what are centromeres?

A

Region where spindle fibres attach and pull apart chromatids during mitosis.
Repetitive sequences
Chromatin contains specialised histones
Chromatin is always very highly condensed into heterochromatin

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what are telomeres?

A

Chromosome ends
DNA repeats, maintained during replication by telomerase
Telomere length decreases as an organism ages, except in stem cells
Chromatin structure is always very highly condensed into heterochromatin

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is Constitutive Heterochromatin?

A

Highly Condensed chromatin
Centromeric and telomeric repeats
Repressive histone modifications
Methylated DNA
No meiotic recombination
Replicated late in S phase
Non-coding RNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is Facultative Heterochromatin?

A

Condensed chromatin
Inactive genes
Repressive histone modifications
Methylated DNA
Replicated later in S phase
Non-coding RNA?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is euchromatin?

A

Less condensed
Active genes
Gene promoters have active histone modifications
Gene promoters not DNA methylated
Replicated throughout S phase

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is DNA methylation?

A

DNA methylation works with repressive histone modifications to condense and silence chromatin.

Approximately 80% of CpG dinucleotides in human somatic cells are subject to cytosine methylation.

CG dinucleotides are methylated on both strands.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

where are Lamin-associated domains (LAD) located and what are they?

A

located at the nuclear periphery
LADs are associated with lamins of the nuclear membrane
LADs are heterochromatin
Contain few genes, and these are silenced
LADs are replicated late in S phase
Some LADs vary depending on cell type

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

when do Chromosomes localise?

A

to distinct territories in interphase
Each chromosome is labelled with a different fluorescent dye

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what tends to be true for euchromatin?

A

-Decondensed
-Acetylated histones
-Active genes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what are types of sequencing technology?

A

Individual clones of DNA molecules: Sanger sequencing

Whole genome: Next generation sequencing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is Sanger sequencing?

A

PCR containing fluorescent, chain termination dideoxynucleotide triphosphates

Current Sanger sequencing sequencing technology uses fluorescently-labelled ddNTPs (dideoxynucleotide triphosphosphates) which do not have a free 3’ OH, mixed in with dNTPS. Whenever the DNA polymerase incorporates a ddNTP it won’t be able to add any other nucleotides.

Each sequencing reaction uses one PCR sample
Sequence length is 600-1300 bp
Sequence is typically obtained within 2 days, analysis is easy
One reaction costs around £6
Each sequencing reaction uses one PCR sample
Sequence length is 600-1300 bp
Sequence is typically obtained within 2 days, analysis is easy
One reaction costs around £6

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is the human genome project?

A

Thousands of short regions of the genome cloned into plasmids
Sanger sequencing of each cloned sequence
Many labs around the world combined their data
First draft of the human genome released in 2001
Took 10 years
Cost $3,000,000,000

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is Next generation sequencing / High throughout sequencing / Massively parallel sequencing?

A

Latest technology (Illumina HiSeqX)

Sequence of a whole genome or transcriptome obtained in one reaction
Sample prep, sequencing and bioinformatic analysis can take less than one week, but typically takes 2-3 months
Costs can be as low as £1000 for full sequencing of a mammalian genome
Costs for RNAseq ~£400 per sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is High throughput sequencing ? -
several technologies, but some principles in common

A

DNA molecules within the library amplified by PCR, not by cloning individually into bacteria

Amplified DNA templates are spatially segregated: eg on beads, in an emulsion, or on a slide.

DNA templates are sequenced simultaneously in a massively parallel fashion

Incorporation of specific bases can be detected in various ways, eg:
by fluorescent tags using a camera
by release of H ions using a semiconductor chip
by how they block the flow of ions through a nanopore

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is De novo genome assembly?

A

35 – 100 bp reads are aligned with each other to produce a consensus genome sequence.
Need at least 10x coverage. Repetitive regions are harder to sequence.
Centromeres and telomeres extremely hard to sequence.

17
Q

how do we Identify sequence variants

A

mutations or SNPs – single nucleotide polymorphism

18
Q

Which of these questions would you use NGS (next generation sequencing) for?

A

What is the genome sequence of the organism I’ve working with?
What mutations are present in this patient’s cancer?

19
Q

What can we do with RNAseq data?

A

What was your hypothesis?

Which genes are upregulated or downregulated?

Gene ontology - are the altered genes associated with particular functions?
eg. Neuronal function; glucose metabolism, gametogenesis

Pathway analysis – are the altered genes associated with particular pathways?
eg. MAPK signalling, insulin signalling

20
Q

what is gene ontology used for?

A

Here, GO was used to identify gene classes containing an overrepresentation of genes that were differentially expressed in ES cells induced to undergo neurogenesis.

21
Q

what is the Ingenuity Pathway Analysis?

A

In this example, IPA was used on RNAseq data to elucidate a network of transcription factors that mediate neurogenesis

22
Q

what questions would you use RNAseq for? What other techniques might be better?

A

What genes are expressed when ES cells differentiate into cardiomyocytes?

Neural stem cells die when treated with tazemetostat (inhibits lysine methylation). Which genes are affected by this treatment?

23
Q

what are the applications of NGS?

A

Mutations, Translocations duplications –> cancer

Somatic variants- single nucleotide polymorphisms (SNPs) –>
Genetic diseases - Genome-wide association studies (GWAS) for chronic disease

RNAseq – transcriptome analysis - model organisms –> epigenetic analysis

24
Q

what is RNAseq analysis?

A

expressed exons are revealed by alignment of NGS reads to the reference genome

25
Q

how do we Identify genes that are significantly up- or down-regulated?

A

Validate results from RNAseq using conventional methods such as qRT-PCR and western blotting

26
Q

what is the Solexa technical approach?

A

1 - bind single DNA molecules to surface
2 - amplify
3 - DNA clusters about 500 copies per cluster and each about 1 micron in diameter

27
Q

what is RNA-SEQ?

A

RNA isolation (Experimental or patient samples) –> cDNA amplification –> library prep –> sequencing (Need around 40 million reads for human RNAseq)–> data analysis (Align to reference genome. Compare results between samples, or with previously generated data)