Genome Sequencing Technologies Flashcards

1
Q

How does sanger dideoxy sequencing show the position of bases on a gel?

A

dATPs can do chain extension and ddATPs are terminator nucleotides as there is an OH group missing. This cuts DNA strands short and can be used to show positions of bases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How does dye-terminator sequencing show the positions of bases?

A

Fluorescently-labelled ddNTPs incorporated into DNA strands. This is done in a capillary tube

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How were DNA fragments broken up for sanger sequencing? How large were the fragments before putting into bacteria?

A

Sonication

100-220kb to be cloned in bacterial artificial chromosomes (BACs)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How are DNA fragments in BACs mapped?

A

800bp fragments sequenced and put into the library

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What sort of sequencing did IHGSC use?

A

Clone-by-clone sequencing (less complex assembly than shotgun)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What sort of sequencing did Celera use

A

Shotgun sequencing with an automated ABI 3700

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How was massively parallel sequencing a step up from sanger sequencing?

A

Multiple molecules could be sequenced at the same time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What sort of sequencing does Illumina sequencing use?

A

Massively Parallel Sequencing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How many reads can Illumina sequencing produce in 1 run?

A

8 billion
125bp max read length

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What steps are involved in Illumina sequncing?

A
  1. Sample Step
  2. Cluster Generation
  3. Sequencing
  4. Data Analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Name what is added to ends of DNA in the sample step of Illumina Sequencing?

A

Adaptor regions
Motifs added through reduced cycle amplification, e.g sequence binding site, indices, oligonucleotide complimentary regions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What happens during cluster generation in Illumina sequencing?

A

DNA fragments isothermally amplified
oligos added and hybridise with the lawn adaptors
Polymerase synthesises a complimentary strand, double strand denatures, the original strand is washed away. The complimentary strand is amplified through bridge amplification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Outline bridge amplification

A

The complimentary strand folds over and hybridises a to the second type of oligo. Polymerase generates a complimentary strand which denatures and forms separate strands. Reverse strands are washed off and prime ends are blocked.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How does sequencing with Illumina work?

A

Reversible terminator nucleotides prevent chain extension and have fluorescent nucleotides which correspond to a nucleotide.
500bp fragments are selected.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What types of third generation sequencing are there?

A

PacBioSMRT
oxford nanopore

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Outline the developments of third generation sequencing over Illumina

A

Single molecule sequencing- no amplification needed
Real time sequencing
Ultra-long read lengths
Can identify base modifications

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Describe Zero-Mode Waveguides

A

Flow cell surface with nanowells where fluorescence can be detected from a single base.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

How are nucleotides detected in PacBio?

A

The phosphate group of nucleotides are fluorescently labelled. The intensity gives a pulse allocated to their nuceotide, e.g “G” pulse in a zero wave guide

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

How are circular consensus sequences produced in PacBio?

A

SMRT-bell adaptors are a single DNA loop and a primer attaches to the adaptor and goes around the entire loop

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What can be used to sequence hard-to sequencing areas of a genome

A

Oxford Nanopore

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What are the motor proteins used in Oxford nanopore?

A

Polymerase or Helicase

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Give a definition of bioinformatics

A

The science of collecting and analysing complex biological data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is it called when two genetic sequences are matching, e.g 70%

A

70% identify

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

How do homology and identify differ

A

Homology is having the same relation and evolutionary origin, but not function. This is absolute. E.g, mammals having the same bones in limbs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

When aligning base sequences, which considerations can maximize a match?

A

Deletions
Insertions
Inversions
Reverse strands

26
Q

How can nucleotide alignments be scored?

A

Scoring matrices, such as PAM70
Matches- positive
Gaps- negative
Extending gaps- lower penalty
Mismatches - Negative

27
Q

How do global and local alignments differ?

A

Global alignments assume the sequences are homologous and every position is conserved
Local alignments just match parts that match very well

28
Q

Why is genome assembly De Novo?

A

You don’t know whether what is being sequenced is correct

29
Q

Outline 5 challenges of genome assembly (will allow comparisons to jigsaws)

A

De Novo assembly
Repeats
Coverage bias (missing pieces)
Contamination
Replication meaning multiple genomes may be sequenced at once
Circular chromosomes
Sequencing errors (broken pieces)

30
Q

What do De Bruijn graphs do?

A

Line up overlapping sequences to find consensus sequences

31
Q

How can repeats be dealt with when sequencing a genome?

A

Resolving bubbles
Read pairs
Increasing chamer repeats to make de Bruijn graphs smaller, making sequences less likely to repeat

32
Q

What are the longest open reading frames in genomes likely to be?

A

Genes

33
Q

How is gene finding made easier?

A

Neural networks

34
Q

What can IGV (integrative genomics viewer) be used for?

A

Visualising map reads
Showing SNPs
Showing Homozygous and Heterozygous map reads

35
Q

What type of SNPs are there?

A

Intergenic (within genes)
Intronic
Synonymous
Regulatory
Non-synonymous
Nonsense

36
Q

What can changes in nucleotide sequences result in?

A

SNPs
Larger chromosomal arrangements
Insertions
Deletions
Ect

37
Q

How can structural variants be genomically detected?

A

Coverage depth caused by duplication

38
Q

What can studying transcriptomics tell us?

A

Which genes are expressed
When they are expressed
How much they are expressed

39
Q

After de novo assembly of a genome, how are consensus sequences made?

A

Using de Brujn graphs to find repeats between samples

40
Q

How can coverage depth of variants of a genome increase?

A

Duplications or deletions being on multiple strands

41
Q

Why might read pairs be mapped further apart than they are?

A

Deletions. These split reads show structural variation

42
Q

How can RNA transcripts be identified?

A

Northern blotting

43
Q

How can transcript expression be quantified?

A

reverse transcriptase quantitative PCR

44
Q

How are probes on microarrays generated?

A

Short oligonucleotides generated by photolithography to find certain genes

45
Q

How is expression on microarrays shown on graphs?

A

the log is taken to make the base quantities symmetrical

46
Q

What are some limitations of microarrays?

A

low resolution
Microarrays are predesigned, we don’t know if probes may be missing
If there is a signal, a similar DNA fragment may be in the sample, not the same
Not direct sequencing

47
Q

Why might RNA sequencing be used?

A

Direct sequencing method
larger dynamic range than microarrays
Allows differences in the reference genome to be identified

48
Q

List some silencing genes

A

X inactivation
germline genes
repeat regions

49
Q

How do organisms utilise gene methylation other than upregulation?

A

Distinguishing parental and maternal alleles.
Bacteria differentiating it’s DNA from phage DNA

50
Q

How can methylation context determine where methylation occurs?

A

CpGs (Cytosine polyGuanine) are 60-80% methlated in mammals so can be found with bisulfate sequencing?

51
Q

How does bisulfate conversion work?

A

The genome is treated with
sodium bisulfate so unmethylated cytosines become uracil. This comes up as T when sequenced and can be compared to the original sequence.

52
Q

How does reduced representation bisulfate sequencing work?

A

DNA is digested with MSP1 which recogneses CCG. Fragments are repaired with adaptors, then selected by size. Bitsulfate conversion and PCR amplification is then done
Unmethylated come up in the sequence as Ts and can be compared to the original sequence

53
Q

How is direct methylation sequencing done with PacBio?

A

SMRT DNA sequencing is dine as gaps in fluorescent intensity are analysed for polymerase kinetics.

54
Q

How does chromatin immunoprecipitation work?

A
  1. Proteins are covalently crosslinked to DNA after treatment with formaldehyde
  2. Chromatin is sheared by sonication or endonucleases. Exonucleases allow bound DNA to be trimmed to the binding sites.
  3. Immunoprecipitaion and purification of bound DNA with antibodies
55
Q

How does chromatin immunoprecipitation work?

A
  1. Proteins are covalently crosslinked to DNA after treatment with formaldehyde
  2. Chromatin is sheared by sonication or endonucleases. Exonucleases allow bound DNA to be trimmed to the binding sites.
  3. Immunoprecipitaion and purification of bound DNA with antibodies
56
Q

How do genes interact with enhancers in 3C

A

One to one. PCR is done with primers specific to both interaction regions.

57
Q

How do genes interact in 4C, 5C and Hi-C?

A

4C- one to all
5C- many to many
Hi-C- all to all

58
Q

What does the ENCODE project do?

A

Identify all the function regions of the human genome
Uses 5C and CIhIA-PET to look at regulatory expression and protein interactions

59
Q

What did ENCODE show the genome as?

A

80% functional
20% regulating 2% encoding proteins

60
Q

What does TraDIS use to sequence

A

Transposons are replaced with selective markers with antibiotic resistance. Genes disrupted by transposons are inactivated. The mutant doesn’t survive if the gene is essential. Transposons can be inserted into non-essential genes

61
Q

What sort of sequencing did ECODE use?

A

Illumina