Sequencing Methods Flashcards

1
Q

what is PCR?

A

Polymerase Chain Reaction
a.k.a. a reaction to amplify DNA in a machine

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

why do we use PCR?

A

amplifying DNA is useful so you get more DNA material to work with. Can be very useful in a forensic context (since there is not much material to work with) to diagnose disease (infectiouse disease and mutiations) and just generally for the study of DNA.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Materials for PCR are…

A

1) DNA template (what you want to amplify)
2) DNA primers
3) DNA polymerase
4) DNTP’s (the building blocks)
5) Buffer solutions
6) Thermal cycling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

DNA primers are…

A

short oligonucleotides (short sequence of nucleotides, usually fewer than 50 nucleotides) for the flanking of target region. they can bind to their complementary sequence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

DNA polymerase is…

A

an enzyme that is essential for the synthesizing of new DNA strands

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Buffer solutions are used for…

A

for the optimal conditions of the enzymatic reactions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Thermal cycling entails…

A

temperature cycles to create the necessary conditions for the reactions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Method for PCR is:

A

1) Denature (95C) : DNA seperates into 2 strands because the hydrogen bonds breaks due to the heat
2) Annealing (50-65C): DNA primers can bind to their complementary strand
3) Extension (72C): the temperature is raised for the optimal function of the DNA polymerase
Repeat: usually 25-35 repeats

Note: temperatures can differ per primer or enzyme used

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Advanteges of PCR

A

+ high sensitivity: can amplify really small portions of DNA
+ High specificity: primers can make the amplification really specific
+ Usually quick

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Limitations of PCR

A
  • Easy contaminated due to it’s sensitivity
  • poorly designed primers can have huge effects
  • It just amplifies, it does not sequence
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Sanger Sequencing

A

it is a chain termination method to sequence DNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why do we use Sanger Sequencing?

A

It is used to sequence DNA. It is still used to validate the results of next generation sequencing technologies and can still be cost-effective for smaller projects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Materials needed for Sanger Sequencing are:

A

1) DNA template
2) Primers
3) DNA polymerase
4) dNTPs to syntesyse DNA
5) ddNTPs with fluorescent dye to stop synthesis and find out what the sequence is
6) buffer for optimal conditions of reaction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Sanger Sequencing Method:

A

1) The DNA that needs to be sequenced is denatured into single strands

2) Primers will anneal to the DNA fragments

3) DNA syntesis –> shorter and longer fragments will be formed because of the dNTPs which wull synthesise the strand and ddNTPs (that have the dye) will terminate the synthesis of the strand, leading to multiple strands with varying lengths

4) electrophoresis –> because longer strands are stuck at the top of the gel while short strands will end up at the bottom. The ddNTPs with the dye will indicate the full sequence if you look from the bottom to the top.

5) Data analysis –> using a chromatogram you can analyse the peaks associated with the corresponding nucleotides

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Advantages to Sanger Sequencing

A

+ high accuracy
+ Usefull and small scale projets
+ validated and reliable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Limitations of Sanger Sequencing

A
  • Lower throughput
  • Not as useful for de novo (NGS has more depth coverage)
  • Difficulties in high repeat area
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Next Generation Sequencing

A

Next-generation sequencing (NGS) is a high-throughput technology that allows for the rapid sequencing of entire genomes or targeted DNA regions, enabling comprehensive genetic analysis and advancements in personalized medicine, research, and diagnostics.

(think of Illumina or Oxford Nanopore)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

advantages of NGS vs. Sanger Sequencing

A

+ higher throughput (can sequence millions of DNA fragments simultaneously
+ More rcomprehensive coverage, such as entire genome or large targeted areas.
+ Cost effective per base and it is easier to scale up
+ Very fast dor how much data it can provide
+ high coverage depth, better at finding rare variants
+ Versatillity: WGS, WES, RNAseq, Epigenetic, Metagenomics
+ better at detecting single nucleotide variations
+ can be used for de novo sequencing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Limitation NGS

A

It may not always be as accurate as Sanger Sequencing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

WGS (what is it)?

A

Whole Genome Sequencing: a method to Determine the complete DNA sequence. (incorporating coding and non-coding , extra chromosomal areas and mitochondrial DNA)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Why do we use WGS?

A

WGS has comprehensive coverage and thus potentially gices a lot of information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Method WGS:

A

1) DNA isolation: blood, tissue and cultured cells
2) Library preparation: adaptors are ligated to ends of DNA fragments
3) Sequencing
4) Data analysis
5) Interpretation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Advantages for WGS

A

+ Comprehensive coverage, entire genome, regulatory elements, repetitive regions
+ Rather accurate
+ Useful for novel genetic variants not captured by targeted sequence approaches

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Limitations WGS

A
  • Need a lot of DNA and analysis resources
  • High cost
  • Difficult to interpret, in WGS we see a lot of variations, but there are a lot of VUS (variants of unknown significance) and the effects of non-coding regions are not always well understood.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

WES (what is it?)

A

Whole exome Sequencing: focusses on the protein-coding regions of the genome (exons) which is about 1-2% of the human genome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Why use WES?

A

This type of sequencing looks at parts of the genome which tend to be more clearly disease causing and better understood in the context of disease compard to non-coding elements.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

How does WES work?

A

1) targeted enrichment: regions of interest are selectively captured using oligonucleotides (in this case sequences to flank targeted regions)
2) Library preparation
3) Sequencing
4) Data analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Advantages WES

A

+ more efficient because the sequenced regions are more likely to be disease causing
+ Is especially useful for detecting disease causing variants
+ Can also facilitate gene discovery

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Limitations WES

A
  • Incomplete coverage compared to WGS
  • Limited detection of structural variants such as chromosomal rearrangements or inversions
  • There are still many variants of unknown significance or variants that are significant but in non-coding areas
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

RNA Sequencing (what is it?)

A

Instead of sequencing the DNA we sequence the RNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Why use RNA sequencing?

A

1) gives insights on the actual gene expression, so what is actually transcribed
2) insights in alternative splicing effects
3) it can detect fusion genes
4) It gives insights in the functional consequences
5) it allows us to detect imprinting effects by seeing which genes ar active, and which are not

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Limitation RNA sequencing

A

RNA is not as stable as DNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

sc-RNA Sequencing (What is it?)

A

Single Cell RNA Sequencing: RNA sequencing of a single cell

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Why use sc-RNA Sequencing

A

To find out what is being transcribed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

How does sc-RNA Sequencing work?

A

1) Cell isolation
2) Library preparation –> extraction of RNA and the cDNA (coding DNA) because it tends to be more stable
3) PCR
4) Sequencing
5) Data analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

Advantages sc-RNA sequencing

A

+ Insights how cell types differ in expression
+ It allows for detection of rare cell types within cell populations
+ Allows for the study of dynamic changes such as the response of environmental stimuli on a cell

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

Limitations sc-RNA Sequencing

A
  • Limited coverage
  • More difficult to replicate
  • More noise and amplification bias compared to bulp RNA sequencing
  • Dissociating cells from tissue can lead to artifacts and stress response from cells
  • More difficult to maintain cell integrity and viability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

Bulk-RNA Sequencing (what is it)

A

B-RNA sequencing: takes multiple cell types in its analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

Why use Bulk-RNA sequencing?

A

Because it has some good advantages

(coste effective, population level analysis, easy to replicate, global expression of genes)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

How does bulk-RNA sequencing work?

A

1) RNA extraction through cells and tissue
2) Library preparation
3) Sequencing
4) Data analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

Advantages bulk-RNA sequencing

A

+ cost effective
+ population level analysis
+ Easier to replicate: because its a larger sample to average out compared to one cell sample
+ Global expression of genes: including upregulation and downreagulation of genes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

Limitation of bulk-RNA sequencing

A
  • Cannot distinguish cell to cell variability, such as subpopulation
  • Limited detection of rare cell transcriptions
  • It is more of a snapshot and less able to detect dynamics
  • Limited resolution for cell-to-cell interactions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

Targeted Gene Panels

A

focusses on specific mutations in a given sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

Limitation Targeted gene panels

A
  • Limited information and other mutations might actually be causative
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

Advantage Targeted gene panel

A

+ It is great if you have an idea of what gene is involved and you want to confirm it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

SNP array (what is it?)

A

Single Nucleotide Polymorphism array is an array to detect SNP in the DNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

SNP

A

occur when a single nucleotide (building block of DNA) is replaced with another

48
Q

Why use SNP array’s?

A

Some SNP are associated with disease

49
Q

Advantages SNP array

A

+ High throughput
+ Cost effective
+ Highly reproducible
+ Well-established technology

50
Q

Limitations SNP-array

A
  • Limited variant detection
  • Often allele specific
  • Cannot be easily modified
  • No structural variants
51
Q

Short Read WGS (what is it?)

A

Sequencing using short fragments (typically 50-300 bp). Technologies such as illumina and pyrosequencing use this method .

52
Q

Why use Short Read WGS?

A

has nice advantages –> 1) cost effective
2) high accuracy
3) robust protocols

53
Q

Limitations Short Read WGS

A
  • Limited for large structural variations and complex genomic rearrangements
  • Limited information compared to long read
54
Q

Illumina (Short read) what is it?

A

Sequencing by synthesis technology

55
Q

Why use Illumina/Short read?

A

for Sequencing

56
Q

How does Illumina/Short read work?

A

1) Library preparation
2) cluster generation through bridge amplification
3) Sequencing: nucleotides have fluorescence which is captured by camera
4) data analysis

57
Q

Advantages Illumina / short read

A

+ High throughput
+ High accuracy
+ Flexibility: can be used for WGS, WES and RNA-sequencing

58
Q

Limitations Illumina/ short read

A
  • GC bias (dependence between fragment count (read coverage) and GC (guanine and cytosine) content found in Illumina sequencing data.)
  • Read length is not as long as other metods (50-300 bases per fragment)
  • General short read sequencing limitations
59
Q

Pyrosequencing

A

Is a short-read sequencing method

60
Q

Long read WGS (what is it?)

A

Sequencing of long fragments which can be multiple kilpbases (1kb = 1000 bases)

61
Q

Why use Long Read WGS?

A

advantages:
+ detection of larger structural variations
+ Better for complex regions

62
Q

Limitations Long read WGS

A
  • Higher error rate
  • Lower throughput
  • Higher quality and quantity of DNA is necessary
63
Q

PacBio (what is it?)

A

a single molecule, real time Long-read sequencing method

64
Q

How does PacBio work?

A

1) template preparation of circular DNA molecules
2) Put these molecules in SMRT cells which contains wells that capture fluorescence signals
3) Real time sequencing
4) Data analysis

65
Q

Why use PacBio?

A

Allows for sequencing of DNA and RNA molecules in real time

66
Q

Advantages PacBio

A

+ Long-reads in the kilobases
+ High accuracy because of the use of circular DNA molecules (it is repeatedly sequenced instead of in clusters)
+ Direct detection
+ Can be used for de novo assembly

67
Q

Limitations PacBio

A
  • Slightly higher error rate comared to short-read sequencing
  • High cost
  • Need for high quality DNA
68
Q

Oxford Nanopore (what is it)

A

Real time long read sequencing method through the measurement of electrical currents of the DNA and RNA molecules

69
Q

Why use Oxford Nanopore?

A

It’s portable and can do direct RNA sequencing

70
Q

How does Oxford Nanopore work?

A

1) Library preparation
2) Loading into flow cells
3) Translocation through nanopores
4) Signal detection and base calling, because the electrical currents are different for the basepairs

71
Q

Advantages Oxford Nanopore

A

+ Long-reads
+ Real time sequencing
+ portability
+ Direct sequencing (which can be used for methylation detection)

72
Q

Limitations Oxford Nanopore

A
  • Relatively high error rate
  • Throughput is relatively low
  • Flow-cell life span is not that high
73
Q

Karyotyping (what is it?)

A

A cytogenetic technique to visualize the chromosomes under a microscope

74
Q

why use Karyotyping?

A

it can be used to assess the number, size, shape, binding patterns or other clear chromosomal abnormalities

75
Q

How does karyotyping work?

A

1) sample collection: tissue biopsy of actively dividing cells

2) Cell culture and mitotic arrest, this is done by simulation of cell division and arresting the cells with colchicine in the metaphase

3) Chromosome harvesting: add solution to swell cells and spread chromosomes

4) staining and banding: add dyes so banding patterns become visible

5) microscopic analysis

76
Q

Advantages Karyotyping

A

+ detection of chromosomal abnormalities
+ helpful for diagnosis for example trisomy

77
Q

Limitations Karyotyping

A
  • difficult or impossible for small abnormalities
  • requires living cells
  • time consuming
  • labor intensive
  • interpretation is difficult and requires knowledge in cytogenetics
  • Limited information, does not say something on the molecular level
78
Q

FISH (what is it)

A

Fluorescence in situ hybridization = a molecular cytogenetic technique

79
Q

why use FISH?

A

Meant to visualize and map the location of specific DNA sequences in chromosomes, cells or tissue samles. If you suspect an abnormality, you can design a probe to confirm the abnormality

80
Q

How does FISH work?

A

1) Probe design: you need a specific target sequence in the genome
2) sample preparation: immobilize cells and chromosomes
3) Denaturation and hybridization: Probe binds to complementary DNA strand (hybridization)
4) wash away the excess or unbound probes
5) Image analysis

81
Q

Advantages FISH

A

+ Visual detection of DNA sequences
+ high specificity (ability to correctly identify lack of abnormalities in sample) and sensitivity (ability to correctly identify abnormalities in sample)
+ Multiplexing is possible

82
Q

Limitations FISH

A
  • Limited resolution: It is based on visual detection and thus on the quality of the microscope and researcher. There can be subjectivity in interpretation.
  • Probe design is difficult and time consuming
83
Q

CHIP-seq (what is it?)

A

Chromatin immunoprecipitation followed by sequencing:
Is a sequence method meant to investigate protein-DNA interactions and chromatin (histones and proteins that regulate DNA)

84
Q

Why do we use CHIP-Seq?

A

To study transcription factors, histone modification and chromatin accessibility (histones accessibility can affect DNA transcription)

85
Q

How does CHIP-seq work?

A

1) cross linking: link proteins to DNA
2) Chromatin fragmentation: fragmentation of 200-500 base pairs
3) Immunoprecipitation: antibodies will bind to the transcriptional factors and histone modification. The complexes that form through this are captured
4) Purify DNA by washing away other proteins and antibodies
5) sequencing using NGS
6) Data analysis

86
Q

Advantages CHIP-seq

A

+ genome wide coverage
+ high sensitivity and specificity
+ gives quantitative data meaning that it gives information on the strength of the protein-DNA relations
+ Multiplexing is possible

87
Q

Limitations CHIP-Seq

A
  • Dependent on antibodies
  • Cross linking efficiency can be variable
  • Require high complexity bioinformatic tools
88
Q

Cut & Run (what is it?)

A

Cleavage Under Targets & Release Using Nuclease:
Is supposed to be a refinement of CHIP-seq and overcome its limitations

89
Q

Why use Cut&Run?

A

Because it is better than CHIP-Seq

90
Q

How does Cut & Run work?

A

very similar to CHIP-seq but with a more specific cleavage mechanism

91
Q

Advantages Cut & Run

A

+ Reduced background noise
+ Easier to use
+ Increased sensitivity
+ Improved resolution (more precise localization)

92
Q

Limitations Cut & Run

A
  • still dependend on antibodies
  • Not suitable for all tissue types
93
Q

ATAC-Seq (What is it?)

A

Transposase-accessible chromatin using sequencing:
A method to stufy gene regulation

94
Q

why use ATAC-Seq

A

Assessing chromatin accessibility on genome-wide scale, to study gene regulation. This is done by measuring the accessibility of DNA to Tn5 transposase enzyme which inserts itself into chromatin regions

95
Q

How does ATAC-seq work?

A

1) Tn5 binds to accessible DNA
2) Accessible DNA is fragmented by Tn5
3) Sequencing adaptors are inserted into these regions
4) DNA fragments are amplified
5) Next generation sequencing (NGS)

96
Q

Advantages ATAC-seq

A

+ high sensitivity: can detect subtle changes
+ does not require a lot of input material
+ relatively simple and fast
+ does not require antibodies

97
Q

Limitations ATAC-Seq

A
  • Bias in some types of regions
  • High computational demands
  • Less sensitive in regions near nuclear membrane
98
Q

Nascent RNA-seq (what is it?)

A

a technique to study transcribed RNA, specifcally Nascent RNA. Which is RNA that has not undergone modification yet

99
Q

Why use Nascent RNA-seq?

A

Unlike normal RNA sequencing it can onlu sequence RNA that have recently been made. It can be used to sequence the RNA that comes from regulatory elements

100
Q

How does Nascent RNA-seq work?

A

1) isolation of Nascent RNA
2) RNA extraction and library preperation
3) Sequencing
4) Data analysis

101
Q

Advantages Nascent RNA-seq

A

+ selective analysis and gives snapshot of transcriptional activities and dynamics
+ regulatory elements do produce nascent RNA so they can be measured using this method
+ High sensitivity

102
Q

Limitations Nascent RNA-seq

A
  • Cell state and stability can effect quality
  • Complex
103
Q

Chromosome Conformation Capture = 3C (what is it?)

A

Molecular biology technique for regulatory elements

104
Q

Why use 3C

A

Investigate the 3D structure of the chromosome, it can be used to study the folding and loooping of the chromosome. 1vs.1 so you look at the interaction between 2 genomic loci

105
Q

How dos 3C work?

A

1) cross linking
2) Restriction digestion
3) Ligation
4) Cross link reversal and purification
5) detection and quantification

106
Q

Advantages 3C

A

+ detection of long-range interactions
+ genome-wide analysis
+ regulatory elements can be studied

107
Q

Limitations 3C

A
  • complexity
  • requirements of antibodies
108
Q

4C, 5C and HIC (what is it?)

A

4C: one vs. all: look at the interactions between 1 genomic loci and all other genomic loci

5C: all vs all, chromatin interaction of multiple loci at the same time

Hi-C: Genome wide captures all possoble interactions

109
Q

GWAS (what is it?)

A

Genome wide association studies:
to identify genomic variants that are statistically associated with a risk for disease or particular trait. This often includes SNP’s, insertions and deletions (INDELs) and copy number variants (CNV) which are larger structural variations.

110
Q

Why use GWAS?

A

to statistically associate genomic variations with diseases or traits

111
Q

How does GWAS work?

A

1) start with a population
2) Genotype the population (SNP array or WGS)
3) Meta analysis
4) Statistical association

112
Q

Advantages GWAS

A

+ can be used in the discovery of genetic variants
+ Hypothesis free approach
+ population level insight
+ Biological insight
+ Can be used for polygenic risk scores

113
Q

Limitations GWAS

A
  • Often have limited predictive power, especially for individuals within the population
  • Limited in explaining complex traits and diseases
  • Limited causality
  • Difficult for rare variants (statistical power)
  • Population stratification (some subgroups in populations might have a significant value while the full population does not or other way around. Can lead to false positices or false negatives).
114
Q

PRS (what is it?)

A

Polygenic Risk Scores:
risk scores obtained from the GWAS studies. Such as the chance of a trait or odds ratio of a disease

115
Q

Why do we use PRS?

A

can be used for predicting disease, prevention strategies and personalized medicine

116
Q

Advantages PRS

A

+ Improved risk prediction
+ Can lead to early detection and prevention
+ Informative for common diseases

117
Q

Limitations PRS

A
  • can only explain small proportion of variance in complex diseases
  • can still be quite specific and non-generalizable for population
  • utility might be low for some diseases
  • ethics: pricacy, informed consent, health insurance
  • Mostly based on white European samples
  • Can be difficult to communicate that it is about risk and this may have negative consequences for mood, behavior and stress.