Sequencing Methods Flashcards

Question

WES (what is it?)

Answer 1

Whole exome Sequencing: focusses on the protein-coding regions of the genome (exons) which is about 1-2% of the human genome

Answer 2

This type of sequencing looks at parts of the genome which tend to be more clearly disease causing and better understood in the context of disease compard to non-coding elements.

Answer 3

1) targeted enrichment: regions of interest are selectively captured using oligonucleotides (in this case sequences to flank targeted regions) 2) Library preparation 3) Sequencing 4) Data analysis

Answer 4

+ more efficient because the sequenced regions are more likely to be disease causing + Is especially useful for detecting disease causing variants + Can also facilitate gene discovery

Answer 5

- Incomplete coverage compared to WGS - Limited detection of structural variants such as chromosomal rearrangements or inversions - There are still many variants of unknown significance or variants that are significant but in non-coding areas

Answer 6

Instead of sequencing the DNA we sequence the RNA

Answer 7

1) gives insights on the actual gene expression, so what is actually transcribed 2) insights in alternative splicing effects 3) it can detect fusion genes 4) It gives insights in the functional consequences 5) it allows us to detect imprinting effects by seeing which genes ar active, and which are not

Answer 8

RNA is not as stable as DNA

Answer 9

Single Cell RNA Sequencing: RNA sequencing of a single cell

Answer 10

To find out what is being transcribed

Answer 11

1) Cell isolation 2) Library preparation --> extraction of RNA and the cDNA (coding DNA) because it tends to be more stable 3) PCR 4) Sequencing 5) Data analysis

Answer 12

+ Insights how cell types differ in expression + It allows for detection of rare cell types within cell populations + Allows for the study of dynamic changes such as the response of environmental stimuli on a cell

Answer 13

- Limited coverage - More difficult to replicate - More noise and amplification bias compared to bulp RNA sequencing - Dissociating cells from tissue can lead to artifacts and stress response from cells - More difficult to maintain cell integrity and viability

Answer 14

B-RNA sequencing: takes multiple cell types in its analysis

Answer 15

Because it has some good advantages (coste effective, population level analysis, easy to replicate, global expression of genes)

Answer 16

1) RNA extraction through cells and tissue 2) Library preparation 3) Sequencing 4) Data analysis

Answer 17

+ cost effective + population level analysis + Easier to replicate: because its a larger sample to average out compared to one cell sample + Global expression of genes: including upregulation and downreagulation of genes

Answer 18

- Cannot distinguish cell to cell variability, such as subpopulation - Limited detection of rare cell transcriptions - It is more of a snapshot and less able to detect dynamics - Limited resolution for cell-to-cell interactions

Answer 19

focusses on specific mutations in a given sample.

Answer 20

- Limited information and other mutations might actually be causative

Answer 21

+ It is great if you have an idea of what gene is involved and you want to confirm it.

Answer 22

Single Nucleotide Polymorphism array is an array to detect SNP in the DNA

Answer 23

occur when a single nucleotide (building block of DNA) is replaced with another

Answer 24

Some SNP are associated with disease

Answer 25

+ High throughput + Cost effective + Highly reproducible + Well-established technology

Answer 26

- Limited variant detection - Often allele specific - Cannot be easily modified - No structural variants

Answer 27

Sequencing using short fragments (typically 50-300 bp). Technologies such as illumina and pyrosequencing use this method .

Answer 28

has nice advantages --> 1) cost effective 2) high accuracy 3) robust protocols

Answer 29

- Limited for large structural variations and complex genomic rearrangements - Limited information compared to long read

Answer 30

Sequencing by synthesis technology

Answer 31

for Sequencing

Answer 32

1) Library preparation 2) cluster generation through bridge amplification 3) Sequencing: nucleotides have fluorescence which is captured by camera 4) data analysis

Answer 33

+ High throughput + High accuracy + Flexibility: can be used for WGS, WES and RNA-sequencing

Answer 34

- GC bias (dependence between fragment count (read coverage) and GC (guanine and cytosine) content found in Illumina sequencing data.) - Read length is not as long as other metods (50-300 bases per fragment) - General short read sequencing limitations

Answer 35

Is a short-read sequencing method

Answer 36

Sequencing of long fragments which can be multiple kilobases (1kb = 1000 bases)

Answer 37

advantages: + detection of larger structural variations + Better for complex regions

Answer 38

- Higher error rate - Lower throughput - Higher quality and quantity of DNA is necessary

Answer 39

a single molecule, real time Long-read sequencing method

Answer 40

1) template preparation of circular DNA molecules 2) Put these molecules in SMRT cells which contains wells that capture fluorescence signals 3) Real time sequencing 4) Data analysis

Answer 41

Allows for sequencing of DNA and RNA molecules in real time

Answer 42

+ Long-reads in the kilobases + High accuracy because of the use of circular DNA molecules (it is repeatedly sequenced instead of in clusters) + Direct detection + Can be used for de novo assembly

Answer 43

- Slightly higher error rate comared to short-read sequencing - High cost - Need for high quality DNA

Answer 44

Real time long read sequencing method through the measurement of electrical currents of the DNA and RNA molecules

Answer 45

It's portable and can do direct RNA sequencing

Answer 46

1) Library preparation 2) Loading into flow cells 3) Translocation through nanopores 4) Signal detection and base calling, because the electrical currents are different for the basepairs

Answer 47

+ Long-reads + Real time sequencing + portability + Direct sequencing (which can be used for methylation detection)

Answer 48

- Relatively high error rate - Throughput is relatively low - Flow-cell life span is not that high

Answer 49

A cytogenetic technique to visualize the chromosomes under a microscope

Answer 50

it can be used to assess the number, size, shape, binding patterns or other clear chromosomal abnormalities

Answer 51

1) sample collection: tissue biopsy of actively dividing cells 2) Cell culture and mitotic arrest, this is done by simulation of cell division and arresting the cells with colchicine in the metaphase 3) Chromosome harvesting: add solution to swell cells and spread chromosomes 4) staining and banding: add dyes so banding patterns become visible 5) microscopic analysis

Answer 52

+ detection of chromosomal abnormalities + helpful for diagnosis for example trisomy

Answer 53

- difficult or impossible for small abnormalities - requires living cells - time consuming - labor intensive - interpretation is difficult and requires knowledge in cytogenetics - Limited information, does not say something on the molecular level

Answer 54

Fluorescence in situ hybridization = a molecular cytogenetic technique

Answer 55

Meant to visualize and map the location of specific DNA sequences in chromosomes, cells or tissue samles. If you suspect an abnormality, you can design a probe to confirm the abnormality

Answer 56

1) Probe design: you need a specific target sequence in the genome 2) sample preparation: immobilize cells and chromosomes 3) Denaturation and hybridization: Probe binds to complementary DNA strand (hybridization) 4) wash away the excess or unbound probes 5) Image analysis

Answer 57

+ Visual detection of DNA sequences + high specificity (ability to correctly identify lack of abnormalities in sample) and sensitivity (ability to correctly identify abnormalities in sample) + Multiplexing is possible

Answer 58

- Limited resolution: It is based on visual detection and thus on the quality of the microscope and researcher. There can be subjectivity in interpretation. - Probe design is difficult and time consuming

Answer 59

Chromatin immunoprecipitation followed by sequencing: Is a sequence method meant to investigate protein-DNA interactions and chromatin (histones and proteins that regulate DNA)

Answer 60

To study transcription factors, histone modification and chromatin accessibility (histones accessibility can affect DNA transcription)

Answer 61

1) cross linking: link proteins to DNA 2) Chromatin fragmentation: fragmentation of 200-500 base pairs 3) Immunoprecipitation: antibodies will bind to the transcriptional factors and histone modification. The complexes that form through this are captured 4) Purify DNA by washing away other proteins and antibodies 5) sequencing using NGS 6) Data analysis

Answer 62

+ genome wide coverage + high sensitivity and specificity + gives quantitative data, meaning that it gives information on the strength of the protein-DNA relations + Multiplexing is possible

Answer 63

- Dependent on antibodies - Cross linking efficiency can be variable - Require high complexity bioinformatic tools

Answer 64

Cleavage Under Targets & Release Using Nuclease: Is supposed to be a refinement of CHIP-seq and overcome its limitations

Answer 65

Because it is better than CHIP-Seq

Answer 66

very similar to CHIP-seq but with a more specific cleavage mechanism

Answer 67

+ Reduced background noise + Easier to use + Increased sensitivity + Improved resolution (more precise localization)

Answer 68

- still dependend on antibodies - Not suitable for all tissue types

Answer 69

Transposase-accessible chromatin using sequencing: A method to study gene regulation [ATAC-seq (Assay for Transposase-Accessible Chromatin using sequencing) is a technique used in molecular biology to assess genome-wide chromatin accessibility]

Answer 70

Assessing chromatin accessibility on genome-wide scale, to study gene regulation. This is done by measuring the accessibility of DNA to Tn5 transposase enzyme which inserts itself into chromatin regions

Answer 71

1) Tn5 binds to accessible DNA 2) Accessible DNA is fragmented by Tn5 3) Sequencing adaptors are inserted into these regions 4) DNA fragments are amplified 5) Next generation sequencing (NGS)

Answer 72

+ high sensitivity: can detect subtle changes + does not require a lot of input material + relatively simple and fast + does not require antibodies

Answer 73

- Bias in some types of regions - High computational demands - Less sensitive in regions near nuclear membrane

Answer 74

a technique to study transcribed RNA, specifcally Nascent RNA. Which is RNA that has not undergone modification yet

Answer 75

Unlike normal RNA sequencing it can only sequence RNA that have recently been made. It can be used to sequence the RNA that comes from regulatory elements

Answer 76

1) isolation of Nascent RNA 2) RNA extraction and library preperation 3) Sequencing 4) Data analysis

Answer 77

+ selective analysis and gives snapshot of transcriptional activities and dynamics + regulatory elements do produce nascent RNA so they can be measured using this method + High sensitivity

Answer 78

- Cell state and stability can effect quality - Complex

Answer 79

Molecular biology technique for regulatory elements

Answer 80

Investigate the 3D structure of the chromosome, it can be used to study the folding and loooping of the chromosome. 1vs.1 so you look at the interaction between 2 genomic loci

Answer 81

1) cross linking 2) Restriction digestion 3) Ligation 4) Cross link reversal and purification 5) detection and quantification

Answer 82

+ detection of long-range interactions + genome-wide analysis + regulatory elements can be studied

Answer 83

- complexity - requirements of antibodies

Answer 84

4C: one vs. all: look at the interactions between 1 genomic loci and all other genomic loci 5C: all vs all, chromatin interaction of multiple loci at the same time Hi-C: Genome wide captures all possoble interactions

Answer 85

Genome wide association studies: to identify genomic variants that are statistically associated with a risk for disease or particular trait. This often includes SNP's, insertions and deletions (INDELs) and copy number variants (CNV) which are larger structural variations.

Answer 86

to statistically associate genomic variations with diseases or traits

Answer 87

1) start with a population 2) Genotype the population (SNP array or WGS) 3) Meta analysis 4) Statistical association

Answer 88

+ can be used in the discovery of genetic variants + Hypothesis free approach + population level insight + Biological insight + Can be used for polygenic risk scores

Answer 89

- Often have limited predictive power, especially for individuals within the population - Limited in explaining complex traits and diseases - Limited causality - Difficult for rare variants (statistical power) - Population stratification (some subgroups in populations might have a significant value while the full population does not or other way around. Can lead to false positices or false negatives).

Answer 90

Polygenic Risk Scores: risk scores obtained from the GWAS studies. Such as the chance of a trait or odds ratio of a disease

Answer 91

can be used for predicting disease, prevention strategies and personalized medicine

Answer 92

+ Improved risk prediction + Can lead to early detection and prevention + Informative for common diseases

Answer 93

- can only explain small proportion of variance in complex diseases - can still be quite specific and non-generalizable for population - utility might be low for some diseases - ethics: pricacy, informed consent, health insurance - Mostly based on white European samples - Can be difficult to communicate that it is about risk and this may have negative consequences for mood, behavior and stress.