Transcriptome Analysis Flashcards

Question 1

Q

Why do we care about transcription?

Answer

A

It is the primary means of interpreting info in the genome
it plays a central role in evolution
Often misrelated in disease

Question 2

Q

Complex traits

Answer

A

> 85% of GWAS associations lie in non-coding regions

- enriched for eQTLs, overlap with regulatory elements

Question 3

Q

Basic principles of gene regulation

Answer

A

Gene expression varies in quantity, space, time, and in response to stimuli
We typically measure steady state RNA
RNA is regulated at the level of transcription, promoter usage, splicing, poly A site usage, stability, and localization

Question 4

Q

Perspectives to study RNA

Answer

A

spatial localization
abundance quantification
transcript isoforms and structure
emphasis on response to stimuli

Question 5

Q

Spatial localization of RNA

Answer

A

Techniques: In situ hybridization, immuno histochemistry, gene fusions

Can provide very precise (sub)cellular resolution

Often on fixed tissues, but live imaging becoming more common

often difficult to quantify due to technical variations

Question 6

Q

immuno histochemistry

Answer

A

treat tissue with antibodies

Question 7

Q

Quantifying RNA abundance

Answer

A

Technqiues: Northern blots, qPCR, microarrays, nano string, RNAseq
Isolate cells, extract RNA, measure steady state RNA
Isolating cells can be difficult

Question 8

Q

transcript isoform usage and structure

Answer

A

techniques: qPCR, nanostring, Long read or paired end RNA seq
microarrays were not particularly good for this
short read RNAseq data has inherent limitations

Question 9

Q

Response to stimulus

Answer

A

Peturb system, measure gene expression

-knock down TF + measure RNA

Question 10

Q

Knock down TF and measure RNA

how do you know if change is direct?

Answer

A

pulse chase experiment

Question 11

Q

method based on pulse chase experiment

Answer

A

nascent transcription quantification (GROseq)

measuring RNA stability

Question 12

Q

EST Sanger sequencing

Answer

A

which is great for gene identification and characterization, long reads enabled isoform reconstruction, too expensive to accurate quantification

Question 13

Q

SAGE

Answer

A

Serial Analysis of Gene Expression

cDNAs cleaved into short <20bp fragments, concatamerized, and sequenced

Question 14

Q

RNAseq molecular biology

Answer

A

extract RNA
purify RNAs of interest (mRNA, miRNA)
fragment, prime
convert to cDNA
attach adapters
sequence single or paired end reads

Question 15

Q

RNAseq analysis outline

Answer

A

In some applications, reads are aligned to transcriptome (some align to transcriptome)
Assemble and quantify transcript abundance
test for differential expression(data are count based)

Question 16

Q

RNAseq complications

Answer

Study These Flashcards

A

Alignment –>short reads, large gaps, 1% error

using annoyed gene models helps
paired end and longer reads help

Experimental design - replicates –>bc the experiment is fairly expensive and complicated, many people do not perform (enough) replicates

Confounding variables:
-randomization is critical in experimental design

small n, large p
empirical bayes approaches

Question 17

Q

Computation cost

Answer

Study These Flashcards

A

Tophat ~1 hr/ 1 M reads on standard workstation

Question 18

Q

Confounding variables:

Answer

Study These Flashcards

A

difficult to control for variables can have large effects on RNAseq data
RNA extraction data, person performing library construction, kit batch, sequencing run, temperature, time day…

Question 19

Q

hidden variables

Answer

Study These Flashcards

A

latent variable techniques: PCA, factor analysis, PEER, SVA

Question 20

Q

FPKM

Answer

Study These Flashcards

A

Fragments per kilobase per million mapped reads
standard unit of measurement for RNA abundance from RNAseq
normalizes by transcript length and read depth

relative measure- depends upon the abundance of all transcripts,

Question 21

Q

Surrogate variable analysis

Answer

Study These Flashcards

A

ranking features of association accounting for hidden variables that are unmeasured

Transcriptome Analysis Flashcards

(21 cards)