Lecture 3 Flashcards

Question 1

Q

What are features of models?

Answer

A

Never exactly like reality
As detailed as necessary
As simple as possible

Question 2

Q

What types of computational models exist

Answer

A

Mathematical and Statistical Models

Question 3

Q

What are the 4 assumptions LINE?

Answer

A

xp is a linear function
Errors are Independent
Errors are normally distributed
Errors have equal variance

Question 4

Q

How does RNA-seq work in steps?

Answer

A

Fragment RNA
Sequence Fragments
Aligne back to genome
Count reads mapping to gene

Question 5

Q

How do you prepare a library with Illumina?

Answer

A

Poly-A + RNA capture
RNA fragments primed
cDNA synthesized
3`ends adenlyated
adapter ligation
aplification of fragemnts

Question 6

Q

What is the Phred Score?

Answer

A

P = 10 ^(-Q/10) P = Propability base call incorrect Q = -10log(10)P

Question 7

Q

What is read trimming?

Answer

A

Chopping of ends of Read Quality Graphs

Question 8

Q

What is Read mapping?

Answer

A

Alignment of read to base gene sequence

Question 9

Q

What determines read count per base?

Answer

A

Expression level
Sequencing depth
Mapability
Noise

Question 10

Q

What determines read count per transcript?

Answer

A

Expression level
Sequencing depth
Mapability
Noise
Transcript length
Splicing

Question 11

Q

What are two methods for RNA enrichment?

Answer

A

poly A enrichment
purify transcripts with poly-A tail
enrich for mRNA
deplete non coding RNA

ribo-minus
remove rRNA
keep non coding tho

Question 12

Q

What is FPKM and how do you calculate it?

Answer

A

FPKM = n(i) / (l(i) * N) // n(i) = number of fragments for transcript i
l(i) = length of transcript i
N = 1 / 1000000 * Summe n(i)

Question 13

Q

What is TPM and how do you calculate it?

Answer

A

n(i2) = n(i) / l(i)
S = 1 / 1000000 * Summe n(i2)
TPM = n(i) / S

Question 14

Q

Whats the difference between TPM and FPKM

Answer

A

TPM is the same for each sample while FPKM may vary since it depends on the length of the fragment

Question 15

Q

How to determine differentially expressed genes?

Answer

A

linear model:
𝑌 = 𝜇 + 𝛽1 * 𝑥𝑡reat +𝛽2 * 𝑥𝑏atch +𝛽3 * 𝑥𝑠ex

Y = read counts in a given sample
µ = average read counts
xtreat = treatment condition (e.g. knock-out or drug treatment)
xbatch = batch membership (account for batch effects)
xsex = sex of the animal (account for sex effects)

Question 16

Q

How would 𝛽1 in the linear mopdel change if the treatment was succesfull?

Answer

Study These Flashcards

A

large and different from zero

Question 17

Q

Why are Generalzed Linear Models better ?

Answer

Study These Flashcards

A

𝐾𝑖j = 𝑓( Summer X(jr) beta(ir))
gene i
sample j
factor r

It does not Follow Gaussian distribution
Variance does not depend on the mean

Question 18

Q

How is a generalized Linear Model made up?

Answer

Study These Flashcards

A

lineqar predictor
link function
varaince function with constant dispersion parameter

Question 19

Q

What is overdispersion?

Answer

Study These Flashcards

A

Variance is larger than the mean

Question 20

Q

How does DeSeq2 work in 4 steps?

Answer

Study These Flashcards

A

Normalization
Variance estimation
Fold changes
Significance

Question 21

Q

Why do you normalize?

Answer

Study These Flashcards

A

Different number of reads
Contamination
Variation in machine

Question 22

Q

Why should you not correct very large values via shrinkage estimation?

Answer

Study These Flashcards

A

To avoid false positives

Lecture 3 Flashcards

(22 cards)