Tonon: Lecture II Flashcards

RNA-Seq and Experiment Design

1
Q

What is RNA-Seq?

A

it is the sequencing of RNA using technology

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the advantages of using Oxford Nanopore and Pacific Biosciences vs Illumina?

A

Pacific and Oxford offer better length, which is essential to detect phenomena such as alternative splicing, which leads to variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are some disadvantages of Pacific and Oxford?

A

high error rate
high quality RNA is needed
lower throughput, which is needed for heterogenous cancer samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What do Illumina and microarrays focus on?

A

poly-A tail structure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Is it better to have 50 million single hand or 25 double strand sequences?

A

the 50 million because it is important to have as many sequences as possible to ensure you have even low expressed genes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

After sequencing, what are the steps?

A

obtain FASTQ file from instrument
align unknown genome
assemble
quantification
normalization
model of differential expression is obtained

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How long do you have to analyze data?

A

depends but there is an expiration

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Biological Replicate

A

biological are done with different cell types

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Technical Repliate

A

same cell type, repeated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Biological Replicates vs Technical Replicates

A

biological is more important

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What has been introduced to minimize bias?

A

Standarized Operational Procedures (SOPs)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When analyzing data for large experiments, what can be done to minimize bias?

A

we can orthogonalize variables to prevent bias from being introduced (seen in batched)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is normalization?

A

removes background and non-biological variation as much as possible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the 2 key assumptions normalization relies on?

A

expression levels of most genes remain the same across replicate groups

different sample groups do not exhibit a meaningful difference in overall mRNA levels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do we normalize data from RNA-seq?

A

we use all the genes in the expression to do

normalisation, we calculate the average amount, so we know the average expression level and then

we bring it down to the same level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is MYC?

A

gene expressed most cancers?

17
Q

How does MYC work?

A

MYC affects only specific targets, so we can normalize the data and report all genes 1 to 1 among other cells

low MYC → quiescent cells
increased MYC → selective gene regulation

proliferating cells with increased MYC have a lot of RNA but the MYC targets are few

18
Q

What are the 2 steps to filtering?

A

identify and remove a set of genes that seem to generate an uninformative signal

statistical testing only to genes that pass the filter

19
Q

What is the goal of filtering?

A

for true-differential expression to emerge and silence the non-informative

20
Q

What is the importance of Filtering on Overall Variance?

A

all samples (healthy and cancerous: without labels) are filtered

all genes that have low expression are removed…this leads to less genes with higher signifigance