Transcriptomics Flashcards
What is the term ‘omics’?
- Coined to describe the field of large-scale data-rich biology
- New technologies are becoming increasingly high throughput and are generating huge numbers (high
resolution) of molecular measurements within a tissue or cell - Data generated provide a snapshot of what is going on in a cell at a given time
How are proteins made from DNA?
Transcription:
1) RNA polymerase binds to promoter regions of DNA. Initiates transcription of gene into pre-mRNA
2) Introns spliced and exons join together
3) mRNA undergoes polyadenylation, adding poly-A tail to 3’ end, protecting from degradation and aids in export from nucleus
4) Mature mRNA exits through nuclear pore into cytosol
Translation:
1) Ribosomes bind to mRNA, each tRNA has an anticodon that is complementary to the mRNA codon, ensuring the correct amino acid is added to the growing polypeptide chain
2) After synthesis, undergoes various post-translational modifications to become fully functional
3) Process continues until a terminator codon is reached, signalling the end of translation. The completed protein is then released to perform its functions within the cell
What is transcriptomics?
Transcriptomics Methods
What is RNA Sequencing
What are Illumina Systems
Adv + Dis of Illumina Systems
PacBio Systems
Adv + Dis of PacBio Systems
Oxford Nanopore
Adv + Dis of Oxford Nanopore
RNASeq Data Processing
- Are there any contaminants
- Are there reference genomes
- Low quality reads are filtered out to increase overall dataset quality
RNASeq Data Alignment
RNAseq reads can be mapped to the transcriptome
- Need reliable gene models
- Can’t detect novel genes
RNAseq reads can be mapped to a reference genome
- RNAseq reads will contain regions covering an exon junction (where introns in the original DNA were spliced out) and these can be thousands of bp apart
- Junction reads need to be broken, or they will not be accurately mapped to the genome
RNASeq Data Normalization
Determining outliers, anomalies and noise
- When comparing different groups of samples, it’s ideal to have bigger inter-group variability than intra-group variability
PCA Plot
- can reduce a large dataset into 1 or 2 principal components which describe most of the variation
Pearson’s Coefficient
- measure the correlation (direction and strength) between two variables
-