L7 - Big Data Flashcards

1
Q

What is big data?

A

-These are datasets too large or complex to process using traditional data processing methods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is big data used to measure? (6)

A

-Big data is gathered from large populations of DNA, RNA, protein molecules as well as cells, tissues and organisms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is used to analyse the following:
Short read sequence -
Long read sequence -
Proteomics/metabolomics -
Epigenomics -

A

Short read sequence - Illumina programme
Long read sequence - PacBio
Proteomics/metabolomics - Mass spectrometry
Epigenomics - ChIP-Seq

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is genomics used for
what is transciptomics used for?

A

-analysing DNA
-analysing RNA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Fold-change?
What is Significance?

A

Fold-change - is how much gene expression is increased or decreased by the treatment

Significance - the statistical significance of the difference in gene expression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What happens in a single cell RNA-Seq? (5)

A

-Collects the transcriptome of many individual cells
-Instead of measuring all the mRNA, measure from the individual cells of the sample
-Do this by breaking down the tissue into a single cell suspension that contains different cell types from the sample
-Link the sequencing data to particular cells and find out which mRNA belongs to which cell
-Use a UMAP plot to present the data where each dot is a cell

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

In a Genome-wide association study (GWAS) what is identified?

A

-(single nucleotide variants) SNPs, high scoring SNPs may be associated with the diseases and play causative roles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly