HC6: Big data analyses in immunology Flashcards
HC 6
Big Data: the three V’s
- Volume of data: a lot
- Velocity of processing of data: fast processing
- Variety of data sources: multiple sources, omics part of it
Ad-hoc tools
For Big Data analysis: to gain speed, certain programs needed
> scripting with known functions: just inputs required and know what to use
Big Data experiments
Data-driven hypothesis testing from publicly available large datasets without performing every experiment
> because: raw data needs to be stored in every experiment: re-use
> make hypothesis on previous data and verify in vitro or in vivo
> some questions can be answered with data that is already there
Omics role in immunology
- Transcriptome
- Cytome: expression proteins on cell surface
Why use of single cell RNA sequencing (scRNAseq)
- More detailed information of individual cells
- Reveals expression heterogeneity and subpopulations
> However: more expensive and more complex analysis.
scRNAseq technologies
- Plate based approach
- Droplet
- Microwell separation
Plate based scRNAseq: SMART-seq2, MARS-seq
- FACS
- One cell in one well: 384 well plate
- Physical separation
- Cells are individually sorted in wells plate with lysis buffer > cell lysis, RT, and downstream processes
Droplet based scRNAseq: 10xGenomics
- Microfluidic chip system: each cell incapsulated into oil droplets together with barcoded gel bead > cell lysis and RT within each droplet
- Barcoded cDNA is pooled for downstream processing, but distinguished with barcode
- Oil and water will divide: separation cells with gel beads with probes with barcodes (hairy bead): hybridization RNA with probe with barcode on gel beads
- flow creates droplets with beads and cells
Microwell separation scRNAseq: BD Rhapsody
- Cells are loaded in microwells together with barcoded magnetic bead
- Very tiny wells: flow across plate to get one cell per well
- Cell lysis is performed within each microwell where RNA of each cell binds > barcoded magnetic bead > also tagged with probes with barcode: all unique barcodes for cells
- Downstream processing performed on pooled beads
scRNAseq technologies and tissue atlas generation
Good with 10x and Rhapsody, less with plate based
Rare population profiling with scRNAseq techniques
Best with plate based
> less cells loaded (96 plate or 384 plate), less good for tissue atlas
> deep sequencing depth (10000 genes)
> 10x and Rhapsody: Shallow depth (2500 genes)
> rare populations: rare and you know the sorting: deep sequencing with plate based: more information of this little populations
> shallow sequencing won’t cover it: for example only B-cells taken: top genes are the same, but deeper genes might expose rare subpopulation
Plate based scRNAseq and throughput
Lower throughput > time-consuming
Increase information for multi-omics after scRNAseq
- Protein markers: CITE-seq
> for plate based, 10x and Rhapsody - Epigenomics: ATAC-seq
> 10x and Rhapsody
BCR/TCR sequencing
> mostly 10x and Rhapsody
Limitation scRNAseq
No spatial organization taken > the way the single cells were organized in the tissue is completely lost
> for some tissues: highly organized in which the same cell might behave differently (transcriptome) depending on where it is located eg lymph nodes
Spatial transcriptomics techniques
- Visium
- MERFISH
MERFISH
In situ hybridization
> high-throughput fluorescence in situ hybridization (FISH) in which expression of RNA is visualized by fluorescent probe with complementarity
> tissue placed on slide
> DNA probes for DNA of interest (gene of interest) will hybridize with RNA of interest > look where gene expressed
> picture on microscope
> FISH: white dots where expression gene of interest
> MERFISH: use barcode multiplexing, visualize up to 10,000 genes: make for each gene barcode sequence and do multiple rounds of fluorochroma labelled oligo flow for parts of barcode to identify all genes
> targeted analysis: only knwon transcripts are visualized: difficult quantification and comparison but nice picture