HC6: Big data analyses in immunology Flashcards
HC 6
Big Data: the three V’s
- Volume of data: a lot
- Velocity of processing of data: fast processing
- Variety of data sources: multiple sources, omics part of it
Ad-hoc tools
For Big Data analysis: to gain speed, certain programs needed
> scripting with known functions: just inputs required and know what to use
Big Data experiments
Data-driven hypothesis testing from publicly available large datasets without performing every experiment
> because: raw data needs to be stored in every experiment: re-use
> make hypothesis on previous data and verify in vitro or in vivo
> some questions can be answered with data that is already there
Omics role in immunology
- Transcriptome
- Cytome: expression proteins on cell surface
Why use of single cell RNA sequencing (scRNAseq)
- More detailed information of individual cells
- Reveals expression heterogeneity and subpopulations
> However: more expensive and more complex analysis.
scRNAseq technologies
- Plate based approach
- Droplet
- Microwell separation
Plate based scRNAseq: SMART-seq2, MARS-seq
- FACS
- One cell in one well: 384 well plate
- Physical separation
- Cells are individually sorted in wells plate with lysis buffer > cell lysis, RT, and downstream processes
Droplet based scRNAseq: 10xGenomics
- Microfluidic chip system: each cell incapsulated into oil droplets together with barcoded gel bead > cell lysis and RT within each droplet
- Barcoded cDNA is pooled for downstream processing, but distinguished with barcode
- Oil and water will divide: separation cells with gel beads with probes with barcodes (hairy bead): hybridization RNA with probe with barcode on gel beads
- flow creates droplets with beads and cells
Microwell separation scRNAseq: BD Rhapsody
- Cells are loaded in microwells together with barcoded magnetic bead
- Very tiny wells: flow across plate to get one cell per well
- Cell lysis is performed within each microwell where RNA of each cell binds > barcoded magnetic bead > also tagged with probes with barcode: all unique barcodes for cells
- Downstream processing performed on pooled beads
scRNAseq technologies and tissue atlas generation
Good with 10x and Rhapsody, less with plate based
Rare population profiling with scRNAseq techniques
Best with plate based
> less cells loaded (96 plate or 384 plate), less good for tissue atlas
> deep sequencing depth (10000 genes)
> 10x and Rhapsody: Shallow depth (2500 genes)
> rare populations: rare and you know the sorting: deep sequencing with plate based: more information of this little populations
> shallow sequencing won’t cover it: for example only B-cells taken: top genes are the same, but deeper genes might expose rare subpopulation
Plate based scRNAseq and throughput
Lower throughput > time-consuming
Increase information for multi-omics after scRNAseq
- Protein markers: CITE-seq
> for plate based, 10x and Rhapsody - Epigenomics: ATAC-seq
> 10x and Rhapsody
BCR/TCR sequencing
> mostly 10x and Rhapsody
Limitation scRNAseq
No spatial organization taken > the way the single cells were organized in the tissue is completely lost
> for some tissues: highly organized in which the same cell might behave differently (transcriptome) depending on where it is located eg lymph nodes
Spatial transcriptomics techniques
- Visium
- MERFISH
MERFISH
In situ hybridization
> high-throughput fluorescence in situ hybridization (FISH) in which expression of RNA is visualized by fluorescent probe with complementarity
> tissue placed on slide
> DNA probes for DNA of interest (gene of interest) will hybridize with RNA of interest > look where gene expressed
> picture on microscope
> FISH: white dots where expression gene of interest
> MERFISH: use barcode multiplexing, visualize up to 10,000 genes: make for each gene barcode sequence and do multiple rounds of fluorochroma labelled oligo flow for parts of barcode to identify all genes
> targeted analysis: only knwon transcripts are visualized: difficult quantification and comparison but nice picture
Visium by 10xGenomics
- Spacial glass slide which contains 4x capture area
- Stick tissue slide on glass slide
- Spots on glass slide: hairy spots: barcoded spots containing oligo (DNA probe) containing barcode for each spot (location on slide) and sequence to catch RNA (hybridize with polyA of mRNA)
> additional barcode for location this time! - Tissue placed on capture area and slides go into visium machine and picture is taken
- Then: tissue is permeabilized and RNA of each cell is released and hybridizes with spot-barcoded oligos
- RNA is sequenced and expression per spot is obtained
- overlay expression with first picture taken
Problem Visium
Not single cell resolution
> up to 10 cells can release RNA and get hybridized with one barcoded hairy spot (multiple hairs on spot)
> 1-10 cells per spot
> bulk spatial RNA seq of small biopsy of 1-10 cells really
Visium HD by 10x Genomics
- Barcoded spots so close to each other, single cell resolution reached
- Single cell transcriptome with spatial orientation
Immunotyping
Characterization of immune cells based on their cellular phenotype
THE technique for immunotyping
Flow cytometry
> differentiation based on expression specific markers on immune cells
> use antibodies specific for these markers, labeled with different fluorochromes, which can distinguish the different cells
> measured due to different specific emission spectra after excitation by laser
CD markers for immune cells and what is CD
CD: cluster of differentiation: determining which cell it is, but do have functions
> CD3: T-cell
> CD19: B-cell
> CD14: monocyte
> CD16 / CD56: NK-cell
Deep immunotyping
Simultaneous detection of large numbers (up to 50) cellular markers
Different ways to achieve deep immunotyping
- Increased number of lasers
- Tandem dyes
- Spectral flow cytometry
- CyTOF