HC6: Big data analyses in immunology Flashcards

Question

Deep immunotyping: increased lasers

Answer 1

- Increasing spectrum of excitation light that can be used > allows for use additional fluorochromes (UV/IR lasers possible to further increase fluorochrome availablity > more markers and antibodies included)

Answer 2

- Two fluorochromes are bound together so that the emission light of the first excited fluorochrome by laser excited the second fluorochrome > shift emission spectrum: more molecules and antibodies can be used

Answer 3

Detection of entire emission spectrum of a fluorochrome rather than specific wavelengths > not single detectors used for single filtered band of wavelengths (less filters used) > entire spectrum measured > more variety: better distinguishing of fluorochromes that are similar in a small band of wavelength > increase fluorochromes that can be used >> completely different machine required!

Answer 4

- Antibodies labelled with metals - Each cell is subjected to mass cytometry (mix FC and MS) by time of flight - For each cell you obtain distinctive metal mass spectrum (intensity vs mass) - No lasers used > work with mass rather than light - Less overlap between different masses of metals than fluorochrome spectra > more antibody-metal conjugates can be used for more markers > up to 100 markers

Answer 5

Big data analysis > increase number of markers that are simultaneously detected > Need for dimensionality reduction and advance computational tools for analysis > need for dimensionality reduction for overview: UMAP > to 2 dimensions > cluster cells of similar expression / transcriptome

Answer 6

Sequencing of BCR and TCR repertoire > they allow B and T-cells for antigen recognition at antigen binding site > uniquely made per antigen > VDJ recombination for TCR and BCR >> through cutting and pasting on DNA level

Answer 7

- VDJ recombination: random choice of gene fragments - Overhangs appear through cutting and pasting: these are filled with random nucleotides: create enormous variability > these parts are CDR3 for example, one of the Ag binding loops with a lot of variability: junction of V-(D)J, main site Ag interaction >> this process happens independently in every B and T cell during development

Answer 8

No, these use reference genome for same genes > shotgun fragmentation > sequencing and alignment to reference > reference genome are little blocks of V,(D), J, and C segments: alignment will never function

Answer 9

- Do not chop DNA or RNA (no reference) - Long read sequencing - Alignment within each gene area (gene segments) > for V and J segments (variable and joining, not diversity D, too short) - No reference for the start J and end of V (junction, CDR3): identification of it - Everyone has own repertoire of TCRs and BCRs that are sequences > expressed as: V-gene, J-gene and CDR3

Answer 10

- Get antibodies for treatment or research: antibody discovery > Create new antibody in antibody discovery: immunized mice and gain all B-cells, screen for antigen specific BCR, sequence BCR, expression of antibodies and testing for efficacy, mAbs against CD marker > Diagnostic and monitoring: find hugely expressed BCR in B-cell lymphoma, use treatment against BCR as treatment, when sequence comes back when monitoring, tumor has reawakened > Research: understanding immune system in health and disease >> understand TCR-epitope binding: super powerful for cellular therapies, diagnostics and vaccine design: to recognize epitope like for CAR therapy, design specific CAR to kill tumor, because finding and expandin ex vivo TCR that works in patient is expensive >> Track antibody formation and maturation: vaccination, autoimmunity, alloimmunization (how to steer response to get best antibody response)

Answer 11

CD71+ activated B-cell > recent GC graduate, rGCG > becomes either switched memory B cell or long lived ASCs (antibody secreting cell) > activated B cell only short time after infection > decision point at exiting GC, close expression profiles > Immunotyping: lineage tracing in mice, barcode for BCR: V-J-CDR3 barcode for every B-cell > multimodal/ multi-omics scRNAseq with BD Rhapsody > sequence BCR > phenotype cells with Ab oligos for specific markers like CDR3 (not in plate based, too many cells) > UMAPs can be made RNA based or protein marker based.

Answer 12

0: Input Data 1: Quality control 2: Normalization, selection HVG and scaling 3: Dimensionality reduction 1 4: Clustering 5: Dimensionality reduction 2 6: define cell identity 6A: find marker genes 6B: check expression of defining genes 6C: Algorithm for cell type identification

Answer 13

- Remove outliers: empty well, dead cells, doublets, dying cells > Dying cells: content mitochondrial genes (should be <20%): mtDNA gets into cytoplasm, gets too high > Doublets: high genes count > Empty wells/droplets: low genes cont

Answer 14

Pre-processing step performed to reduce experimental and technical confounders in dataset in order to highlight biological signals 1: data normalization 2: selection Highly Variable Gene: important step: select genes that show highest variation across cells and are probably related to specific cell behavior or phenotype. Selection for downstream analysis on these genes 3: Data scaling, for visualization

Answer 15

Big data: too many dimensions for easy analysis > Principal Component Analysis (PCA): linear method that selects the components (group of genes) that together are inducing most variation in data > Elbow plot: shows how many variance explained for top number of PCs > when little increase in variance > stop including > choose set amount of PCs

Answer 16

Comparing each cell to each other in scRNAseq to identify differences between cells is time and computional demanding > single cells can be of same cell type > clustering: cells with very similar transcriptional profile are grouped together and treated as one entity in comparative analysis >> too strict clustering: separation of cells that are same type >> too loose clustering: grouping of cells that are different type

Answer 17

For visualization purposes > too many dimensions too see > to two-dimensional picture for biological meaningful representation > tSNE or UMAP > tSNE focuses on local data structures > UMAP focuses on both local and global data structures

Answer 18

A: Unsupervised approach: find marker genes > find marker genes of each cluster, specifically expressed in cluster and whose pattern could define cells of that cluster > differential expression analysis among all clusters against each other B: Supervised approach: check expression of defining genes > when you know which cells are included in data set and you know marker genes > check expression of marker genes C: Semi-supervised: Algorithm for cell type identification > you kind of know which cells might be in dataset, but no defining marker genes for cell types > algorithm that tries to match total expression profile of all cells with that of specifically defined and sorted immune cell populations from databases

HC6: Big data analyses in immunology Flashcards

HC 6 (42 cards)