Abstract 1451

Simultaneously examining multiple epitopes in single cells has become increasingly useful as improvements are made to multi-parametric flow techniques. Increased parameterization has enabled subdivision of functionally distinct cell populations based on an increasing variety of physiological attributes. It has not only helped better define the landscape of “normal” hematopoiesis, but also has been clinically applied in detection of minimal residual disease (MRD) in hematopoietic malignancies. Detection of rare “abnormal” cells is the crux of MRD-based risk stratification where sporadic residual cancer cells are identified through multi-parameter flow cytometry.

Current methods for the detection and characterization of cellular populations are generally reliant on manual examination and targeted gating techniques. This approach relies almost entirely on prior knowledge and leaves little room for discovery of novel populations. As the number of parameter per cell increases there is a rising need for dimensionality reduction (DR) methods to resolve high dimensional data “down” into a human-interpretable space. Classical DR, such as principle component analysis (PCA), fail to address the non-linear relationships in cellular phenotypes while newer approaches, such as spanning-tree progression of density normalized events (SPADE), have an inherent level of stochasticity that might adversely affect the robust separation of cellular phenotypes (i.e. discriminating healthy vs. diseased cells). Here, we present a novel algorithm that identifies and characterizes distinct cell populations, preserving the high dimensional information, but providing an interpretable visualization of their phenotypic relationships. This approach was applied to a cohort of normal human bone marrow (BM) specimens to discern a landscape of normal hematopoietic phenotypes. We then contrasted this to overlays of human leukemic bone marrow aspirates (AML and ALL) to understand the extent to which cancer corrupts the shape and form of the landscape. We illustrate the application for automated MRD detection in human leukemia (Figure 1).

Method:

Our approach, CellSNE, is an adaptation of t-Distributed Stochastic Neighbor Embedding (t-SNE), a non-linear dimensionality reduction algorithm. CellSNE finds a low dimensional mapping of cells that preserves their pairwise distances in a high dimensional space. A distance between each cell to every other cell in the dataset is calculated, based on a vector defined by the combined values of cellular parameters measure. An optimization algorithm then searches for a projection of the points into 2D, in such a way that maximizes the similarity in pairwise distances between the high-dimensional and two dimensional spaces. The resulting 2D projection organizes the sample into subpopulations that conserve the shape and relative distances between each cell.

Results/Conclusion:

Application of CellSNE to healthy BM clearly separated cells based on their known immune subtype and was confirmed by manual analysis (Figure 1A). The results are robust across data collected from different individuals on different days as well as in analyses conducted using low numbers of single cell parameters, suggesting that healthy BM generally maintains the same cellular population characteristics (or “shape”) across samples. When applied to leukemic BM from patients with AML and ALL CellSNE demonstrates a unique cancer landscape (“shape”) for each patient that is dramatically different from normal (Figure 1B).

It is critical to note that despite the overwhelming infiltration by cancer cells, rare “normal” cell populations can still be discerned in the ALL BM. CellSNE succeeded in automatically identifying rare (<1%) abnormal ALL cells (tracked using a CellSNE independent parameter) in an otherwise normal BM (Figure 1C). As such, CellSNE achieves in identifying and characterizing rare cellular populations that can be applied in both normal and malignant hematopoiesis. Thus, it provides opportunities for the automated analysis of both large cytometry datasets and clinical MRD detection.

Disclosures:

No relevant conflicts of interest to declare.

This icon denotes a clinically relevant abstract

Author notes

*

Asterisk with author names denotes non-ASH members.

Sign in via your Institution