Using gene expression profiling, we and others identified a novel subgroup of B-precursor acute lymphoblastic leukemia (B-ALL) with a gene expression signature similar to Philadelphia (Ph) chromosome (BCR-ABL1)-positive ALL. Termed “Ph-like” or “BCR-ABL1-like” ALL, this subgroup constitutes 10-15% of pediatric and 25% of adolescent/young adult ALL cases and is associated with a very poor clinical outcome. Using next generation sequencing, we have shown that Ph-like ALL is characterized by a highly heterogeneous spectrum of activating mutations or gene fusions targeting genes regulating cytokine receptor and tyrosine kinase signaling (JAK1/2, ABL1/2, PDGFRB, EPOR, CSF1R, AKT2, STAT5B, CRLF2, IL7R, SH2B3). As Ph-like ALLs may be sensitive to tyrosine kinase inhibitors (TKIs) in vivo, incorporating TKIs into therapy may significantly improve clinical outcomes. Here we report the development and validation of a robust gene expression classifier that can prospectively identify Ph-like ALL patients for therapeutic intervention.
Supervised learning methods were applied to gene expression profiles (Affymetrix U133_Plus_2.0; RMA normalized) generated from pre-treatment leukemic samples from 811 B-ALL patients accrued to COG High-Risk ALL Trials P9906 and AALL0232. Patients were partitioned into a training (P9906: n=207; AALL0232: n=278) and an independent test set (AALL0232: n=325). Next generation sequencing was used to identify Ph-like ALL-associated genomic lesions in these cohorts. The 54,675 Affymetrix probe sets were evaluated using Prediction Analysis of Microarrays (PAM), applying the method of nearest shrunken centroids to identify those probe sets best distinguishing Ph-like ALL. These probe sets were then distilled by 100 iterations of 10-fold cross-validation using three optimization criteria (overall error, average error, and ROC accuracy), leading to the identification of the 64 most predictive probe sets (derived from 38 unique genes). Quantitative RT-PCR assays were developed for each of the 38 genes by selecting optimized primer/probe sets and assays were run on 384-well Low Density Microarray (LDA) cards; 780/811 cases had residual material for LDA testing. LDA data were remodeled in the training set using double loop cross validation, resulting in a best and final predictive model and statistical algorithm containing 15 of the 38 genes (IGJ, SPATS2L, MUC4, CRLF2, CA6, NRXN3, BMPR1B, GPR110, CHN2, SEMA6A, PON2, SLC2A5, S100Z, TP53INP1, IFITM1). The sensitivity and specificity of the predictor was then evaluated in the independent test set.
The 15 gene LDA classifier was able to predict Ph or Ph-like ALL in the test set with a high degree of sensitivity (93.0%) and specificity (89.7%) and identified the heterogeneous genomic lesions associated with Ph-like ALL with very high frequency (Table 1). When compared to non-Ph-like ALL, Ph-like cases had a significantly poorer event-free survival (HR 3.58; p<.0001) (Fig. 1, left). A second predictive classifier modeled on the same training/test sets but with true BCR-ABL1 cases excluded yielded a virtually identical performance (97.2% sensitivity, 87.1% sensitivity; HR: 2.9; p<.0001). Strikingly, Ph-like ALL cases with IKZF1 deletions had a significantly worse outcome when compared to ALL cases with IKZF1 deletions alone, emphasizing the clinical importance of the Ph-like signature (Fig. 1, right). Concordance between the LDA predictor and our previously reported PAM method (NEJM 360:470, 2009) was 87.2%, with the largest difference being additional CRLF2 lesions identified by LDA.
We have developed and validated a highly robust gene expression classifier for the prospective identification of Ph-like ALL. Rapidly screened patients will then undergo targeted sequencing to confirm the presence of specific genomic lesions. This approach will facilitate the therapeutic targeting of Ph-like ALL patients to novel clinical trials, hopefully leading to improved outcomes.
No relevant conflicts of interest to declare.