Abstract
We examined the efficacy of reduced-intensity conditioning (RIC) and compared outcomes of 93 patients older than 16 years after RIC with 1428 patients receiving full-intensity conditioning for allografts using sibling and unrelated donors for Philadelphia-negative acute lymphoblastic leukemia (ALL) in first or second complete remission. RIC conditioning included busulfan 9 mg/kg or less (27), melphalan 150 mg/m2 or less (23), low-dose total body irradiation (TBI; 36), and others (7). The RIC group was older (median 45 vs 28 years, P < .001) and more received peripheral blood grafts (73% vs 43%, P < .001) but had similar other prognostic factors. The RIC versus full-intensity conditioning groups had slightly, but not significantly, less acute grade II-IV graft-versus-host disease (39% vs 46%) and chronic graft-versus-host disease (34% vs 42%), yet similar transplantation-related mortality. RIC led to slightly more relapse (35% vs 26%, P = .08) yet similar age-adjusted survival (38% vs 43%, P = .39). Multivariate analysis showed that conditioning intensity did not affect transplantation-related mortality (P = .92) or relapse risk (P = .14). Multivariate analysis demonstrated significantly improved overall survival with: Karnofsky performance status more than 80, first complete remission, lower white blood count, well-matched unrelated or sibling donors, transplantation since 2001, age younger than 30 years, and conditioning with TBI, but no independent impact of conditioning intensity. RIC merits further investigation in prospective trials of adult ALL.
Introduction
Allogeneic hematopoietic stem cell transplantation (HSCT) has an established role in the management of adults with acute lymphoblastic leukemia (ALL) in first or second complete remission (CR1 and CR2). Recent large prospective donor versus no donor analyses showed that matched sibling allografting yields superior outcomes compared with chemotherapy,1 and its use is recommended in the highest-risk, Philadelphia chromosome–positive (Ph+) cohort.1 A recent report described encouraging results with allografting from unrelated donors (URDs) for patients with Ph-negative (Ph−) ALL in CR1 at high risk of relapse.2 Although only a minority of adult patients achieve a second remission and remain fit for a transplantation,3 allogeneic HSCT is still the treatment of choice for adults with ALL in CR2 where sibling donor allografts yield disease-free survival (DFS) of 25% to 40%.4 Most conditioning regimens involve the use of total body irradiation (TBI) with some evidence favoring doses exceeding 13 Gy.5,6 Reports of non–TBI-containing regimens for ALL are limited.
Allogeneic HSCT may be the most effective antileukemic therapy in adult ALL, but results are compromised by high transplantation-related mortality (TRM), approaching 35% in patients older than 35 years.1 In an attempt to avoid the high TRM seen with full-intensity (FI) HSCT and to exploit the graft-versus- ALL effect, investigators have been exploring the role of reduced- intensity conditioning (RIC) regimens. Results have been difficult to evaluate because of regimen heterogeneity and small numbers.
Various conditioning regimens have been used, including very low-intensity (eg, fludarabine and TBI 2 Gy) or more intense regimens, such as fludarabine and melphalan.7 Most regimens used in the United States have been T-cell replete, whereas a significant proportion of RIC transplantations performed in Europe have used in vivo T-cell depletion (TCD) with either alemtuzumab8 or antithymocyte globulin (ATG).9 Mohty et al10 from the European Group for Blood and Marrow Transplantation reported outcomes in 97 adults with ALL and the 42% 2-year leukemia-free survival seen in CR1 patients was encouraging, but only 53 of the 97 patients had early-phase disease. Interestingly, the presence of chronic graft-versus-host disease (GVHD) was associated with superior survival (P = .01). The outcome of patients not in remission was dismal. Heterogeneity in the definitions of RIC limits interpretation of some data from this report.
Other groups are prospectively investigating the efficacy of RIC HSCT in adult ALL.11 However, these transplants are being performed in the absence of any large-scale data about outcome and with insufficient knowledge of the prognostic factors that determine that outcome and thus influence the decision to transplantation. There are also limited data to help determine the choice of conditioning regimen and in particular whether TBI should be used. The use of in vivo TCD and the effect of acute and chronic GVHD on relapse and overall outcome also require examination. In this retrospective study using the large multicenter observational database of the Center for International Blood and Marrow Transplant Research (CIBMTR), we examined the outcome of RIC and FI conditioning to address these issues.
Methods
Data sources
The CIBMTR is a research affiliation of the International Bone Marrow Transplant Registry, Autologous Blood and Marrow Transplant Registry, and the National Marrow Donor Program established in 2004 that received data from more than 450 transplantation centers worldwide on consecutive allogeneic and autologous hematopoietic HSCT to a Statistical Center at the Medical College of Wisconsin in Milwaukee and the National Marrow Donor Program Coordinating Center in Minneapolis with computerized checks for discrepancies, physicians' review of submitted data, and on-site audits of participating centers ensure data quality. The CIBMTR collects both Transplant Essential Data and Comprehensive Report Form data at pretransplantation, 100 days and 6 months after transplantation, and annually thereafter.
Observational studies conducted by the CIBMTR are performed with approval of the Institutional Review Boards of the National Marrow Donor Program and the Medical College of Wisconsin.
Patient population
Eligible patients were 16 years of age and older had Ph-ALL in CR1 or CR2 and received a matched sibling or URD HSCT using RIC (n = 93) between 1995 and 2006. They were compared with the 1428 patients who received a myeloablative, FI regimen for ALL. It was anticipated that there would be significant differences between the 2 study populations; thus, adjusted multivariate regression was planned.
The following subgroups were excluded: L3 ALL, HTLV1-associated T-cell, aggressive NK-cell, and large granular lymphocytic leukemia; and also patients with missing or inadequate conditioning regimen or follow-up information. A total of 1428 FI and 93 RIC cases met the on-study criteria from 219 reporting centers in 36 different countries. For 27 patients, conditioning information was missing, and in 6 it was inadequate; these patients were excluded. Median follow-up of survivors was 54 months (range, 3-166 months) for the FI group and 38 months (range, 3-93 months) for the RIC group.
Conditioning regimens and definitions
The CIBMTR definition of RIC has been published12 and included: busulfan 9 mg/kg or less (n = 27), melphalan 150 mg/m2 or less (n = 23), TBI less than 500 cGy single dose or TBI less than 800 cGy fractionated (n = 18), and fludarabine plus TBI dose = 200 cGy (n = 12) and other (n = 13, 6 containing TBI). Most patients in the FI arm received TBI (n = 1252), usually in combination with cyclophosphamide (n = 1037). Smaller numbers had high-dose busulfan (n = 175), usually in combination with cyclophosphamide (n = 142).
Endpoints
Primary endpoints were TRM, morphologic leukemia relapse (hematologic and/or extramedullary), acute and chronic GVHD, DFS, and overall survival (OS). TRM was defined as death during continuous complete remission after transplantation. Relapse was defined as clinical or hematologic leukemia recurrence. For analyses of DFS, failures were clinical or hematologic relapses or deaths from any cause; patients alive and in complete remission were censored at time of last follow-up. For analyses of OS, failure was death from any cause; surviving patients were censored at the date of last contact.
Statistical analysis
Patient-, disease-, and transplantation-related variables among the conditioning regimen groups (RIC vs FI) were compared using the χ2 statistic for categorical variables and the Kruskal-Wallis test for continuous variables. Univariate probabilities of DFS and OS were calculated using the Kaplan-Meier estimator; the log-rank test was used for univariate comparisons. Probabilities of TRM and relapse were calculated using cumulative incidence curves to accommodate competing risks.13 Assessment of potential risk factors for outcomes of interest were evaluated in multivariate analyses using Cox proportional hazards14 or competing hazards regression.15 The variables considered in the multivariate analyses were age at transplantation (< 30 years vs 30-39 years vs 40-49 years vs ≥ 50 years), sex, Karnofsky performance score (KPS; < 80% vs ≥ 80%), lineage (T-cell vs B-cell vs unclassified/other), white blood cells (WBCs, < 25 × 109/L vs 25-100 × 109/L vs > 100 × 109/L), extramedullary disease before HSCT, cytogenetic abnormalities (t(4,11)/t(8:14)/hypodiploid/complex cytogenetics, > 5 abnormalities/others vs no abnormalities),16 CR1 versus CR2, CR1 duration (< 12 months vs ≥ 12 months) for CR2 patients, time from diagnosis to HSCT (< 6 months vs ≥ 6 months) for CR1 patients, TBI in conditioning regimen, type of donor (HLA-identical sibling vs matched URD vs partially matched URD vs mismatched or unknown matching URD), donor-recipient sex match (female-male vs others), donor-recipient cytomegalovirus status (−/− vs others), bone marrow versus granulocyte colony-stimulating factor–mobilized peripheral blood stem cell, GVHD prophylaxis (TCD/ATG or alemtuzumab vs tacrolimus or cyclosporin A-based ± methotrexate ± others vs other), and year of transplantation (1995-2001 vs 2002-2007). The proportional hazard assumption was first tested by adding a time-dependent covariate for each factor. Factors violating the proportionality assumption were adjusted by stratification. A stepwise model selection approach was used to identify all significant risk factors. Each step of model building contained the main effect for conditioning regimen group. Factors that were significant at a 5% level were kept in the final model. The potential interactions between the main effect and all other significant risk factors were tested. Adjusted probabilities of DFS and OS were generated from the final Cox models. All computations were made using the proportional hazards regression procedure in the statistical package of SAS, Version 9. All P values are 2-sided.
Results
Patients and clinical characteristics
The demographic, clinical, and biologic characteristics of 93 RIC and 1428 FI patients who received allogeneic HSCT for ALL are shown in Table 1. The RIC group was older (median, 45 vs 28 years, P < .001). Forty-three percent of the RIC group were 50 years of age and older compared with 7% of the FI group (P < .001). Other factors probably influenced the choice of RIC. Of the 38 RIC patients younger than 40 years, 7 had a KPS less than 80, one had a creatinine more than 2 mg/dL, 10 had a history of fungal disease, 4 had a significant cardiac history, and 13 had pulmonary dysfunction. In total, 23 of 38 patients had at least one of these adverse clinical factors.
More RIC patients received peripheral blood grafts (73% vs 43%, P < .001), and more RIC were performed in the more recent time period (73% vs 51%, P < .001). There were no significant differences in other characteristics, including performance status, WBC at diagnosis, poor prognosis cytogenetics, remission status, or percentage of patients with a sibling or well-matched URD.
Hematopoietic recovery and GVHD
Hematopoietic (neutrophil) recovery at day 100 occurred in 98% of the RIC arm and 96% of the FI arm (Table 2, P = .18). There was slightly, but not significantly, more grade II-IV acute GVHD in the FI group (46%, 95% confidence interval [CI], 43%-49% vs 39%, 95% CI, 29%-49%, Table 2, P = .16). Similarly, somewhat more frequent chronic GVHD at 3 years occurred in the FI group (42%, 95% CI, 39%-44% vs 34%, 95% CI, 24%-44%), but this was not statistically significant (Table 2, P = .16). TCD or in vivo T-depleting agents (ATG or alemtuzumab) did not affect engraftment (data not shown) but were associated with a somewhat lower incidence of grade II-IV acute GVHD (FI with TCD 42%, 95% CI, 37%-48%, RIC with TCD 30%, 95% CI, 12%-51%, P = .26).
Relapse
The RIC conditioning group did not experience significantly more relapse at 3 years (35% vs 26%, P = .08; Table 2; Figure 1), but comparisons between the 2 groups are difficult. Factors associated with higher relapse rates included transplantation in CR2, particularly with a short (< 12 months) CR1 (short CR1, relapsed risk [RR] of relapse = 2.75; long CR1 > 12 months, RR = 1.51, P < .001; Table 3). The incidence of relapse was lower if there was grade II-IV acute GVHD (RR = 0.54, P = .023), but not with chronic GVHD (RR = 0.80, P = .08). Relapse during CR1 was 21% (95% CI, 18%-24%) for FI and 39% (95% CI, 26%-53%) for RIC (P = .013) compared with CR2 (FI 31%, 95% CI, 28%-35% vs RIC 30%, 95% CI, 17%-46%, P = not significant). TCD did not affect relapse incidence (data not shown).
TRM and causes of death
Conditioning regimen intensity had no significant impact on TRM with a similar incidence in the 2 groups (TRM at 3 years, FI 33%, 95% CI, 31%-36% vs RIC 32%, 95% CI, 23%-43%, P = .86; Table 4; Figure 2). However, there may have been other factors that contribute to TRM that differed between the 2 groups. Poor performance status (KPS), transplantation in CR2, and the use of TCD were significantly associated with a higher TRM on multivariate analysis, yet none had a significant interaction with conditioning intensity. Increasing age was associated with a higher TRM, yet in patients older than 40 years, TRM at 3 years was similar in the RIC and FI arms (36% vs 38%, P = not significant). Similarly, donor selection was critical with both partially matched and mismatched URD having higher TRM on multivariate analysis (RR = 1.43 and 2.47, respectively) but did not differ in RIC or FI HSCT. Compared with cyclosporin A or tacrolimus-based GVHD prophylaxis, TCD was associated with a higher 3-year TRM (RR = 1.45, P = .001) (FI 44%, 95% CI, 38%-50% vs RIC 35% 95% CI, 15%-59%; Table 3). Thirteen patients with KPS less than 80 were treated with RIC, and 6 died of TRM (48%, 95% CI, 22%-75%) at 3 years.
Causes of death are shown in Table 5. Relapse was the most common cause of death and was slightly more common in the RIC group (46% vs 34%, P = not significant). Infection was the next most common cause, followed by GVHD (5% in the RIC group vs 11% in the FI patients).
Survival
Despite the older age in the RIC group, OS and LFS at 3 years were similar to those receiving FI HSCT (Table 2; Figures 3–4; supplemental Figure 1; available on the Blood Web site; see the Supplemental Materials link at the top of the online article). However, the differences between the 2 conditioning groups again suggest caution in interpreting the direct comparisons. Age younger than 30 years was associated with significantly better survival, and the RR of death increased with each additional decade. Other factors associated with improved survival were a higher KPS, WBC less than 25 000/μL at diagnosis, the use of TBI in the conditioning regimen, and HSCT using a sibling or well-matched URD. Of the 13 patients treated with RIC who had a KPS less than 80, only 1 survives leukemia-free. No factors that significantly influenced survival showed a significant interaction with conditioning intensity.
Survival for HSCT in CR1 at 3 years was similar in both cohorts (FI 51%, 95% CI, 48%-55% vs RIC 45%, 95% CI, 31%-59%) and in CR2 (FI 33%, 95% CI, 30%-37% vs RIC 28%, 95% CI, 14%-44%). In addition, DFS was similar for FI vs RIC conditioning in both CR1 (FI 49%, 95% CI, 45%-53% vs RIC 36%, 95% CI, 23%-51%) and in CR2 (FI 32%, 95% CI, 29%-36% vs RIC 27, 95% CI, 14%-43%), although smaller numbers in the RIC group limit the power of the comparisons (supplemental Figure 2).
Discussion
The results of therapy for adult ALL are globally unsatisfactory. Allografting has an important and growing role in its management, but the outcomes of this treatment modality still need improvement, particularly for older patients where FI regimens lead to excess TRM and unsatisfactory survival.1 Advances will probably come from better patient and donor selection, but also from exploring safer conditioning protocols. The main finding of our study was that the RIC group experienced survival similar to the FI group despite a substantially older median age. RIC transplantations can be performed in patients with high-risk ALL with reasonable TRM and survival. Reducing the intensity of conditioning was not associated with a significant increase in relapse, although a larger study would be required to demonstrate any small differences in relapse rates. It is possible that pretransplantation chemotherapy intensity or minimal residual disease differed between the 2 groups, but these data were not available for analysis. Importantly, the time to transplantation was similar, suggesting that the number of induction and consolidation cycles administered before HSCT were similar as well.
Although originally anticipated to reduce toxicity, as seen in many other studies,7 TRM for the older RIC group was comparable with that seen in the younger FI cohort. The Medical Research Council/Eastern Cooperative Oncology Group study reported 35% 2-year TRM for FI sibling transplantations in patients older than 35 years. The 32% 3-year TRM we observed in sibling and well-matched URD RIC allograft patients warrants further study. Comorbidities (assessed in this study only as lower KPS) and older age both had significant adverse effects on TRM. TRM was higher in patients with less well-matched URD; and as reported from Minnesota,11,17 the use of cord blood donors after either RI or FIC deserves investigation in patients with ALL. The use of ATG, albeit in limited numbers (n = 37) of patients, appeared to increase TRM and remains unexplained. It is notable that a multicenter United Kingdom study of alemtuzumab-based conditioning in myeloablative transplantations reported excellent survival and very low TRM.18
ALL is the last major leukemia in which RIC allografting has been investigated. RIC relies on an immune graft versus tumor effect, and some have doubted the clinical potency of that effect in ALL. Although there is a clear and strong graft-versus-leukemia (GVL) effect with some studies showing a 2.5-fold reduction in relapse in patients with GVHD,19-21 it is also postulated that RIC may not achieve the minimal residual disease state where this GVL effect could be operative. Although we found only modestly reduced relapse in patients with acute GVHD, this study does provide further evidence of a GVL effect because, even with less intense conditioning, low relapse rates and promising DFS were observed. However, relapse remains frequent, and investigation of posttransplantation maintenance strategies may be of value.
Heterogeneity in patients confounds the interpretation of this and other series describing RIC allografting in adult ALL. In this analysis, we did not detect a beneficial effect of chronic GVHD on survival, even though approximately one-third of patients in each cohort developed chronic GVHD. More patients in the European Group for Blood and Marrow Transplantation series had received ATG (39% vs 20%).10
Martino et al22 reported 27 high-risk adult ALL patients (44% alternative donor, 85% beyond CR1, 44% chemorefractory, 41% Ph+) who had a 31% 2-year survival. Massenkeil et al9 from Berlin reported 9 patients with 44% survival after fludarabine, busulphan, and ATG conditioning. Hamaki et al23 from Tokyo reported 33 patients (20 siblings, 13 alternative donors) given fludarabine/busulphan and ATG conditioning (19 in remission and 14 refractory) with a 30% 1-year relapse-free survival. Stein et al7 from City of Hope reported 22 patients given fludarabine and melphalan conditioning using either sibling or URD, and this resulted in excellent survival at 1 year after transplantation. These reports suggest that more than 30% of patients survive through the highest risk period and that TRM ranges from 4% to 27%.
This study has some limitations that should influence data interpretation. The number of patients treated with RIC allografts in CR1 and CR2 (n = 93), although the largest reported, is still relatively small, and it should be noted that they were transplanted at 57 centers using various conditioning regimens.24 More of the RIC group received a transplantation after 2002 (73% vs 51%), and more received peripheral blood stem cell grafts. Given the number of patients in the RIC arm, the study was not powered sufficiently to detect a 10% difference in relapse (35% vs 25%). There was also an undoubted selection bias in ALL patients chosen for either FI or RIC HSCT, and the reasons for selected RIC are not always apparent. Further research is required to determine the preferred conditioning regimens and, in particular, whether the use of TBI, even at low dose, improves outcomes.
These data suggest that RIC conditioning is worthy of further investigation in prospective clinical trials of adult ALL. These trials should include both well-matched URD and related donors and should be confined to patients in remission. Caution should be exercised in the most difficult clinical situations (patients with a poor performance status and those with less well-matched donors); thus, careful consideration of other available approaches is required. Adult Ph− ALL is a rare disease. Multicenter and probably multinational collaboration is required to define further the role of RIC allografting in this disease.
The online version of this article contains a data supplement.
The publication costs of this article were defrayed in part by page charge payment. Therefore, and solely to indicate this fact, this article is hereby marked “advertisement” in accordance with 18 USC section 1734.
Acknowledgments
The authors thank Drs Mitchell Cairo, Steven C. Goldstein, Luis Isola, Michael Lill, Mark Litzow, Richard T. Maziarz, Alan Miller, Bipin Savani, Gary Schiller, Tsiporah B. Shore, Brian Bolwell, Joachin Deeg, Alison W. Loren, Gustavo A. Milone, Harry C. Schouten, Robert Soiffer, Michael Bishop, Roger Herzig, Osman Illhan, Dipnarine Maharaj, and Reinhold Munker for helpful comments and insights.
The CIBMTR is supported by the National Cancer Institute, the National Heart, Lung and Blood Institute, and the National Institute of Allergy and Infectious Diseases (Public Health Service Grant/Cooperative Agreement U24-CA76518); the National Heart, Lung and Blood Institute and National Cancer Institute (Grant/Cooperative Agreement 5U01HL069294); the Health Resources and Services Administration (contract HHSH234200637015C); the Office of Naval Research (grants N00014-06-1-0704 and N00014-08-1-0058); and AABB; Aetna; American Society for Blood and Marrow Transplantation; Amgen Inc; anonymous donation to the Medical College of Wisconsin; Astellas Pharma US Inc; Baxter International Inc; Bayer HealthCare Pharmaceuticals; Be the Match Foundation; Biogen IDEC; BioMarin Pharmaceutical Inc; Biovitrum AB; BloodCenter of Wisconsin; Blue Cross and Blue Shield Association; Bone Marrow Foundation; Canadian Blood and Marrow Transplant Group; CaridianBCT; Celgene Corporation; CellGenix GmbH; Centers for Disease Control and Prevention; Children's Leukemia Research Association; ClinImmune Labs; CTI Clinical Trial and Consulting Services; Cubist Pharmaceuticals; Cylex Inc; CytoTherm; DOR BioPharma Inc; Dynal Biotech, an Invitrogen Company; Eisai Inc; Enzon Pharmaceuticals Inc; European Group for Blood and Marrow Transplantation; Gamida Cell Ltd; GE Healthcare; Genentech Inc; Genzyme Corporation; Histogenetics Inc; HKS Medical Information Systems; Hospira Inc; Infectious Diseases Society of America; Kiadis Pharma; Kirin Brewery Co Ltd; Leukemia & Lymphoma Society; Merck & Company; Medical College of Wisconsin; MGI Pharma Inc; Michigan Community Blood Centers; Millennium Pharmaceuticals Inc; Miller Pharmacal Group; Milliman USA Inc; Miltenyi Biotec Inc; National Marrow Donor Program; Nature Publishing Group; New York Blood Center; Novartis Oncology; Oncology Nursing Society; Osiris Therapeutics Inc; Otsuka America Pharmaceutical Inc; Pall Life Sciences; Pfizer Inc; Saladax Biomedical Inc; Schering Corporation; Society for Healthcare Epidemiology of America; StemCyte Inc; StemSoft Software Inc; Sysmex America Inc; Teva Pharmaceutical Industries; THERAKOS Inc; Thermogenesis Corporation; Vidacare Corporation; Vion Pharmaceuticals Inc; ViraCor Laboratories; ViroPharma Inc; and Wellpoint Inc.
The views expressed in this article do not reflect the official policy or position of the National Institutes of Health, the Department of the Navy, the Department of Defense, or any other agency of the US government.
National Institutes of Health
Authorship
Contribution: D.I.M. and D.J.W. designed the study and prepared the manuscript; T.W. and W.S.P. performed the statistical analysis; and J.H.A., E.C., R.P.G., B.G., V.G., J.H., H.J.K., T.R.K., H.M.L., V.A.L., P.M., D.A.R., M.S., J.S., and M.S.T. participated in interpretation of data and approval of the final manuscript.
Conflict-of-interest disclosure: The authors declare no competing financial interests.
Correspondence: Daniel J. Weisdorf, University of Minnesota Medical Center, 420 Delaware Street SE, MMC 480, Minneapolis, MN 55455; e-mail: weisd001@umn.edu.