Key Points
The microbe-derived SCFAs butyrate and propionate in the systemic circulation are associated with protection from cGVHD.
cGVHD is associated with gastrointestinal dysbiosis late after HCT.
Abstract
Studies of the relationship between the gastrointestinal microbiota and outcomes in allogeneic hematopoietic stem cell transplantation (allo-HCT) have thus far largely focused on early complications, predominantly infection and acute graft-versus-host disease (GVHD). We examined the potential relationship of the microbiome with chronic GVHD (cGVHD) by analyzing stool and plasma samples collected late after allo-HCT using a case-control study design. We found lower circulating concentrations of the microbe-derived short-chain fatty acids (SCFAs) propionate and butyrate in day 100 plasma samples from patients who developed cGVHD, compared with those who remained free of this complication, in the initial case-control cohort of transplant patients and in a further cross-sectional cohort from an independent transplant center. An additional cross-sectional patient cohort from a third transplant center was analyzed; however, serum (rather than plasma) was available, and the differences in SCFAs observed in the plasma samples were not recapitulated. In sum, our findings from the primary case-control cohort and 1 of 2 cross-sectional cohorts explored suggest that the gastrointestinal microbiome may exert immunomodulatory effects in allo-HCT patients at least in part due to control of systemic concentrations of microbe-derived SCFAs.
Introduction
Chronic graft-versus-host disease (cGVHD) affects 40% to 50% of long-term survivors of allogeneic hematopoietic stem cell transplantation (allo-HCT) and is the leading cause of nonrelapse mortality among patients who survive to 2 years.1 In contrast to acute GVHD (aGVHD), characterized by apoptotic tissue damage, particularly in the intestine, cGVHD is a fibrotic condition predominantly involving the skin and mouth that shares features with the autoimmune condition systemic sclerosis.2 We and others have demonstrated relationships between the microbiota and overall survival, aGVHD mortality, infection, organ toxicity, engraftment, and relapse3-5 in the allo-HCT patient population, but cGVHD remains unexamined.
Preclinical models have demonstrated that T follicular helper and T helper 17 cells are critical for cGVHD development, partly due to production of profibrotic cytokines which have been linked to gastrointestinal (GI) microbiota composition.6-8 Mouse studies also suggest that the GI microbiome modulates alloreactivity in aGVHD, and that these effects are dependent upon microbe-derived metabolites.5,9,10 In preclinical models, the microbe-derived short-chain fatty acid (SCFA) butyrate has been shown to exert local effects on intestinal epithelia and immune populations in the GI tract,8,11,12 and clinical studies have associated colonization with butyrate producers with risk of viral infection.13,14 We have previously demonstrated that allo-HCT is accompanied by a decrease in microbial diversity within the GI tract, particularly loss of commensal anaerobes.3,15,16 We hypothesized that microbiota perturbations may be associated with the development of cGVHD and investigated this using a case-control study design using patient stool and plasma samples collected at our center.
Study design
Patient selection
We used the cohort of adult transplant recipients at Memorial Sloan Kettering Cancer Center (MSKCC) who had ≥1 stool sample between day −30 and day +365 (8164 samples, 1081 unique patients). Patients provided written consent to an institutional review board–approved biospecimen-collection protocol. cGVHD cases were identified using National Institutes of Health criteria.17 Diagnoses and onset dates were confirmed by chart review. Controls were randomly selected from the overall cohort on the basis of matched graft type (ie, CD34-selected/T-cell–depleted or unmodified graft) in a 1:3 ratio. Three patients in the cGVHD case group received fecal microbiota transplantation, as did 4 controls as part of a randomized clinical trial.18 Healthy volunteers (plasma samples included in Figure 2A) provided written informed consent prior to the collection of samples for inclusion in this study.
For the analysis of the Duke University Medical Center (Duke) and University Hospital Regensburg (Regensburg) cohorts, a cross-sectional design was applied, whereby adult allo-HCT recipients were eligible for inclusion on the basis of the availability of paired stool and plasma (Duke) or serum (Regensburg) samples in the time window of interest (days 70 to 130), and without specifying clinical variables or cGVHD outcomes. Patients provided consent in a fashion consistent with each institution’s institutional review board requirements.
Stool sequencing
16S ribosomal-RNA gene sequencing was performed and analyzed as previously described.3,4,16 Shotgun sequencing was analyzed by HUMAnN2,19 and the association between pathway abundances and cGVHD was analyzed using linear discriminant analysis of effect size.20 Stool samples from Duke and Regensburg were aliquoted and frozen locally; DNA extraction and sequencing were performed at MSKCC. Bacterial DNA extraction was performed as previously described for the MSKCC samples15 and with a modified protocol that extracts both bacterial and fungal DNA from stool.21 Duke had 37 unique patients, with 27 evaluable, and Regensburg had 89 unique patients, with 44 stool samples evaluable; missing data were due to sample extraction or sequencing failure.
Plasma metabolomic analysis
Plasma samples were processed as previously described22 and analyzed using quantitative gas chromatography time-of-flight (Agilent 7890B GC System, Agilent 7200A MS Detector). Data analysis was performed using MassHunter Profinder software (version B.08.00, Agilent Technologies).
Statistical analysis
P values for α-diversity were calculated using a Wilcoxon test using R (version 3.5.0). PERMANOVA testing for β-diversity was performed using the R package vegan (using the adonis function).23 P values for the plasma SCFA analysis were calculated using analysis of variance followed by Mann-Whitney U testing using GraphPad Prism (Figure 2A). Fisher’s exact test, where reported, was also performed using GraphPad Prism. For Figure 2, the log10 abundances of the most abundant microbial genera, as well as genera that have been associated with immune-system dynamics in cancer patients (Faecalibacterium, Clostridium, Rothia, and Staphylococcus),24-26 were analyzed with a Bayesian logistic regression using Python and the pymc3 module,27 further described in supplemental Methods (available on the Blood Web site). P values for accuracy and recall of the trained model were calculated by drawing 100 000 sets of n = 45 binomial samples with P = .2, as per the case-control design and each time calculating accuracy and recall of this null model (for the other centers, the binomial parameter p was set to the fraction of observed cases across all patients per center data set).
Results and discussion
Allo-HCT is accompanied by a loss of fecal microbiota α-diversity.28 Since samples in our previous studies were almost exclusively collected from inpatients, the data from later time points were biased toward patients who either remained admitted for prolonged periods after HCT or were readmitted, perhaps reflecting a more complicated transplant course. Here, we describe a cohort of patients who were sampled both during their inpatient stay and, in some cases, following discharge (n = 1079 patients, 8164 samples; 522 of these samples [6.4%] were collected while the patients were out of the hospital; supplemental Table 1) and demonstrate that loss of α-diversity can be observed for up to 12 months (Figure 1A). We speculate that hospitalization, dietary changes, and exposure to antibiotics and other medications establish lasting changes in microbial communities in the GI tract.29,30
To examine associations between microbiota composition and cGVHD, we identified 54 cGVHD patients from MSKCC with banked stool samples (collected between day −30 and day 365 relative to HCT). The majority of these patients had either classic cGVHD (n = 23; 42.5%) or an overlap syndrome (n = 23; 42.5%), with the minority developing a late-onset aGVHD syndrome (n = 8; 14.8%). The overlap subtype of cGVHD is characterized by the simultaneous presence of the clinical features of aGVHD and cGVHD and has been associated with a poorer overall prognosis than classic cGVHD.31 Most cGVHD cases (29) were mild (53.7%), 9 cases were moderate (16.6%), and 8 were severe (14.8%); the 8 late-acute patients did not have severity recorded. The median time to onset of cGVHD was 184 days (range, 55-451 days), consistent with the reported median onset in the literature of 7.4 months (range, 0.8-45.1 months).32 To draw comparisons between patients who developed cGVHD and those who did not, we employed a case-control study design and randomly selected 171 controls (∼3 controls per case) with stool samples from our institutional database by matching for graft type (CD34-selected or unmodified bone marrow or peripheral blood stem cell grafts). The rate of preceding aGVHD was similar in the 2 groups (45.1% vs 55.1%; P = .36; subtype included in supplemental Table 2). Early exposure to broad-spectrum antibiotics was similar between groups (supplemental Table 3). We observed similar α-diversity patterns over time in the selected cGVHD patients and controls (Figure 1B; supplemental Table 4).
We next analyzed the microbiota composition of samples collected in the pretransplant, periengraftment, and peri-day 100 windows. Prior to allo-HCT, samples from both cases and controls were comparably diverse, and bacteria of the phylum Firmicutes dominated the community compositions (Figure 1C). Consistent with our previous data,18,29 we observed many dysbiotic microbial communities in the periengraftment period, as exemplified by expansion of the Enterococcus genus (shown in green), but these patterns were similar between cases and controls (monodomination incidences: 8/42 cases [19.0%], 38/144 controls [26.4%]; difference was nonsignificant using Fisher’s exact test [P = .68]), and we did not detect significant differences in α-diversity. In the peri-day 100 window, several patient samples remained dominated in a fashion similar to that seen periengraftment, while others reverted to a composition that was more similar to their pretransplant composition, but differences between cases and controls were not significant. A comparison of α-diversity values and β-diversity distances between the groups at the different time points of interest is provided in supplemental Tables 4 and 5.
Since cGVHD is a late complication of allo-HCT, we focused on the peri-day 100 time window to investigate features of the microbiota that may be associated with the development of cGVHD. All day 100 samples for these analyses were collected prior to the onset of cGVHD, but of note, among the 225 patients in this cohort, only 45 had samples available in the day 100 window (36 controls and 9 cases). We performed shotgun metagenomic sequencing on available stool samples, and multiple microbial metabolic pathways were enriched in the cGVHD patients, many of which relate to SCFA metabolism (marked in Figure 1D with green squares). This suggests that there may be functional differences in the microbiome of patients who go on to develop cGVHD compared with those who do not. To examine if this difference in encoded microbial metabolic capacity was correlated with differences in circulating SCFAs, we subjected peri-day 100 plasma samples from cGVHD patients (n = 10) and controls (n = 43) to metabolomic analysis. Of note, these samples were not matched to stool, and in fact only 3 cGVHD plasma samples and 13 control samples are from the same individuals who had stool samples available for analysis in this time window. Plasma concentrations of butyrate and propionate were both significantly lower in patients who went on to develop cGVHD compared with controls (Figure 2A). These molecules have been reported to promote differentiation of regulatory T cells in preclinical models,11,33 and butyrate is thought to protect mice from lethal GVHD due by enhancing the repair of damaged intestinal epithelial cells.9 Hexanoate concentration was also lower in patients who went on to develop cGVHD, and isobutyrate concentration was elevated in both of the transplanted patient groups compared with healthy controls.
SCFAs are produced in the GI tract as the end products of microbial fermentation by anaerobic taxa. Therefore, to understand what may be causing the changes in plasma SCFA in our patients, we reexamined the peri-day 100 stool samples, focusing on the 10 most-abundant taxa as well as taxa previously associated with immune dynamics in cancer patients, yielding a list of 14 genera (Figure 2B). Of note, the abundance of genus Faecalibacterium, previously associated with success of immune checkpoint blockade therapy25 and increased total white cell count after allo-HCT,26 was below the limit of detection in all peri-day 100 samples from patients who developed cGVHD (Figure 2B, arrow). To explore the association of gut microbiota taxa with cGVHD, we performed a Bayesian logistic regression (supplemental Figures 1 and 2). This revealed that high abundances of the genus Akkermansia (log-odds ratio, 1.0 [0.2, 1.8]; mean and 95% highest posterior density intervals [HDI95], respectively) were associated with cGVHD development, and, conversely, low abundance of the commensal obligate anaerobe Lachnoclostridium was associated with remaining cGVHD-free (−1.5, [−2.6, −0.4] HDI95; Figure 2C). Additionally, we detected associations of the genus Streptococcus with cGVHD (1.2, [−0.1, 2.5] HDI95) and the commensal genus Clostridium (−0.6 [−1.4, 0.2] HDI95) with the absence of cGVHD. The logistic regression model using the stool sample data predicted all cGVHD patients correctly (recall = 1) but also yielded 9 false positives. Accuracy and recall were significantly improved (compared with the null) by including microbiota information in form of 16S relative abundances (Figure 2D-E). The presence of the fermentatively active, butyrate-producing genera Lachnoclostridium, Clostridium, and, to a lesser degree, Faecalibacterium was associated with reduced incidence of cGVHD. To demonstrate the influence of the abundances of these commensal obligate anaerobe genera on the probability of cGVHD, we developed an additional predictive model (described in supplemental Methods). The predictive model demonstrated that as the abundance of Lachnoclostridium, Clostridium, and Faecalibacterium (Figure 2F-G; supplemental Figure 3A) approached 0 in a simulated microbiome, the predicted probability of cGVHD approached 1 (conversely for Akkermansia and Streptococcus, supplemental Figure 3B-C). These metagenomic and microbiota community composition data, combined with the independent plasma data, provide evidence that maintenance of butyrate producers within the GI is correlated with a lower risk of cGVHD; however, given the small study size, we sought to validate these findings in additional patients.
We assembled sets of peri-day 100 samples from 2 additional transplant centers: Duke (United States) and Regensburg (Germany) (demographics in supplemental Table 6). In contrast to the matched case and control design for the MSKCC data set, the evaluable samples from the additional 2 centers are cross-sectional in nature; ie, patients were selected for inclusion on the basis of the availability of paired stool and blood samples within the peri-day 100 window and without regard to their baseline clinical characteristics or their cGVHD outcomes. Initial analysis of stool sample composition was similar to the MSK data, and we did not observe differences in α-diversity or community composition (as assessed by PERMANOVA test of β-diversity distances) between cases and controls in these additional centers (supplemental Figure 4). The Bayesian logistic regression model predicted 8 out of 11 cGVHD cases in the Duke cohort but only 10 out of 23 cGVHD cases in the Regensburg cohort; neither result was statistically significant. The reduced accuracy and recall of our model in the other centers (compared with the MSKCC set) may in part be due to different treatment regimens and patient demographics (not all patients matched the case-control criteria used for MSKCC patients). We also analyzed circulating SCFAs in blood samples from these cross-sectional data sets and observed a similar trend as in MSK patients toward a lower concentration of butyrate in patients who went on to develop cGVHD in the Duke sample set (P = .07) and a significantly lower concentration of propionate (Figure 2H; P = .027). No differences were seen in the Regensburg patient group (supplemental Figure 5); however, these samples were serum, not plasma as in the MSKCC and Duke samples, and the concentrations were lower overall, which may explain the lack of observed difference between groups.
These data highlight that the lasting damage to the microbiome observed after HCT may be clinically significant. Within a matched case and control group, features of the GI microbiota are associated with subsequent cGVHD onset. The observation that higher circulating concentrations of SCFAs are associated with lower rates of cGVHD in 2 independent patient cohorts supports further exploration of the immunotherapeutic role of SCFAs within the gut lumen and systemic butyrate and propionate.
The MSK Fecal Biobank contains >10 000 samples from >1000 allo-HCT recipients and has been used for many retrospective observational studies. There is overlap between the patients in this study and some of these prior papers, but none have examined cGVHD as a clinical outcome. Data sharing requests can be e-mailed to the corresponding author.
The online version of this article contains a data supplement.
The publication costs of this article were defrayed in part by page charge payment. Therefore, and solely to indicate this fact, this article is hereby marked “advertisement” in accordance with 18 USC section 1734.
Acknowledgments
This research was supported by National Institutes of Health (NIH), National Cancer Institute MSKCC Cancer Center core grants P30 CA008748, R01-CA228358, R01-CA228308, R01-HL147584, and P01-CA023766 (M.R.M.v.d.B.); NIH, National Heart, Lung, and Blood Institute awards R01-HL125571 and R01-HL123340 (M.R.M.v.d.B.) and K08HL143189 (J.U.P.); NIH, National Institute on Aging Project 2 award P01-AG052359 (M.R.M.v.d.B.); NIH, National Institute of Allergy and Infectious Diseases awards U01 AI124275 (M.R.M.v.d.B.) and R01 AI032135, AI095706, and U01 AI124275 (E.G.P.); Tri-Institutional Stem Cell Initiative award 2016-013 (M.R.M.v.d.B.); The Lymphoma Foundation (M.R.M.v.d.B.); The Susan and Peter Solomon Divisional Genomics Program (M.R.M.v.d.B.); the Parker Institute for Cancer Immunotherapy at MSKCC (K.A.M., M.R.M.v.d.B., and J.U.P.); the Sawiris Foundation (J.U.P.); the Society of Memorial Sloan Kettering Cancer Center (J.U.P.); and Seres Therapeutics (M.R.M.v.d.B., J.U.P., J.B.S., A.L.C.G., A.G.C., A.E.S., and A.D.S.). J.S. is supported by NIH, National Institute of Allergy and Infectious Diseases grant U01 AI124275. K.A.M. wishes to acknowledge funding received from the Haematology Society of Australia and New Zealand, The American Australian Association, The Royal Australasian Society of Physicians, Endeavour Australia, and the Parker Institute for Cancer Immunotherapy.
Authorship
Contribution: K.A.M. designed the study, performed research, and wrote the manuscript; J.S. wrote the manuscript and performed Figure 2 analyses; A.L.C.G. and E.R.L. contributed to analyses in Figure 1; A.J.P., B.P.T., P.A.G., D.W., M.D.D., A.D., G.K.A., A.E.S., J.B.S., K.B.N., D.G.B., A.G.C., R.J.R., A.R., A.B., L.B., M.C., M.V.L., D.A.R., and N.C. contributed data and performed research; M.M., C.C., I.P., S.G., Y.T., E.G.P., M.-A.P., E.H., and D.M.P. contributed clinical data and expert review; S.M.D. provided statistical consultation; A.S., J.U.P., J.R.C., and J.B.X. contributed critical data analysis and expertise and edited the manuscript; and M.R.M.v.d.B. contributed to study design and the manuscript.
Conflict-of-interest disclosure: K.A.M. reports consulting with Karius. J.B.S., A.E.S., G.K.A., A.G.C., D.G.B., and P.A.G. report partial salary support from Seres Therapeutics. J.U.P. reports research funding and licensing fees from Seres Therapeutics and consulting with DaVolterra. I.P. has received research funding from Merck and serves on a data and safety monitoring board for ExCellThera. S.G. reports research funding from Amgen, Actinuum, Celgene, Johnson and Johnson, Miltenyi, and Takeda and serves on advisory boards for Amgen, Actinuum, Celgene, Johnson and Johnson, Jazz Pharmaceutical, Takeda, Novartis, Kite, and Spectrum Pharma. M.-A.P. reports honoraria from AbbVie, Bellicum, Bristol-Myers Squibb, Incyte, Merck, Novartis, Nektar Therapeutics, Omeros, and Takeda; serves on data and safety monitoring boards for Servier and Medigene and scientific advisory boards of MolMed and NexImmune; has received research support for clinical trials from Incyte, Kite/Gilead, and Miltenyi Biotec; and serves in a volunteer capacity as a member of the board of directors of the American Society for Transplantation and Cellular Therapy and Be The Match (National Marrow Donor Program), as well as on the Center for International Blood & Marrow Transplant Research Cellular Immunotherapy Data Resource Committee. M.R.M.v.d.B. has received research support from Seres Therapeutics; has consulted for, received honorarium from, or participated in advisory boards for Seres Therapeutics, Flagship Ventures, Novartis, Evelo, Jazz Pharmaceuticals, Therakos, Amgen, Magenta Therapeutics, WindMIL Therapeutics, Merck & Co., Acute Leukemia Forum, and DKMS Medical Council (board); has intellectual property licensing with Seres Therapeutics and Juno Therapeutics; and has stock options with Seres Therapeutics. The remaining authors declare no competing financial interests.
Correspondence: Kate A. Markey, Memorial Sloan Kettering Cancer Center, 1275 York Ave, New York, NY 10065; e-mail: markeyk@mskcc.org.