Key Points
A comprehensive transcriptome analysis of aged murine HSC identifies an aging signature.
The HSC aging signature is highly enriched for cell membrane–related transcripts and identifies age-associated heterogeneity.
Abstract
We surveyed 16 published and unpublished data sets to determine whether a consistent pattern of transcriptional deregulation in aging murine hematopoietic stem cells (HSC) exists. Despite substantial heterogeneity between individual studies, we uncovered a core and robust HSC aging signature. We detected increased transcriptional activation in aged HSCs, further confirmed by chromatin accessibility analysis. Unexpectedly, using 2 independent computational approaches, we established that deregulated aging genes consist largely of membrane-associated transcripts, including many cell surface molecules previously not associated with HSC biology. We show that Selp (P-selectin), the most consistent deregulated gene, is not merely a marker for aged HSCs but is associated with HSC functional decline. Additionally, single-cell transcriptomics analysis revealed increased heterogeneity of the aged HSC pool. We identify the presence of transcriptionally “young-like” HSCs in aged bone marrow. We share our results as an online resource and demonstrate its utility by confirming that exposure to sympathomimetics or deletion of Dnmt3a/b molecularly resembles HSC rejuvenation or aging, respectively.
Introduction
Aging is a natural and time-dependent process, leading to the functional decline of many cell types and tissues, including those of the hematopoietic system. Aged hematopoietic stem cells (HSCs) have less regenerative potential, and their overall contribution to formation of mature blood populations, at the clonal level, is restricted and lineage skewed when compared with their young counterparts, suggesting stem cell exhaustion, one of the hallmarks of aging.1-9 Transplantation of aged HSCs into young recipients does not improve HSC function,3,5 indicating that cell-intrinsic aspects play a pivotal role in HSC aging. The differential expression of genes is a prominent readout of multiple cell-intrinsic factors, and various studies have compared the transcriptomes of young and aged HSCs.10-24 These studies have suggested a plethora of distinct aging-associated mechanisms, including aberrant regulation of genes involved in lymphoid and platelet differentiation14,22 ; elevated inflammation and stress responses12,17 ; chromatin remodeling/DNA methylation12 ,23-25; changes in cell cycle and maintenance of quiescence15,16,19 ; and age-associated replicative stress in HSCs.13 It remains unclear whether all suggested factors equally contribute to HSC aging and why different studies report different mechanisms.
Although a substantial number of HSC aging transcriptome studies has been published, no cross-validation of all transcriptomic HSC aging data has yet been carried out. In fact, it is unknown whether a common HSC aging transcriptome exists, and if so, which genes are involved. In order to address this question, we selected 15 studies in which transcriptomes of young and aged murine HSCs were compared and added to these our own recently published transcriptome analysis.26
Despite uncovering considerable heterogeneity among different studies, we were able to identify a robust and biologically relevant HSC transcriptomic aging signature (AS). This signature is mainly composed of genes that encode for membrane-associated proteins, many with unknown functions in HSC biology. A large fraction of AS genes are upregulated in aged HSCs, indicating transcriptional activation in this population. We also show that aging increases transcriptional heterogeneity among single HSCs and provide evidence that transcriptionally “young” HSCs exist in aged mice. Furthermore, we demonstrate that the top age-associated gene, Selp (P-selectin), contributes to functional decline of HSCs. Finally, we provide an open Web-based resource to the community and use independent data sets to demonstrate how our approach can be used to validate novel biological paradigms.
Methods
Mice
All experiments were approved by the Central Commission for Animal Testing and the Animal Ethical Committee. Young (4-5 months) and aged (20-24 months) C57BL/6, C57BL/6.SJL (10-14 weeks), and C57BL/6J-kitW-41J/kitW-41J (W41) mice (originally from European division of Harlan) were obtained from the Central Animal Facility at the University Medical Center Groningen. TetOP-H2B-GFP mice (provided by Rob Coppes, University Medical Center Groningen [UMCG], Groningen, The Netherlands). H2B-GFP expression was induced by providing mice (6 weeks) with food containing doxycycline for 6 weeks. Mice were then fed with normal food for 14 weeks. Mice were kept in group cages and housed under conventional temperature- and day cycle–controlled conditions.
Flow cytometry
Bone marrow was isolated from the tibia, femur, pelvis, sternum, and spine by crushing, and red blood cells were lysed with erylysis buffer. All samples were analyzed on a BD FACSCanto II (BD Biosciences) and sorted on MoFlo Astrios or XDP cell sorters (Beckman Coulter). Detailed information of the antibodies used for isolation and identifications can be found in the supplemental Data (available on the Blood Web site).
Retroviral constructs and transduction
Selp was amplified from C57BL/6J long-term HSC (LT-HSC) complementary DNA and then cloned into pSF912–IRES–GFP vector (originally as pSF91 from C. Baum, Hannover Medical School, Hannover, Germany, further modified in our laboratory). Virus was produced by transfecting Platinum-E retroviral packaging cells (Cell Biolabs) according to the manufacturer’s protocol. Viral supernatant was harvested 48 hours after transfection. LT-HSCs were transduced as previously described.6 Cells were harvested 20 to 22 hours after transduction for transplantation.
BM transplantation
For SELPlow and SELPhigh transplantation, 1000 SELPlow and SELPhigh LT-HSCs were isolated from an old CD45.2+ C57BL/6 mouse and transplanted together with 2 million CD45.1+ W41 bone marrow into lethally irradiated (9 Gy) CD45.1+ C57BL/6.SJL recipients, respectively. For Selp overexpression transplantation, empty vector (SF91–IRES–GFP empty) and SELP OE CD45.2+ cells harvested after retroviral transduction were transplanted together with 2 million CD45.1+ W41 bone marrow into lethally irradiated CD45.1+ C57BL/6.SJL recipients, respectively.
Assay for transposase-accessible chromatin using sequencing (ATAC-seq)
A total of 2.5 × 104 LT-HSCs were sorted from young and aged mice in biological replicates. Library preparation was performed according to Buenrostro et al.27 In brief, cells were lysed and tagmented for 1 hour at 37°C. Samples were then polymerase chain reaction amplified using provided adapter primers for 11 or 12 cycles. Following, samples were size selected for fragments of 150 to 1000 bp. Samples were pooled and sequenced to obtain 30 to 50 × 106 reads using a NextSeq 500. Data are deposited under accession number GSE166674.
Immunofluorescence
Images were acquired on a Leica Sp8 confocal microscope and quantified on Fiji ImageJ.
RNA content
A total of 4000 to 6000 LT-HSCs were seeded onto spots on an adhesion immunofluorescent slide (VWR). For RNA staining, cells were fixed with 100% methanol for 10 minutes and washed with phosphate-buffered saline. Cells were subsequently stained with 1:10 000 SYTO RNA Select Green Fluorescent Stain (Thermo Fisher Scientific) for 20 minutes at room temperature. After washing, coverslips were mounted with ProLong Diamond Antifade Mountant with 4′,6-diamidino-2-phenylindole (DAPI).
RNA polymerase II
For RNA polymerase II staining, cells were fixed and permeabilized with Fixation/Permeabilization Solution Buffer (BD Bioscience) for 20 minutes on ice. After washing, cells were blocked first with Endogenous Biotin-Blocking Kit (Thermo Fisher Scientific) followed by 4% bovine serum albumin for 30 minutes. Cells were stained with 1:100 biotin mouse monoclonal RNA polymerase II antibody (Novus Bio) at 4°C overnight. Cells were then washed with 0.1% Triton-X-100 and stained with 1:1000 streptavidin Alexa Fluor 488 secondary antibody. After washing, coverslips were mounted with ProLong Diamond Antifade Mountant with DAPI.
Further information
More detailed information about the methods used can be found in the supplemental Data.
Additional resources
Additional resources include the following: AS Web tool site (https://eriba.umcg.nl/agingsignature) and GitHub with all resources (https://github.com/LeonidBystrykh/data-for-manuscript).
Results
Systematic collection of publications reporting gene expression changes in aged HSCs
We took advantage of PubMed’s API E-utility to select all informative studies on HSC aging in C57BL/6 mice (Figure 1A; supplemental Data). This search resulted in 16 published studies that met all criteria and included our own recently published data set26 (Figure 1B; supplemental Table 1). The various studies used different sequencing platforms, ranging from microarray hybridization, bulk RNA sequencing to single-cell RNA sequencing (scRNA-seq) (Figure 1B; supplemental Table 1) and included mainly 2 age groups (Figure 1B; supplemental Figure 1A). We employed 2 alternative approaches to perform the analysis. First, we compared the self-reported, preanalyzed lists of DEGs from the selected manuscripts and referred to this as “meta-analysis.” In parallel, we reanalyzed all available raw data in the most uniform way; these are referred to as “reanalysis” (Figure 1A; supplemental Table 1).
Gene symbol . | Consistency (out of 12) . | Average log2FC . | Function . | Phenotype . | References . |
---|---|---|---|---|---|
Itgb3 | 9 | 1.25 ± 0.48 | Cell adhesion; signal transduction | Itgb3high identifies myeloid-biased HSCs | 17,39 |
Vwf | 8 | 1.37 ± 0.66 | Cell adhesion; signal transduction | Vwfhigh identifies myeloid-biased HSCs | 14,40,41 |
Neo1 | 8 | 1.84 ± 1.18 | Cell adhesion; signal transduction | Neo1high identifies more dormant HSCs | 26,42 |
Alcam | 8 | 1.39 ± 0.62 | Cell adhesion | Alcam+ identifies myeloid-biased HSCs and with reduced potential | 43 |
CD86 | 6 | −1.39 ± 0.36 | Signal transduction | Required for lymphoid differentiation | 44 |
Gadd45g | 6 | 1.54 ± 0.99 | Stress response | Gadd45g−/− shows increased self-renewal and deficient megakaryocytic differentiation | 45 |
Satb1 | 6 | −1.21 ± 0.50 | Epigenetic effector | Required for lymphoid differentiation | 46 |
Sdpr | 6 | 1.88 ± 1.61 | Transcription regulation | Involved in PRC2 expression regulation | 47 |
Slamf1 | 5 | 0.92 ± 0.16 | Cell adhesion; signal transduction | Slamf1high identifies myeloid-biased HSCs with reduced potential | 48,49 |
Dnmt3b | 4 | −1.09 ± 0.62 | Epigenetic regulator | Dnmt3a/b−/− shows differentiation blockage/increased self-renewal | 29,50,51 |
Gene symbol . | Consistency (out of 12) . | Average log2FC . | Function . | Phenotype . | References . |
---|---|---|---|---|---|
Itgb3 | 9 | 1.25 ± 0.48 | Cell adhesion; signal transduction | Itgb3high identifies myeloid-biased HSCs | 17,39 |
Vwf | 8 | 1.37 ± 0.66 | Cell adhesion; signal transduction | Vwfhigh identifies myeloid-biased HSCs | 14,40,41 |
Neo1 | 8 | 1.84 ± 1.18 | Cell adhesion; signal transduction | Neo1high identifies more dormant HSCs | 26,42 |
Alcam | 8 | 1.39 ± 0.62 | Cell adhesion | Alcam+ identifies myeloid-biased HSCs and with reduced potential | 43 |
CD86 | 6 | −1.39 ± 0.36 | Signal transduction | Required for lymphoid differentiation | 44 |
Gadd45g | 6 | 1.54 ± 0.99 | Stress response | Gadd45g−/− shows increased self-renewal and deficient megakaryocytic differentiation | 45 |
Satb1 | 6 | −1.21 ± 0.50 | Epigenetic effector | Required for lymphoid differentiation | 46 |
Sdpr | 6 | 1.88 ± 1.61 | Transcription regulation | Involved in PRC2 expression regulation | 47 |
Slamf1 | 5 | 0.92 ± 0.16 | Cell adhesion; signal transduction | Slamf1high identifies myeloid-biased HSCs with reduced potential | 48,49 |
Dnmt3b | 4 | −1.09 ± 0.62 | Epigenetic regulator | Dnmt3a/b−/− shows differentiation blockage/increased self-renewal | 29,50,51 |
Individual studies identify highly variable numbers of DEGs
Strikingly, whereas some studies reported thousands of genes to be differentially expressed, others only reported dozens (Figure 1C). When DEGs from the meta-analysis and reanalysis approaches were compared, both demonstrated a significant correlation between fold-change (FC) expression values, indicating that reanalysis of the raw data did not deviate drastically from the originally reported data sets (supplemental Figure 1C). The observed study-to-study variations might result from differences in mouse gender, group size, exact age, or HSC immunophenotyping; intrinsic differences between expression platforms; or unknown environmental effects, including differences in mouse facilities or breeding companies from which the mice originated (supplemental Table 1). Principal-component analysis performed in the reanalyzed sets demonstrated that studies clustered according to their sequencing platform (supplemental Figure 1B), although degree of their separation was minor. Overall, this analysis demonstrates that the variability between studies partially has a technical origin, but the discrepancy between studies is not fully explained.
Most reported age-associated HSC genes have low reproducibility
Collectively, these 16 studies reported >6000 genes being differentially expressed between young and aged HSCs. Surprisingly, the vast majority of reported genes demonstrated poor reproducibility, as almost 80% of all putative age-associated genes were reported in a single study only (supplemental Figure 1D). Moreover, pairwise analysis of all different studies showed that some studies had little to no overlapping genes with any other study (supplemental Figure 1E-F).
Identification of an HSC aging signature (AS)
Despite limited reproducibility between individual data sets, we wondered whether it would be possible to identify consistently DEGs across all studies. To this end, we ranked all reported genes by their occurrence in different studies (consistency) (Figure 1D). This revealed a limited number of highly consistent genes that were frequently documented to be differentially expressed upon HSC aging. The single most consistently deregulated gene was Selp. We provide a complete gene list in supplemental Table 2. We restricted further analysis to genes that were consistently reported in ≥4 studies, resulting in a list of 142 genes for the meta-analysis and 220 genes for the reanalyzed data sets. We refer to this as the “aging signature” (supplemental Figure 1G; supplemental Table 2).
The AS is robust and provides higher resolution than individual studies
In order to assess the minimal number of publications required to generate a robust list of consistent aging genes, we modeled all possible combinations of most consistent genes from the reanalyzed data, starting from 1 data set at a time to all combinations of 11 out of 12 data sets (Figure 1E). This analysis generated multiple consistencies for the same genes across different permutations. When the analysis was performed with a small number of data sets, we observed a high degree of variation, which started to diminish when the number of data sets increased. When >6 data sets were used, the number and consistency of genes stabilized (Figure 1E; supplemental Data). Complementarily, we also modeled the predicted resolution (the number of aging genes found) for the same range of studies (Figure 1F). The model predicted that the number of studies that we used for our analyses (11 for meta-analysis and 12 for reanalysis) have high resolution; with the current number of included studies, >90% of all the possible age-associated genes have been found. This also implies that the gene collection in the HSC AS remains stable and that the addition of novel HSC transcriptomes is unlikely to significantly alter the AS.
We evaluated the degree of similarity between the AS and the individual studies from which it is derived (Figure 1G). All studies demonstrated approximately the same similarity to the AS, with no clear outliers. No single study encompassed all aging genes, but collectively, they contribute to the robustness of the AS. We conclude that the AS can be used as a reference for DEGs upon HSC aging.
The AS reveals high enrichment for membrane-related processes in HSC transcriptional activation
We restricted subsequent analyses to the AS derived from the reanalyzed data and observed several remarkable features. First, approximately half of the aging genes encode for membrane-associated proteins (Figure 2A-B). These included many cell adhesion molecules or channels (Selp, Alcam, Cd34, Dsg2, Itgb3, and Clca3a1) and cytokine receptors (Csf2rb, Ebi3, Flt3, Ghr, Il1rapl2, and Osmr), many of which have not been previously associated with HSC biology.
Gene Ontology (GO) analysis identified a total of 46 enriched GO function terms (supplemental Table 3). As expected, the most significant GO terms revolve around membrane-associated functions (supplemental Figure 2A). Among upregulated processes, we observed significant enrichment of different GO terms associated with cell adhesion and activation and the organization of cell junction and projection (Figure 2C). Within the downregulated processes, we identified significant enrichment for DNA replication, cytokine signaling, tissue homeostasis, and cell proliferation (Figure 2D). Pathway analysis also demonstrated analogous pathways (supplemental Figure 2B).
We next collected all GO terms associated with every gene present in the AS (supplemental Figure 2C), resulting in >2000 GO terms. The majority of these terms were very specific and contained <20 AS genes (supplemental Figure 2C, lower panel). We then searched in the AS GO term database for key terms that represent previously described HSC aging mechanisms, including “inflammation,”12 “replicative stress,”13 “cell cycle,”15,16 and “epigenetic drift,”12,23,28 and indeed found these. However, the large majority of genes (149) were associated with “membrane” (supplemental Figure 2D), denoting that significant changes occur in the HSC membrane upon aging.
Secondly, we observed that a disproportional number (181 genes, ∼80%) from the AS were upregulated in aged HSCs (Figure 3A). We assessed whether this would be reflected in global transcriptional activation. Comparing young and aged LT-HSCs (LSK CD48− CD150+) (supplemental Figure 3A), we observed an approximately threefold higher overall RNA yield per cell in aged HSCs (supplemental Figure 3B). We confirmed this by intracellular staining of young and aged LT-HSCs with the RNA-specific dye SYTO (supplemental Figure 3C). In order to investigate if increased RNA abundance in aged LT-HSCs was directly linked to increased transcription, we performed intracellular staining for RNA polymerase II in young and aged HSCs and indeed observed increased RNA polymerase II activity in aged cells (Figure 3B).
Finally, we evaluated the number of chromatin accessible sites by ATAC-seq. In agreement with the data above, aged HSCs showed an increased number of overall accessible sites compared with young HSCs (supplemental Figure 3D). In addition, differential analysis of accessible sites demonstrated strong skewing toward more open sites in aged HSCs (Figure 3C). We then speculated that aged-specific accessible sites could contribute to deregulated expression. To investigate this, we overlapped the age-restricted accessible sites with the genes in the aging list, which would reflect genes more accessible in aged HSCs and therefore upregulated, and performed a GO analysis (Figure 3D). This revealed high enrichment for transcription-related processes, further reinforcing our hypothesis that aged HSCs are transcriptionally activated.
Taken together, the AS suggests that aged HSCs undergo adaptation at cellular and functional levels through membrane-related processes and that aged HSCs demonstrate higher transcriptional activation.
The AS as a reference for age-associated studies
We aimed to test whether the AS can be used as a reference to (re)analyze existing and future HSC gene expression profiles. To this end, we developed 2 independent approaches where custom transcriptome sets can be compared with the AS.
First, we designed 2 scoring methods, the discrete gene set enrichment (dGSE) algorithm and a rank score (RS), both of which are able to cope with semiordered (discrete) lists of genes (supplemental Data) and can be used for testing bulk RNA-sequencing sets. To benchmark both methods, we individually tested the 12 HSC aging transcriptome studies that comprise the AS (Figure 4A). As expected, we observed that the different data sets had a wide range of enrichment scores when compared with the AS.
We next used transcriptomic data sets from 3 unrelated studies. The first independent data set compared young and physiologically aged HSCs (K.K., A.M.-M. and B.v.E., manuscript submitted December 2020). The second involved HSCs that were “rejuvenated” by a sympathomimetic drug acting on adrenoreceptor β3.18 The third study used expression data from HSCs that were derived from Dmnt3a/b-deficient mice that show accelerated aging phenotypes.29 As a negative control, we included a transcriptomic signature of embryonic stem cells.30 As expected, the embryonic stem cell signature demonstrated low scores for aging genes. In contrast, the independent set of young and physiologically aged HSCs demonstrated enrichment when compared with the AS. When we compared the list of genes that were downregulated in “rejuvenated” old HSCs to the AS, we also found strong enrichment. The transcriptome of young HSCs isolated from Dmnt3a/b knockout mice also scored high for HSC aging genes. These data suggest that both β3 sympathomimetic signaling and loss of Dnmt3a/b molecularly phenocopy normal HSC aging, additionally confirming the value of our method (Figure 4B).
This approach allowed us to investigate the relationship of our HSC AS to a previously established multiorgan interspecies transcriptomic signature.31 Expectedly, we observed a mild enrichment between both signatures, denoting that some more general aging-associated genes are also present in HSC aging (supplemental Figure 4A). Genes present in both signatures are related to inflammatory response. We also investigated the involvement of transforming growth factor β (TGF-β) signaling, which has been implicated in HSC aging,21,23,32 in the AS. We compared different molecular signatures of TGF-β signaling33,34 to our AS, but none of the sets demonstrated high scores. Furthermore, only 5 TGF-β target genes from all sets (Bmpr1a, Id2, Jun, Itgb3, and Mef2c) are present in the AS. Thus, TGF-β signaling, from a transcriptional perspective, is not robustly involved in HSC aging (supplemental Figure 4A).
Interestingly, single-cell transplantations have revealed that the overall engraftment potential of aged HSCs is substantially lower than that of young HSCs.2 However, some aged HSCs display engraftment performance similar to young cells, suggesting that the aged HSC pool is functionally heterogenous. Thus, we wondered whether the AS can be used to also identify intrinsic variation between cells from the same population. To identify such “young-like” cells using single-cell transcriptomic studies, we assigned an AS score to individual cells (Figure 4C; supplemental Figure 4B). This revealed that separation of young and aged cells is in good agreement with their assigned AS scores, where aged cells score higher and young cells lower, also predicting their age. Interestingly, some aged cells had a low AS score and were close to the young cells cluster, suggesting that these cells are, at least transcriptionally, young-like.
Taken together, the HSC AS demonstrated to be a bode fine transcriptional reference to physiological aging and thus a valuable resource for the scientific community. It allowed us to demonstrate that the aged HSC pool is heterogeneous and that some aged cells are transcriptionally similar to young HSCs. We provide both (bulk and single-cell) analyses as a resource to the community at https://eriba.umcg.nl/agingsignature.
Machine learning algorithms independently identify the best age-associated gene predictors
Employing machine learning strategies in conjunction with scRNA-seq data have allowed for prediction and detection of different hematopoietic populations.35,36 We envisioned that by applying this approach to the age-related scRNA-seq data sets would allow for the identification of genes which best divide young and old HSCs. This approach would also serve as an independent and alternative approach to DEG analysis. We trained and tested 2 different machine learning algorithms, ADAboost and Random Forest (RNDforest), for identification of the best gene predictors. Interestingly, both algorithms determined that Nupr1, Selp, and Sult1a1 were the best predictors (supplemental Figure 5A). We tried to restrict the aging-related list to the best 20 age-associated genes. With those 20 genes, both algorithms performed well on identifying young and aged HSCs (Figure 5A-B; supplemental Figure 5B-C) when compared with the full list of AS genes. Attempts to further narrow down the age predictors from 20 to 5 genes, however, showed considerable deterioration in prediction capacity compared with larger lists of age predictors (supplemental Figure 5D). Out of 20 gene predictors, the majority are also membrane related (Figure 5C).
The top aging gene, Selp, affects HSC functioning
The most consistent gene from the AS, Selp (supplemental Table 2), was also among the best machine learning predictors (Figure 5). Selp, which encodes for the membrane adhesion molecule P-selectin, was confirmed to be differently expressed also at the protein levels in young and aged hematopoietic progenitor subsets by flow cytometry (Figure 6A; supplemental Figure 6A). To investigate whether Selp affects HSC functioning, we competitively transplanted 1000 SELPlow or 1000 SELPhigh aged LT-HSCs (supplemental Figure 6B) into lethally irradiated young recipients. Whereas overall engraftment levels were similar for SELPlow and SELPhigh LT-HSCs, we found that SELPhigh LT-HSCs produced more myeloid cells, suggesting that SELPhigh HSCs are myeloid biased (Figure 6B; supplemental Figure 6C).
To determine the role of Selp in the functioning of LT-HSCs, we overexpressed Selp in young LT-HSCs and competitively transplanted these in young recipients. Whereas Selp overexpression did not affect differentiation to leukocytes and platelets, erythroid production was almost completely abolished (Figure 6C). We also employed a Tet-On H2B-GFP reporter mouse to determine the dormancy or activation status of SELPlow and SELPhigh LT-HSCs. GFP fluorescence was significantly higher in SELPlow LT-HSCs, indicating that myeloid-biased SELPhigh LT-HSCs were more actively cycling (supplemental Figure 6D).
Thus, higher levels of SELP in LT-HSCs functionally contribute to myeloid bias, erythroid blockage, and HSC activation. Collectively, this demonstrates that SELP is not only a marker of aged HSCs but has the capacity to contribute to the functional impairment of HSCs.
Discussion
In this study, we demonstrate that individual studies have limited resolution to identify genes altered in HSC aging, but collectively, they provide an unprecedented profile of transcriptomic alterations in HSC aging (Figure 1D). From this extensive repertoire of studies, we were able to identify a comprehensive, robust, and stable transcriptomic signature of HSC aging, which we validated by 3 independent data sets.
The HSC AS is highly enriched for membrane-associated transcripts (Figure 2A). Several of these have been previously associated with aspects of the HSC aging process and thus document the rigor of our approach (Table 1). However, the role of most other genes in HSC (aging) biology remains unknown. The strong involvement of cell-surface molecules does provide insight into the long-standing discussion whether HSC aging results from cell-intrinsic or cell-extrinsic molecular alterations. This enrichment of cell membrane–associated transcripts suggests that physiologically aged HSCs communicate differently with their immediate environment compared with their young counterparts.
Our study also provides novel insight into the transcriptional activation state of HSCs (Figure 3). In line with recent reports that loss of heterochromatin leads to age-associated phenotypes in HSCs,37 RNA-specific quantification, RNA polymerase II activity, and chromatin accessibility analyses all indicate that aged HSCs are, at a population level, more transcriptional active than young HSCs.
The AS can be employed as a bona fide transcriptional reference for physiological HSC aging for both bulk and single-cell transcriptomic studies (Figure 4). We provide protocols and a user-friendly and interactive website (https://agingsignature.webhosting.rug.nl/) to allow users to execute custom queries and compare new data sets against the AS that we present here.
Our approaches also allowed powerful analyses and uncovered novel perspectives into HSC aging from a single-cell perspective. By applying our AS score to single-cell transcriptomic studies, we determined that HSCs isolated from aged mice contain a fraction of cells which behave, at least transcriptionally, like young HSCs (Figure 4C). Further studies are necessary to understand if such cells indeed are functionally superior and also why such cells do not seem to “age.” We also successfully employed machine leaning to further identify best age-associated gene predictors (Figure 5).
Both computational approaches demonstrated that Selp expression is strongly associated with HSC aging. Previous data have shown that Selp-deficient HSCs display a competitive advantage over wild-type HSCs.38 Here, we document that high levels of SELP expression induced myeloid-biased, erythroid blockage and cell activation, all hallmarks of HSC aging8,9 (Figure 6). Together with Selp, which is a well-known inflammatory marker, Nupr1 is one of the top-listed predictors and belongs to a group of inflammatory and stress-responding transcription factors not earlier detected in relation to the aging phenotype. Although we were able to reduce the list of age-related genes, it is clear that HSC aging cannot be reduced to a single gene. Instead, our machine learning analysis suggests that multiple age-related genes are involved in the HSC aging process. The hypothesis of a single gene explaining most of the aging phenotype sounds unlikely, and it is more likely that multiple cell surface markers are required to prospectively separate young and aged HSCs.
Nevertheless, our data and those of others demonstrate that (at least some of) the aging genes that we identify not only are markers of aged HSCs but also are involved in age-dependent functional decline. The identification of the HSC AS does not resolve whether altered expression is the cause or consequence of loss of HSC functioning during aging. Future functional studies are required to assess whether perturbation of AS genes can improve functioning of aged stem cells. We believe that this resource is valuable in assisting the aging research community to verify involvement of genes in physiological HSC aging.
Acknowledgments
The authors thank T. Bijma, G. Mesander, and J. Teunis from University Medical Center Groningen (UMCG) Flowcytometry Unit facilities for their assistance on cell sorting; Klaas Sjollema from UMCG Microscopy and Imaging Center for assistance with confocal microscopy; and B. Bakker, M. Gerritsen, E. Verovskaya, and all members of the Ageing Biology and Stem Cells Laboratory for discussions.
This work was supported by the Netherlands Organization for Scientific Research/Mouse Clinic for Cancer and Aging, the Landsteiner Foundation for Blood Transfusion Research (LSBR1703), a China Student Council Fellowship (D.Y.), Marriage, a EU FP7 Marie Curie Initial Training Network (contract 316964), and ARCH (a European Union’s Horizon 2020 Research and Innovation Program) under Marie Skłodowska-Curie grant agreement 813091. B.v.E. was supported by grants from the Deutsche Forschungsgemeinschaft (EY 120/1-1), Else Kröner-Fresenius Foundation (2016_A58), and the German Cancer Aid (Deutsche Krebshilfe, 70113138). The Fritz Lipmann Institute is a member of the Leibniz Association and is financially supported by the Federal Government of Germany and the State of Thuringia.
Authorship
Contribution: A.F.S., D.Y., S.L., G.d.H., and L.B. conceptualized the study; A.F.S., D.Y., and L.B. carried out the methodology; A.F.S., D.Y., S.S., A.A., K.K., A.M.-M., B.v.E., and L.B. performed the investigation; A.F.S., D.Y., S.S., K.K., A.M.-M., B.v.E., and L.B. performed the formal analysis; A.F.S., E.Z., and L.B. curated the data; E.Z. and A.A. collected the data; A.F.S. wrote the original manuscript draft; A.F.S., D.Y., G.d.H., and L.B. reviewed and edited the manuscript; and G.d.H. and L.B. supervised the study.
Conflict-of-interest disclosure: The authors declare no competing financial interests.
Correspondence: Gerald de Haan, Sanquin Research and Landsteiner Laboratory, Academic Medical Center, University of Amsterdam, Amsterdam, The Netherlands; e-mail: g.dehaan@sanquin.nl.
ATAC-seq data were deposited under the accession number GSE166674.
The online version of this article contains a data supplement.
There is a Blood Commentary on this article in this issue.
The publication costs of this article were defrayed in part by page charge payment. Therefore, and solely to indicate this fact, this article is hereby marked "advertisement" in accordance with 18 USC section 1734.
REFERENCES
Author notes
G.d.H. and L.B. contributed equally to this study.
This feature is available to Subscribers Only
Sign In or Create an Account Close Modal