Limiting dilution transplantation assay (LDTA) is considered as the gold standard method to assess hematopoietic stem cell (HSC) content. Traditionally, HSC frequency estimates are based on the single-hit Poisson model (SHPM), which posits that one donor HSC is sufficient to generate a progeny of detectable differentiated cells above a threshold value in hosts. However, there is no clear support for this statement, and it is receivable that more than one donor HSC may be necessary to provide detectable reconstitution in hosts above the threshold level for detection, usually 0.5% to 1% of donor-derived cells. To address this hypothesis, we evaluated the ability of a class of multiCell Poisson models (C≥1PMs) to fit to LDTAs. In 7 of the 8 reanalyzed LDTAs, C≥1PMs plausibly compete with the traditional SHPM. Model averaging across the set of plausible models gives 1.32- to 5.88-fold increases in HSC frequencies compared with the SHPM.

Although limiting dilution transplantation assay (LDTA) in recipient animals coupled to the single-hit Poisson model (SHPM) is considered as one of the best methods for quantitating hematopoietic stem cells (HSCs),1  investigators should be aware of the potential problems associated with this assay.2  In particular, HSC frequency estimates traditionally rely on the SHPM,1,3  which posits that one donor HSC is sufficient to generate a progeny of detectable differentiated cells above a threshold value in hosts, usually considered approximately 1% of donor-derived cells as estimated by flow cytometry.2  It turns out that the reliability of HSC frequency estimates is critically dependent on this hypothesis. Indeed, it is perfectly acceptable that the progeny of one HSC may be unable to reach this limit of detection imposed by standard flow cytometric analysis. In this context, several recipients having received one, or even more than one, HSC(s) may be falsely classified as negative for reconstitution. This potential situation disqualifies the use of the SHPM. To address this problem, we demonstrate that it is possible to accurately quantitate HSCs, providing that the traditional SHPM is mathematically remodeled to turn to a new class of Poisson models termed multicell Poisson models (C≥1PMs), which take into account the possibility of recipients misclassified as negative. The validity of this new modeling approach is demonstrated by reassessing 8 previously published LDTAs4,5  aimed at comparing HSC frequencies in Hoxa-9−/− versus wild-type mice (2 LDTAs)4  and in Notch ligand–stimulated versus –unstimulated CD34+ cord blood cells (6 LDTAs).5  In 7 of the 8 reanalyzed LDTAs, C≥1PMs plausibly compete with the traditional SHPM, leading to significant changes in HSC frequency estimates compared with the SHPM.

Source of the 8 reanalyzed LDTAs

Details of the LDTAs numbered 1a (Hoxa-9−/− mice), 1b (wild-type mice), and 2a to 2f (Notch ligand–stimulated CD34+ cells and controls) are described in supplemental Tables 1 and 2 (available on the Blood Web site; see the Supplemental Materials link at the top of the online article).4,5 

Model assumptions

The equation of the SHPM model is as follows:

This is the first term in the Poisson series, describing the equation of the SHPM.3,6  In this equation, πi is considered as the proportion of the recipient population with no detectable donor-derived blood cells (negative outcome), fc1 is the HSC frequency estimate, and xi is the number of cells at each cell dose group i with i = 1, 2, …, k, where k is the total number of cell doses. The major assumption of the SHPM is that only animals that receive zero HSCs do not produce positive outcome, whereas animals transplanted with 1, 2, …, C HSC(s) will result in positive outcome.

The general equation of the models of the class of C≥1PMs is as follows:

This equation describes a series of more sophisticated Poisson models that extend the basic SHPM by including the second term, third term, …, nth term of the Poisson series in addition to the first term of the Poisson series, where fc>1 is the HSC frequency estimate. This equation can be written as follows:

It describes the general form of the multicell PMs (C≥1PMs). C appears for each model as the minimum number of HSCs necessary to promote a detectable positive outcome. A set of 20 models is tested, C taking the value 1, 2, …, 20. The C≥1PMs include the SHPM (C = 1) but leave open the possibility that more than one HSC (C > 1) is necessary to give rise to a progeny of detectable differentiated cells in transplanted animals (positive outcome). In the case where C > 1, our hypothesis is that C≥1PMs mathematically capture the situation in which a proportion of recipients having received HSC(s), but not in sufficient number (ie, 1 to C − 1), have been falsely scored as negative outcome.

Fitting a generalized linear model for assessing the fit of the SHPM to LDTAs

In a previously published paper,7  we advised a statistical test aimed at estimating the fit of the SHPM to the data and based on a generalized linear modeling approach. Briefly, testing the null hypothesis that the slope β is equal to 1 (β = 1) explores the SHPM hypothesis, and this can be performed by a standard likelihood ratio test.8  Standardized deviance residuals9  were also used as SHPM checking diagnostics after fitting the SHPM3  to the LDTAs.

Fitting the C≥1PMs to LDTAs and computation of HSC frequency estimates

Calculations of C≥1PM-based HSC frequencies were obtained by a standard maximum likelihood procedure applied to binomial data.9  Comparison of the C≥1PMs models was performed with Akaike information criterion (AIC) and the related Akaike weights, called w, grounded on an extension of the AIC known as information-theoretic approach, applied to model uncertainty and multimodel inference.10-15  Finally, C≥1PM-based HSC frequency estimates were averaged across the set of all plausible C≥1PMs, along to a statistical procedure that takes into account the HSC frequency estimates and the relative Akaike weights of all plausible C>1PMs10,11  (supplemental Methods).

The SHPM is rejected for the LDTA nos. 1a, 2a, and 2e (P < .05) and is barely acceptable for the LDTA nos. 1b and 2d (P < .1; supplemental Table 3), casting doubt on the accuracy of the previously reported HSC frequency estimates4,5  for these 5 of the 8 LDTAs. The standardized deviance residuals under the SHPM are presented in supplemental Figure 1A-B. Overall, it can be observed that the residuals are usually positive at low cell doses and negative at high cell doses. Given the structure of the deviance residuals,9  this means that the fraction of negative mice under the SHPM tends to be underestimated at low cell doses and overestimated at high cell doses. The occurrence of such a systematic pattern favors the hypothesis that the SHPM is not correct and motivates the development of alternative Poisson models. Next, C≥1PMs are fitted to the LDTA data with C taking the values 1 to 20. Based on the log-likelihood values, it can be computed AIC, ΔAICm (the AIC difference between the best model and a given model in the set), and weights w that represent the probability of each model to be the expected best model (Supplemental data); a representative example of calculation is given in supplemental Table 4. Based on each Akaike weight w, it can be observed that the best model probability w obtained for a given C≥1PM is not large relative to the weights for the other competing C≥1PMs (supplemental Figure 2 left panels). The conclusion is that there is no model from the C≥1PMs, including the traditional SHPM, that can be considered as the best-approximating model to the data, except in LDTA no. 2f exhibiting w close to 1 for the model with C = 1, strong evidence that the SHPM is the expected best model. Considering this model uncertainty for 7 of the 8 LDTAs, HSC frequency estimates were computed by a model averaging procedure across the plausible C≥1PMs, defined as the set of models with ΔAICm ≤ 10 (supplemental Figure 2 right panels). C≥1PM-averaged HSC frequency estimates are higher than HSC frequency estimates fc1 based on the SHPM, with 1.32- to 5.88-fold increases (Table 1). Fitted SHPM and C≥1PMs regression lines for the 8 LDTAs are presented in Figure 1A-H, with χ2/df ratios (χ2-dispersion statistics; supplemental data) favoring the conclusion that C>1PMs fit better to the data than the SHPM. This study strongly suggests that C≥1PMs should be routinely used to more accurately estimate HSC frequencies in LDTAs. In line with our main finding that a single HSC may not be sufficient to generate detectable hematopoietic reconstitution in recipients, single-cell transplantations performed with various HSC-enriched populations may have chronically underestimated the total HSC frequencies.16 

Figure 1

Fitted SHPM and C≥1PM-averaged regression lines for the 8 LDTAs. Vertical axis represents expected fraction, termed yi, of negative mice predicted by the SHPM, or expected fraction, termed yi, of negative mice predicted by model averaging across the set P of plausible C≥1PMs; and horizontal axis, number of injected cells xi at each cell dose group i. (A) LDTA no. 1a: Hoxa-9−/− bone marrow cells. (B) LDTA no. 1b: wild-type bone marrow cells. (C) LDTA no. 2a: Notch ligand (delta1-IgG) stimulated CD34+ cells, 3 weeks after transplantation. (D) LDTA no. 2b: CD34+ cultured with control IgG, 3 weeks after transplantation. (E) LDTA no. 2c: noncultured CD34+, 3 weeks after transplantation. (F) LDTA no. 2d: Notch ligand (delta1-IgG) stimulated CD34+ cells, 9 weeks after transplantation. (G) LDTA no. 2e: CD34+ cultured with control IgG, 9 weeks after transplantation. (H) LDTA no. 2f: noncultured CD34+, 9 weeks after transplantation. Blue line indicates fitted SHPM regression line; red line, fitted C≥1PM-averaged regression line; and black symbols, experimental data (yi/Ni, xi), where yi is the number of mice with negative outcome, Ni is the total number of mice, and xi is the number of injected cells, at each cell dose i. (A-H) The values of χ2/df ratios (Pearson χ2-dispersion statistics) highlight that the C≥1PM class better fit to the data than the SHPM in 7 of the 8 LDTAs: the lower value of this ratio, the better fit of the model to the data (supplemental data).

Figure 1

Fitted SHPM and C≥1PM-averaged regression lines for the 8 LDTAs. Vertical axis represents expected fraction, termed yi, of negative mice predicted by the SHPM, or expected fraction, termed yi, of negative mice predicted by model averaging across the set P of plausible C≥1PMs; and horizontal axis, number of injected cells xi at each cell dose group i. (A) LDTA no. 1a: Hoxa-9−/− bone marrow cells. (B) LDTA no. 1b: wild-type bone marrow cells. (C) LDTA no. 2a: Notch ligand (delta1-IgG) stimulated CD34+ cells, 3 weeks after transplantation. (D) LDTA no. 2b: CD34+ cultured with control IgG, 3 weeks after transplantation. (E) LDTA no. 2c: noncultured CD34+, 3 weeks after transplantation. (F) LDTA no. 2d: Notch ligand (delta1-IgG) stimulated CD34+ cells, 9 weeks after transplantation. (G) LDTA no. 2e: CD34+ cultured with control IgG, 9 weeks after transplantation. (H) LDTA no. 2f: noncultured CD34+, 9 weeks after transplantation. Blue line indicates fitted SHPM regression line; red line, fitted C≥1PM-averaged regression line; and black symbols, experimental data (yi/Ni, xi), where yi is the number of mice with negative outcome, Ni is the total number of mice, and xi is the number of injected cells, at each cell dose i. (A-H) The values of χ2/df ratios (Pearson χ2-dispersion statistics) highlight that the C≥1PM class better fit to the data than the SHPM in 7 of the 8 LDTAs: the lower value of this ratio, the better fit of the model to the data (supplemental data).

Close modal

The online version of this article contains a data supplement.

The publication costs of this article were defrayed in part by page charge payment. Therefore, and solely to indicate this fact, this article is hereby marked “advertisement” in accordance with 18 USC section 1734.

This work was supported by Association pour la Recherche sur le Cancer (grant 3218).

Contribution: T.B. performed mathematical and statistical data modeling; and M.C. and T.B. cowrote the paper.

Conflict-of-interest disclosure: The authors declare no competing financial interests.

Correspondence: Thierry Bonnefoix, Inserm, U823, Oncogenic Pathways in the Haematological Malignancies, Institut Albert Bonniot, Grenoble Cedex 9, France; e-mail: thierry.bonnefoix@ujf-grenoble.fr.

1
Szilvassy
 
SJ
Humphries
 
RK
Lansdorp
 
PM
Eaves
 
AC
Eaves
 
CJ
Quantitative assay for totipotent reconstituting hematopoietic stem cells by a competitive repopulation strategy.
Proc Natl Acad Sci U S A
1990
, vol. 
87
 
22
(pg. 
8736
-
8740
)
2
Purton
 
LE
Scadden
 
DT
Limiting factors in murine hematopoietic stem cell assays.
Cell Stem Cell
2007
, vol. 
1
 
3
(pg. 
263
-
270
)
3
Taswell
 
C
Limiting dilution assays for the determination of immunocompetent cell frequencies: I. Data analysis.
J Immunol
1981
, vol. 
126
 
4
(pg. 
1614
-
1619
)
4
Lawrence
 
HJ
Christensen
 
J
Fong
 
S
et al. 
Loss of expression of the Hoxa-9 homeobox gene impairs the proliferation and repopulating ability of hematopoietic stem cells.
Blood
2005
, vol. 
106
 
12
(pg. 
3988
-
3994
)
5
Delaney
 
C
Heimfeld
 
S
Brashem-Stein
 
C
et al. 
Notch-mediated expansion of human cord blood progenitor cells capable of rapid myeloid reconstitution.
Nat Med
2010
, vol. 
16
 
2
(pg. 
232
-
236
)
6
Porter
 
HE
Berry
 
RJ
The efficient design of transplantable tumour assays.
Br J Cancer
1963
, vol. 
17
 (pg. 
583
-
595
)
7
Bonnefoix
 
T
Bonnefoix
 
P
Verdiel
 
P
Sotto
 
JJ
Fitting limiting dilution experiments with generalized linear models results in a test of the single-hit Poisson assumption.
J Immunol Methods
1996
, vol. 
194
 
2
(pg. 
113
-
119
)
8
Hu
 
Y
Smyth
 
GK
ELDA: extreme limiting dilution analysis for comparing depleted and enriched populations in stem cell and other assays.
J Immunol Methods
2009
, vol. 
347
 
1–2
(pg. 
70
-
78
)
9
Collett
 
D
Modelling Binary Data
2003
Boca Raton, FL
Chapman & Hall
10
Burnham
 
KP
Anderson
 
DR
Model Selection and Multimodel Inference: A Practical Information—Theoretic Approach
2002
New York, NY
Springer
11
Anderson
 
DR
Model Based Inference in the Life Sciences
2008
New York, NY
Springer
12
Claeskens
 
G
Carroll
 
RJ
An asymptotic theory for model selection inference in general semiparametric problems.
Biometrika
2007
, vol. 
94
 
2
(pg. 
249
-
265
)
13
Buena
 
F
Consistent covariate selection and post model selection inference in semiparemetric regression.
Ann Stat
2004
, vol. 
32
 
3
(pg. 
898
-
927
)
14
Hjort
 
NL
Claeskens
 
G
Frequentist model average estimators.
J Am Stat Assoc
2003
, vol. 
98
 
464
(pg. 
879
-
899
)
15
Claeskens
 
G
Model Selection and Model Averaging
2008
Cambridge, United Kingdom
Cambridge University Press
16
Kent
 
DG
Copley
 
MR
Benz
 
C
et al. 
Prospective isolation and molecular characterization of hematopoietic stem cells with durable self-renewal potential.
Blood
2009
, vol. 
113
 
25
(pg. 
6342
-
6350
)
Sign in via your Institution