To the editor:

We read with interest the report from Hong et al1  on the incremental prognostic value of gene expression signatures in diffuse large B-cell lymphoma (DLBCL). We were surprised by the conclusions that these “signatures are inferior to clinical factors and provide little added value in risk assessment” when considering a 2-gene score (TGS) and a 6-gene score described by us2-4  and others.5  The authors’ claim that “all studies assess predictive significance based on P value from multivariable Cox regression” was also surprising, because dedicated parts of prior studies specifically addressed added prognostic value. We reported robust risk reclassification by integration of molecular indices (TGS) with clinical factors within the international prognostic index (IPI) (TGS-IPI),2  and both our study and the study of Lenz et al5  demonstrated splitting of IPI strata using molecular signatures.

We favored risk reclassification as a measure of added value from molecular indices over more complex statistical tests for several reasons (pie charts in figure 4B of Alizadeh et al2 ). First, even statistically significant biomarkers may yield minimal improvement in the area under the curve.6,7  Therefore, many statisticians consider C-statistics unsuitable for assessing improvement in prediction, preferring the use of measures such as net reclassification improvement and integrated discrimination improvement (IDI).7-10  Second, C-statistics and IDI lack an intuitive interpretation for clinicians. However, most surprisingly, we came to the opposite conclusion as the authors in considering these indices in 3 cohorts of DLBCL patients (n = 561), including an important validation cohort that they did not consider. We provide these new data here (Figure 1A), having originally opted not to present them for space considerations.

Figure 1

Gene expression signatures add to risk assessment in DLBCL. (A) Estimates of prognostic value added by consideration of expression of two genes (TGS) to the five components of the IPI for predicting risk of death with DLBCL reflect significant improvement in a combined risk model (TGS-IPI) than with clinical indices alone (IPI). NRI, net reclassification improvement.9  NRI was considered both with infinite strata and with 3 predefined risk strata considering 0% to 10%, 11% to 30%, and >30% risk of death within the follow-up interval. Use of alternate microarray probe set summarization methods to estimate expression levels of (B) LMO2 and (C) TNFRSF9 and double logarithm transformation result in significant distortion of the (D) TGS, whether in patients treated with CHOP (black) or RCHOP (red).

Figure 1

Gene expression signatures add to risk assessment in DLBCL. (A) Estimates of prognostic value added by consideration of expression of two genes (TGS) to the five components of the IPI for predicting risk of death with DLBCL reflect significant improvement in a combined risk model (TGS-IPI) than with clinical indices alone (IPI). NRI, net reclassification improvement.9  NRI was considered both with infinite strata and with 3 predefined risk strata considering 0% to 10%, 11% to 30%, and >30% risk of death within the follow-up interval. Use of alternate microarray probe set summarization methods to estimate expression levels of (B) LMO2 and (C) TNFRSF9 and double logarithm transformation result in significant distortion of the (D) TGS, whether in patients treated with CHOP (black) or RCHOP (red).

Close modal

Although Hong et al do not provide detailed methods to fully address this, several aspects of the analysis may have led to the discrepancy in their conclusions.

  1. There is an error in the LIM domain only 2 (rhombotin-like 1) (LMO2) coefficient (0.032 instead of 0.32) for calculating the TGS.

  2. The model parameters we reported weight gene expression values measured by quantitative real-time reverse-transcription polymerase chain reaction, with array data needing appropriate rescaling. The TGS values obtained by the authors are thus incorrect, because rescaling is equivalent to changing the relative weightings of the genes in the model, similar to how an equation considering “age” would differ depending on whether it was measured in days or years.

  3. We used a custom gene-level chip definition to renormalize probe-level Affymetrix array data from Lenz et al,5  providing more accurate gene expression quantification (Figure 1B-C).

  4. The publicly available normalized expression data used by the authors are already log2 transformed. Doing this again as suggested in the methods would result in data distortion (Figure 1D).

  5. The authors state that gene expression scores do not add to “clinical factors,” but it is not stated whether they considered the 4 original IPI risk groups or its individual components, which yield up to 6 strata. If the latter, then the authors have refit (over-fit) the components of IPI to the test cohort and instead should have compared IPI to TGS-IPI, a composite model that we defined and tested in independent cohorts receiving the R-CHOP regimen (combining Rituximab with Cyclophosphamide, Hydroxydaunorubicin, Oncovin [vincristine], and Prednisone) (Figure 1A).

Importantly, the authors only assess TGS (derived in patients receiving rituximab) in an older cohort of patients not receiving rituximab. This is of little direct clinical relevance because RCHOP is the current therapeutic standard for DLBCL. They also suggest that gene signatures may provide additional information only in intermediate-risk patients. We argue that this is exactly the population where more refined prognostic indices are most valuable, because these are the patients for whom it is most difficult to make therapeutic decisions based on the current care standards.

Contribution: A.J.G. and A.A.A. designed and performed analyses, and wrote the article.

Conflict-of-interest disclosure: The authors declare no competing financial interests.

Correspondence: Andrew J. Gentles, Stanford University Lucas Center for Imaging, 1201 Welch Rd, Stanford, CA 94305; email: andrewg@stanford.edu.

1
Hong
 
F
Kahl
 
B
Gray
 
R
Incremental value in outcome prediction with gene expression-based signatures in diffuse large B-cell lymphoma.
Blood
2013
, vol. 
121
 
1
(pg. 
156
-
158
)
2
Alizadeh
 
AA
Gentles
 
AJ
Alencar
 
AJ
et al. 
Prediction of survival in diffuse large B-cell lymphoma based on the expression of 2 genes reflecting tumor and microenvironment.
Blood
2011
, vol. 
118
 
5
(pg. 
1350
-
1358
)
3
Alizadeh
 
AA
Gentles
 
AJ
Lossos
 
IS
et al. 
Molecular outcome prediction in diffuse large-B-cell lymphoma.
N Engl J Med
2009
, vol. 
360
 
26
(pg. 
2794
-
2795
)
4
Lossos
 
IS
Czerwinski
 
DK
Alizadeh
 
AA
et al. 
Prediction of survival in diffuse large-B-cell lymphoma based on the expression of six genes.
N Engl J Med
2004
, vol. 
350
 
18
(pg. 
1828
-
1837
)
5
Lenz
 
G
Wright
 
G
Dave
 
SS
et al. 
Lymphoma/Leukemia Molecular Profiling Project
Stromal gene signatures in large-B-cell lymphomas.
N Engl J Med
2008
, vol. 
359
 
22
(pg. 
2313
-
2323
)
6
Lyssenko
 
V
Jonsson
 
A
Almgren
 
P
et al. 
Clinical risk factors, DNA variants, and the development of type 2 diabetes.
N Engl J Med
2008
, vol. 
359
 
21
(pg. 
2220
-
2232
)
7
Cook
 
NR
 
Use and misuse of the receiver operating characteristic curve in risk prediction. Circulation. 2007;115(7):928-935
8
McGeechan
 
K
Macaskill
 
P
Irwig
 
L
et al. 
Assessing new biomarkers and predictive models for use in clinical practice: a clinician’s guide.
Arch Intern Med
2008
, vol. 
168
 
21
(pg. 
2304
-
2310
)
9
Pencina
 
MJ
D'Agostino
 
RB
Pencina
 
KM
et al. 
 
Intepreting incremental value of markers added to risk prediction models. Am J Epidemiol. 2012;176(6):473-481
10
So
 
H-C
Sham
 
PC
A unifying framework for evaluating the predictive power of genetic variants based on the level of heritability explained.
PLoS Genet
2010
, vol. 
6
 
12
pg. 
e1001230
 
Sign in via your Institution