The usage of the immunoglobulin (Ig) VH3-21 gene is associated with poor prognosis in B-cell chronic lymphocytic leukemia (B-CLL) despite VH gene mutation status. Many VH3-21+ patients also display restricted heavy- and light-chain Ig gene rearrangements, implying a role of antigen selection in disease development. To explore the specific phenotypic/genotypic features among VH3-21+ B-CLLs, we compared gene expression patterns in 15 VH3-21+ and 24 non-VH3-21 patients (11 with unmutated and 13 with mutated VH genes) using Affymetrix microarray analysis (∼12 500 genes). A distinct expression profile was identified for VH3-21+ patients in contrast to the Ig-unmutated and -mutated groups. By applying different algorithms, the data enabled an efficient class discrimination of the VH3-21+ subset based on 27 or 57 genes. A set of genes was sorted out which, using different analytical methods, consistently gave a distinction between VH3-21+ and non-VH3-21 samples. Several of these genes are involved in regulation of DNA replication/cell-cycle control, transcription and protein kinase activity, which may render the VH3-21+ cells with a higher proliferative drive. However, no clear evidence of increased B-cell receptor signaling was found in the VH3-21+ group. Altogether, our identification of a specific VH3-21 profile may provide insights into the pathogenesis of the VH3-21+ subgroup. (Blood. 2005;106:681-689)

The somatic hypermutation status of the immunoglobulin variable heavy chain (IgVH) genes has in the last few years been identified as one of the most important prognostic markers in B-cell chronic lymphocytic leukemia (B-CLL). Roughly half of patients with B-CLL display mutations within their VH genes (Ig-mutated), which correlates with favorable prognosis, whereas the remaining patients have unmutated VH genes and a worse outcome (Ig-unmutated).1-4  The selection of VH genes has been shown to be a nonrandom process in B-CLL with overrepresentation of the VH1-69, VH3-7, VH3-21, and VH4-34 genes.2,5-11  The implication of preferential VH gene usage has been hypothesized to be a result of antigen selection of B cells with certain B-cell receptors (BCRs), which then might render a higher probability to undergo malignant transformation. In Ig-unmutated B-CLL, biased usage of the VH1-69 gene has been shown by several groups in a significant fraction of patients.2,5-11  Lately, small subsets of VH1-69+ cases with more or less identical heavy- and light-chain Ig rearrangement features have also been reported.12,13  These findings have led to the speculation that unknown antigens could be involved in the pathogenesis of VH1-69+ B-CLL.

Recently, an overrepresentation of the VH3-21 gene was demonstrated in B-CLL with mutated VH genes.10  Interestingly, VH3-21-utilizing patients showed short overall survival, similar to patients with unmutated VH genes, despite the fact that 2-thirds of them had mutated VH genes.10,11,14,15  Furthermore, many of these VH3-21+ rearrangements (both mutated and unmutated) displayed unusually short and highly homologous complementarity-determining regions 3 (CDR3s), predominant λ light chain expression, and restricted use of 1 particular Vλ gene, Vλ2-14.11  These latter findings have implicated that an unknown antigen epitope could be recognized by the homologous VH3-21+/Vλ2-14+ BCRs expressed in the different VH3-21+ tumors.16  Thus, VH3-21-utilizing patients comprise a new separate B-CLL entity independent of mutation status and an effort is therefore warranted to identify distinguishing features at the molecular level in order to reveal the biology behind VH3-21+ B-CLL.

Gene expression patterns typical for different hematologic malignancies have been identified using microarray techniques.17-19  Expression profiling of B-CLL has shown a characteristic B-CLL “signature” compared with other malignant lymphomas, but has so far revealed relatively uniform gene expression patterns between the Ig-unmutated and Ig-mutated form of B-CLL.20,21  Interestingly, the expression profiles of both these subsets resembled the signature of memory B cells,20  which is in contrast with the initial speculation that Ig-unmutated B-CLL should derive from a naive B cell.22 

In the present study, we have analyzed gene expression patterns in samples from 39 patients with B-CLL with the specific aim to identify genes that may reflect disturbed regulatory pathways, such as genes involved in BCR signaling, in the subset of patients with VH3-21 usage compared with Ig-mutated and Ig-unmutated cases using other VH genes.

Patient material

Tumor samples were collected from 39 patients with B-CLL at the University Hospitals in Uppsala (n = 17), Linköping (n = 2), and Huddinge (n = 20), Sweden, between 1982 and 2002. Twenty-four samples were obtained from B-CLLs with non-VH3-21 usage, 13 with mutated and 11 with unmutated VH genes, and 15 samples with VH3-21 usage. Tumor material was mainly obtained from peripheral blood but also from bone marrow and lymph nodes. Morphologic classification was performed according to the World Health Organization (WHO) classification and the tumors typically expressed CD5 and CD23 and showed a weak Ig expression.23  The median age at diagnosis was 63 years (range, 45 to 83 years), 25 were men, and 14 women with a male-female ratio of 1.8:1. The Binet/Rai stage was as follows: 10 patients A/0, 10 patients A/I, 4 patients A/III, 3 patients B/I, 1 patient B/II, 3 patients B/III, 2 patients C/III, and 2 patients C/IV. In 4 cases either Rai or Binet stage was available: 2 cases were Rai III, 1 Rai I, and 1 Binet C. Informed consent was provided according to the Declaration of Helsinki, and the study was approved by the ethical committees of Karolinska Institutet, Huddinge University Hospital (Dnr 184/01), and Uppsala University (Ups 01-082).

Isolation of mononuclear cells

Mononuclear cells were obtained by separation on a Lymphoprep gradient (Nycomed Pharma, Roskilde, Denmark). The number of leukemic cells was assessed by immunophenotyping. The median proportion of B cells with CLL phenotype (CD5+/CD19+/CD23+) was 90% (range, 73%-98%).

VH gene analysis

High-molecular-weight DNA was prepared using standard protocols including proteinase K treatment. VH gene family-specific polymerase chain reaction (PCR) amplification was performed using consensus VH/JH primers and the clonal VH gene rearrangements were sequenced as previously described.11,24  The sequences were aligned to germline Ig sequences in the GenBank, V-BASE (MRC Centre for Protein Engineering, Cambridge, United Kingdom), and International ImMuno GeneTics (IMGT) databases. VH gene sequences deviating more than 2% from the corresponding germ-line gene were defined as mutated. The length of the heavy-chain CDR3 was calculated between codons 95 and 102.

RNA isolation, probe preparation, and hybridization

Lymphocytes (50-100 × 106) were collected for total RNA isolation from fresh or frozen samples using the Rneasy Mini Kit (Qiagen, Valencia, CA). The RNA was checked qualitatively on a 1% agarose gel, and quantitatively by spectrophotometry. Probes were prepared according to the recommendations from Affymetrix (Santa Clara, CA) from 8 μg total RNA. HG-U95Av2 arrays, complementary to more than 12 500 sequences, were hybridized, developed, and scanned to obtain quantitative gene expression levels, using standard protocols.

Microarray data analysis

Affymetrix Micro Array Suite Software (MAS) version 5.0 was applied to analyze the scanned images, convert intensities to a numerical format, and obtain a signal value for each probe on the array. Target intensity values from each array were scaled/normalized to a value of 100 to enable different arrays to be compared. A detailed description of the statistical algorithms has been presented.25 

A 3-stage approach was followed in the subsequent microarray data analyses. First, a marker analysis was performed using the GeneCluster2 software26  (Whitehead Institute/MIT Center for Genome Research [WICGR], MA) by computing a ranking list where all individual probes are sorted in descending order according to the degree of correlation between their expression level and the sample group compared. For each sample group in the comparison, this analysis calculates a correlation score to identify the genes (probes) that have consistently high average expression in this group and low in the others (ie, “marker” genes). A measure of the statistical significance of each score can be obtained through a permutation test, where the sample group assignment is randomized iteratively and the probability of obtaining a given score by chance in the present material is calculated. The scores actually obtained in a given sample group comparison can thus be assigned to significance levels of, for example, 1% and 5%.

Second, a class discrimination study was conducted. In contrast to the gene-by-gene examination above, this type of analysis seeks to identify a group of genes whose expression levels collectively can discriminate between 2 sample groups. A common application of such an analysis is to enable classification of unknown samples based on the gene expression signature alone. However, since the basis for classification of samples in the present study is a known genotype (VH3-21 and VH mutation status), such a diagnostic tool is not relevant. The purpose of the class-discriminant analyses used here was instead to suggest groups of genes that may be involved in alternative or complementary molecular pathways that are related to the phenotypic differences between sample groups. Two different algorithms were used for class discrimination: weighted voting (GeneCluster226 ) and linear discriminant analysis (dCHIP27 ; Department of Biostatistics, Harvard School of Public Health, MA).

Third, in an effort to pinpoint a more limited number of genes that may be of particular interest from a functional point of view, the gene lists from the marker and discriminant analyses were combined. Genes that appeared in more than one of the various analyses for the same group (for example, VH3-21) were identified and listed.

Detailed descriptions of all data analyses conducted in this study are included as supplemental data on the Blood website; click the Supplemental Materials link at the top of the online article.

For gene annotation, we used the Gene Ontology Tree Machine (GOTM) software (Graduate School in Genome Science and Technology, University of Tennessee-Oak Ridge National Laboratory, Oak Ridge, TN), a web-based data-mining tool using Gene Ontology (GO) hierarchies to identify significant GO categories in a gene set. A statistical analysis in GOTM indicates GO terms, such as “cell cycle” and “metabolism,” and thereby suggests biologic areas that warrant further study (P < .01).28 

Real-time quantitative PCR

For real-time quantitative PCR, TaqMan Assays-on-Demand (Applied Biosystems, Warrington, United Kingdom) including probes and primers for PPP5C, E2F4, SHARP, CDKN1B, and PPIA = (cyclophilin A) were carried out. First-strand cDNA was synthesized with 1 μg of total RNA using High-Capacity cDNA Archive Kit with random primers (Applied Biosystems). PCR was performed with TaqMan Universal PCR Master Mix and an ABI PRISM 7000 Sequence Detection System (Applied Biosystems) according to the manufacturer's instructions. The relative expression of PPP5C, E2F4, SHARP, and CDKN1B mRNA was normalized to the amount of PPIA in the same cDNA using the standard curve method described by the manufacturer.

VH gene analysis

A total of 46 VH gene rearrangements were amplified and sequenced from 39 B-CLL cases, including 7 cases with 2 rearrangements. Of the 24 non-VH3-21 patients, 13 showed somatically mutated VH genes (mean mutation frequency 7%) while 11 cases were considered unmutated. Among the 15 VH3-21+ patients, 9 were mutated (mean mutation frequency 4%) and 6 unmutated. Eight VH3-21 cases displayed VH rearrangements with a short and homologous CDR3 (7 codons without known D gene usage), whereas the remaining 7 cases had rearrangements with varying CDR3 lengths (10-23 codons). The proportion of cases with a mutated VH3-21 gene and with a short/homologous CDR3 reflects the proportion of cases within the VH3-21+ subset as previously reported11  and as found in an extended study of more than 80 VH3-21+ CLLs (R.R., unpublished data, April 2005). The sequence results have been published previously for all cases included.11  The most frequently used VH genes among the non-VH3-21 gene rearrangements were VH1-69 (n = 6, 20%), VH3-23 (n = 3, 10%), VH3-9 (n = 3, 10%), VH3-11 (n = 3, 10%), and VH2-5 (n = 3, 10%).

Identification of genes distinguishing between VH3-21+ and non-VH3-21 B-CLL samples: marker gene analysis

To identify genes correlating with VH3-21 and non-VH3-21 usage, a marker analysis was performed with the 4701 probes whose expression values passed the variation filter. A permutation testing on the 1000 best-correlated probes (see Supplemental Figure S1) showed that all of the 1000 probes were statistically significant on the 5% level for both VH3-21+ and non-VH3-21 samples, whereas at the 1% level 184 and 152 probes correlated with VH3-21 usage and non-VH3-21 usage, respectively. Figure 1 displays the 50 best-correlated probes for each group. In conclusion, the VH3-21 group was clearly and significantly distinguished by a large number of individual genes from B-CLLs with non-VH3-21 usage.

Figure 1.

Results from a marker gene analysis, where the 50 genes that correlated best with either VH3-21 or non-VH3-21 usage were identified. The expression matrix depicts the gene expression values of the individual samples, with columns representing individual samples and rows representing individual genes. Genes are ranked in a “best-correlated” order. The color scale identifies relative gene expression changes normalized by the SD of 1, with 0 representing the mean expression level of a given gene across the panel. Gene lists are included as Table S1 in the Supplemental Materials.

Figure 1.

Results from a marker gene analysis, where the 50 genes that correlated best with either VH3-21 or non-VH3-21 usage were identified. The expression matrix depicts the gene expression values of the individual samples, with columns representing individual samples and rows representing individual genes. Genes are ranked in a “best-correlated” order. The color scale identifies relative gene expression changes normalized by the SD of 1, with 0 representing the mean expression level of a given gene across the panel. Gene lists are included as Table S1 in the Supplemental Materials.

Close modal

Sample class discriminator analysis

First, the weighted voting method for class discrimination was applied to the data, in which different numbers of individual probes are systematically tested for their ability to collectively discriminate between the sample groups. The range of 22 to 27 probes rendered the best statistically significant outcome in our data set; an evaluation with Fisher test yielded a P value of less than .001 for all class discriminators in the 22- to 27-probe range with a 90% correct classification. Figure 2 shows the outcome for the 27-probe model. With the “leave one out” cross validation, each of the 39 samples that were tested generated a new set of 27 probes. In the final model, 15 out of the 27 probes were common to all 39 samples.

Figure 2.

Genes included in the weighted voting class discriminator model. Genes expressed at higher level in non-VH3-21 samples are listed on top and those that were more abundant in VH3-21+ B-CLL are shown on bottom. The highest accuracy was obtained using 27 probes: a total of 35 (90%) of 39 of samples were correctly predicted (P < .001).

Figure 2.

Genes included in the weighted voting class discriminator model. Genes expressed at higher level in non-VH3-21 samples are listed on top and those that were more abundant in VH3-21+ B-CLL are shown on bottom. The highest accuracy was obtained using 27 probes: a total of 35 (90%) of 39 of samples were correctly predicted (P < .001).

Close modal

Second, another class discriminator algorithm, linear discriminant analysis (LDA), was also applied to ascertain if an even more accurate classification was possible. Fifty-seven probes were obtained after filtering (P =.01) in a comparison analysis and using an internal validation procedure, a 100% correct classification was achieved based on these 57 probes (Figure 3A). A graphical illustration of the separability of the VH3-21 and non-VH3-21 using groups is outlined in Figure 3B.

In summary, it was possible to construct class discriminators in the form of expression patterns based on approximately 30 to 60 probes that with 90% to 100% accuracy could distinguish cells with VH3-21 usage from those with non-VH3-21 usage.

Three-group marker analysis

In order to identify the probes that best correlate with the VH3-21 status when a mutated versus unmutated comparison is performed simultaneously, a 3-group marker analysis was also carried out. Figure 4 shows the 50 best-ranked probes for each group compared with both of the other groups. The permutation testing showed no significant association in either the Ig-unmutated or the Ig-mutated group (comparison A and B, Figure 4). However, 155 and at least 1000 up-regulated probes were associated to the VH3-21+ group at the 1% and 5% significance level, respectively (comparison C, Figure 4). Thus, the overall association of the marker genes for the VH3-21+ group was stronger than the marker genes for either the non-VH3-21-using Ig-unmutated or mutated group in this 3-group comparison.

Figure 3.

LDA analysis in VH3-21+ and non-VH3-21 B-CLL. (A) Genes included in the LDA class discriminator. The expression matrix shows the 57 probes that are reliably differentially expressed between the 2 groups, VH3-21+ B-CLL and non-VH3-21 B-CLL. (B) LDA of VH3-21+ and non-VH3-21 B-CLL. Red squares represent VH3-21+ samples; blue squares, non-VH3-21 samples. LDA was applied to the data of the 39 samples by the 57-probe set. Linear discriminants were determined for each sample and plotted. The prediction rate for samples with known class was 100% (P = .01). The median false discovery rate for this predictor was 1.7%.

Figure 3.

LDA analysis in VH3-21+ and non-VH3-21 B-CLL. (A) Genes included in the LDA class discriminator. The expression matrix shows the 57 probes that are reliably differentially expressed between the 2 groups, VH3-21+ B-CLL and non-VH3-21 B-CLL. (B) LDA of VH3-21+ and non-VH3-21 B-CLL. Red squares represent VH3-21+ samples; blue squares, non-VH3-21 samples. LDA was applied to the data of the 39 samples by the 57-probe set. Linear discriminants were determined for each sample and plotted. The prediction rate for samples with known class was 100% (P = .01). The median false discovery rate for this predictor was 1.7%.

Close modal

Consistency of marker genes related to VH3-21 identified in different comparisons

A combination of the results from all the previous comparisons was performed to identify genes that most consistently could distinguish between VH3-21+ and non-VH3-21 samples, when using more than 1 comparison method. In order to obtain this list, the 20 best-ranked probes, up-regulated or down-regulated in VH3-21 B-CLLs, were listed from the 2 marker gene comparisons described previously. Out of 40 up-regulated and 40 down-regulated probes, 29 genes correlated with VH3-21 status and 38 with non-VH3-21 status. Eight genes were identified as up-regulated in VH3-21 in both marker gene analyses; 2 probes recognising VH3 family gene members, PPP5C: protein phosphatase 5, ARHGEF1: Rho guanine nucleotide exchange factor (GEF) 1, VCP: valosin-containing protein, VDAC1: voltage-dependent anion channel 1, TYMS: thymidylate synthetase, and SMARCD1: SWI/SNF-related, matrix-associated, actin-dependent regulator of chromatin, subfamily d, member 1. Two genes were down-regulated in both marker gene analyses: PLCL2: phospholipase C-like 2 and SHARP: SMART/HDAC1-associated repressor protein.

Figure 4.

Three-group marker analysis in Ig-mutated, unmutated, and VH3-21+B-CLL. Results from the 3-group marker gene analysis, where the 50 genes best-correlated with either (A) Ig-mutated, (B) Ig unmutated, or (C) VH3-21+ B-CLL were identified. Gene lists are included in the Supplementary Materials.

Figure 4.

Three-group marker analysis in Ig-mutated, unmutated, and VH3-21+B-CLL. Results from the 3-group marker gene analysis, where the 50 genes best-correlated with either (A) Ig-mutated, (B) Ig unmutated, or (C) VH3-21+ B-CLL were identified. Gene lists are included in the Supplementary Materials.

Close modal

The 27 probes that showed the best class discrimination in the weighted voting were all present and highly ranked among the 2 comparisons. Of the 57 probes that were obtained in the LDA analysis, 11 genes were in common with the genes identified in the 2 marker gene analysis; 2 out of 5 up-regulated in VH3-21+ samples and 9 out of 38 down-regulated. All comparisons are summarized in Table 1, and the genes are shown in a ranking order according to how many comparisons a given gene was found to be related to VH3-21 status (the VH3-21 vs non-VH3-21 comparison is designated comparison 1 and the 3-group marker analysis as comparison 2 in Table 1).

Sample preparation control experiment

A methodologic issue regarding the collection of samples analyzed in this study relates to their variable sample collection procedure, and storage time. These considerations are normally addressed by including a large number of samples in a study, where these factors can be considered effectively randomized. In our material, however, there was a bias toward the VH3-21+ samples having been frozen before RNA extraction, whereas most of the non-VH3-21 samples had been harvested from fresh cells. We therefore performed a set of control experiments to assess the possible influence of freezing status on the results. Our conclusion is that such an effect is likely to be very minor, based on the following data. Five B-CLL samples were tested such that fresh and frozen samples from the same patient were compared in a marker analysis and probes related to freezing were identified. These probes were then compared with probes identified in the VH3-21+ versus non-VH3-21 comparison. A few probes (3 of 50) were found to be common in the comparisons. These genes were: EIF4G1: eukaryotic translation initiation factor 4 gamma, 1, probe id. 32 844_at, TOP3B: topoisomerase (DNA) III beta, probe id. 40 324_r_at and Cas-Br-M (murine) ectropic retroviral transforming sequence b, probe id. 514_at. It is therefore likely that these common probes related to freezing status and which were low ranked among VH3-21+-related probes, are of very limited relevance for the overall VH3-21+ versus non-VH3-21 comparison profile.

RQ-PCR

The differential expression of 4 genes (PPP5C, E2F4, SHARP, CDKN1B) was investigated by real-time quantitative (RQ)-PCR analysis of 4 VH3-21+ and 4 non-VH3-21 samples. RQ-PCR fold-change values for each gene were calculated and tabulated (Table 2), along with the fold-change analysis obtained from the microarray data. Three out of 4 genes showed changes in the same direction with both methods. However, SHARP, which was down-regulated in VH3-21+ B-CLLs, could not be verified with RQ-PCR. Mapping of the precise sequence coverage of the Affymetrix array probe target sequence and the RQ-PCR primers/probe in the ENSEMBL genome database (www.ensembl.org) showed a clear difference concerning which part of the SHARP mRNA that was detected with the 2 methods (data not shown). Thus, the discrepancy in the experimental results could be explained if SHARP is expressed as alternative mRNAs (eg, splice variants) and the differential regulation only concerns one of the mRNA variants. A specific investigation using multiple probes will be warranted to address this question.

Biological annotation of identified genes

In order to obtain a broader, global description of the biologic patterns in the 2 groups correlating with VH3-21 and non-VH3-21 usage, the gene identifiers were linked to the Gene Ontology database. Pathways in biologic processes and molecular functions that showed significant enrichment for the genes that differed in expression between samples with VH3-21 and non-VH3-21 usage is shown in Table 3, which is based on the 200 best-ranked up- and down-regulated genes that were identified in the marker analysis. The table shows representative GO categories in “Biologic process” and “Molecular function” with the “parent” category and its corresponding “child” categories. For example, VH3-21 samples demonstrated statistical overrepresentation of genes in the functional categories “cell organization and biogenesis,” “cell cycle,” and “protein modification.”

BCR-signaling-related genes

Since the different comparisons in the previous paragraph did not reveal any obvious dysregulation of genes involved in BCR signaling, we also compared the expression levels of a selection of known BCR-related genes29  (BCR-signaling complex genes: SYK, LYN, FYN, BLNK, BLK, VAV1, VAV2, GRB2, ZAP-70; genes associated with BCR-signaling: CD19, CD22, CD23/FCER2; genes involved in downstream pathways affected by BCR signaling: PLCG2, PIK3CD, PIK3C2B, AKT, RAF1, ERK, PRKCB1, JNK, NFKBIE, NFKB2, NFKBIB, RELA, CCND2, NFATC1, NFATC3, GAB2), but only 3 of these genes were among the 200 highest up-regulated/down-regulated genes in the VH3-21 group: NFKB2, CD22, and NFATC1, none of which showed a high-ranking position (positions 133, 105, and 84, respectively).

During recent years, the VH genes have been extensively studied in B-CLL and it has been observed, based on the mutation status, that Ig-unmutated B-CLLs have a more severe disease course than patients with B-CLL with mutated VH regions.1-4  Recently, Tobin et al11  showed that CLL patients who use the VH3-21 gene segment have a poor prognosis despite VH gene mutation status and that many of these cases also display peculiar molecular features with homologous heavy- and light-chain Ig rearrangements, which implies that antigen(s) could be involved in the disease pathogenesis. Based on these data, the VH3-21 subset has been suggested to comprise a third B-CLL entity.14  As a first attempt to investigate differences in gene expression in the VH3-21 group, we have analyzed the global gene expression profile in a group of 39 patients with B-CLL consisting of the 3 different subcategories (eg, Ig-unmutated, Ig-mutated, and VH3-21+ B-CLL).

VH3-21+ B-CLLs display a distinctive gene expression profile

First, a marker analysis revealed that VH3-21+ and non-VH3-21 B-CLLs were clearly separated and at least 1000 markers for each group were significant at the 5% level (Figure 1). Thus, VH3-21+ and non-VH3-21 B-CLLs are distinguished by pronounced differences in gene expression (Figure 1). Next, we attempted to build a model, which would correctly identify the VH3-21+ cases using the expression patterns of a number of genes. With the weighted voting algorithm and a 27-gene model, a 90% correct classification was achieved (Figure 2), whereas the LDA approach yielded an even more accurate classification based on 57 genes (Figure 3). Thus, it was possible to obtain a good discriminatory power for the distinction between VH3-21+ and non-VH3-21 B-CLLs using 2 different algorithms, which shows that these 2 sample groups are distinguishable based only on the expression patterns of a limited number of genes. In addition, the 3-group comparison (Figure 4), including all 3 subgroups in one marker analysis, demonstrated that the VH3-21 group was more clearly and significantly distinguished from the Ig-unmutated/mutated groups than either of these compared with each other. Taken together, our analyses underline the observation that the VH3-21+ group of B-CLL expresses a particular and identifiable gene expression profile, which may be searched for functional clues to the underlying pathology.

Based on the preceding analyses, a composite of the best-ranked identified genes from the 2 marker gene analyses was assembled in Table 1. This list also shows genes that were consistently identified as being related to VH3-21 status using different analytical approaches (eg, LDA and weighted voting). Two IgVH genes, 1 VH3-23 gene, and 1 VH3 family gene (which could represent either the VH3-21, VH3-07, or VH3-74 gene), were among the best reproduced VH3-21-related markers present in both marker gene analyses and the 2 class prediction models. The up-regulated VH3-23 serves as a positive marker and confirms the VH3-21 genotype, since VH3-23 is the product of the VH3-23 gene that is closely related to the VH3-21 gene.

Dysregulated transcription and cell-cycle control in the VH3-21 group

Among the highest ranked up-regulated genes in the VH3-21 group, several are involved in the regulation of transcription and cell-cycle control (eg, PPP5C, E2F4, SMARCD1, ARHGEF1, TYMS, HMGA1 and EIF4G1), which may provide cells with an increased global or specific transcription activity as well as a higher general proliferative drive. An increase in expression of protein phosphatase 5 has been shown to block p53-mediated p21 growth suppression,30  whereas overexpression of the E2F transcription factor 4 was reported to activate transcription.31  The SMARCD1/BAF60a have also been shown to be involved in regulation of transcription.32,33  The function of the up-regulated guanine nucleotide exchange factor (GEF) 1 is less well known. This protein belongs to the Rho guanosine triphosphatases (GTPases), which play a fundamental role in numerous cellular processes that are initiated by extracellular stimuli through G protein-coupled receptors. Furthermore, the up-regulated TYMS protein has been associated with drug resistance in colorectal cancer,34  and this protein as well as the HMGA1 and EIF4G1 proteins was shown to be overexpressed in recent microarray studies of different cancers.35-37 

Among the down-regulated genes in the VH3-21 group, there are several examples of genes involved in repression of transcription and negative control of cell cycle progression (eg, SHARP, SAP18, TBL1X, MXI-1, CDKN1B, CDKN2D, and MADH2). The histone deacetylase (HDAC) complex is known to repress transcription and several proteins involved in this complex were found down-regulated in the VH3-21 group. For instance, reduced SHARP protein, SAP18, and TBL1X38  expression levels may result in reduced HDAC complex formation and thus decreased transcriptional repression. The down-regulated MXI-1 protein is another transcriptional repressor that has been shown to interact with MAX and result in MYC activation.39 

The down-regulated cyclin-dependent kinase (CDK) inhibitor 1B (CDKN1B, p27, Kip1) and cyclin-dependent kinase inhibitor 2D (CDKN2D, p19, inhibits CDK4) are both proteins that bind to CDKs and prevents G1 progression.40  The ING1 tumor suppressor interacts with p53, thereby promoting cell-cycle arrest and apoptosis, and has been reported to show decreased expression in both lung cancer and acute lymphoblastic leukemia (ALL).41-43 MADH2/SMAD2 is involved in transforming growth factor (TGF)-β signaling,44  and mutations in MADH2/SMAD2 will provide resistance of cells to TGF-β induced growth inhibition.

The categorization of genes into functional groups also gives indications of the increased growth and cell proliferation among the VH3-21+ samples. Representative GO categories, such as “regulation of cell cycle,” “DNA replication and chromosome cycle,” and “kinase activity,” contain twice as many genes than expected that are involved in these processes (Table 3), and thus indicate that these enriched functional categories might be of importance for the more aggressive phenotype in VH3-21+ tumors.

No profound evidence of increased BCR signaling in the VH3-21+ group

Considering the speculated antigen involvement in the VH3-21+ group and the possibility of ongoing antigen activation through the VH3-21-encoded BCR, we were interested to study whether this subset displayed up-regulation of BCR-signaling pathways. We therefore compared the expression levels of a selection of BCR-signaling-related genes (“Results”), but no profound overexpression was evident in the VH3-21+ group. Furthermore, 1 signaling molecule that has caught much attention lately is the tyrosine kinase zeta-associated protein (ZAP)-70, which is overexpressed in unmutated B-CLLs and thought to be involved in increased BCR signaling within this subset.21,45-47  However, since we did not deplete the T cells, which also express ZAP-70, we cannot address the issue of ZAP-70 expression in our subgroups properly, in contrast to previous microarray studies.20,21 

In conclusion, we have shown that the VH3-21+ group has a distinctive expression profile compared with Ig-unmutated and Ig-mutated CLLs, and that different class discriminator models can be applied to accurately classify a sample as being VH3-21+ solely based on gene expression data. Among the highest ranked up- and down-regulated genes, several were involved in regulation of DNA replication/cell cycling as well as transcriptional activation and kinase activity. It is thus tempting to speculate that these dysregulated genes could reflect a more proliferative phenotype in VH3-21+ tumors, which is ultimately expressed as a clinically more aggressive disease.

Prepublished online as Blood First Edition Paper, April 7, 2005; DOI 10.1182/blood-2004-10-4073.

Supported by grants from the Swedish Cancer Society and the Stockholm Cancer Society. The Wallenberg Consortium provided support to the Affymetrix core facility.

The online version of the article contains a data supplement.

The publication costs of this article were defrayed in part by page charge payment. Therefore, and solely to indicate this fact, this article is hereby marked “advertisement” in accordance with 18 U.S.C. section 1734.

The authors are grateful to Emma Emanuelsson, Ann-Marie Forsblom, Monika Jansson, Ann Wallblom, Karin Karlsson, and Anna Laurell for skillful technical assistance and for assistance with clinical data, and to Ola Söderberg and Christer Sundström for providing CLL samples.

1
Damle RN, Wasil T, Fais F, et al. Ig V gene mutation status and CD38 expression as novel prognostic indicators in chronic lymphocytic leukemia.
Blood.
1999
;
94
:
1840
-1847.
2
Hamblin TJ, Davis Z, Gardiner A, Oscier DG, Stevenson FK. Unmutated Ig V(H) genes are associated with a more aggressive form of chronic lymphocytic leukemia.
Blood.
1999
;
94
:
1848
-1854.
3
Kröber A, Seiler T, Benner A, et al. V(H) mutation status, CD38 expression level, genomic aberrations, and survival in chronic lymphocytic leukemia.
Blood.
2002
;
100
:
1410
-1416.
4
Oscier DG, Gardiner AC, Mould SJ, et al. Multivariate analysis of prognostic factors in CLL: clinical stage, IGVH gene mutational status, and loss or mutation of the p53 gene are independent prognostic factors.
Blood.
2002
;
100
:
1177
-1184.
5
Kipps TJ, Tomhave E, Pratt LF, Duffy S, Chen PP, Carson DA. Developmentally restricted immunoglobulin heavy chain variable region gene expressed at high frequency in chronic lymphocytic leukemia.
Proc Natl Acad Sci U S A.
1989
;
86
:
5913
-5917.
6
Schroeder HW, Jr., Dighiero G. The pathogenesis of chronic lymphocytic leukemia: analysis of the antibody repertoire.
Immunol Today.
1994
;
15
:
288
-294.
7
Johnson TA, Rassenti LZ, Kipps TJ. Ig VH1 genes expressed in B cell chronic lymphocytic leukemia exhibit distinctive molecular features.
J Immunol.
1997
;
158
:
235
-246.
8
Fais F, Ghiotto F, Hashimoto S, et al. Chronic lymphocytic leukemia B cells express restricted sets of mutated and unmutated antigen receptors.
J Clin Invest.
1998
;
102
:
1515
-1525.
9
Rosenquist R, Lindstrom A, Holmberg D, Lindh J, Roos G. V(H) gene family utilization in different B-cell lymphoma subgroups.
Eur J Haematol.
1999
;
62
:
123
-128.
10
Tobin G, Thunberg U, Johnson A, et al. Somatically mutated Ig V(H)3-21 genes characterize a new subset of chronic lymphocytic leukemia.
Blood.
2002
;
99
:
2262
-2264.
11
Tobin G, Thunberg U, Johnson A, et al. Chronic lymphocytic leukemias utilizing the VH3-21 gene display highly restricted Vlambda2-14 gene use and homologous CDR3s: implicating recognition of a common antigen epitope.
Blood.
2003
;
101
:
4952
-4957.
12
Tobin G, Thunberg U, Karlsson K, et al. Subsets with restricted immunoglobulin gene rearrangement features indicate a role for antigen selection in the development of chronic lymphocytic leukemia.
Blood.
2004
;
104
:
2879
-2885.
13
Widhopf GF, 2nd, Rassenti LZ, Toy TL, Gribben JG, Wierda WG, Kipps TJ. Chronic lymphocytic leukemia B cells of more than 1% of patients express virtually identical immunoglobulins.
Blood.
2004
;
104
:
2499
-2504.
14
Kröber A, Benner A, Bühler A, Döhner H, Stilgenbauer S. Multivariate Survival Analysis of Specific VH-Genes in CLL: V3-21 and V3-23 Are Prognostic Factors Independently of the VH Mutation Status.
ASH.
2002
;
100
.
15
Lin K, Manocha S, Harris RJ, Matrai Z, Sherrington PD, Pettitt AR. High frequency of p53 dysfunction and low level of VH mutation in chronic lymphocytic leukemia patients using the VH3-21 gene segment.
Blood.
2003
;
102
:
1145
-1146.
16
Tobin G, Soderberg O, Thunberg U, Rosenquist R. V(H)3-21 gene usage in chronic lymphocytic leukemia—characterization of a new subgroup with distinct molecular features and poor survival.
Leuk Lymphoma.
2004
;
45
:
221
-228.
17
Alizadeh AA, Eisen MB, Davis RE, et al. Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling.
Nature.
2000
;
403
:
503
-511.
18
Golub TR, Slonim DK, Tamayo P, et al. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.
Science.
1999
;
286
:
531
-537.
19
Shipp MA, Ross KN, Tamayo P, et al. Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning.
Nat Med.
2002
;
8
:
68
-74.
20
Klein U, Tu Y, Stolovitzky GA, et al. Gene expression profiling of B cell chronic lymphocytic leukemia reveals a homogeneous phenotype related to memory B cells.
J Exp Med.
2001
;
194
:
1625
-1638.
21
Rosenwald A, Alizadeh AA, Widhopf G, et al. Relation of gene expression phenotype to immunoglobulin mutation genotype in B cell chronic lymphocytic leukemia.
J Exp Med.
2001
;
194
:
1639
-1647.
22
D'Arena G, Di Renzo N, Brugiatelli M, Vigliotti ML, Keating MJ. Biological and clinical heterogeneity of B-cell chronic lymphocytic leukemia.
Leuk Lymphoma.
2003
;
44
:
223
-228.
23
Matutes E, Owusu-Ankomah K, Morilla R, et al. The immunological profile of B-cell disorders and proposal of a scoring system for the diagnosis of CLL.
Leukemia.
1994
;
8
:
1640
-1645.
24
Li AH, Rosenquist R, Forestier E, et al. Clonal rearrangements in childhood and adult precursor B acute lymphoblastic leukemia: a comparative polymerase chain reaction study using multiple sets of primers.
Eur J Haematol.
1999
;
63
:
211
-218.
25
Affymetrix. GeneChip Expression Analysis Data Analysis Fundamentals. http://www.affymetrix.com/support/index.affx; 2003. Accessed on May 27, 2005.
26
Tamayo P, Slonim D, Mesirov J, et al. Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation.
Proc Natl Acad Sci U S A.
1999
;
96
:
2907
-2912.
27
Li C, Wong WH. Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection.
Proc Natl Acad Sci U S A.
2001
;
98
:
31
-36.
28
Zhang B, Schmoyer D, Kirov S, Snoddy J. GOTree Machine (GOTM): a web-based platform for interpreting sets of interesting genes using Gene Ontology hierarchies.
BMC Bioinformatics.
2004
;
5
:
16
.
29
Niiro H, Clark EA. Regulation of B-cell fate by antigen-receptor signals.
Nat Rev Immunol.
2002
;
2
:
945
-956.
30
Urban G, Golden T, Aragon IV, et al. Identification of a functional link for the p53 tumor suppressor protein in dexamethasone-induced growth suppression.
J Biol Chem.
2003
;
278
:
9747
-9753.
31
Yoshida K, Inoue I. Regulation of Geminin and Cdt1 expression by E2F transcription factors.
Oncogene.
2004
;
23
:
3802
-3812.
32
Ring HZ, Vameghi-Meyers V, Wang W, Crabtree GR, Francke U. Five SWI/SNF-related, matrix-associated, actin-dependent regulator of chromatin (SMARC) genes are dispersed in the human genome.
Genomics.
1998
;
51
:
140
-143.
33
Hsiao PW, Fryer CJ, Trotter KW, Wang W, Archer TK. BAF60a mediates critical interactions between nuclear receptors and the BRG1 chromatin-remodeling complex for transactivation.
Mol Cell Biol.
2003
;
23
:
6210
-6220.
34
Wang TL, Diaz LA, Jr., Romans K, et al. Digital karyotyping identifies thymidylate synthase amplification as a mechanism of resistance to 5-fluorouracil in metastatic colorectal cancer patients.
Proc Natl Acad Sci U S A.
2004
;
101
:
3089
-3094.
35
Takaha N, Hawkins AL, Griffin CA, Isaacs WB, Coffey DS. High mobility group protein I(Y): a candidate architectural protein for chromosomal rearrangements in prostate cancer cells.
Cancer Res.
2002
;
62
:
647
-651.
36
Cromer A, Carles A, Millon R, et al. Identification of genes associated with tumorigenesis and metastatic potential of hypopharyngeal cancer by microarray analysis.
Oncogene.
2003
;
23
:
2484
-2498.
37
Ulger C, Toruner GA, Alkan M, et al. Comprehensive genome-wide comparison of DNA and RNA level scan using microarray technology for identification of candidate cancer-related genes in the HL-60 cell line.
Cancer Genet Cytogenet.
2003
;
147
:
28
-35.
38
Zhang J, Kalkum M, Chait BT, Roeder RG. The N-CoR-HDAC3 nuclear receptor corepressor complex inhibits the JNK pathway through the integral subunit GPS2.
Mol Cell.
2002
;
9
:
611
-623.
39
Hoffman B, Amanullah A, Shafarenko M, Liebermann DA. The proto-oncogene c-myc in hematopoietic development and leukemogenesis.
Oncogene.
2002
;
21
:
3414
-3421.
40
Hirai H, Adachi T, Tsubata T. Involvement of cell cycle progression in survival signaling through CD40 in the B-lymphocyte line WEHI-231.
Cell Death Differ.
2004
;
11
:
261
-269.
41
Tsang FC, Po LS, Leung KM, Lau A, Siu WY, Poon RY. ING1b decreases cell proliferation through p53-dependent and -independent mechanisms.
FEBS Lett.
2003
;
553
:
277
-285.
42
Kameyama K, Huang CL, Liu D, et al. Reduced ING1b gene expression plays an important role in carcinogenesis of non-small cell lung cancer patients.
Clin Cancer Res.
2003
;
9
:
4926
-4934.
43
Nouman GS, Anderson JJ, Wood KM, et al. Loss of nuclear expression of the p33(ING1b) inhibitor of growth protein in childhood acute lymphoblastic leukaemia.
J Clin Pathol.
2002
;
55
:
596
-601.
44
Heldin CH, Miyazono K, ten Dijke P. TGF-beta signalling from cell membrane to nucleus through SMAD proteins.
Nature.
1997
;
390
:
465
-471.
45
Chen L, Widhopf G, Huynh L, et al. Expression of ZAP-70 is associated with increased B-cell receptor signaling in chronic lymphocytic leukemia.
Blood.
2002
;
100
:
4609
-4614.
46
Crespo M, Bosch F, Villamor N, et al. ZAP-70 expression as a surrogate for immunoglobulin-variable-region mutations in chronic lymphocytic leukemia.
N Engl J Med.
2003
;
348
:
1764
-1775.
47
Wiestner A, Rosenwald A, Barry TS, et al. ZAP-70 expression identifies a chronic lymphocytic leukemia subtype with unmutated immunoglobulin genes, inferior clinical outcome, and distinct gene expression profile.
Blood.
2003
;
101
:
4944
-4951.

Supplemental data

Sign in via your Institution