TO THE EDITOR:
The PAX5 (paired-box domain gene 5) gene encodes a transcription factor with a key role in regulating B-cell differentiation.1 PAX5 abnormalities are observed in ∼30% of B-lymphoblastic leukemia (B-ALL) cases, occurring as secondary changes in association with different subtype-defining genetic drivers,2 but recently also described as primary oncogenic alterations in 2 novel genetic subtypes of the disease: PAX5 P80R and PAX5Alt.3,4
A rare but recurrent PAX5 abnormality in B-ALL is the presence of multiple additional copies of several exons within the 5′ end of the gene, referred to as “PAX5 intragenic tandem multiplication (PAX5-ITM)” or “PAX5 intragenic amplification” in the literature.5,6
Although PAX5-ITM has been investigated by a variety of methods, many of its characteristics remain unknown. These include the number of extra copies of the affected region, their position and orientation within the gene, and the effects of the extra copies on the structure and function of the PAX5 gene and protein.
Optical genomic mapping (OGM) is a powerful new technology that enables genome-wide, high-resolution, high-throughput detection of balanced and unbalanced structural variants.7,8 It is based on fluorescent labeling of high-molecular-weight genomic DNA at a short nucleotide motif, followed by linearization and imaging of labeled DNA and comparison of the genome-wide labeling patterns with reference genomes to identify structural variants. Direct visualization of long DNA molecules allows for successful utilization of OGM to study repetitive regions.9,10 These characteristics led us to hypothesize that OGM may be an optimal technique for characterization of PAX5-ITM in B-ALL.
In the present study, we used OGM to perform the first direct visualization of PAX5-ITM, determine the specific number of extra copies of the multiplied region, and confirm their intragenic position and orientation.
We used a retrospective clinical cohort of 412 consecutive pediatric B-ALL cases tested at the Children’s Hospital Los Angeles Center for Personalized Medicine at the time of diagnosis or relapse. Genetic evaluation of leukemia samples included karyotype analysis, fluorescence in situ hybridization studies, chromosomal microarray (CMA) analysis, and the OncoKids® next-generation–sequencing panel, which were conducted as described previously.11 This testing had detected the following subtypes of B-ALL: hyperdiploid (n = 94; 22.8%); ETV6::RUNX1 (n = 61; 14.8%); Philadelphia (Ph)-positive (n = 21; 5%); Ph-like, CRLF2+ (n = 63; 15%); Ph-like, ABL-type (n = 12; 2.9%); KMT2A-rearranged (n = 17; 4%); iAMP21 (n = 15; 3.6%); hypodiploid (n = 12; 3%); TCF3::PBX1 (n = 11; 2.6%); dic(9;20) (n = 15; 3.6%); ERG-deleted/DUX4-rearranged (n = 5; 1.2%); ZNF384-rearranged (n = 5; 1.2%); and IGH-rearranged, MEF2D-rearranged, PAX5 P80R, and PAX5Alt (≤1% each; supplemental Table 1). In 65 cases (15.8%), the subtype-defining genetic driver remained unknown. OGM processing of 42 selected cases (including those positive for PAX5-ITM) was performed at either Bionano Genomics Service Laboratory (Bionano Genomics, San Diego, CA) (19 cases) or Children’s Hospital Los Angeles CPM (23 cases), and the data were analyzed with the Bionano Solve software. Technical details of OGM testing are described in Supplemental Materials.
PAX5-ITM was detected by CMA in 6 of 412 B-ALL cases for a prevalence of 1.4%, consistent with previous reports.5 Basic clinical characteristics, immunophenotype, and other genetic findings for the 6 PAX5-ITM+ cases are provided in Supplemental Table 2. CMA findings for the PAX5-ITM abnormality in the 6 positive cases are shown in Table 1 and Figure 1. CMA results were consistent with 2 extra copies (triplication) of a region that was ∼25 kb in length in 5 cases and ∼50 kb in the remaining case (no. 5). Based on the breakpoints determined by CMA, the affected region included PAX5 exons 2 to 5 in 5 cases and exons 1 to 5 in case 5.
Material for OGM was available for 3 of 6 PAX5-ITM cases. OGM results are displayed in Figure 1. OGM showed the presence of 4 additional copies of the 5′ PAX5 region in cases 1 and 6 and 5 additional copies in case 4. These insertions were not detected by OGM in any of the remaining 39 B-ALL cases in our cohort or in >300 normal control samples tested at the Bionano Genomics Laboratory. The multiplied copies were inserted in situ and in direct orientation, and their approximate size was concordant between OGM and CMA. Notably, in all 3 cases, CMA analysis showed only 2 extra copies of the affected region, demonstrating that CMA analysis, as well as previously published Multiplex Ligation-dependent Probe Amplification (MLPA)–based testing,5 may significantly underestimate the number of extra copies of the affected region in PAX5-ITM.
PAX5-ITM was first comprehensively studied by Schwab et al, who identified this abnormality by MLPA in ∼1% of B-ALL cases in their cohort. The MLPA data suggested the presence of >1 extra copy of the affected region, but the exact copy number was not determined. The PAX5-ITM+ cases lacked other primary, subtype-defining molecular changes, leading to the hypothesis that PAX5-ITM may be associated with a novel genetic subtype of B-ALL.5
In a more recent study, Gu et al used integrated genomic analysis of 1988 childhood and adult B-ALL cases to delineate 23 genetic subtypes, some of them novel. PAX5-ITM was detected in 10 cases in this cohort (0.5%), of which 8 were classified based on their expression profiles as PAX5Alt, whereas no classification was provided for the remaining 2.3 This finding suggested that PAX5-ITM could be one of the abnormalities defining PAX5Alt B-ALL, supporting its association with the unique molecular subtype of B-ALL proposed by Schwab et al.5
Gu et al performed detailed characterization of 1 PAX5-ITM+ case using genome, exome, and transcriptome sequencing.3 The authors showed that multiplication involved exons 2 to 5, which code for the paired-box/DNA binding domain of the PAX5-encoded protein. Transcriptome sequencing confirmed expression of the multiplied exons, whereas reverse transcription-polymerase chain reaction and Sanger sequencing showed that the extra copies resulted in an in-frame junction between exons 5 and 2. Gu et al hypothesized that the abnormality resulted in the presence of extra copies of the paired-box/DNA binding domain within the mutant protein. However, accurate determination of copy number for small gains and losses from short-read sequencing data remains challenging,12 and even this comprehensive genomic analysis did not detect the exact number of copies of the multiplied region; furthermore, the structure of the mutant PAX5 allele was indirectly inferred.
Herein, we describe the first direct visualization and accurate determination of the structure of the PAX5-ITM abnormality. Although the number of the studied cases is small, to our knowledge, this is one of the largest cohorts of B-ALL cases with PAX5-ITM to date at a single institution. Using OGM, we showed intragenic position and tandem orientation of the repeated segments; both of these features had been inferred only indirectly by previous studies. In the 3 characterized cases, OGM showed >1 extra copy of the multiplied region. Although CMA analysis indicated the total copy number to be 4 in all the cases (2 extra copies), OGM showed 4 extra copies in 2 cases and 5 extra copies in 1 case, for a total of 5 and 6 copies, respectively. The underestimation of the copy number by CMA may be related to intrinsic limitations of CMA technology (loss of linearity for the log2 ratio increase for high copy gains) and by the presence of normal (nonleukemic) cells in the tested bone marrow aspirate samples. The presence of normal cells did not interfere with OGM analysis, which allowed separate visualization of mutant and wild-type DNA molecules. Interestingly, previous studies also reported that PAX5-ITM consisted of >1 extra copy of the multiplied region.5 Consistent presence of multiple additional copies of the critical region is intriguing, and may be related to the mechanisms through which the abnormality originates or may reflect the need for multiple copies for the PAX5-ITM abnormality to be oncogenic. Accurate determination of the number of extra copies in a large series of PAX5-ITM+cases, as well as functional studies of the abnormal protein, will be needed to address these questions. Considering that it is likely to result in the presence of extra copies of the DNA-binding region in the PAX5-encoded transcription factor, PAX5-ITM may change its binding affinity or specificity, thus altering its transcription program and disrupting B-cell differentiation.
In summary, we have shown the feasibility and advantages of using OGM to determine the exact number of copies and accurate structure of PAX5-ITM through direct visualization of the repeat region, suggesting OGM as the optimal approach for characterization of this and similar abnormalities in further studies.
Acknowledgments: The authors thank Nick Ranelli and Hayk Barseghyan from Bionano Genomics for technical assistance with figures.
Contribution: G.R., A.E.K., and D.B. initiated the cohort study; J. Jean and G.R. analyzed the OGM data; all authors analyzed the remaining cohort data; G.R., J. Jean, A.E.K., and D.B. wrote the manuscript; A.E.K., A.D., M.O., J. Ji, R.J.S., J.A.B., and D.B. took care of patients or performed clinical laboratory testing; and all authors reviewed and approved the final manuscript.
Conflict-of-interest disclosure: The authors declare no competing financial interests.
Correspondence: Gordana Raca, Children’s Hospital Los Angeles, 4650 Sunset Blvd #173, Los Angeles, CA 90027; e-mail: graca@chla.usc.edu.
References
Author notes
Requests for data sharing may be submitted to Gordana Raca (graca@chla.usc.edu).
The full-text version of this article contains a data supplement.