Key Points
MHCI-restricted TCRs exhibit an explicit preference for a single MHCI-peptide length.
Effective CD8+ T-cell immunity can only be achieved by length-matched Ag-specific T-cell clonotypes.
Abstract
αβ-TCRs expressed at the CD8+ T-cell surface interact with short peptide fragments (p) bound to MHC class I molecules (pMHCI). The TCR/pMHCI interaction is pivotal in all aspects of CD8+ T-cell immunity. However, the rules that govern the outcome of TCR/pMHCI engagement are not entirely understood, and this is a major barrier to understanding the requirements for both effective immunity and vaccination. In the present study, we discovered an unexpected feature of the TCR/pMHCI interaction by showing that any given TCR exhibits an explicit preference for a single MHCI-peptide length. Agonists of nonpreferred length were extremely rare, suboptimal, and often entirely distinct in sequence. Structural analysis indicated that alterations in peptide length have a major impact on antigenic complexity, to which individual TCRs are unable to adapt. This novel finding demonstrates that the outcome of TCR/pMHCI engagement is determined by peptide length in addition to the sequence identity of the MHCI-bound peptide. Accordingly, the effective recognition of pMHCI Ag, which is a prerequisite for successful CD8+ T-cell immunity and protective vaccination, can only be achieved by length-matched Ag-specific CD8+ T-cell clonotypes.
Introduction
MHC class I (MHCI) molecules present short peptide fragments at the cell surface, enabling inspection of the intracellular proteome by CD8+ T cells. This elegant display system allows clearance of cells that exhibit abnormal biochemistry or express non-self, pathogen-derived proteins. The MHCI complex contains an Ag-binding cleft specifically designed to cradle a single peptide fragment. CD8+ T cells engage pMHCI molecules via their clonotypic αβ-TCRs, which are encoded by variable (V), diversity (D), joining (J), and constant (C) gene fragments. These fragments recombine, with additional N-nucleotide insertions/deletions at V(D)J junctions, to produce a theoretical repertoire of 1015-1020 unique receptors.1-3 As a consequence of size limitations and thymic selection, the in vivo TCR repertoire has been estimated at 2.5 × 107 receptors.4,5 The focal point of TCR diversity is manifested in the Ag-binding site, which comprises 6 highly flexible complementarity determining regions (CDRs).6
TCRs are inherently cross-reactive toward pMHC molecules. Indeed, a recent comprehensive analysis of TCR cross-reactivity demonstrated that a single TCR is capable of recognizing more than one million different peptides of defined length presented in the context of a single MHCI restriction element.7 This feature explains how the naive TCR repertoire, which is dwarfed by the potential number of pMHC molecules that could be encountered in nature, can successfully provide broad antigenic coverage. TCR cross-reactivity is essential for normal processes such as positive selection in the thymus and survival of naive T cells in the periphery. A consequence of this cross-reactivity, however, is a high frequency of T cells capable of recognizing distinct peptides in the context of non-self MHC, a phenomenon known as alloreactivity.8 In addition, several studies have demonstrated the induction of autoimmunity with exogenous microbial peptides9 and, more recently, the ability of a single TCR to recognize both pMHCI and pMHCII Ags.10-12 The huge scope for CDR loop flexibility on binding to pMHCI13,14 and the ability of the TCR to force changes in the shape of the peptide15 provide a structural explanation for the ability of a single TCR to recognize a multitude of different pMHC shapes.15-17
Initially, it was thought that MHCI presentation was restricted to peptides 8-10 amino acids in length, but it is now established that peptides > 10 amino acids in length play a significant role in immune surveillance by CD8+ T cells.18 As such, the TCR repertoire must be able to recognize all possible MHCI-associated peptide lengths (8-14 amino acids) to achieve effective immunity. Given the highly cross-reactive nature of TCR recognition, as well as the inherent bias that TCRs are considered to have for the MHC, it seemed reasonable to assume that TCRs can adapt to and recognize peptides of a different length bound to the same allotype. To address this assumption, we synthesized 6 combinatorial peptide libraries (CPLs) ranging from 8-13 amino acid residues in length and measured TCR recognition across an array of human MHCI-restricted CD8+ T-cell clones. These αβ-T-cell clones were raised against a range of pMHCI Ags derived from both foreign and self-Ags. Unexpectedly, we found that strict limits exist to TCR plasticity in that each receptor is predisposed to engage peptides of a defined length. Thus, while any given TCR is capable of recognizing tens of thousands, if not millions, of different peptides, these peptides must be of an explicit length that varies depending on the individual TCR.
Methods
Cells
The following HLA A*0201–restricted CD8+ T-cell clones were used in this study: 1E6, specific for the preproinsulin-derived epitope ALWGPDPAAA (residues 15-24)19 ; ILA1, specific for the human telomerase reverse transcriptase (hTERT)–derived epitope ILAKFLHWL (residues 540-548)20 ; and MEL5, specific for the Melan-A–derived epitope ELAGIGILTV (residues 26-35).21 In addition, the following HLA B*3508–restricted CD8+ T-cell clones were used: SB14, specific for the EBV-derived EBNA-1 epitope HPVGEADYFEY (residues 407-417)22 ; SB10, specific for the HCMV-derived pp65 epitope CPSQEPMSIYVY (residues 103-114)23 ; and SB27, specific for the EBV-derived BZLF1 epitope LPEPLPQGQLTAY (residues 52-64).24 The recognition parameters of these CD8+ T-cell clones are detailed in Table 1. C1R-HLA A*0201 target cells and T2-HLA B*3508 target cells were generated as described previously.25
Sizing scan
The following mixtures were used to define the MHCI-peptide length preference of individual TCRs: X8, X9, X10, X11, X12, and X13 (where X is any of the 19 proteogenic L-amino acids excluding cysteine; Pepscan Presto). Sizing scan parameters are detailed in Table 2. CD8+ T-cell clones were washed and rested overnight in RPMI 1640 medium containing 100 U/mL of penicillin, 100 μg/mL of streptomycin, 2mM l-glutamine, and 2% heat-inactivated FCS (all Life Technologies). In 96-well, U-bottom plates, 6 × 104 target cells were incubated with sizing scan mixtures (at 1mM) in duplicate for 2 hours at 37°C. After peptide pulsing, 3 × 104 clonal CD8+ T cells were added and the assay was incubated overnight at 37°C. The supernatant was assayed by MIP1β ELISA (R&D Systems).
pMHCI stability assays
For T2 cell-based pMHCI-binding assays, the T-lymphoblastoid hybrid cell line 0.174 × CEM.T2 was used. These cells, referred to as T2 cells, lack the transporter associated with Ag processing (TAP); therefore, the addition of exogenous binding peptide is required for stable cell-surface expression of MHCI. A total of 0.5 × 106 T2 cells were incubated in RPMI 1640 medium (Life Technologies) with various concentrations of the indicated peptide or peptide mixture for 14-16 hours at 26°C, then for 2 hours at 37°C, before staining for HLA A*0201 surface expression with the PE-labeled mAb BB7.2 (BD Biosciences). Duplicate samples for each condition were acquired using a FACSCanto II flow cytometer (BD Biosciences). Data were analyzed with FlowJo Version 9.5.3 software (TreeStar).
Combinatorial peptide library scans
The 8mer, 9mer, 10mer, 11mer, 12mer, and 13mer CPLs were synthesized in positional scanning format26 (Pepscan). CPL parameters are detailed in Table 3. For CPL screening, CD8+ T-cell clones were washed and rested overnight in RPMI 1640 medium containing 100 U/mL of penicillin, 100 μg/mL of streptomycin, 2mM l-glutamine, and 2% heat-inactivated FCS (Life Technologies). In 96-well, U-bottom plates, 6 × 104 target cells were incubated with various library mixtures (at 100μM) in duplicate for 2 hours at 37°C. After peptide pulsing, 3 × 104 clonal CD8+ T cells were added and the assay was incubated overnight at 37°C. The supernatant was assayed by MIP1β ELISA (R&D Systems).
Peptide titration assays: MIP1β ELISA and quantification of T-cell cross-reactivity
Six × 104 target cells were pulsed with peptide at the indicated concentrations for 2 hours at 37°C. Three × 104 clonal CD8+ T cells were added and the assay was incubated overnight at 37°C. Subsequently, the supernatant was assayed for MIP1β by ELISA (R&D Systems). Functional sensitivity is expressed by the pEC50 for each peptide with respect to the TCR. This is defined as minus 1 times the base-10 logarithm (p) of the 50% efficacy concentration (EC50); a greater functional sensitivity is indicated by a larger pEC50 value, which was estimated as described previously.7 T-cell cross-reactivity was quantified by CPL-based importance sampling as described previously.7
pMHCI staining and flow cytometry
Soluble biotinylated pMHCI monomers were produced as described previously.21 Tetrameric pMHCI reagents (tetramers) were constructed by the addition of PE-conjugated streptavidin (Life Technologies) at a pMHCI:streptavidin molar ratio of 4:1. A total of 5 × 104 MEL5 CD8+ T cells were stained with PE-conjugated tetramer (25 μg/mL) folded around the indicated peptides for 15 minutes at 37°C (for review, see Wooldridge et al27 ) before staining with 5 μL of 7-aminoactinomycin D (Viaprobe; BD Biosciences) for 30 minutes at 4°C. Data were acquired using a FACSCanto II flow cytometer (BD Biosciences) and analyzed with FlowJo software (TreeStar).
Crystallization, structure determination, and refinement
HLA A*0201 in complex with ILAKFLHRL (A2-ILA) was refolded as described previously21 and then concentrated to 10 mg/mL in 10mM MES and 10mM NaCl. Screens were set up in 96-well Intelli-plates (Art Robbins Instruments) by the crystal Phoenix robot (Art Robbins Instruments) applying the sitting drop vapor diffusion technique. Next, 0.2 μL of HLA A*0201 ILAKLFHRL and 0.2 μL of crystallization buffer were dispensed into the small reaction well and 60 μL of crystallization buffer was dispensed into the large reservoir. Intelli-plates were then sealed, incubated at room temperature in a crystallization incubator (Molecular Dimensions), and analyzed for crystal formation. Crystals selected for further analysis grew in 0.1M MES (pH 7), 15% PEG 4000, 0.2M (NH4)2SO4, and were cryoprotected with ethylene-glycol to 25% and then flash cooled in liquid nitrogen in Litho loops (Molecular Dimensions). Diffraction data were collected on beamline I03.1 using a fixed wavelength of 0.9163 Å at the Diamond light source, Oxford, with a Pilatus 2M detector. Using a rotation method, 400 frames were recorded, each covering 0.5° of rotation. Reflection intensities were estimated with the MOSFLM package, and the previously solved A24-VYG structure (PDB = 2BCK)28 was used as a model for molecular replacement. Refinement and rebuilding was carried out using COOT and the data were scaled, reduced, and analyzed with the CCP4 package (PDB code: 414W).
Results
MHCI-restricted T-cell clones exhibit explicit peptide length recognition footprints
To determine the extent of peptide length cross-reactivity displayed by TCRs, we examined the functional recognition of a custom-built “sizing scan” comprising random peptide libraries of different lengths (X8, X9, X10, X11, X12, and X13, where X = any of the 19 L-amino acids excluding cysteine; Table 2) by a panel of CD8+ T-cell clones raised against peptides ranging from 9-13 residues (Table 1). The CD8+ T-cell clones used in the study were restricted by 1 of 2 different HLA class I alleles (HLA A*0201 or HLA B*3508) and targeted epitopes derived from a range of sources: EBV and HCMV, the tumor-associated Ags Melan-A and telomerase, and preproinsulin, a self-Ag implicated in type 1 diabetes (Table 1). This spectrum of recognition specificities permitted a broad overview of TCR cross-reactivity. The sizing scan suggested that each CD8+ T-cell clone exhibited a preference for MHCI-peptide length (Figure 1 and supplemental Figure 1, available on the Blood Web site; see the Supplemental Materials link at the top of the online article). This MHCI-peptide length preference was not random, but reflected the length of the peptide sequence against which each clone had been raised. Accordingly, the ILA1, 1E6, SB14, SB10, and SB27 clones demonstrated a recognition preference for 9mer, 10mer, 11mer, 12mer, and 13mer peptides (Figure 1), respectively.
Although the sizing scans indicated that individual TCRs can exhibit an MHCI-peptide length preference, more detailed data can be obtained by performing positional scanning CPL scans across multiple peptide lengths. Therefore, we examined MHCI-peptide length preference by comparing the functional recognition of 8-13mer CPLs by 4 different CD8+ T-cell clones (ILA1, 1E6, MEL5, and SB27; Table 3). Figure 2A through D show, in heat map form, the detailed TCR recognition footprints of all 4 CD8+ T-cell clones (raw data are shown in supplemental Figures 2-5). ILA1, 1E6, MEL5, and SB27 exhibited a large number of responses to peptide mixtures from the corresponding length-matched CPLs, in accordance with the results obtained from the sizing scan analysis (Figure 1). In stark contrast, CPL scans of other lengths elicited a very low number of positive responses, with a large number of mixtures failing to elicit outputs above background levels (Figure 2A-D). Some functional responses could be detected at the nonpreferred peptide length for all CD8+ T-cell clones, although it was not possible to identify responses at all peptide positions at the nonpreferred lengths (with the possible exception of the 8mer CPL scan on MEL5; Figure 2C). Therefore, detailed CPL scans of multiple CD8+ T-cell clones suggest that every TCR is characterized by a defined MHCI-peptide length recognition footprint.
MHCI-peptide length preference is not determined by MHCI binding
HLA A24, B7, and B35 molecules are capable of binding peptides of different lengths (8-14 amino acids).29,30 However, limited data are available for the peptide length binding specificity of HLA A*0201 and this has not been assessed previously using unbiased peptide mixtures. Therefore, we examined the ability of HLA A*0201 to bind sizing scan mixtures of different lengths (Table 2). To accomplish this, we measured the peptide-dependent stability of HLA A*0201 on the surface of T2 cells by flow cytometry. These experiments demonstrated that HLA A*0201 is capable of binding to all sizing scan mixtures that span 8-13 amino acids in length (Figure 3 and supplemental Figure 6). Importantly, under conditions that were identical to those used in the sizing scan assays (Figure 1 and supplemental Figure 1), sizing scan mixtures of all lengths bound efficiently to cell-surface HLA A*0201 (Figure 3). In addition, levels of binding for all peptide lengths were similar to those observed for the immunodominant influenza A matrix protein (M1)–derived epitope GILGFVFTL (residues 58-66; Figure 3), which is known to bind strongly to HLA A*0201.31,32 These data suggest that the peptide length preference exhibited by each individual TCR is not determined by differential MHCI-peptide binding. Instead, the peptide length preference of individual CD8+ T-cell clones appears to be determined strictly by the TCR.
Peptides designed from CPL scan data confirm MHCI-peptide length preference
Next, we designed peptide sequences based on the preferred amino acids at each peptide position as revealed by CPL scan analysis (Figure 2) for the CD8+ T-cell clones 1E6, MEL5, and SB27. Previously, we demonstrated that > 6 × 104 decameric peptides can act as substantially more potent agonists for the 1E6 TCR compared with the index peptide.7 Remarkably, in contrast, activation of 1E6 CD8+ T cells with a 9mer CPL scan yielded a very low number of positive responses relative to the 10mer CPL scan (Figure 2B). To study this observation in more detail, we designed panels of 9mer and 10mer peptides from combinations of the amino acids that elicited the strongest responses at each position observed in the corresponding CPL scans. The 9mer peptides tested on 1E6 were not recognized (Figure 4A-B). In contrast, the majority of 10mer peptides were recognized efficiently, with 13 of 17 acting as more potent agonists for this clone compared with the index peptide (Figure 4C-D). These data demonstrate that the 1E6 TCR exhibits a stringent preference for 10mer peptides compared with peptide sequences that are 9 amino acids in length, thereby verifying the CPL scan data (Figure 2B).
The MEL5 CD8+ T-cell clone also exhibited a strong preference for 10mer peptides in the corresponding CPL scan (Figure 2C). In addition, the best-recognized CPL sublibrary did not always correspond to the amino acid residue present in the index ELAGIGILTV (ELA) peptide (Figure 2C). Multiple substitutions at position one (P1) of the index ELA peptide produced more potent agonists (Figure 5A), which is consistent with the structure of the TCR/pMHC complex33 (supplemental Figure 7). Here, the CDR1α loop of the MEL5 TCR lies closely over the N-terminus of the ELA peptide, offering opportunities for additional atomic contacts. A strong preference for P3-Thr and P8-Ile was also observed (Figure 5B). Single, double, and triple substitutions in the ELA peptide yielded a set of very strong agonists, the majority of which were considerably more potent than the index ELA peptide (Figure 5A-B). These results corroborate the CPL scan data demonstrating that a large number of 10mer peptides can be recognized efficiently by the MEL5 TCR. In contrast, 9mer and 11mer sequences designed from the low responses observed in the corresponding CPL scans exhibited poor activation potency, with the exception of one 9mer peptide, FWLLPAWAL, which activated MEL5 CD8+ T cells with intermediate potency (Figure 5C). pMHCI tetramer staining with peptide variants was consistent with the activation data (Figure 5D). Therefore, the MEL5 TCR preferentially engages 10mer peptides.
It is now widely accepted that longer peptides play an important role in MHCI-restricted immunity.18,24,34 The CD8+ T-cell clone SB27, which recognizes the EBV BZLF1-derived 13mer peptide LPEPLPQGQLTAY (LPE), epitomizes this phenomenon. Activation of SB27 with 13mer CPL screen mixtures yielded a very large number of responses, higher than any other CPL scan performed in the present study (Figure 2D). All 13mer peptides in an extensive panel designed from the corresponding CPL scan data activated SB27 efficiently (Figure 6C-D). It was also clear from the CPL scan data that the SB27 TCR exhibited a strong preference for 13mer peptides (Figure 2D). We examined a panel of 9mer peptides designed from the corresponding CPL scan and tested them for recognition by SB27. As expected from preceding length recognition data, activation was poor and approximately 10 000-fold weaker compared with the index 13mer peptide (Figure 6A-B). This result implies that TCRs specific for longer peptides are not capable of engaging shorter peptides efficiently. Immunity to longer peptides may therefore require mobilization of specific portions of the CD8+ T-cell compartment that exhibit appropriate length specificity.
Peptide length has a profound effect on TCR degeneracy
We next examined the impact of peptide length on the level of TCR degeneracy using a previously verified method called CPL-based importance sampling,7 which incorporates a sampling bias toward strong agonists as indicated by CPL scan data and thus allows an estimation of the number of peptides of a defined length that can be recognized by a given TCR on the basis of a sample of manageable size (in this case, 30 peptides). To correct the observed distribution for this bias, the observations are weighted by the reciprocal of the original sampling probability, giving estimates of numbers of peptides of a defined length that can be recognized at varying functional sensitivities. Figures 2 and 4 show that the 1E6 TCR demonstrates a strict preference for 10mer peptides. Indeed, when we previously performed CPL-based importance sampling using this 10mer scan, we found a large number of robustly recognized peptides, with more than 25% being recognized at higher functional sensitivity than the index peptide sequence.7 Responses to scan mixtures at all other peptide lengths were either entirely absent or very poor (Figure 2). To characterize the diminished efficacy of the interaction with 11mers and 12mers, CPL-based importance sampling was performed. In stark contrast to previous results obtained using 10mer CPL scan data, sampled 11mer peptides were all recognized with > 1000-fold lower efficiency compared with the index peptide (Figure 7A and supplemental Table 1). Similarly, sampled 12mer peptides were also poorly recognized by the 1E6 TCR (Figure 7B and supplemental Table 1).
Whereas CPL-based importance sampling performed at the preferred peptide length yielded small peptide libraries that were highly enriched with agonists, the extent of enrichment observed at the nonpreferred peptide lengths was lower, possibly because CPL scans performed at these peptide lengths contain less information. Alternatively, the degree of enrichment may be of comparable magnitude in all cases, but the occurrence rate of strong agonists (determined by both TCR degeneracy and degree of enrichment) remains too low to render agonists readily detectable in the samples. Degeneracy curves for the 1E6 TCR are shown in Figure 7C; the abscissa is the functional sensitivity of peptides relative to the index peptide (ie, peptides with a value < 0, 0, or > 0 are either poor agonists compared with index, equivalent to index, or better agonists than index, respectively), whereas the ordinate axis is the number of peptides that exhibit a functional sensitivity of at least the abscissa. The degeneracy curves suggest that no 11mer or 12mer peptides were recognized with a functional sensitivity equivalent to index, indicating that peptide length has a major impact on the degeneracy profile of a TCR.
Different length peptide agonists can occur but are often poorly recognized and distinct in sequence
We have shown that Ag recognition by MHCI-restricted TCRs is constrained by peptide length, and that efficient recognition of peptides with alternative lengths is less likely. Interestingly, we identified one 12mer peptide that could be recognized by 1E6, although the efficiency of recognition was approximately 10-fold lower compared with the index peptide (Figure 7B). The sequence of this 12mer peptide (MMLWLRWIDVPT) was entirely different from the index sequence (ALWGPDPAAA) for the 1E6 TCR. Furthermore, 2 × 8mer epitopes and 1 × 9mer epitope were recognized by the MEL5 TCR (Figure 5C-F). Again, the sequences of these peptides (WLLPAWGV, WLLPTWGV, and FWLLPAWAL) were entirely different from the index sequence for the MEL5 TCR (ELAGIGILTV). Therefore, although rare, recognition of different length peptides is possible. However, the eliciting peptide sequence can be distinct from that of the index peptide and recognition typically occurs with reduced levels of functional sensitivity.
Peptide length has a major impact on the complexity of antigenic structure
To examine the underlying molecular mechanism of peptide length specificity exhibited by MHCI-restricted TCRs, we conducted a structural comparison of pMHCI complexes recognized by the T cells used in this study. The atomic structure of the nonligated ELAGIGILTV (10mer),35 ALWGPDPAAA (10mer),36 HPVGEADYFEY (11mer),37 CPSQEPMSIYVY (12mer),38 and LPEPLPQGQLTAY (13mer)24 peptides, in complex with their corresponding MHCIs, have been solved previously. We also included previously solved structures of an 8mer39 and 14mer40 peptide in complex with MHCI (Figure 8). To complete our structural comparison of peptides corresponding to the CD8+ T-cell clones included in this study, we solved the atomic structure of HLA A*0201-ILAKFLHRL (9mer), recognized by ILA1, to a resolution of 1.8Å (supplemental Table 2). The overall structure of HLA A*0201-ILAKFLHRL was similar to previously solved pMHCI structures, with the peptide anchored at position 2 and the peptide C-terminus, enabling the central residues to bulge out away from the groove for potential TCR interactions.
The analysis demonstrated that the positions of the anchor residues for each of the different length peptides were superimposable (Figure 8). Therefore, the distance between peptide residue 2, and the C-termini of every peptide investigated, irrespective of differences in length, was very similar (approximately 20Å). Consequently, the only option available to accommodate extra residues as MHCI-bound peptides increase in length was an incremental increase in: (1) the degree of backbone “bulging” (ie, the height of the central peptide bulge away from MHCI groove; Figure 8A), (2) sideways displacement of the peptide toward the MHCI α1 or α2 helices (Figure 8B), or (3) a combination of both. For example, the central residues in the LPEPLPQGQLTAY 13mer peptide bulged out of the groove by approximately 20Å, compared with only approximately 8Å for the ILAKFLHRL (9mer) peptide. The increasingly complex antigenic shape of the longer peptides revealed by this structural meta-analysis may restrict some TCRs from mediating stabilizing interactions with the MHCI surface, as described previously.24 This notion is further supported by our observation that individual TCRs are hard-wired to recognize peptides of a specific length and presumably cannot adapt to large structural variations in peptide conformation. Therefore, antigenic complexity (degree of peptide backbone “bulging”/sideways displacement) is determined by peptide length (in addition to peptide sequence), which in turn determines the outcome of TCR/peptide-MHCI engagement.
Discussion
Individual αβ-TCRs can recognize vast numbers of peptides encapsulated within a singular length.7 This inherent degeneracy is essential for effective immune coverage against the enormous number of antigenic peptides that could be encountered. Given the remarkable extent of such length-confined TCR plasticity, it seemed likely that TCRs could also cross-react with peptides of different lengths in the context of the same, or even different, presenting MHCI molecules. Indeed, the natural killer T-cell (NKT) TCR can recognize lipid Ags with various length headgroups, namely α-galcer (1 sugar) and iGb3 (3 sugars).41,42 Such peptide length flexibility would further maximize the ability of the numerically limited TCR repertoire to provide broad antigenic coverage. Indeed, the Ag processing/presentation machinery naturally generates MHCI-binding peptides across a range of fragment lengths, typically 8-14 residues. However, the degree to which αβ-TCRs cross-recognize these fragment lengths is ill-defined. In the present study, we used an approach that allowed for a sweeping and inclusive scan of TCR specificity. Surprisingly, we discovered a clear limit to receptor plasticity in terms of a defined peptide length preference. This unexpected finding was shown to be an inherent property of all TCRs examined across numerous biologic specificities and restricting MHCI alleles.
The data presented herein demonstrate that TCR plasticity is delimited by peptide length. Therefore, individual TCRs are predisposed to engage peptides of a defined length and cross-reactivity across peptide length boundaries is uncommon. These findings uncover a novel and fundamental feature of MHCI-restricted T-cell immunity that has wide implications. In particular, they suggest that the CD8+ T-cell repertoire is compartmentalized by length preference. Effective MHCI-mediated immunity would therefore require the mobilization of length-matched Ag-specific CD8+ T-cell clonotypes from the peripheral repertoire. This is especially pertinent to the design of “superagonists” or heteroclitic variants for the purposes of peptide vaccination because our results suggest that the therapeutic peptide of choice must be the same length as the target peptide on the cell surface to elicit an effective immune response. By way of an example, preferred length disparity may in part explain the poor objective response rates observed in trials using the 10mer ELAGIGILTV epitope for melanoma therapy,43 because only the 9mer AAGIGILTV epitope appears to be naturally present on the surface of melanoma cells.44
In addition, our findings have implications for T-cell selection, naive T-cell survival in the periphery, and the identification of alloreactive ligands. TCR cross-reactivity is the molecular basis of selection within the thymus, where positive and negative selection are mediated via interactions with self-derived pMHCI molecules displayed at the thymic epithelial cell surface.45,46 Our data predict that thymic epithelial cells must display peptides of multiple different lengths and that clonotype selection is mediated by peptides of the length preferred by the TCR after thymic egress into the periphery. Furthermore, survival signals in the periphery delivered via continual TCR engagement of low-affinity self-derived pMHCI molecules47,48 are likely to be provided by peptides of a defined length. In terms of non-self pMHCI molecules, it is established that the TCR repertoire is characterized by a high frequency of alloreactive T cells.8 This population represents a major barrier to transplantation and, as such, there is substantial pressure to elucidate the biochemical and structural basis of this phenomenon. Our data suggest that peptide length will be an important consideration in the identification of alloreactive ligands. Indeed, there is mounting evidence to show that alloreactive T-cell recognition is mediated in a peptide-dependent manner.17,18
Despite clear length preferences, TCRs can recognize peptides of nonpreferred lengths. However, this phenomenon is relatively uncommon and generally suboptimal in terms of CD8+ T-cell activation compared with peptides of preferred length. Length-disparate agonist peptides can be entirely distinct at the primary sequence level with different amino acid residues at every position. For example, the 1E6 TCR is specific for the preproinsulin-derived epitope ALWGPDPAAA (residues 15-24), but can recognize a 12mer peptide with almost no sequence homology (MMLWLRWIDVPT). The 1E6 clone was generated from a patient with type 1 diabetes and is a well-documented example of an autoreactive CD8+ T-cell that can kill human pancreatic islet β cells.19,36 This is a particularly interesting result because it suggests that CD8+ T cells of pathogenic relevance in autoimmune disease can be activated by cross-reactive ligands that are not amenable to a priori prediction on the basis of peptide length or sequence.
To understand the molecular mechanism that underlies the MHCI-peptide length preferences displayed by different TCRs, it is informative to consider the conformations adopted by MHCI-bound peptides of different lengths. We show that incremental increases in peptide length result in increased antigenic complexity. For MHCI alleles, the ends of the binding groove are closed. This feature both confines and compresses the peptide termini, causing the backbone of any peptide longer than 8 residues in length to either bulge away from the central groove or to be displaced sideways toward the binding groove helices. The degree of backbone “bulging” is proportional to the constrained conformation and length of the antigenic peptide, which ranges from 8-14 residues in length (Figure 8). These varied pMHCI shapes look very different structurally and present a range of diverse contours for incoming T cells. In general, 10mer and 11mer peptides are bulged and often highly flexible in their unbound state.17,22,33,49 Interestingly, 10-11mer peptides can be “bull-dozed” flat when the cognate TCR is bound,17,37,50 resulting in an MHC contact footprint much like that observed with conventional 9mer pMHCI targets. In contrast, some long peptides can form a rigid helix at the crest of the bulge,23 which introduces yet another structural variable to incoming T cells. Both 13mer24 and 14mer40 peptides also form “bulged” structures in the MHCI groove. Structural determination of a 13mer TCR/pMHCI complex showed that the TCR fights to straddle the bulged peptide while maintaining contact with both helices of the MHC presentation platform, a unique architectural feature that perhaps not all receptors may encode.50 MHCI-binding peptides are beginning to fall into groupings based on their length and accompanying structural characteristics, such as high and rigid, helical, or flexible and crushable. It appears that the CD8+ T-cell repertoire is partitioned to engage each peptide group and that individual TCRs cannot adapt to efficiently engage peptides with different length-imposed conformational features. Interestingly, a highly constrained TCR repertoire has been observed within CD8+ T-cell populations specific for longer peptides,22,49,50 which might suggest a limit to TCR diversity in response to peptides of defined length. However, the CDR3 loops of the clones used in this analysis, which vary in length by 6-13 residues,22-24,33,36 showed no correlation with peptide length specificity. Further studies are required to define a role for TCR usage as a determinant of MHCI-peptide length preference.
In summary, we have shown that MHCI-restricted T cells exhibit a preference for peptide length that is governed by the TCR and that effective recognition of pMHCI Ag can only be achieved by length-matched Ag-specific CD8+ T-cell clonotypes. This novel finding has broad implications for understanding how effective CD8+ T-cell immunity is achieved and the future design of peptide vaccination. In addition, we propose that every TCR is characterized by a unique “peptide-recognition signature” that is defined by: (1) a preference for peptide length, (2) the number of peptides that can be recognized, and (3) the amino acid sequence of these agonist peptides. The peptide recognition signature of an induced CD8+ T-cell response is likely to have a major impact on CD8+ T-cell immunity and could underlie the pathogenesis of many disease states.
The online version of this article contains a data supplement.
The publication costs of this article were defrayed in part by page charge payment. Therefore, and solely to indicate this fact, this article is hereby marked “advertisement” in accordance with 18 USC section 1734.
Acknowledgments
This work was supported by the Biotechnology and Biological Sciences Research Council (grant BB/H001085/1) and The Wellcome Trust (grant WT079848MA). L.W. is a Wellcome Trust Clinical Intermediate Fellow, D.A.P. is a Medical Research Council Senior Clinical Fellow, and D.K.C. is a Wellcome Trust Career Development Fellow. J.J.M., J.R., and S.R.B. are funded by the National Health and Medical Research Council. J.R. is a National Health and Medical Research Council Australia Fellow. A.S., G.D., M.P., and D.A.P. are funded by the Juvenile Diabetes Research Foundation (grants 7-2005-877, 1-2007-1803, and 17-2009-806). J.E-M., M.P.T., M.C., and T.W. are funded by The Wellcome Trust. Research by A.S. and M.P. is supported by the National Institute for Health Research Biomedical Research Center based at Guy's and St Thomas' National Health Service Foundation Trust and King's College London. The views expressed are those of the author(s) and not necessarily those of the National Health Service, the National Institute for Health Research, or the Department of Health.
Wellcome Trust
Authorship
Contribution: J.E.-M., J.J.M., G.D., A.J.A.S., M.P.T., J.M.P., S.L.-L., K.M.M., A.M.B., M.C., T.W., A.T., and L.W. performed the research; J.J.M., H.A.v.d.B., S.R.B., A.K.S., and L.W. designed the research and wrote the manuscript; A.S. and M.P. provided vital cells; D.K.C., M.B. and P.R. analyzed the data and constructed figures; J.R. and D.A.P. wrote the manuscript; and J.J.M. and L.W. supervised the research.
Conflict-of-interest disclosure: The authors declare no competing financial interests.
Correspondence: Dr Linda Wooldridge, Institute of Infection and Immunity, Cardiff University, Henry Wellcome Building, Heath Park, Cardiff, CF14 4XN United Kingdom; e-mail: wooldridgel@cardiff.ac.uk.
References
Author notes
J.E.-M., J.J.M., H.A.v.d.B., S.R.B., A.K.S., and L.W. contributed equally to this work.