Introduction: Several B-cell malignancies, including chronic lymphocytic leukemia (CLL) and multiple myeloma (MM), are known to have a familial component to disease risk. Although several common inherited genetic variants have been found to be associated with risk of these malignancies, a large proportion of the heritability remains unexplained. Here we set out to identify rare high-penetrance susceptibility variants for CLL and MM using the established Mayo Clinic family study of B-cell malignancies.

Methods: We performed exome sequencing on germ line DNA of 106 families with either two or more members with CLL (n=93 families) or two or more members with MM or monoclonal gammopathy of undetermined significance (MGUS) (n=13 families) using Aglient capture kits. All samples were sequenced on an Illumina HiSeq2000. Bioinformatics analyses leveraged the following software packages: Novoalign, Picard, The Genome Analysis Toolkit (GATK), and the Biological Reference Repository (bioR). Quality control filters were implemented; subjects with mis-specified relationships were removed as were variants with <95% call rate or <20x coverage, and those variants identified as sequencing artifacts. To identify potentially functional single nucleotide variants, the following was required a) uncommon in public databases, b) cosegregating variants in multiple families, c) variants to be highly conserved and in coding regions, and d) functional prediction status of deleterious (SIFT score), damaging (PolyPhen score), and a moderate, or high variant impact (SNPEffect). The association of these variants with risk was then evaluated in large samples of cases and controls in genome-wide association studies (GWAS).

Results: In the CLL familial cohort, there were 443 individuals (160 with CLL, 73 with monoclonal B cell lymphocytosis (MBL), and 210 relatives without CLL or MBL) sequenced; and in the MM cohort, there were 39 individuals (13 with MM, 13 with MGUS and 13 without MM or MGUS) sequenced. A total of 61,992 single nucleotide variants passed quality control filters in the CLL cohort and 39,456 single nucleotide variants in the MM cohort. From these, we identified 32 variants in the CLL cohort and 39 variants in the MM cohort that were cosegregating in multiple CLL or MM families, respectively, and were predicted to affect protein structure or function. Of these, one CLL variant was within 1 Mb of a known GWAS-discovered locus. Moreover, there were seven CLL and four MM variants that were within 1 Mb of single nucleotide polymorphisms (SNPs) that had statistical significance between p<10-4 and p<10-6 in GWAS. Finally, there were 3 variants found to be common between CLL and MM families (see Table).

Conclusions: Through whole exome sequencing, we identified a number of rare, highly penetrant predisposition variants for CLL and MM. Several of these variants were common to both B-cell malignancies providing evidence of shared genetic components; one variant may be a functional driver of the GWAS-discovered loci. These identified variants provide a list for future validation studies.

Table:

Exome variants common in both Chronic Lymphocytic Leukemia and Multiple Myeloma

SNP IDrs41314099rs61754114rs143025033
Chromosome 17 
Gene WSCD1 BACH2 TNS3 
Penetrance (CLL) 0.70 0.75 0.80 
Penetrance (MM) 1.00 0.80 0.75 
Allele Frequency (CLL) 0.015 0.025 0.017 
Allele Frequency (MM) 0.077 0.077 0.058 
1000 Genome Frequency (European) 0.010 0.020 0.010 
SIFT Term Deleterious Deleterious Deleterious 
PolyPhen Term Probably Damaging Probably Damaging Probably Damaging 
SNP IDrs41314099rs61754114rs143025033
Chromosome 17 
Gene WSCD1 BACH2 TNS3 
Penetrance (CLL) 0.70 0.75 0.80 
Penetrance (MM) 1.00 0.80 0.75 
Allele Frequency (CLL) 0.015 0.025 0.017 
Allele Frequency (MM) 0.077 0.077 0.058 
1000 Genome Frequency (European) 0.010 0.020 0.010 
SIFT Term Deleterious Deleterious Deleterious 
PolyPhen Term Probably Damaging Probably Damaging Probably Damaging 

SNP: Single Nucleotide Polymorphism; CLL: Chronic Lymphocytic Leukemia; MM: Multiple Myeloma; Penetrance is defined as proportion of individuals carrying the alternate allele that are also affected; SIFT: Sorting Intolerant From Tolerant, predicts whether an amino acid substitution affects protein function; PolyPhen: Polymorphism Phenotyping, predicts possible impact of an amino acid substitution on the structure of a human protein

Disclosures

No relevant conflicts of interest to declare.

Author notes

*

Asterisk with author names denotes non-ASH members.

Sign in via your Institution