Molecular evidence of SB100X-mediated transposition in multilineage cells derived from human HPCs
| Cells/transposition site sequences . | Chromosome location:hit from . | Located gene . | Gene symbol . | Cancer-related gene . | Proximal TSS . | Distance to TSS, bp . |
|---|---|---|---|---|---|---|
| T | ||||||
| CAACTGTACATCCTTCATTCTAACTACTGAGTTAACTATCCA | chr12:32279251 | Intronic | BICD1 | BICD1 | chr12:32151168 | 128630 |
| CAACTGTACAGTATGGATGGCTCTCATAAATAGAATGTTGAG | chrX:40617753 | Intergenic | NA | NA | NA | NA |
| CAACTGTACAGTATGATTTCGTTTGGGTAAAAACAATGACAG | chr16:25261378 | Intergenic | NA | NA | NA | NA |
| CAACTGTATTATATGGAAATTATTATGCTAGTCCCTTAAGCG | chr12:27063331 | Intergenic | NA | NA | NA | NA |
| CAACTGTATTATTAAGTGCTAGTCCCTTAAGCGGAGCCCTAA | chrX:71598419 | Intronic | HDAC8 | HDAC8 | chrX:71731790 | 133350 |
| CAACTGTAAAATCTGCCCTTACTTACCTGCCCGCATCCTCGT | chr11:43002377 | Intergenic | NA | NA | NA | NA |
| CAACTGTACATTCCGACAGCCTGGGGAAATGGATCTTTGAGA | chr3:58431902 | Intergenic | NA | NA | NA | NA |
| CAACTGTATTTCAAACACTGAAGATCTGACTCAGGAAGTGCT | chr10:59705796 | Intronic | CISD1 | CISD1, MITONEET | chr10:59698939 | 6857 |
| B | ||||||
| CAACTGTACTAAGTATAGGCATCCTTAATTGGTGCAATTCTA | chr7:147222151 | Intronic | CNTNAP2 | CNTNAP2, CASPR2, NRXN4, CDFE, AUTS15 | chr7:146999704 | 222447 |
| CAACTGTATGTTGGAATGCCCCAGAATTTGGAGTTTATCTCT | chr13:45671894 | Intergenic | NA | NA | NA | NA |
| CAACTGTAGGCCTGAAAGCGCTCCAAATGTCCACTTCCAGA | chr2:91688345 | Intergenic | NA | NA | NA | NA |
| CAACTGTACATTCCGACAGCCTGGGGAAATGGATCTTTGAG | chr3:58431902 | Intergenic | NA | NA | NA | NA |
| CAACTGTATGTTATATATATATGCAAATATAAACACAGAAAA | chr11:72120697 | Intronic | CENTD2 | CENTD2, ARAP1, KIAA0782 | chr11:72141107 | 19946 |
| CAACTGTAAATCAGGTGAAGCCCTATTAAAGATGTCCTGAAA | chr18:35338281 | Intronic | AK090603 | (PIK3C3) | chr18:35634274 | 295573 |
| CAACTGTATTCTCAGAATATTTGCAACAATCACTCAAAAGGT | chr2:182456397 | Intergenic | NA | NA | NA | NA |
| CAACTGTACAAATCTGGAGTCCTTCCAAAACAGGACAAGTAA | chr12:3737128 | Intergenic | NA | NA | NA | NA |
| NK | ||||||
| CAACTGTAAGTTCCTTCCACAAAAATTGGGCAGCTTCTAGAAT | chr8:102548406 | Intergenic | NA | NA | NA | NA |
| CAACTGTACATATATAGTCTATTAATTGAGATAATATCTGTAA | chr2:192099107 | Intergenic | NA | NA | NA | NA |
| CAACTGTAGGTGTTTAGAGGGAAAGAAGAAAGGACATTCTGT | chr17:41583312 | Intronic | KIAA1267 | KIAA1267 | chr17:41605366 | 21942 |
| CAACTGTATAATTTTAGGTTACCATCTTCCATGGGGGAAATAT | chr12:69015183 | Intronic | CNOT2 | CNOT2, NOT2 | chr12:68923454 | 91729 |
| CAACTGTATATGGCACATGGGCTTTTGCAGGTGTGATGAAACT | chr16:30803923 | Intronic | BCL7C | BCL7C | chr16:30812887 | 8898 |
| CAACTGTATTCTCAGAATATTTGCAACAATCACTCAAAAGGTT | chr2:182456397 | Intergenic | NA | NA | NA | NA |
| CAACTGTAATATCCCAAGACTCTTTAAAGGTGGCAATGGCCG | chr7:294398 | Intronic | FAM20C | FAM20C, DMP4 | chr7:291895 | 2503 |
| M | ||||||
| CAACTGTACATACTTTCTTTCTTAAGGTAGTGTTTTGACAGAG | chr8:108578342 | Intronic | ANGPT1 | ANGPT1, ANG1 | chr8:108579262 | 920 |
| CAACTGTAGTTGAGGTCACACAAGACCTAAGTAGGGGAAACT | chr5:172192814 | Intergenic | NA | NA | NA | NA |
| CAACTGTACAATCATGTCGTCTGCGAACAGGGACAATTTGACT | chr5:38207284 | Intergenic | NA | NA | NA | NA |
| CAACTGTATATGTAAAGGTTTTTTTAAGTGGGTATATTGCGTGA | chr4:126348665 | Intergenic | NA | NA | NA | NA |
| CAACTGTAGATGTTGTGAGCATAATGAGTTAGGTGTTCCAAAG | chr3:178255770 | Intronic | TBL1XR1; TBLR1 | TBL1XR1 | chr3:178397855 | 141978 |
| CAACTGTAGCAACATGTTTAAGAGATTATACACCATGACCCAC | chr2:115950838 | Intronic | DPP10 | DPP10, DPRP3, KIAA1492 | chr2:115635383 | 315455 |
| CAACTGTATATACAGACTCTAAGTATGCTTACCTAGTCCCTTA | chr6:13990710 | Intergenic | NA | NA | NA | NA |
| CAACTGTATGTCCATCTATTGAGGCCCTAAATTAAGTCTACAG | chr5:131806233 | Intronic | LOC441108 | (IRF1, MAR) (SLC22A5, OCTN2, CDSP, SCD) | chr5:131774556 | 31677 |
| Cells/transposition site sequences . | Chromosome location:hit from . | Located gene . | Gene symbol . | Cancer-related gene . | Proximal TSS . | Distance to TSS, bp . |
|---|---|---|---|---|---|---|
| T | ||||||
| CAACTGTACATCCTTCATTCTAACTACTGAGTTAACTATCCA | chr12:32279251 | Intronic | BICD1 | BICD1 | chr12:32151168 | 128630 |
| CAACTGTACAGTATGGATGGCTCTCATAAATAGAATGTTGAG | chrX:40617753 | Intergenic | NA | NA | NA | NA |
| CAACTGTACAGTATGATTTCGTTTGGGTAAAAACAATGACAG | chr16:25261378 | Intergenic | NA | NA | NA | NA |
| CAACTGTATTATATGGAAATTATTATGCTAGTCCCTTAAGCG | chr12:27063331 | Intergenic | NA | NA | NA | NA |
| CAACTGTATTATTAAGTGCTAGTCCCTTAAGCGGAGCCCTAA | chrX:71598419 | Intronic | HDAC8 | HDAC8 | chrX:71731790 | 133350 |
| CAACTGTAAAATCTGCCCTTACTTACCTGCCCGCATCCTCGT | chr11:43002377 | Intergenic | NA | NA | NA | NA |
| CAACTGTACATTCCGACAGCCTGGGGAAATGGATCTTTGAGA | chr3:58431902 | Intergenic | NA | NA | NA | NA |
| CAACTGTATTTCAAACACTGAAGATCTGACTCAGGAAGTGCT | chr10:59705796 | Intronic | CISD1 | CISD1, MITONEET | chr10:59698939 | 6857 |
| B | ||||||
| CAACTGTACTAAGTATAGGCATCCTTAATTGGTGCAATTCTA | chr7:147222151 | Intronic | CNTNAP2 | CNTNAP2, CASPR2, NRXN4, CDFE, AUTS15 | chr7:146999704 | 222447 |
| CAACTGTATGTTGGAATGCCCCAGAATTTGGAGTTTATCTCT | chr13:45671894 | Intergenic | NA | NA | NA | NA |
| CAACTGTAGGCCTGAAAGCGCTCCAAATGTCCACTTCCAGA | chr2:91688345 | Intergenic | NA | NA | NA | NA |
| CAACTGTACATTCCGACAGCCTGGGGAAATGGATCTTTGAG | chr3:58431902 | Intergenic | NA | NA | NA | NA |
| CAACTGTATGTTATATATATATGCAAATATAAACACAGAAAA | chr11:72120697 | Intronic | CENTD2 | CENTD2, ARAP1, KIAA0782 | chr11:72141107 | 19946 |
| CAACTGTAAATCAGGTGAAGCCCTATTAAAGATGTCCTGAAA | chr18:35338281 | Intronic | AK090603 | (PIK3C3) | chr18:35634274 | 295573 |
| CAACTGTATTCTCAGAATATTTGCAACAATCACTCAAAAGGT | chr2:182456397 | Intergenic | NA | NA | NA | NA |
| CAACTGTACAAATCTGGAGTCCTTCCAAAACAGGACAAGTAA | chr12:3737128 | Intergenic | NA | NA | NA | NA |
| NK | ||||||
| CAACTGTAAGTTCCTTCCACAAAAATTGGGCAGCTTCTAGAAT | chr8:102548406 | Intergenic | NA | NA | NA | NA |
| CAACTGTACATATATAGTCTATTAATTGAGATAATATCTGTAA | chr2:192099107 | Intergenic | NA | NA | NA | NA |
| CAACTGTAGGTGTTTAGAGGGAAAGAAGAAAGGACATTCTGT | chr17:41583312 | Intronic | KIAA1267 | KIAA1267 | chr17:41605366 | 21942 |
| CAACTGTATAATTTTAGGTTACCATCTTCCATGGGGGAAATAT | chr12:69015183 | Intronic | CNOT2 | CNOT2, NOT2 | chr12:68923454 | 91729 |
| CAACTGTATATGGCACATGGGCTTTTGCAGGTGTGATGAAACT | chr16:30803923 | Intronic | BCL7C | BCL7C | chr16:30812887 | 8898 |
| CAACTGTATTCTCAGAATATTTGCAACAATCACTCAAAAGGTT | chr2:182456397 | Intergenic | NA | NA | NA | NA |
| CAACTGTAATATCCCAAGACTCTTTAAAGGTGGCAATGGCCG | chr7:294398 | Intronic | FAM20C | FAM20C, DMP4 | chr7:291895 | 2503 |
| M | ||||||
| CAACTGTACATACTTTCTTTCTTAAGGTAGTGTTTTGACAGAG | chr8:108578342 | Intronic | ANGPT1 | ANGPT1, ANG1 | chr8:108579262 | 920 |
| CAACTGTAGTTGAGGTCACACAAGACCTAAGTAGGGGAAACT | chr5:172192814 | Intergenic | NA | NA | NA | NA |
| CAACTGTACAATCATGTCGTCTGCGAACAGGGACAATTTGACT | chr5:38207284 | Intergenic | NA | NA | NA | NA |
| CAACTGTATATGTAAAGGTTTTTTTAAGTGGGTATATTGCGTGA | chr4:126348665 | Intergenic | NA | NA | NA | NA |
| CAACTGTAGATGTTGTGAGCATAATGAGTTAGGTGTTCCAAAG | chr3:178255770 | Intronic | TBL1XR1; TBLR1 | TBL1XR1 | chr3:178397855 | 141978 |
| CAACTGTAGCAACATGTTTAAGAGATTATACACCATGACCCAC | chr2:115950838 | Intronic | DPP10 | DPP10, DPRP3, KIAA1492 | chr2:115635383 | 315455 |
| CAACTGTATATACAGACTCTAAGTATGCTTACCTAGTCCCTTA | chr6:13990710 | Intergenic | NA | NA | NA | NA |
| CAACTGTATGTCCATCTATTGAGGCCCTAAATTAAGTCTACAG | chr5:131806233 | Intronic | LOC441108 | (IRF1, MAR) (SLC22A5, OCTN2, CDSP, SCD) | chr5:131774556 | 31677 |
Bold letters represent transposon sequence. Terms in parentheses indicate the closest neighboring cancer-related gene.
M indicates myeloid cells; NA, not applicable; and TSS, transcriptional start site.