Molecular evidence of SB100X-mediated transposition in multilineage cells derived from human HPCs
Cells/transposition site sequences . | Chromosome location:hit from . | Located gene . | Gene symbol . | Cancer-related gene . | Proximal TSS . | Distance to TSS, bp . |
---|---|---|---|---|---|---|
T | ||||||
CAACTGTACATCCTTCATTCTAACTACTGAGTTAACTATCCA | chr12:32279251 | Intronic | BICD1 | BICD1 | chr12:32151168 | 128630 |
CAACTGTACAGTATGGATGGCTCTCATAAATAGAATGTTGAG | chrX:40617753 | Intergenic | NA | NA | NA | NA |
CAACTGTACAGTATGATTTCGTTTGGGTAAAAACAATGACAG | chr16:25261378 | Intergenic | NA | NA | NA | NA |
CAACTGTATTATATGGAAATTATTATGCTAGTCCCTTAAGCG | chr12:27063331 | Intergenic | NA | NA | NA | NA |
CAACTGTATTATTAAGTGCTAGTCCCTTAAGCGGAGCCCTAA | chrX:71598419 | Intronic | HDAC8 | HDAC8 | chrX:71731790 | 133350 |
CAACTGTAAAATCTGCCCTTACTTACCTGCCCGCATCCTCGT | chr11:43002377 | Intergenic | NA | NA | NA | NA |
CAACTGTACATTCCGACAGCCTGGGGAAATGGATCTTTGAGA | chr3:58431902 | Intergenic | NA | NA | NA | NA |
CAACTGTATTTCAAACACTGAAGATCTGACTCAGGAAGTGCT | chr10:59705796 | Intronic | CISD1 | CISD1, MITONEET | chr10:59698939 | 6857 |
B | ||||||
CAACTGTACTAAGTATAGGCATCCTTAATTGGTGCAATTCTA | chr7:147222151 | Intronic | CNTNAP2 | CNTNAP2, CASPR2, NRXN4, CDFE, AUTS15 | chr7:146999704 | 222447 |
CAACTGTATGTTGGAATGCCCCAGAATTTGGAGTTTATCTCT | chr13:45671894 | Intergenic | NA | NA | NA | NA |
CAACTGTAGGCCTGAAAGCGCTCCAAATGTCCACTTCCAGA | chr2:91688345 | Intergenic | NA | NA | NA | NA |
CAACTGTACATTCCGACAGCCTGGGGAAATGGATCTTTGAG | chr3:58431902 | Intergenic | NA | NA | NA | NA |
CAACTGTATGTTATATATATATGCAAATATAAACACAGAAAA | chr11:72120697 | Intronic | CENTD2 | CENTD2, ARAP1, KIAA0782 | chr11:72141107 | 19946 |
CAACTGTAAATCAGGTGAAGCCCTATTAAAGATGTCCTGAAA | chr18:35338281 | Intronic | AK090603 | (PIK3C3) | chr18:35634274 | 295573 |
CAACTGTATTCTCAGAATATTTGCAACAATCACTCAAAAGGT | chr2:182456397 | Intergenic | NA | NA | NA | NA |
CAACTGTACAAATCTGGAGTCCTTCCAAAACAGGACAAGTAA | chr12:3737128 | Intergenic | NA | NA | NA | NA |
NK | ||||||
CAACTGTAAGTTCCTTCCACAAAAATTGGGCAGCTTCTAGAAT | chr8:102548406 | Intergenic | NA | NA | NA | NA |
CAACTGTACATATATAGTCTATTAATTGAGATAATATCTGTAA | chr2:192099107 | Intergenic | NA | NA | NA | NA |
CAACTGTAGGTGTTTAGAGGGAAAGAAGAAAGGACATTCTGT | chr17:41583312 | Intronic | KIAA1267 | KIAA1267 | chr17:41605366 | 21942 |
CAACTGTATAATTTTAGGTTACCATCTTCCATGGGGGAAATAT | chr12:69015183 | Intronic | CNOT2 | CNOT2, NOT2 | chr12:68923454 | 91729 |
CAACTGTATATGGCACATGGGCTTTTGCAGGTGTGATGAAACT | chr16:30803923 | Intronic | BCL7C | BCL7C | chr16:30812887 | 8898 |
CAACTGTATTCTCAGAATATTTGCAACAATCACTCAAAAGGTT | chr2:182456397 | Intergenic | NA | NA | NA | NA |
CAACTGTAATATCCCAAGACTCTTTAAAGGTGGCAATGGCCG | chr7:294398 | Intronic | FAM20C | FAM20C, DMP4 | chr7:291895 | 2503 |
M | ||||||
CAACTGTACATACTTTCTTTCTTAAGGTAGTGTTTTGACAGAG | chr8:108578342 | Intronic | ANGPT1 | ANGPT1, ANG1 | chr8:108579262 | 920 |
CAACTGTAGTTGAGGTCACACAAGACCTAAGTAGGGGAAACT | chr5:172192814 | Intergenic | NA | NA | NA | NA |
CAACTGTACAATCATGTCGTCTGCGAACAGGGACAATTTGACT | chr5:38207284 | Intergenic | NA | NA | NA | NA |
CAACTGTATATGTAAAGGTTTTTTTAAGTGGGTATATTGCGTGA | chr4:126348665 | Intergenic | NA | NA | NA | NA |
CAACTGTAGATGTTGTGAGCATAATGAGTTAGGTGTTCCAAAG | chr3:178255770 | Intronic | TBL1XR1; TBLR1 | TBL1XR1 | chr3:178397855 | 141978 |
CAACTGTAGCAACATGTTTAAGAGATTATACACCATGACCCAC | chr2:115950838 | Intronic | DPP10 | DPP10, DPRP3, KIAA1492 | chr2:115635383 | 315455 |
CAACTGTATATACAGACTCTAAGTATGCTTACCTAGTCCCTTA | chr6:13990710 | Intergenic | NA | NA | NA | NA |
CAACTGTATGTCCATCTATTGAGGCCCTAAATTAAGTCTACAG | chr5:131806233 | Intronic | LOC441108 | (IRF1, MAR) (SLC22A5, OCTN2, CDSP, SCD) | chr5:131774556 | 31677 |
Cells/transposition site sequences . | Chromosome location:hit from . | Located gene . | Gene symbol . | Cancer-related gene . | Proximal TSS . | Distance to TSS, bp . |
---|---|---|---|---|---|---|
T | ||||||
CAACTGTACATCCTTCATTCTAACTACTGAGTTAACTATCCA | chr12:32279251 | Intronic | BICD1 | BICD1 | chr12:32151168 | 128630 |
CAACTGTACAGTATGGATGGCTCTCATAAATAGAATGTTGAG | chrX:40617753 | Intergenic | NA | NA | NA | NA |
CAACTGTACAGTATGATTTCGTTTGGGTAAAAACAATGACAG | chr16:25261378 | Intergenic | NA | NA | NA | NA |
CAACTGTATTATATGGAAATTATTATGCTAGTCCCTTAAGCG | chr12:27063331 | Intergenic | NA | NA | NA | NA |
CAACTGTATTATTAAGTGCTAGTCCCTTAAGCGGAGCCCTAA | chrX:71598419 | Intronic | HDAC8 | HDAC8 | chrX:71731790 | 133350 |
CAACTGTAAAATCTGCCCTTACTTACCTGCCCGCATCCTCGT | chr11:43002377 | Intergenic | NA | NA | NA | NA |
CAACTGTACATTCCGACAGCCTGGGGAAATGGATCTTTGAGA | chr3:58431902 | Intergenic | NA | NA | NA | NA |
CAACTGTATTTCAAACACTGAAGATCTGACTCAGGAAGTGCT | chr10:59705796 | Intronic | CISD1 | CISD1, MITONEET | chr10:59698939 | 6857 |
B | ||||||
CAACTGTACTAAGTATAGGCATCCTTAATTGGTGCAATTCTA | chr7:147222151 | Intronic | CNTNAP2 | CNTNAP2, CASPR2, NRXN4, CDFE, AUTS15 | chr7:146999704 | 222447 |
CAACTGTATGTTGGAATGCCCCAGAATTTGGAGTTTATCTCT | chr13:45671894 | Intergenic | NA | NA | NA | NA |
CAACTGTAGGCCTGAAAGCGCTCCAAATGTCCACTTCCAGA | chr2:91688345 | Intergenic | NA | NA | NA | NA |
CAACTGTACATTCCGACAGCCTGGGGAAATGGATCTTTGAG | chr3:58431902 | Intergenic | NA | NA | NA | NA |
CAACTGTATGTTATATATATATGCAAATATAAACACAGAAAA | chr11:72120697 | Intronic | CENTD2 | CENTD2, ARAP1, KIAA0782 | chr11:72141107 | 19946 |
CAACTGTAAATCAGGTGAAGCCCTATTAAAGATGTCCTGAAA | chr18:35338281 | Intronic | AK090603 | (PIK3C3) | chr18:35634274 | 295573 |
CAACTGTATTCTCAGAATATTTGCAACAATCACTCAAAAGGT | chr2:182456397 | Intergenic | NA | NA | NA | NA |
CAACTGTACAAATCTGGAGTCCTTCCAAAACAGGACAAGTAA | chr12:3737128 | Intergenic | NA | NA | NA | NA |
NK | ||||||
CAACTGTAAGTTCCTTCCACAAAAATTGGGCAGCTTCTAGAAT | chr8:102548406 | Intergenic | NA | NA | NA | NA |
CAACTGTACATATATAGTCTATTAATTGAGATAATATCTGTAA | chr2:192099107 | Intergenic | NA | NA | NA | NA |
CAACTGTAGGTGTTTAGAGGGAAAGAAGAAAGGACATTCTGT | chr17:41583312 | Intronic | KIAA1267 | KIAA1267 | chr17:41605366 | 21942 |
CAACTGTATAATTTTAGGTTACCATCTTCCATGGGGGAAATAT | chr12:69015183 | Intronic | CNOT2 | CNOT2, NOT2 | chr12:68923454 | 91729 |
CAACTGTATATGGCACATGGGCTTTTGCAGGTGTGATGAAACT | chr16:30803923 | Intronic | BCL7C | BCL7C | chr16:30812887 | 8898 |
CAACTGTATTCTCAGAATATTTGCAACAATCACTCAAAAGGTT | chr2:182456397 | Intergenic | NA | NA | NA | NA |
CAACTGTAATATCCCAAGACTCTTTAAAGGTGGCAATGGCCG | chr7:294398 | Intronic | FAM20C | FAM20C, DMP4 | chr7:291895 | 2503 |
M | ||||||
CAACTGTACATACTTTCTTTCTTAAGGTAGTGTTTTGACAGAG | chr8:108578342 | Intronic | ANGPT1 | ANGPT1, ANG1 | chr8:108579262 | 920 |
CAACTGTAGTTGAGGTCACACAAGACCTAAGTAGGGGAAACT | chr5:172192814 | Intergenic | NA | NA | NA | NA |
CAACTGTACAATCATGTCGTCTGCGAACAGGGACAATTTGACT | chr5:38207284 | Intergenic | NA | NA | NA | NA |
CAACTGTATATGTAAAGGTTTTTTTAAGTGGGTATATTGCGTGA | chr4:126348665 | Intergenic | NA | NA | NA | NA |
CAACTGTAGATGTTGTGAGCATAATGAGTTAGGTGTTCCAAAG | chr3:178255770 | Intronic | TBL1XR1; TBLR1 | TBL1XR1 | chr3:178397855 | 141978 |
CAACTGTAGCAACATGTTTAAGAGATTATACACCATGACCCAC | chr2:115950838 | Intronic | DPP10 | DPP10, DPRP3, KIAA1492 | chr2:115635383 | 315455 |
CAACTGTATATACAGACTCTAAGTATGCTTACCTAGTCCCTTA | chr6:13990710 | Intergenic | NA | NA | NA | NA |
CAACTGTATGTCCATCTATTGAGGCCCTAAATTAAGTCTACAG | chr5:131806233 | Intronic | LOC441108 | (IRF1, MAR) (SLC22A5, OCTN2, CDSP, SCD) | chr5:131774556 | 31677 |
Bold letters represent transposon sequence. Terms in parentheses indicate the closest neighboring cancer-related gene.
M indicates myeloid cells; NA, not applicable; and TSS, transcriptional start site.