Fig. 1.
Analysis of the upstream region of the mouse β-globin LCR. (A) Map of the 5′ HS 4, 5 and 6 region. Upper solid black line represents the 5′ HS 4, 5, and 6 region and restriction enzyme sites used for HS mapping. Alignment: boxes with diagonal cross hatch represent regions that can be aligned with the human locus using the program yama2. Homology: boxes with vertical hatch represent regions with >40% identity with human and are a more stringent gauge of homology. HSs: arrows show HSs and their relative intensity. Mouse HSs are placed relative to the mouse map, whereas human HSs are placed relative to the homologous regions of the mouse sequence. The numbers above the arrows are the map positions relative to the Ey cap of the mouse HSs. Bottom diagram represents the targeting construct that deletes HS 5, HS 6 and the region of homology to human HS 5. Arrows and black stripes represent loxP sites that flank the construct. Restriction enzyme sites: S- SphI, H- HpaI, X- XbaI. (B) Sequence features in the 5′ HS 4, 5, and 6 region. Regions that each HS maps to are represented by open rectangles. L1 and B2 repeat elements are represented by a gray rectangle and triangle, respectively. The black bar represents the region deleted by homologous recombination. The striped line represents the thymidine-rich tract. Below the map is a PIP of the mouse-human comparison of the region. The percent identity in each gap-free aligning segment is plotted using coordinates of the mouse sequence with the cap site of the Ey gene being +1. Cross hatching indicates the region for which no human sequence is available for comparison. The lines above the map show sequence features as noted below. In each case short lines mark any occurrence of the specified sequence and tall lines mark sites that are conserved between mouse, human, and galago. Invariant: blocks of at least 7 bp that are invariant in mouse, human, and galago. GATA-1: sites matching the consensus WGATAR. MARE: sites matching the Maf-associated response element consensus TGASTCA. Several basic leucine zipper proteins contain this motif in their consensus binding sequence including NFE-2, AP-1, and LCRF1/Nrf1. EKLF: sites matching the consensus CCNCNCCC. BKLF also has been shown to bind this consensus. CACC BP: sites matching the consensus CACC, a sequence motif common to Krüppel-like Zn finger proteins such as Sp1 and EKLF. ATTTA/TATTT: sites matching the specified sequences.