Hypersensitive site 3 (HS3) of the β-like globin locus control region has been implicated as an important regulator of the β-like globin genes, but the trans factors that bind HS3 have only been partially characterized. Using a five-species alignment (human, galago, rabbit, goat, and mouse) that represents 370 million years of evolution, we have identified 24 phylogenetic footprints in the HS3 core and surrounding regions. Probes corresponding to the human sequence at each footprint have been used in binding studies to identify the nuclear factors that bind within and near these conserved sequence elements. Among the high-affinity interactions observed were several binding sites for proteins with repressor activity, including YY1, CCAAT displacement protein, and G1/G2 complexes (uncharacterized putative repressors) and several binding sites for the stage selector protein. To complement this analysis, orthologous galago sequences were also used to derive probes and the pattern of proteins binding to human and galago probes was compared. Binding interactions differing between these two species could be responsible for the different expression patterns shown by the two γ genes (galago γ is embryonic; human γ is fetal). Alternatively, binding interactions that are conserved in the two species may be important in the regulation of common expression patterns (eg, repression of γ in adult life).

EACH OF THE FIVE active human β-like globin genes is expressed in an erythroid-specific and developmental stage-specific fashion. Interestingly, the arrangement of these genes on chromosome 11 parallels their temporal order of developmental expression: 5′ ε (embryonic), γ1 (fetal), γ2 (fetal), δ (postnatal), β (postnatal) 3′. Studies in several laboratories have shown that within the 60-kb region of human chromosome 11 that contains the five active genes proper, regulatory elements exist that control the characteristic expression pattern of each gene.1-3 Thus, both the γ and the β genes show regulated developmental expression (albeit at low levels) in erythroid cells when human constructs containing only these genes and a few hundred base pairs of 5′ flanking sequence are tested in transgenic mice. However, high-level expression of each of the globin genes requires sequences located several kilobases upstream from ε.4 These upstream sequences were first identified by their altered chromatin structure5,6 and later shown to harbor domain opening and enhancing functions.7-11 The critical sequences span five super hypersensitive DNase I sites (HS1-HS5). Because these sequences are dominant regulators of the entire gene cluster, they are collectively referred to as the locus control region or LCR. The fact that deletion of the region encompassed by HS2-HS5 causes a severe γδβ0 thalassemia12 attests to the importance of the LCR in globin gene regulation. The sequences spanning the hypersensitive cores of HS1-HS4 have been extensively probed in both binding and functional assays7,10,11,13-24; site 5 is less well studied.25 26 In the course of these studies in different laboratories, different landmarks have been used to define the functional cores; effects of these cores have been studied on different genes (γ or β or γβ or γγδβ constructs), and the analyses have been performed at different times in development in transgenic mice or in different systems altogether (transient v stable assays; MEL v K562 cells). Although much meaningful information has nevertheless emerged, several important questions remain. One critical question regards the location and characterization of the key regulators that govern LCR function.

One approach to the identification of these cis regulators is the use of evolution. All eutherian mammals possess β-like globin clusters that were derived from the same ancestral five member cluster (5′-ε-γ-η-δ-β-3′).27 Within this cluster, similar developmental switches in gene activity occur in all mammalian species, indicating that the regulatory machinery that controls developmental switching and erythroid-specific expression is conserved. The fact that the human globin genes undergo a developmental switching program in the transgenic mouse confirms that many of the cis signals encoded by human globin genes are properly recognized in the murine background. Moreover, experiments in the chicken suggest that the evolution of the switching machinery predated the separation of aves and mammals.28 To detect conserved sequence elements that could be involved in globin gene regulation, we have used alignments in which sequences from multiple mammalian species are represented. In such alignments, constrained (and possibly functionally relevant) motifs appear as invariable regions (phylogenetic footprints).29-32 

In this study, we have applied phylogenetic footprinting to the region surrounding hypersensitive site 3 of the LCR. The region surveyed includes sequences spanning the hypersensitive core and surrounding regions, and extends 430 bp further 3′ from the HindIII site that has marked the 3′ end of the HS3 fragment (1.9 kb HindIII) used in the majority of functional studies. This region shows a great deal of sequence conservation but has not been previously tested in binding or expression studies. A total of 24 phylogenetic footprints were analyzed in the region surveyed; oligonucleotide probes corresponding to all of these conserved sequences were found to contain binding sites for nuclear factors. To further determine which of these interactions are actually functionally conserved, and which might represent newly gained motifs in the human genome, galago probes corresponding to all of these human sequence oligonucleotides were also tested in binding assays. Finally, to facilitate the general use of the large amount of binding data generated in this study, and to begin to correlate this binding information with binding and functional studies already in the literature, we are working to integrate these data into an existing Web Server called the Globin Gene Server. Presently, this Server site allows the user to observe multispecies alignments for any region of the globin cluster and to query for information on DNA transfer experiments, DNA-protein binding results, and location of DNaseI hypersensitive regions. Software that will allow interactive electronic exploration of sequence alignments in direct register with all existing functional data for a given region is being developed and refined.

Alignment.The alignment described here covered bases 4205 to 5606 of the human β cluster (GenBank HUMHBB). All sequences used are available through GenBank; the alignment itself can be accessed on the Globin Server (http://globin.cse.psu.edu/). A schematic view of the alignment is presented in Fig 1. Within the aligned sequences, 24 phylogenetic footprints were analyzed (FP1-FP24). For each phylogenetic footprint tested, two double-stranded oligonucleotide probes were synthesized, one corresponding to the human sequence and one corresponding to the galago sequence (Table 1). The sequences corresponding to the phylogenetic footprints are underlined in Table 1. Oligonucleotides were synthesized on an Applied Biosystems Synthesizer, Model 380B (Applied Biosystems, Foster City, CA). The 5′ end of the antisense strand of all probes contained a GATC overhang which facilitated Klenow labeling of the annealed probes.

Fig. 1.

(A) Schematic representation of the region of HS3 analyzed in this study. Stippled rectangles indicate two regions of DNaseI hypersensitivity. The darker, longer rectangle is the region referred to as the “core” of HS3 while the other, more 5′ rectangle represents a second hypersensitive region.22 The 24 phylogenetic footprints (FP1-24) are indicated as numbered black vertical bars. Beneath the footprints, the relative locations of the probes used for gel shift analysis is shown, and the position of the 5′ end of each probe with respect to the HUMHBB (GenBank) sequence is indicated. The positions of several landmark restriction sites are shown below the probes, and the regions of HS3 that have been used in functional assays are indicated. The end-points of deletions of HS3 tested in functional studies are also shown.52 53 (B) Probe overlaps are documented schematically; the length of overlap is given in base pairs. Here, as in the text, probes are referred to by the footprint number rather than the position in the sequence.

Fig. 1.

(A) Schematic representation of the region of HS3 analyzed in this study. Stippled rectangles indicate two regions of DNaseI hypersensitivity. The darker, longer rectangle is the region referred to as the “core” of HS3 while the other, more 5′ rectangle represents a second hypersensitive region.22 The 24 phylogenetic footprints (FP1-24) are indicated as numbered black vertical bars. Beneath the footprints, the relative locations of the probes used for gel shift analysis is shown, and the position of the 5′ end of each probe with respect to the HUMHBB (GenBank) sequence is indicated. The positions of several landmark restriction sites are shown below the probes, and the regions of HS3 that have been used in functional assays are indicated. The end-points of deletions of HS3 tested in functional studies are also shown.52 53 (B) Probe overlaps are documented schematically; the length of overlap is given in base pairs. Here, as in the text, probes are referred to by the footprint number rather than the position in the sequence.

Close modal
Table 1.

Oligonucleotide Probes Used in This Study

Human probes (5′ to 3′ ): 
  
FP1 TATTATTATCTTTATTTTGCCATGACAAGA 
FP2 TGACAAGACTGAGCTCAGAAGAGTCAAGCA 
FP3 ATTCTGTGGATAAAGGAGACAGCCATGAAG 
FP4 TCAAGGAATTTTGACTCAGCAAACACAAGA 
FP5 CGGTGACTTTGCGAGCTGGTGTGCCAGATG 
FP6 CGAGCTGGTGTGCCAGATGTGTCTATCAGAGGTTCCAGG 
FP7 GGCTGGCCACCAGCTATCAGGGCCCAGATGG 
FP8 GGCAGGCTCAGATAGGTGGTTAGGTCAGGTTGGT 
FP8M GGCAGGCTCAGATAGGTGGTTTCTTTGGGTTGGTGGTGC 
FP9 GGTTGGTGGTGCTGGGTGGAGTCCATGACTC 
FP10 GGGTGGAGTCCATGACTCCCAGGAGCCAGG 
FP11 GGAGCCAGGAGAGATAGACCATGAG 
FP12 CAGCATAGCAGCATTTTTCATTCTACTACT 
FP13 CTATCTCATAGGGTTATTATGAGGATCAAA 
FP14 AGTGGCCTAACAGTTCAGGACAGAGCTATGGGCTTCCTA 
FP15 ATGTTAATTTAGCAATGGGCTGGGACCCT 
FP16 TTAATGAAAGCTTAGCTTTCATTAAAGATT 
FP17 GGAGTTGAGAGGGAAACAAGATGGTGAAGG 
FP18 GAAGGGACTAGAAACCACATGAGAGACACGAGG 
FP19 AAGTGGGAATTAATACTTACTGAAATCTTTCTATATGTC 
FP20 TGAACATACTATAATTAATACTAATGATAA 
FP21 ATAATTAATACTAATGATAAGTAACACCTCTT 
FP22 TAACACCTCTTGAGTACTTAGTATAT 
FP23 GAGTACTTAGTATATGCTAGAATCAAATTT 
FP24 GCTAGAATCAAATTTAAGTTTATCATATGAGGalago probes (5′ to 3′ ): 
   
FP1 CATTTTGACATAGGGGTATTTTTGGA 
FP2 TGAGAAGATAGAGCTCAGAGGAGTCAAGAC 
FP3 GGGCTGTGGGTGAAGGAGGCAGCCAGGCAG 
FP4 TTAGGGAACTTTGACTCAGCAACTCAGCAA 
FP5 CTGTGACTTTAGGAGCTGATGTGCAAGATG 
FP6 GGAGCTGATGTGCAAGATGTGTCTATCGGAGGTTGCAGA 
FP7 GGCTAGCTGCAAACTATCAGGGCCAGAAGG 
FP8 TGTGGGCTCAGATAGTTGGTTAGGTCAGGTTGGC 
FP9 GGTTGGCAGCTCTGGGTGGAGTCCATGACTC 
FP10 TCTGGGTGGAGTCCATGACTCTTGGAGGTG 
FP11 ACTCTTGGAGGTGATAGACCAGGAGTGTGG 
FP12 CAACCCAGCAGCATTTTTTCGTTATACTAT 
FP13 CTATCATATGAGGGTATTATGAACATTAAA 
FP14 AGGGCCTTAGGAATTCAGGACAGAGCTGTGGACACCTTA 
FP15 ATGTTTAATTTAGCAATGGCCTGGGTCCTC 
FP16 AAAGTTCAGAACCTAGCTTTAATGGCTCCT 
FP17 GAAGAAGAGAGAGAAACAGGATGATGAAGG 
FP18 GAAGGGACTAGAGACCACACAAGGAACACAA 
FP19 AGATGGGACTTTAACTTACTGAAGTCTTTCTACATGCCA 
FP20 TAAGCATATTATAATTAATGATAATGATAA 
FP21 ATAATTAATGATAATGATAAGCAGAACCTCTT 
FP22 CAGAACCTCTTGAGGGCTGAGGATAG 
FP23 GAGGGCTGAGGATAGGCTATAATCTAATTT 
FP24 GCTATAATCTAATTTAAGTCCATTGTGTTA 
Human probes (5′ to 3′ ): 
  
FP1 TATTATTATCTTTATTTTGCCATGACAAGA 
FP2 TGACAAGACTGAGCTCAGAAGAGTCAAGCA 
FP3 ATTCTGTGGATAAAGGAGACAGCCATGAAG 
FP4 TCAAGGAATTTTGACTCAGCAAACACAAGA 
FP5 CGGTGACTTTGCGAGCTGGTGTGCCAGATG 
FP6 CGAGCTGGTGTGCCAGATGTGTCTATCAGAGGTTCCAGG 
FP7 GGCTGGCCACCAGCTATCAGGGCCCAGATGG 
FP8 GGCAGGCTCAGATAGGTGGTTAGGTCAGGTTGGT 
FP8M GGCAGGCTCAGATAGGTGGTTTCTTTGGGTTGGTGGTGC 
FP9 GGTTGGTGGTGCTGGGTGGAGTCCATGACTC 
FP10 GGGTGGAGTCCATGACTCCCAGGAGCCAGG 
FP11 GGAGCCAGGAGAGATAGACCATGAG 
FP12 CAGCATAGCAGCATTTTTCATTCTACTACT 
FP13 CTATCTCATAGGGTTATTATGAGGATCAAA 
FP14 AGTGGCCTAACAGTTCAGGACAGAGCTATGGGCTTCCTA 
FP15 ATGTTAATTTAGCAATGGGCTGGGACCCT 
FP16 TTAATGAAAGCTTAGCTTTCATTAAAGATT 
FP17 GGAGTTGAGAGGGAAACAAGATGGTGAAGG 
FP18 GAAGGGACTAGAAACCACATGAGAGACACGAGG 
FP19 AAGTGGGAATTAATACTTACTGAAATCTTTCTATATGTC 
FP20 TGAACATACTATAATTAATACTAATGATAA 
FP21 ATAATTAATACTAATGATAAGTAACACCTCTT 
FP22 TAACACCTCTTGAGTACTTAGTATAT 
FP23 GAGTACTTAGTATATGCTAGAATCAAATTT 
FP24 GCTAGAATCAAATTTAAGTTTATCATATGAGGalago probes (5′ to 3′ ): 
   
FP1 CATTTTGACATAGGGGTATTTTTGGA 
FP2 TGAGAAGATAGAGCTCAGAGGAGTCAAGAC 
FP3 GGGCTGTGGGTGAAGGAGGCAGCCAGGCAG 
FP4 TTAGGGAACTTTGACTCAGCAACTCAGCAA 
FP5 CTGTGACTTTAGGAGCTGATGTGCAAGATG 
FP6 GGAGCTGATGTGCAAGATGTGTCTATCGGAGGTTGCAGA 
FP7 GGCTAGCTGCAAACTATCAGGGCCAGAAGG 
FP8 TGTGGGCTCAGATAGTTGGTTAGGTCAGGTTGGC 
FP9 GGTTGGCAGCTCTGGGTGGAGTCCATGACTC 
FP10 TCTGGGTGGAGTCCATGACTCTTGGAGGTG 
FP11 ACTCTTGGAGGTGATAGACCAGGAGTGTGG 
FP12 CAACCCAGCAGCATTTTTTCGTTATACTAT 
FP13 CTATCATATGAGGGTATTATGAACATTAAA 
FP14 AGGGCCTTAGGAATTCAGGACAGAGCTGTGGACACCTTA 
FP15 ATGTTTAATTTAGCAATGGCCTGGGTCCTC 
FP16 AAAGTTCAGAACCTAGCTTTAATGGCTCCT 
FP17 GAAGAAGAGAGAGAAACAGGATGATGAAGG 
FP18 GAAGGGACTAGAGACCACACAAGGAACACAA 
FP19 AGATGGGACTTTAACTTACTGAAGTCTTTCTACATGCCA 
FP20 TAAGCATATTATAATTAATGATAATGATAA 
FP21 ATAATTAATGATAATGATAAGCAGAACCTCTT 
FP22 CAGAACCTCTTGAGGGCTGAGGATAG 
FP23 GAGGGCTGAGGATAGGCTATAATCTAATTT 
FP24 GCTATAATCTAATTTAAGTCCATTGTGTTA 
Specific competitors (probes with known factor binding sites) Reference 
   
AP1 TGGGGAACCTGTTCTGAGTCACTGGAG 65 
CDP GCTTTTCAGTTGACCGGTGATTATTAGCCAATTTCTGATAAA 46 
CP1(−88γ) CCAGCCTTGCCTTGACCATAGCCTTGACA 33 
CSBP-2(−698ɛ) ATCTTTTCTGTCTGTCCTCTCTAGGGAATC 32 
CSBP-2/SSP(−835ɛ) TCCCAGTGAGAAGTATAAGCAGGACAGACAGGCAAGCAAG 32 
ELKF TAGAGCCACACCCTGGTAAG 66 
GATA1/Oct-1(−175γ) CCACACTATCTCAATGCAAATATCTGTCTGAAACG 33 
G1/G2 AGTGACCTTGACCAATAGCCTC 37 
NF-E2 TGGGGAACCTGTGCTGAGTCACTGGAG 65 
Sp1 TGGCTAAACTCCACCCATGGGTGGCCAGC 67 
TEF-2(SV40 GT-1) CTAGACTTTCCACACCCTT 68 
SSP(−50γ) TCCCTAGCCAGCCGCCGGCCCCTGGCCTCACTGGA 33 
USF AGAAGCTGACCACCTGACTAAA 69 
YY1 CTGCAGTAACGCCATTTTGCAAGGCAT 70 
Specific competitors (probes with known factor binding sites) Reference 
   
AP1 TGGGGAACCTGTTCTGAGTCACTGGAG 65 
CDP GCTTTTCAGTTGACCGGTGATTATTAGCCAATTTCTGATAAA 46 
CP1(−88γ) CCAGCCTTGCCTTGACCATAGCCTTGACA 33 
CSBP-2(−698ɛ) ATCTTTTCTGTCTGTCCTCTCTAGGGAATC 32 
CSBP-2/SSP(−835ɛ) TCCCAGTGAGAAGTATAAGCAGGACAGACAGGCAAGCAAG 32 
ELKF TAGAGCCACACCCTGGTAAG 66 
GATA1/Oct-1(−175γ) CCACACTATCTCAATGCAAATATCTGTCTGAAACG 33 
G1/G2 AGTGACCTTGACCAATAGCCTC 37 
NF-E2 TGGGGAACCTGTGCTGAGTCACTGGAG 65 
Sp1 TGGCTAAACTCCACCCATGGGTGGCCAGC 67 
TEF-2(SV40 GT-1) CTAGACTTTCCACACCCTT 68 
SSP(−50γ) TCCCTAGCCAGCCGCCGGCCCCTGGCCTCACTGGA 33 
USF AGAAGCTGACCACCTGACTAAA 69 
YY1 CTGCAGTAACGCCATTTTGCAAGGCAT 70 

Underlined sequences represent phylogenetic footprints.

Gel shift assays, competition assays, antibody supershifts, and limited protease digestion.Labeling of probes, preparation of nuclear extracts, and performance of gel-shift assays were as previously described.33 Binding studies were performed using erythroid (K562, HEL, MEL, U937) as well as nonerythroid (HeLa, CaCo2, HepG2, Jurkat, and B cells) nuclear extracts. Competition assays were performed by preincubation of nuclear extract (5 minutes, room temperature) with a significant molar excess of unlabeled competitor oligonucleotide before addition of labeled probe. Identification of complexes containing CCAAT displacement protein (CDP) was accomplished by preincubation of nuclear extract with CDP antibody (kindly provided by Dr Ellis Neufeld, Children's Hospital, Boston, MA) in the absence of poly(dIdC). After 5 minutes at room temperature, poly(dIdC) and probe were added. These binding reactions were loaded onto 3.5% polyacrylamide gels and the presence of CDP was detected by loss of complex formation in the presence of antibody. For upstream transcription factor (USF ) supershifts, antibody (a generous gift of Dr Emory Bresnick, University of Wisconsin, Madison) was added after incubation of probe, nonspecific competitor, and nuclear extract. Under these conditions, complexes containing USF remained at the top of the well.

For confirmation of YY1 binding, semi-purified preparations of YY1 were used. These were prepared as described earlier34 and contain significant amounts of YY1 binding activity, but no binding activity for any other transcription factor tested in this study.

Twenty-four phylogenetic footprints are detected in 1.4 kb of hypersensitive site 3.The region shown schematically in Fig 1 covers 1,401 bp of the hypersensitive site 3 region (positions 4205 to 5606 of the human sequence, GenBank HUMHBB). Thus, the alignment includes the previously described functional core of HS322,23 and extends an additional 342 bp 5′ and 834 bp 3′ from this core. The relationship of the region analyzed here to the areas previously assessed in functional studies is illustrated in Fig 1A.

Over the majority of the region studied, sequences from five mammalian species are available: human, galago, goat, rabbit, and mouse. Because evolutionary time is additive within such an alignment, this alignment spans roughly 370 million years (MY) of evolution. In previous studies, we defined a phyogenetic footprint as six contiguous base pairs showing 100% conservation.31,32 However, given the relative flexibility in binding exhibited by most transcription factors, the 6-bp criterion may be overly strict. For example, GATA-1 has been shown to bind with high affinity to AGATAG, TGATTG, AGATTG, GGATAA, TGATTA, GGTTAG, GGATAG, CGATAC, CGATTA, CGATAG, and AGATAA.35 Since the alignment used here covers a large amount of evolutionary time over which small fluctuations in sequence could be tolerated by GATA-1 and other transcription factors, a more relaxed phylogenetic footprinting criterion was used: the 6-bp window was preserved, but one base pair mismatch relative to the human sequence was allowed per species. Some of the motifs selected by this criterion are not highly conserved in sequence, but may be conserved in terms of binding function. A total of 24 probes were synthesized spanning these modified footprints (FP1-FP24, Fig 1A); some covered more than one phylogenetic footprint. Because some phylogenetic footprints were clustered, overlapping oligonucleotides were synthesized to ensure that conserved motifs were centered within the probe. These overlaps are shown schematically in Fig 1B.

All 24 human probes detected nuclear proteins. Figure 2 displays the gel shift patterns for each of these probes; the side-by-side run in three gels facilitates comparison of band migration. However, because it is difficult to obtain optimum binding of all factors using the same conditions, an idealized version of the binding results is displayed in Fig 3. This schematic compilation will be placed on the Globin Server. Band intensity in Fig 2 reflects not only binding affinity, but also specific factor concentration within the extract and the presence or absence of overlapping binding sites on the probe. To more directly compare affinity of binding among all probes and to establish a relative hierarchy of binding affinity, competition experiments were done. For these experiments, oligonucleotides containing well-characterized binding sites were labeled and used as probes. Each of the 24 HS3 competitor DNAs was tested at several concentrations for its ability to compete for binding of specific complexes. The results of these experiments are tabulated in Table 2. In Fig 3, band thicknesses reflect these affinity data. Although most of the binding factors could be identified, six could not. The complexes representing these unidentified factors could not be competed by oligonucleotides containing known binding sites for Sp1, Oct-1, GATA-1, CP1, CDP, YY1, SSP,36 CSBP-2,32 G1/G2,37 NF-E2, AP1, EKLF, USF, or TEF-2.

Fig. 2.

Representative patterns of binding interactions to 24 human oligonucleotide probes spanning phylogenetic footprints. Not all interactions could be detected using standard conditions, and other complexes, not visible in this figure, were detected. An idealized illustration of the binding interactions which includes all complexes reproducibly seen is shown in Fig 3. Binding was performed with K562 protein (3 to 5 μg) and 500 ng poly (dIdC). Unidentified complexes are numbered. The band marked with the star (FP15) indicates a complex that is only seen with higher concentrations of extracts and may represent the binding of more than one protein.

Fig. 2.

Representative patterns of binding interactions to 24 human oligonucleotide probes spanning phylogenetic footprints. Not all interactions could be detected using standard conditions, and other complexes, not visible in this figure, were detected. An idealized illustration of the binding interactions which includes all complexes reproducibly seen is shown in Fig 3. Binding was performed with K562 protein (3 to 5 μg) and 500 ng poly (dIdC). Unidentified complexes are numbered. The band marked with the star (FP15) indicates a complex that is only seen with higher concentrations of extracts and may represent the binding of more than one protein.

Close modal
Fig. 3.

Idealized representation of binding results. Band intensities are corrected for affinity of binding as determined in competition assays and summarized in Table 2. Binding interactions that were only detected by competition assays are not shown here, but are listed in Table 2. The unlabeled complex observed below some of the GATA bands is a subband that often accompanies GATA-1 binding.31-33 Sp1a (FP15) is a subband that accompanies Sp1 binding33; this band comigrates with but is distinct from CSBP-2.

Fig. 3.

Idealized representation of binding results. Band intensities are corrected for affinity of binding as determined in competition assays and summarized in Table 2. Binding interactions that were only detected by competition assays are not shown here, but are listed in Table 2. The unlabeled complex observed below some of the GATA bands is a subband that often accompanies GATA-1 binding.31-33 Sp1a (FP15) is a subband that accompanies Sp1 binding33; this band comigrates with but is distinct from CSBP-2.

Close modal
Table 2.

Combined Data: Factors Bound, Relative Binding Affinity, and Evolutionary Conservation

GATAYY1Oct-1SSPCSBP-2Sp1G1/G2NF-E2CDPUSFCP1Other
FP1 ++++ +++  +c +c    +c 
FP2    (++) (++++)       [1]++ 
FP3 +c ++++   (+) 
FP4      (++)    [2]+ 
FP5    ++ (+) 
FP6 (++) (+)   (++++) 
FP7 (+++) (+c)         +c 
FP8 +c (+c)  (+++) (++)  (++++)    +c [5]+c 
FP9  (+++)   (++) (++++) 
FP10  (++++)          ([3]++) 
FP11 (++++) +++          [4]++ 
FP12 (+) (++++) (++)     ++    [3]+ 
FP13 (+) (+++) (++) (++)   (+++)     ([5]++) 
FP14  (+++)  ++ 
FP15  +++ +c +c  (++++)     +c [5]+; [*] 
FP16 ++ (++++) ++     ++   [5]+ 
FP17  (++++)  +c 
FP18   ++ (+++)     +++ 
FP19 ++  (+)      +++   ([5]+++) 
FP20 (+++)  (+) ++     ++   [5]++ 
FP21 (+++)  (+++)         [5]+ 
FP22   (++)  
FP23 (+)  (++) +++        [5]++; [6]++ 
FP24 (++)  (++)         [5]++; [6]++ 
GATAYY1Oct-1SSPCSBP-2Sp1G1/G2NF-E2CDPUSFCP1Other
FP1 ++++ +++  +c +c    +c 
FP2    (++) (++++)       [1]++ 
FP3 +c ++++   (+) 
FP4      (++)    [2]+ 
FP5    ++ (+) 
FP6 (++) (+)   (++++) 
FP7 (+++) (+c)         +c 
FP8 +c (+c)  (+++) (++)  (++++)    +c [5]+c 
FP9  (+++)   (++) (++++) 
FP10  (++++)          ([3]++) 
FP11 (++++) +++          [4]++ 
FP12 (+) (++++) (++)     ++    [3]+ 
FP13 (+) (+++) (++) (++)   (+++)     ([5]++) 
FP14  (+++)  ++ 
FP15  +++ +c +c  (++++)     +c [5]+; [*] 
FP16 ++ (++++) ++     ++   [5]+ 
FP17  (++++)  +c 
FP18   ++ (+++)     +++ 
FP19 ++  (+)      +++   ([5]+++) 
FP20 (+++)  (+) ++     ++   [5]++ 
FP21 (+++)  (+++)         [5]+ 
FP22   (++)  
FP23 (+)  (++) +++        [5]++; [6]++ 
FP24 (++)  (++)         [5]++; [6]++ 

+ to ++++, Relative affinity of binding to the human probe (++++ = most avid binding). Complexes bound by both human and galago probes (regardless of affinity) are in parentheses. Numbers in “Other” column correspond to unidentified complexes shown in Fig 2.

Abbreviation: c, competes for binding in a competition assay.

*

See Fig 2, FP15.

Binding interactions in the core of HS3.The hypersensitive core of HS3 has been finely mapped in MEL cells by Philipsen et al.22 The two areas of hypersensitivity detected in that study (corresponding to HUMHBB 4240-4375 and 4550-4780) are indicated by the stippled bars in Fig 1. Nine phylogenetic footprints (FP1, FP2, and FP5-11) are contained within these hypersensitive regions; seven of these (FP5-11) lie within the 225-bp core (HphI to Fnu4HI) that harbors most of the enhancing activity of the HS3 region.22 23 

In their analysis of binding interactions within the core of HS3, Philipsen et al22 observed six DNaseI footprints in the HphI-Fnu4HI fragment. A subset of the specific proteins binding to these footprints were also identified using gel shift studies.18,23 A number of the probes used in the present analysis overlap substantially with those used in the Philipsen study (Table 3) and have allowed further characterization of several previously undefined binding interactions. Of particular interest is the fact that factor “X,” a ubiquitously distributed protein that binds to footprint 1 of Philipsen et al,23 has now been identified as CSBP-2 (Conserved Sequence Binding Protein-2)32 in the probe FP6. Figure 4A establishes that the FP6 oligonucleotide is one of four (FP18, FP8, FP6, and FP2) that compete well for the binding of CSBP-2 to a previously characterized CSBP-2 binding site located at −835 bp upstream from the ε gene. Six additional probes showed weaker competition (data not shown; tabulated in Table 2). In an earlier study, a common 11-bp motif was noted in several strong CSBP-2 binding sites in the ε upstream region.32 As shown in Fig 4C, nine nucleotides of this motif are also well conserved in two of the four strong CSBP-2 binding oligonucleotides of HS3 (FP6 and FP18). FP8 contains a related motif that matches six of the nine positions, while FP2 contains four different motifs that each match at six of nine positions: AAGAGTCAA (sense strand); CTCAGTCTT (antisense strand); ACAAGACTG (sense strand) and CTGAGCTCA (part of a dyad, and found on both strands). To further confirm that this motif represents the core binding motif for CSBP-2, an oligonucleotide probe was synthesized that was identical to −698ε, except that four bases at the center of the putative core motif were altered (AGAC to CTCG, Fig 4C). This mutation severely curtailed CSBP-2 binding (Fig 4B). In competition studies, this mutant oligonucleotide failed to compete for CSBP-2 binding to FP18, FP8, FP6, and FP2 (not shown). Other mutations that were earlier shown to affect the binding of this complex are summarized in Fig 4C.23 All of these data support the sequence SHBAGAYAS as the recognition core motif for the CSBP-2 complex.

Table 3.

Binding Interactions in the Core of HS3

HS3 DNase I Footprint22 23 Factors Bound18 23 Corresponding PhylogeneticFactors Bound to Phylogenetic
FootprintFootprint Probes
Footprint 1 GATA-1 FP6 GATA-1 
 Factor X  CSBP-2 
Footprint 2 Sp1 Not conserved ND 
 TEF-2   
 4 unidentified proteins   
Footprint 3 GATA-1 FP7 GATA-1 
Footprint 4 Sp1 FP8, FP9, FP10, FP11 (partial) Sp1 
 CACCC binding protein  SSP 
 7 unidentified proteins  G1/G2 
   CSBP-2 
   YY1 (two sites) 
Footprint 5 GATA-1 FP9 (partial) GATA-1 
  FP10 (partial), FP11 YY1 
   CSBP-2 
Footprint 6 Sp1 Not conserved ND 
 4 unidentified proteins   
HS3 DNase I Footprint22 23 Factors Bound18 23 Corresponding PhylogeneticFactors Bound to Phylogenetic
FootprintFootprint Probes
Footprint 1 GATA-1 FP6 GATA-1 
 Factor X  CSBP-2 
Footprint 2 Sp1 Not conserved ND 
 TEF-2   
 4 unidentified proteins   
Footprint 3 GATA-1 FP7 GATA-1 
Footprint 4 Sp1 FP8, FP9, FP10, FP11 (partial) Sp1 
 CACCC binding protein  SSP 
 7 unidentified proteins  G1/G2 
   CSBP-2 
   YY1 (two sites) 
Footprint 5 GATA-1 FP9 (partial) GATA-1 
  FP10 (partial), FP11 YY1 
   CSBP-2 
Footprint 6 Sp1 Not conserved ND 
 4 unidentified proteins   

Abbreviation: ND, not determined.

Fig. 4.

(A) Competition survey of FP probes shows binding sites for CSBP-2, SSP, and YY1. The labeled probe was described in an earlier study and spans a phylogenetic footprint located at −835 upstream from the ε gene.32 This probe binds CSBP-2 and SSP at moderate affinity and YY1 weakly (indicated at left). Each lane contains 4 μg of K562 extract, 0.2 ng of labeled probe (20,000 cpm), 500 ng poly (dIdC), and 100 ng of the specific competitor indicated above the lane. Lane 1 contains no added competitor. (B) Effect of a 4-bp mutation on CSBP-2 binding. The labeled probe spans a characterized CSBP-2 binding site located −698 bp upstream from the ε gene.32 In lane 2 (WT), this probe was used in a binding assay with K562 extract, poly(dIdC) and no specific competitor. In lane 1, the same probe, but containing base changes at the four sites shown in (C) was used in the binding (M). (C) Alignment of five moderate- to high-affinity CSBP-2 sites found at conserved sites in the globin cluster. The −835 ε, −698 ε, and −49 ε sequences were described earlier.32 The location and exact sequence of several mutations that have been shown to disrupt binding of the CSBP-2 complex are shown below the consensus sequence. Mutants 2 and 4 are from Philipsen et al23 and were shown to severely inhibit the binding of factor “X,” here identified as CSBP-2. Effect of the −698 ε mutation is shown in (B). At the bottom is the sequence of the portion of the HS3 core known as “footprint 1” described by Philipsen et al.23 The location of the CSBP-2 consensus on the antisense strand is shown by the long arrow below the sequence. A GATA-1 consensus sequence is also located on the bottom strand (overline). S, G or C; B, not A; H, not G; Y, T or C. (D) Effect of chelators on binding of the zinc finger protein YY1. The probe for (D) is a well-described YY1 binding site (Table 1). Lane 1 is a standard binding reaction with K562 extract (described in Materials and Methods). In lane 2, 100 ng of specific competitor (unlabeled probe) was added to show specificity of binding. In lanes 3 and 4, 3 mmol/L orthophenanthroline and 1 mmol/L EDTA, respectively, was added to the binding reaction. The position of the YY1 complex (large arrowhead) and a minor subband described earlier (small arrowhead)31 is shown. Both complexes are sensitive to the presence of chelators. (E) The CSBP-2 complex is not sensitive to chelation. In lane 1, the −698ε probe was used in a standard binding reaction. Chelators were added to other lanes as follows: lane 2, 3 mmol/L orthophenanthroline; lane 3, 6 mmol/L orthophenanthroline; lane 4, 1 mmol/L EDTA; lane 5, 3 mmol/L EDTA. The position of the CSBP-2 complex is marked with an arrowhead.

Fig. 4.

(A) Competition survey of FP probes shows binding sites for CSBP-2, SSP, and YY1. The labeled probe was described in an earlier study and spans a phylogenetic footprint located at −835 upstream from the ε gene.32 This probe binds CSBP-2 and SSP at moderate affinity and YY1 weakly (indicated at left). Each lane contains 4 μg of K562 extract, 0.2 ng of labeled probe (20,000 cpm), 500 ng poly (dIdC), and 100 ng of the specific competitor indicated above the lane. Lane 1 contains no added competitor. (B) Effect of a 4-bp mutation on CSBP-2 binding. The labeled probe spans a characterized CSBP-2 binding site located −698 bp upstream from the ε gene.32 In lane 2 (WT), this probe was used in a binding assay with K562 extract, poly(dIdC) and no specific competitor. In lane 1, the same probe, but containing base changes at the four sites shown in (C) was used in the binding (M). (C) Alignment of five moderate- to high-affinity CSBP-2 sites found at conserved sites in the globin cluster. The −835 ε, −698 ε, and −49 ε sequences were described earlier.32 The location and exact sequence of several mutations that have been shown to disrupt binding of the CSBP-2 complex are shown below the consensus sequence. Mutants 2 and 4 are from Philipsen et al23 and were shown to severely inhibit the binding of factor “X,” here identified as CSBP-2. Effect of the −698 ε mutation is shown in (B). At the bottom is the sequence of the portion of the HS3 core known as “footprint 1” described by Philipsen et al.23 The location of the CSBP-2 consensus on the antisense strand is shown by the long arrow below the sequence. A GATA-1 consensus sequence is also located on the bottom strand (overline). S, G or C; B, not A; H, not G; Y, T or C. (D) Effect of chelators on binding of the zinc finger protein YY1. The probe for (D) is a well-described YY1 binding site (Table 1). Lane 1 is a standard binding reaction with K562 extract (described in Materials and Methods). In lane 2, 100 ng of specific competitor (unlabeled probe) was added to show specificity of binding. In lanes 3 and 4, 3 mmol/L orthophenanthroline and 1 mmol/L EDTA, respectively, was added to the binding reaction. The position of the YY1 complex (large arrowhead) and a minor subband described earlier (small arrowhead)31 is shown. Both complexes are sensitive to the presence of chelators. (E) The CSBP-2 complex is not sensitive to chelation. In lane 1, the −698ε probe was used in a standard binding reaction. Chelators were added to other lanes as follows: lane 2, 3 mmol/L orthophenanthroline; lane 3, 6 mmol/L orthophenanthroline; lane 4, 1 mmol/L EDTA; lane 5, 3 mmol/L EDTA. The position of the CSBP-2 complex is marked with an arrowhead.

Close modal

To further characterize the CSBP-2 protein, binding was performed in the presence of the metal chelators EDTA or orthophenanthroline. Most zinc finger proteins show reduced binding in the presence of these compounds, as illustrated by the response of YY1, a well-characterized zinc finger protein (Fig 4D, lanes 3 and 4). In contrast, CSBP-2 binding is not reduced in the presence of chelators (Fig 4E, lanes 2 through 5). Thus, CSBP-2 is not likely to be a member of the zinc finger class of transcription factors. Insensitivity of this complex to chelators was confirmed using all four probes with high affinity CSBP-2 sites (FP18, FP8, FP6, and FP2).

Other previously undescribed binding interactions of note within the HphI-Fnu4HI core include two strong sites for SSP (stage selector protein)36 in FP5 and FP8 (Figs 4A and 5). The high-affinity binding site for G1/G2, a complex of putative repressor proteins, within FP8 was described in an earlier study37 and is further discussed below.

Fig. 5.

Competition scan for SSP binding sites. The probe in all lanes is derived from the -50 region of the human γ promoter, a well-characterized SSP binding site.36 Binding assays were done using standard conditions and K562 extracts. Unlabeled competitor was added in the prebinding step at a level of 100 ng. Lane 1 contains no added competitor. Lane 2 represents self competition. All other lanes contain one of the FP probes (indicated above the lane) as specific competitor. The position of the Sp1 complex, SSP complex, and another Sp1-related complex (S) are indicated at left.

Fig. 5.

Competition scan for SSP binding sites. The probe in all lanes is derived from the -50 region of the human γ promoter, a well-characterized SSP binding site.36 Binding assays were done using standard conditions and K562 extracts. Unlabeled competitor was added in the prebinding step at a level of 100 ng. Lane 1 contains no added competitor. Lane 2 represents self competition. All other lanes contain one of the FP probes (indicated above the lane) as specific competitor. The position of the Sp1 complex, SSP complex, and another Sp1-related complex (S) are indicated at left.

Close modal

Binding to conserved sequences 5′ to the HS3 core.The DNaseI hypersensitive region 5′ to the functional core contains two phylogenetic footprints. One of these, FP1, binds both YY1 and GATA-1 with high affinity. Simultaneous binding of these two factors has also been observed in the ε promoter, and it has been postulated that the differential binding of these two factors during development could be involved in the switching mechanism.38 39 Simultaneous strong binding of YY1 and GATA-1 also occurs at FP11. Several other probes are bound by both YY1 and GATA-1, but in these cases one of the two factors binds strongly and the other weakly (FP3, FP6, FP7, FP12, FP13, and FP16; see Fig 2 and Table 2). FP2, which also falls within the 5′-most region of hypersensitivity, is bound with high affinity by SSP (Figs 4A and 5), CSBP-2 (Fig 4A), and an unidentified protein (last column of Table 2, factor 1) which appears to be enriched in erythroid cells, but not restricted to this cell type (data not shown).

The region between the two DNaseI hypersensitive regions contains two conserved sequence motifs. FP3 binds YY1 with high affinity, and CSBP-2 and GATA-1 weakly. FP4 contains a highly conserved binding site for NF-E2 of moderate affinity which had been previously noted.18,22,23 In functional assays in MEL cells, this site caused a moderate increase in expression level when linked to fragments from the HS3 core and the human β-globin promoter.18 Furthermore, restriction fragments containing this site can enhance expression of an ε globin-luciferase construct in transiently transfected K562 cells,40 and in vivo footprinting studies show contacts at this site in U11 cells.41 

Binding to conserved sequence elements 3′ to the hypersensitive core region.Probes FP12 through FP15 lie outside of the HS3 core but within the 1.9-kb HindIII fragment that has been tested in functional studies (Fig 1). The FP12 probe is bisected by the Fnu4HI site that defines the 3′ end of the most active core fragment and the FP16 probe is bisected by the 3′ HindIII site at 5172. Several binding interactions in this region are of particular note. FP13 contains a high-affinity binding site for the G1/G2 complex of proteins (Fig 6), proposed repressors of embryonic globin genes in fetal cells.37 Thus, two sites for the G1/G2 complexes exist in HS3 (FP8 and FP13). Figures 6A and B illustrate the cross competition between G1/G2 complexes binding to FP8, FP13, and the galago proximal CCAAT box probe (G), the first binding site described for these complexes.37 In Fig 6C, the cell-type specificity of the G1/G2 complex is confirmed using the FP13 probe; the complexes are present in K562 and absent in Jurkat cells. This had been shown earlier for the FP8 probe and the GALCAT probe.37 

Fig. 6.

Two binding sites for the G1/G2 complexes. (A) Binding of G1/G2 to FP8. The binding assay was done under standard conditions, using a HeLa extract. Competitor DNA (100 ng) was added as indicated above the lanes. 0, no competitor; G, GALCAT competitor (see Table 1); this competitor binds the G1/G2 complexes37; 13 and 14, human FP probes. (B) Competition assay to define the complexes that bind probe FP13. K562 extracts were used. The identities of the complexes, established by these and other competition and binding assays are indicated at left. Competitors (100 ng) added were: 0, no competitor; S, FP13; 8, FP8; G, GALCAT; γ-175, from the human γ promoter −175 region (see Table 1), an oligonucleotide that binds GATA-1, GATA-like proteins and Oct-1.32,33 Although the GATA complex bound by FP13 comigrates with GATA-1, it is not erythroid specific (data not shown). A similar complex (called NOTAGATA) has been noted in HS2 (Zhu et al, unpublished data). (C) Cell-type specificity of the factors that bind to FP13. The G1/G2 complexes are not found in Jurkat cells (J) as indicated earlier,37 but are present in other nonerythroid cells (not shown). Likewise, complex (5) is missing in Jurkat cells, and present in K562 cells (K) as well as several other cell types (CaCo2 and HeLa cells, data not shown).

Fig. 6.

Two binding sites for the G1/G2 complexes. (A) Binding of G1/G2 to FP8. The binding assay was done under standard conditions, using a HeLa extract. Competitor DNA (100 ng) was added as indicated above the lanes. 0, no competitor; G, GALCAT competitor (see Table 1); this competitor binds the G1/G2 complexes37; 13 and 14, human FP probes. (B) Competition assay to define the complexes that bind probe FP13. K562 extracts were used. The identities of the complexes, established by these and other competition and binding assays are indicated at left. Competitors (100 ng) added were: 0, no competitor; S, FP13; 8, FP8; G, GALCAT; γ-175, from the human γ promoter −175 region (see Table 1), an oligonucleotide that binds GATA-1, GATA-like proteins and Oct-1.32,33 Although the GATA complex bound by FP13 comigrates with GATA-1, it is not erythroid specific (data not shown). A similar complex (called NOTAGATA) has been noted in HS2 (Zhu et al, unpublished data). (C) Cell-type specificity of the factors that bind to FP13. The G1/G2 complexes are not found in Jurkat cells (J) as indicated earlier,37 but are present in other nonerythroid cells (not shown). Likewise, complex (5) is missing in Jurkat cells, and present in K562 cells (K) as well as several other cell types (CaCo2 and HeLa cells, data not shown).

Close modal

FP14 contains binding sites for CSBP-2, SSP, and YY1 (Figs 1 and 5). The binding of all three of these proteins to the same probe was previously noted at three sites (−1095, −835, and −49) upstream from the ε gene.32 FP15 contains a strong Sp1 binding site (Table 1; Fig 5). It has been suggested that Sp1 is capable of protein-protein interactions with itself,42 with GATA-1,43 and with YY1,44,45 and it has been speculated that such protein-protein interaction could be important in generating contacts between the LCR and the individual promoters.43 Thus, HS3 contains two high-affinity Sp1 sites located within or near conserved regions (FP9 and FP15) and two additional avid Sp1 sites, neither of which is conserved (Table 3).22 23 

FP16 is bound by CDP (CCAAT displacement protein) as shown by competition assays (Fig 7A) and antibody-mediated binding inhibition (Fig 7B). This is one of four CDP binding sites within HS3 (FP1, FP16, FP19, FP20).

Fig. 7.

CDP binding sites in HS3. (A) FP19, FP20, FP16, and FP1 contain binding sites for CDP. In this competition scan, the FP19 probe was labeled and binding assays were performed with K562 extracts. Incubations were performed at 4°C and gels (3.5%) were also run at 4°C for 3.5 hours to enable detection of the CDP complex, which is large and has a tendency to degrade. Competitors (100 ng) were added as indicated above the lane. Lane 1 has no competitor (0). The final lane (CDP) confirms that the slowest complex is efficiently competed by an oligonucleotide containing a well-characterized CDP binding site.46 (B) Anti-CDP antibody inhibits formation of the CDP complex. In the left panel, the CDP oligonucleotide (Table 1) was used as a probe; in the right panel, the probe was FP19. For both panels, lane 1 consists of a standard binding assay (no antibody preincubation); in lane 2, 0.5 μL of anti-CDP antibody was preincubated with extract protein before probe addition as described in Materials and Methods; in lane 3, a similar volume of preimmune serum was added in the preincubation step. Formation of the CDP complex (arrowhead) is abolished in the presence of antibody.

Fig. 7.

CDP binding sites in HS3. (A) FP19, FP20, FP16, and FP1 contain binding sites for CDP. In this competition scan, the FP19 probe was labeled and binding assays were performed with K562 extracts. Incubations were performed at 4°C and gels (3.5%) were also run at 4°C for 3.5 hours to enable detection of the CDP complex, which is large and has a tendency to degrade. Competitors (100 ng) were added as indicated above the lane. Lane 1 has no competitor (0). The final lane (CDP) confirms that the slowest complex is efficiently competed by an oligonucleotide containing a well-characterized CDP binding site.46 (B) Anti-CDP antibody inhibits formation of the CDP complex. In the left panel, the CDP oligonucleotide (Table 1) was used as a probe; in the right panel, the probe was FP19. For both panels, lane 1 consists of a standard binding assay (no antibody preincubation); in lane 2, 0.5 μL of anti-CDP antibody was preincubated with extract protein before probe addition as described in Materials and Methods; in lane 3, a similar volume of preimmune serum was added in the preincubation step. Formation of the CDP complex (arrowhead) is abolished in the presence of antibody.

Close modal

Finally, numerous YY1 binding sites were found in this region (Fig 1, Fig 2, Fig 4A, Table 1, Table 2). Interestingly, eight of the highest affinity sites in HS3 are located within a region that begins at the 3′ end of the HS3 core and stretches over 650 bp: FP9/10 (the same YY1 site is probably detected by overlapping FP9 and FP10 probes), FP11, FP12, FP13, FP14, FP15, FP16, FP17. For all of these probes, the identity of the YY1 complex was confirmed by performing binding assays with highly purified YY1 protein prepared as described previously34 (data not shown).

Binding interactions in functionally untested areas 3′ to HS3.Probes FP17 through FP24 lie 3′ to the HindIII site at 5172. Only one study using transgenic mice has included this region in an HS3-containing fragment47; however, the function of the region 3′ to the HindIII site was not specifically tested in that study. FP17 contains a very high-affinity site for YY1 and little additional binding activity. A strong binding site for USF was identified in FP18 (Fig 8A) and confirmed by supershift analysis with an anti-USF antibody (Fig 8B). The FP18 probe is also bound with high affinity by CSBP-2 (Fig 4A) and with low affinity by SSP and YY1 (Figs 4A and 5A). Binding sites for CDP are present in probes FP19 and FP20 (Fig 7).

Fig. 8.

Identification of a binding site for USF in FP18. (A) Cross competition between an oligonucleotide containing a known USF binding site (Table 1) and the FP18 oligonucleotide (K562 extract). Probes and competitors are indicated above the lanes. The amount of competitor added in lanes marked a, b, and c was 20, 40, and 80 ng, respectively. The band marked with the arrowhead is USF; the band marked with the small arrow is CSBP-2. The faster migrating complex formed on the USF probe was not identified, but this complex is also competed efficiently by FP18. (B) Antibody supershifts of the USF complex on the USF oligonucleotide and on FP18. Assays were performed as described in Materials and Methods. In each set of three lanes, lane 1 contains a standard binding assay, lane 2 contains 0.3 mL of anti USF antibody, and lane 3 contains a similar amount of pre-immune serum. Arrowhead, USF complex.

Fig. 8.

Identification of a binding site for USF in FP18. (A) Cross competition between an oligonucleotide containing a known USF binding site (Table 1) and the FP18 oligonucleotide (K562 extract). Probes and competitors are indicated above the lanes. The amount of competitor added in lanes marked a, b, and c was 20, 40, and 80 ng, respectively. The band marked with the arrowhead is USF; the band marked with the small arrow is CSBP-2. The faster migrating complex formed on the USF probe was not identified, but this complex is also competed efficiently by FP18. (B) Antibody supershifts of the USF complex on the USF oligonucleotide and on FP18. Assays were performed as described in Materials and Methods. In each set of three lanes, lane 1 contains a standard binding assay, lane 2 contains 0.3 mL of anti USF antibody, and lane 3 contains a similar amount of pre-immune serum. Arrowhead, USF complex.

Close modal

Moderate to strong binding sites for the erythroid-specific GATA-1 protein are detected with FP19, FP20, FP21, and FP24 and a weaker GATA-1 interaction is seen with FP23 (Figs 2 and 3). FP20 and FP21 probes overlap and most likely detect the same GATA-1 interaction. The detection of four GATA-1 binding sites in this region is of interest considering the central role played by this factor in erythroid development48,49 and the possibility that GATA-1 could mediate LCR-promoter contact via protein-protein interactions.43 As has been noted in earlier studies, Oct-1 binding often accompanies GATA-1 binding,31 32 and this is the case with each of these probes. Oct-1 binding is weak on FP19 and FP20 and strong on FP21, FP23, and FP24. Competition studies suggest that the binding of CDP to FP19 and FP20 interferes with the binding of Oct-1 to these probes (data not shown). Finally, a strong binding site for SSP is detectable with the FP23 probe (Figs 4A and 5A).

Patterns of factor binding.When all data are accumulated (Fig 2), three overall patterns are apparent. First, a striking set of strong YY1 interactions is visible in the center of the region surveyed. While FP9 and FP10 likely share the same YY1 binding site, none of the other eight probes (FP11 through FP18) that exhibit YY1 binding are overlapping. Additional weak sites (FP6, FP7, FP8, FP18, see Table 2) extend this YY1-rich region. A second pattern is shown by the distribution of Oct-1 sites, which appear to be concentrated in the 3′ end of the region surveyed. In this region, except for FP22, Oct-1 sites are always accompanied by GATA-1 sites. In the 5′ end of the region, in contrast, strong GATA-1 sites are seen in the absence of Oct-1 binding. A final pattern concerns the distribution of factor 5, for which eight sites were detected. Five of these were located in the 3′ end of the region tested.

Conservation of binding interactions.To further pinpoint the subset of interactions that are most highly conserved, all 24 probes were resynthesized using the galago sequence at each site. Human and galago last shared a common ancestor approximately 55 MY ago.29 Although the galago is a primate, its globin genes (particularly the γ and β globin genes) are expressed in a pattern that is more similar to mouse and rabbit than to human. Binding interactions that are conserved between human and galago may be significant for common aspects of globin gene regulation that are shared by both species (eg, expression of ε in embryonic life and its repression in the fetal liver) while binding interactions that differ could be important for the evolution of these new patterns of expression (eg, expression of the γ gene as an embryonic gene in galago and a fetal gene in human). For each set of probes binding patterns were compared directly and six such comparisons are shown in Fig 9. In addition, the ability of the galago oligonucleotide (100 ng) to compete for binding of identified human complexes was tested. Proteins bound by galago and not human probes were not further identified. Binding interactions were designated “conserved” if evidence for binding to both probes could be shown by these two assays (regardless of differences in affinity of binding). In Table 2, all interactions that were found to be conserved between human and galago are shown in parentheses.

Fig. 9.

Comparison of human and galago binding patterns. A subset of the comparison gel shifts are shown. In some cases, complexes identified with the human probe are more intense with the galago probe: CSBP-2 in FP3; YY1 in FP7; YY1 in FP13. In other cases, complexes are less intense, but still present with the galago probe: G1/G2 and GATA in the FP13 probe. All such results were retested using competition assays and the results are summarized in Table 2. C2, CSBP-2; GAT, GATA-like; G, G1/G2.

Fig. 9.

Comparison of human and galago binding patterns. A subset of the comparison gel shifts are shown. In some cases, complexes identified with the human probe are more intense with the galago probe: CSBP-2 in FP3; YY1 in FP7; YY1 in FP13. In other cases, complexes are less intense, but still present with the galago probe: G1/G2 and GATA in the FP13 probe. All such results were retested using competition assays and the results are summarized in Table 2. C2, CSBP-2; GAT, GATA-like; G, G1/G2.

Close modal

The HS3 region has been implicated in stage specificity,13 position independence,10 chromatin alteration,50,51 and transcriptional regulation.18,22,23 47 These diverse functions are more than likely the result of the very complex set of transcription factors that bind to this region of the LCR. Through the use of phylogenetic footprinting, we have identified a large number of evolutionarily conserved and potentially important cis elements within HS3 and have identified the trans factors binding at or near these elements. In addition, the resulting “maps” of binding factors reveal interesting patterns within the binding interactions that may be of functional importance. Finally, analysis of the sequences bound adds further information about the binding requirements of uncharacterized transcription factors such as CSBP-2. Several of the specific binding interactions identified in this study are of particular interest.

First, eight moderate- to high-affinity binding sites for SSP were observed (FP2, FP5, FP8, FP13, FP14, FP18, FP20, and FP23). Binding of SSP to the −50 region of the γ promoter appears to be responsible for the competitive advantage, in cis, of the γ promoter over the β promoter in transient assays in K562 cells.36 Those transient assays were performed using a construct containing HS2. We have detected several SSP sites in HS2 (Zhu et al, in preparation), but most of these HS2 sites are lower in affinity than those in HS3. It is not known whether the cis competition effects seen in the K562 assay require the SSP sites in HS2 nor whether HS3 would also mediate such an activity. It is also noteworthy that five strong binding sites for SSP were identified upstream from the ε gene in an earlier study.32 Thus, in addition to its involvement in the γ to β switch, SSP may play a role in ε gene transcription and/or LCR:ε interaction.

Several strong binding sites for the CSBP-2 protein were detected in this study. An earlier phylogenetic footprinting analysis of the ε upstream region showed seven binding sites for this protein; in contrast, no CSBP-2 binding sites were found in phylogenetic footprints upstream from the γ gene. One of the strongest CSBP-2 sites in HS3 was recognized in FP6. In an earlier analysis, Philipsen et al23 noted this same complex (called “Factor X” in that study) within the region known as DNase footprint 1. Deletion of footprint 1 resulted in loss of 70% of the activity of the entire core fragment in MEL assays.18 The investigators concluded that the transcriptional activity of footprint 1 is due to the presence of a strong binding site for GATA-1, because mutation of the factor X site in the presence of an intact GATA-1 site appeared to have no effect in transgenic mice.23 However, that analysis was carried out only on 13.5-day founders and the test construct contained only the β globin gene. Given the number of CSBP-2 binding sites upstream from ε, and the apparent importance of the HS3 region in embryonic expression,13 52 it would be of interest to test the role of this factor in ε gene expression. Derivation of a consensus binding sequence, SHBAGAYAS, will facilitate the identification of additional CSBP-2 sites in the globin cluster.

A striking arrangement of 13 clustered YY1 sites was detected in the center of the area surveyed (FP6-FP18, Table 2). Eight of these sites are of high affinity (Figs 2 and 3) and are located within a 650-bp region beginning at the end of the HS3 core and stretching 3′. Most of the high-affinity sites (6/8) are conserved in the galago (Fig 9 and data not shown). The location of these clustered YY1 sites is interesting in light of two recent studies in which deletions of different regions of HS3 were shown to produce dramatically different phenotypes in mice.52,53 In one study, deletion of the region that includes FP1-FP16 (and removes all but one of the clustered YY1 sites) had only mild effects on globin gene transcription.53 In a second study,52 deletion of the HS3 core (the region spanning FP5-FP11) seriously reduced the expression of the ε, γ, and β genes. Among the possible explanations for the dramatic effect of this smaller deletion, the investigators suggested a dominant negative effect mediated by sequences surrounding the deleted core. In fact, this deletion removes only two (FP9/10, FP11) of the nine consecutive YY1 binding sites immediately downstream from the core (Figs 2 and 3), and juxtaposes the strong YY1 binding sites at FP1 and FP3 to the remaining YY1 sites downstream from the core. Strong silencing effects mediated through these multiple juxtaposed YY1 binding sites and/or other nearby repressor sites (such as the three CDP binding sites in FP16, FP19, and FP20 or the strong G1/G2 site in FP15) could potentially lead to a dominant negative phenotype in the absence of the core region. In fact, an enhancer silencing property of YY1 was recently shown in studies of the embryonically expressed chicken ρ-globin gene in K562 cells.54 The identification of this string of YY1 binding sites provides an opportunity to design experiments to test whether or not they can mediate a dominant negative silencing effect and to investigate the mechanism of such an effect.

Besides its repressor activities, YY1 has additional interesting properties. It is known to interact with other proteins such as Sp1.44,45 Because Sp1 binds to conserved CACCC motifs in the promoter regions of all β-like globin genes55 these clustered YY1 sites could stabilize important enhancer/promoter interactions. In fact, YY1 is also known to interact with TAF55, a coactivator that is part of the basal transcription machinery.56 Moreover, several systems have been described in which YY1 mediates a switch from activation to repression57-60 and all of these switching systems rely on the ability of YY1 to interact with other proteins. YY1 can also bend DNA58 and could thus be involved in the generation of the three-dimensional structure of the LCR holocomplex. Finally, YY1 has recently been shown to preferentially associate with the nuclear matrix.61 All of these attributes make further functional analysis of the multiple high-affinity binding sites for YY1 found in HS3 a compelling goal.

Downstream from the string of YY1 binding sites are three binding sites for CDP, another protein which has been implicated as a repressor.46 In Drosophila, the CDP homologue, cut, is involved in fate determination.62,63 The combination of CDP sites, YY1 sites, and a strong binding site for the G1/G2 complex of proteins, also putative repressors,37 make this region of HS3 (FP9 through FP20) remarkably rich in repressor binding proteins.

The tabular compilation of the binding results obtained in this study suggests that the functional properties of HS3 are carried out by multiple redundant sites for a limited number of transcription factors (Table 2). Conservation of many of the high-affinity YY1, GATA-1, Oct-1, G1/G2, SSP, CSBP-2, and Sp1 sites suggests that these factors could be important in the control of regulatory properties necessary for overall LCR function that are shared between human and galago globin genes. In contrast, several binding interactions are not conserved, including the strong USF site in FP18, all four CDP sites, the strong YY1 site in FP3, and avid SSP sites in FP5, FP18, FP20, and FP23. The evolutionary gain of some of these binding sites in the human LCR could have facilitated the gain of new LCR functions that affect the pattern of globin gene expression (eg, the recruitment of the previously embryonic γ globin gene to a fetal expression pattern).

Integration of the data in this report with the large number of other published binding and functional studies of the globin gene locus will be greatly aided by electronic collation. The Globin Gene Server64 provides an electronic database with the capacity to respond to user queries; software available at this web site should greatly aid in the integration of this information and the generation of new hypotheses regarding globin gene regulation.

The authors thank Dr Ellis Neufeld and Dr Emory Bresnik for generously providing antibodies for the supershift experiments performed with CDP and USF, respectively.

Support from National Institutes of Health Grants No. HL-48802 (D.L.G.), HL-33940 (M.G.), LM-05773 (R.H.), LM-05110 (R.H.), and DK-27635 (R.H.) is gratefully acknowledged. Supported in part by the Michigan General Clinical Research Center Grant MO1-RR00042, funded by the National Center for Research Resources (NIH-USPHS).

Address reprint requests to Deborah L. Gumucio, PhD, University of Michigan Medical School, Department of Anatomy and Cell Biology, 5793A Medical Science II, Ann Arbor, MI 48109-0616.

1
Chada
K
Magram
J
Costantini
F
An embryonic pattern of expression of a human fetal globin gene in transgenic mice.
Nature
319
1986
685
2
Kollias
G
Wrighton
N
Hurst
J
Grosveld
F
Regulated expression of human Aγ-, β-, and hybrid γβ-globin genes in transgenic mice: Manipulation of the developmental expression patterns.
Cell
46
1986
89
3
Townes
TM
Lingrel
JB
Chen
HY
Brinster
RL
Palmiter
RD
Erythroid-specific expression of human β-globin genes in transgenic mice.
EMBO J
4
1985
1715
4
Grosveld
F
Greaves
DR
Kollias
G
Position-independent, high-level expression of the human β-globin gene in transgenic mice.
Cell
51
1987
975
5
Tuan
D
Soloman
W
Li
Q
London
IM
The “β-like-globin” gene domain in human erythroid cells.
Proc Natl Acad Sci USA
82
1985
6384
6
Forrester
WC
Thompson
C
Elder
JT
Groudine
M
A developmentally stable chromatin structure in the human β-globin gene cluster.
Proc Natl Acad Sci USA
83
1986
1359
7
Ney
PA
Sorrentino
BP
McDonagh
KT
Nienhuis
AW
Tandem AP-1-binding sites within the human β-globin dominant control region function as an inducible enhancer in erythroid cells.
Genes Dev
4
1990
993
8
Lowrey
CH
Bodine
DM
Nienhuis
AW
Mechanisms of DNase I hypersensitive site formation within the human globin locus control region.
Proc Natl Acad Sci USA
89
1992
1143
9
Curtin
PT
Liu
D
Liu
W
Change
JC
Kan
YW
Human β-globin gene expression in transgenic mice is enhanced by a distant DNase I hypersensitive site.
Proc Natl Acad Sci USA
86
1989
7082
10
Fraser
P
Hurst
J
Collis
P
Grosveld
F
DNaseI hypersensitive sites 1, 2, and 3 of the human β-globin dominant control region direct position-independent expression.
Nucleic Acids Res
18
1990
3503
11
Ryan
TM
Behringer
RR
Martin
NC
Townes
IM
Palmiter
RD
Brinster
RL
A single erythroid-specific DNase I super-hypersensitive site activates high levels of human β-globin gene expression in transgenic mice.
Genes Dev
3
1989
314
12
Driscoll
MC
Dobkin
CS
Alter
BP
γδβ-Thalassemia due to a de novo mutation deleting the 5′ β-globin gene activation-region hypersensitive sites.
Proc Natl Acad Sci USA
86
1989
7470
13
Fraser
P
Pruzina
S
Antoniou
M
Grosveld
F
Each hypersensitive site of the human β-globin locus control region confers a different developmental pattern of expression on the globin genes.
Genes Dev
7
1993
106
14
Collis
P
Antoniou
M
Grosveld
F
Definition of the minimal requirements within the human β-globin gene and the dominant control region for high level expression.
EMBO J
9
1990
233
15
Forrester
WC
Novak
U
Gelinas
R
Groudine
M
Molecular analysis of the human β-globin locus activation region.
Proc Natl Acad Sci USA
83
1989
5439
16
Talbot
D
Philipsen
S
Fraser
P
Grosveld
F
Detailed analysis of the site 2 region of the human β-globin dominant control region.
EMBO J
9
1989
2169
17
Moon
AM
Ley
TJ
Functional properties of the β-globin locus control region in K562 erythroleukemia cells.
Blood
77
1991
2272
18
Pruzina
S
Antoniou
M
Hurst
J
Grosveld
F
Philipsen
SS
Transcriptional activation by hypersensitive site three of the human β-globin locus control region in murine erythroleukemia cells.
Biochem Biophys Acta
35
1994
351
19
Caterina
JJ
Ryan
TM
Pawlik
KM
Palmiter
RD
Brinster
RL
Behringer
RR
Townes
TM
Human β-globin locus control region: Analysis of the 5′ DNase I hypersensitive site HS2 in transgenic mice.
Proc Natl Acad Sci USA
88
1991
1626
20
Caterina
JJ
Clavatta
DJ
Donze
D
Behringer
RR
Townes
TM
Multiple elements in human β-globin locus control region 5′ HS2 are involved in enhancer activity and position-independent transgene expression.
Nucleic Acids Res
22
1994
1006
21
Liu
D
Chang
JC
Moi
P
Liu
W
Kan
YW
Curtin
PT
Dissection of the enhancer activity of β-globin 5′ DNase I-hypersensitive site 2 in transgenic mice.
Proc Natl Acad Sci USA
89
1992
3899
22
Philipsen
S
Talbot
D
Fraser
P
Grosveld
F
The β-globin dominant control region: Hypersensitive site 3.
EMBO J
9
1990
2159
23
Philipsen
S
Pruzina
S
Grosveld
F
The minimal requirements for activity in transgenic mice of hypersensitive site 3 of the β-globin locus control region.
EMBO J
12
1993
1077
24
Pruzina
S
Hanscombe
O
Whyatt
D
Grosveld
F
Philipsen
S
Hypersensitive site 4 of the human β globin locus control region.
Nucleic Acids Res
19
1991
1413
25
Li
Q
Stamatoyannopoulos
G
Hypersensitive site 5 of the human β locus control region functions as a chromatin insulator.
Blood
84
1994
1399
26
Yu
J
Bock
JH
Slightom
JL
Villeponteau
B
A 5′ β-globin matrix-attachment region and the polyoma enhancer together confer position-independent transcription.
Gene
139
1994
139
27
Goodman
M
Koop
BF
Czelusniak
J
Weiss
ML
Slightom
JL
The eta-globin gene. Its long evolutionary history in the beta-globin gene family of mammals.
Mol Biol
180
1984
803
28
Mason
MM
Lee
E
Westphal
H
Reitman
M
Expression of the chicken β-globin gene cluster in mice: Correct development expression and distributed control.
Mol Cell Biol
15
1995
407
29
Tagle
DA
Koop
BF
Goodman
M
Slightom
JL
Hess
D
Jones
RT
Embryonic ε and γ globin genes of a prosimian primate (Galago crassicaudatis): Nucleotide and amino acid sequences, developmental regulation, and phylogenetic footprints.
J Molec Biol
203
1988
439
30
Gumucio DL, Blanchard-McQuate KL, Heilstedt-Williamson H, Tagle DA, Gray TA, Tarlé SA, Gragowski L, Goodman M, Slightom JL, Collins FS: γ-Globin gene regulation: Evolutionary approaches, in Stamatoyannopoulos G, Nienhuis A (eds): Proceedings of the 7th Conference on Hemoglobin Switching. Baltimore, MD, Johns Hopkins University Press, 1991
31
Gumucio
DL
Heilstedt-Wlliamson
H
Gray
TA
Tarlé
SA
Shelton
DA
Tagle
DA
Slightom
JL
Goodman
M
Collins
FS
Phylogenetic footprinting reveals a nuclear protein which binds to silencer sequences in the human γ and ε globin genes.
Mol Cell Biol
12
1992
4919
32
Gumucio
DL
Shelton
DA
Bailey
WL
Slightom
JL
Goodman
M
Phylogenetic footprinting reveals unexpected complexity in trans factor binding upstream from the ε globin gene.
Proc Natl Acad Sci USA
90
1993
6018
33
Gumucio
DL
Rood
KL
Gray
TA
Riordan
MF
Sartor
CI
Collins
FS
Nuclear proteins that bind the human γ-globin gene promoter: Alterations in binding produced by point mutations associated with hereditary persistence of fetal hemoglobin.
Mol Cell Biol
8
1988
5310
34
Yant
SR
Zhu
W
Millinoff
D
Slightom
JL
Goodman
M
Gumucio
DL
High affinity YY1 binding motifs: Identification of two core types (ACAT and CCAT) and distribution of potential binding sites within the human β globin cluster.
Nucleic Acids Res
23
1995
4353
35
Merika
M
Orkin
SH
DNA-binding specificity of GATA family transcription factors.
Mol Cell Biol
13
1993
3999
36
Jane
SM
Ney
PA
Vanin
EF
Gumucio
DL
Nienhuis
AW
Identification of a potential stage selector element in the human γ-globin gene promoter that fosters preferential interaction with the HS2 enhancer when in competition with the β-promoter.
EMBO J
11
1992
2961
37
Gumucio
DL
Shelton
DA
Blanchard-McQuate
K
Tarlé
SA
Gray
TA
Heilstedt-Williamson
H
Slightom
JL
Collins
FS
Goodman
M
Differential phylogenetic footprinting as a means to identify base changes responsible for recruitment of the anthropoid γ gene to a fetal expression pattern.
J Biol Chem
269
1994
15371
38
Peters
B
Merezhinskaya
N
Diffley
JF
Noguchi
CT
Protein-DNA interactions in the epsilon-globin gene silencer.
J Biol Chem
268
1993
3430
39
Raich
N
Clegg
CH
Grofti
J
Romeo
P
Stamatoyannopoulos
G
GATA1 and YY1 are developmental repressors of the human ε-globin gene.
EMBO J
14
1996
801
40
Jackson
JD
Petrykowska
H
Philipsen
S
Miller
W
Hardison
R
Role of DNA sequences outside the cores of DNase hypersensitive sites (HSs) in functions of the β-globin locus control region.
J Biol Chem
271
1996
11871
41
Strauss
EC
Orkin
SH
In vivo protein-DNA interactions at hypersensitive site 3 of the human β-globin locus control region.
Proc Natl Acad Sci USA
89
1992
5809
42
Pascal
E
Tjian
R
Different activation domains of Sp1 govern formation of multimers and mediate transcriptional synergism.
Genes Dev
5
1991
1646
43
Merika
M
Orkin
SH
Functional synergy and physical interactions of the erythroid transcription factor GATA-1 with the Kruppel family proteins Sp1 and EKLF.
Mol Cell Biol
15
1995
2437
44
Seto
E
Lewis
B
Shenk
T
Interaction between transcription factors Sp1 and YY1.
Nature
36
1993
462
45
Lee
JS
Galvin
KM
Shi
Y
Evidence for physical interaction between the zinc-finger transcription factors YY1 and Sp1.
Proc Natl Acad Sci USA
90
1993
6145
46
Skalnik
DG
Strauss
EC
Orkin
SH
CCAAT displacement protein as a repressor of the myelomonocytic-specific gp91-phox gene promoter.
J Biol Chem
266
1991
16736
47
Li
Q
Stamatoyannopoulos
J
Position independence and proper developmental control of γ-globin gene expression require both a 5′ locus control region and a downstream sequence element.
Mol Cell Biol
14
1994
6087
48
Simon
MC
Pevny
L
Wiles
MV
Keller
G
Costantini
F
Orkin
SH
Rescue of erythroid development in gene targeted GATA-1 mouse embryonic stem cells.
Nature Genet
1
1992
92
49
Pevny
L
Simon
MC
Robertson
E
Klein
WH
Tsai
S-F
D'Agati
V
Orkin
SH
Costantini
F
Erythroid differentiation in chimaeric mice blocked by a targeted mutation in the gene for transcription factor GATA-1.
Nature
349
1991
257
50
Ellis
J
Kian
CT-U
Harper
A
Michalovich
D
Yannoutsos
N
Philipsen
S
Grosveld
F
A dominant chromatin-opening activity in 5′ hypersensitive site 3 of the human β-globin locus control region.
EMBO J
15
1996
562
51
Dhar
V
Nandi
A
Schildkraut
CL
Skoultchi
AI
Erythroid-specific nuclease-hypersensitive sites flanking the human β-globin domain.
Mol Cell Biol
10
1990
4324
52
Bungert
J
Utpal
D
Lim
K
Lieuw
KH
Shavit
JA
Liu
Q
Engel
JD
Synergistic regulation of human β-globin gene switching by locus control region elements HS3 and HS4.
Genes Dev
9
1995
3083
53
Hug
BA
Moon
AM
Ley
TJ
Structure and function of the murine β-globin locus control region 5′ HS3.
Nucleic Acids Res
20
1992
5771
54
Wandersee
NJ
Ferris
RS
Ginder
GD
Intronic and flanking sequence are required to silence enhancement of an embryonic β-type globin gene.
Mol Cell Biol
16
1996
236
55
Hartzog
GA
Myers
RM
Discrimination among potential activators of the β-globin CACCC element by correlation of binding and transcriptional properties.
Mol Cell Biol
13
1993
44
56
Chiang
C-M
Roeder
RG
Cloning of an intrinsic human TFIID subunit that interacts with multiple transcriptional activators.
Science
267
1995
531
57
Shrivastava
A
Saleque
S
Kalpana
GV
Goff
SP
Calame
K
Inhibition of transcriptional regulator Yin Yang-1 by association with c-myc.
Science
262
1993
1889
58
Natesan
S
Gilman
MZ
DNA bending and orientation-dependent function of YY1 in the c-fos promoter.
Genes Dev
7
1993
2497
59
Shi
Y
Seto
E
Chang
L-S
Shenk
T
Transcriptional repression by YY1, a human GLI-Kruppel-related protein, and relief of repression by adenovirus E1A protein.
Cell
67
1991
377
60
Lu
SY
Rodriguez
M
Liao
WS
YY1 represses rat serum amyloid A1 gene transcription and is antagonized by NF-κB during acute-phase response.
Mol Cell Biol
14
1994
6253
61
Guo
B
Odgren
PR
van Wijnen
AJ
Last
TJ
Nickerson
J
Penman
S
Lian
JB
Stein
JL
Stein
GS
The nuclear matrix protein NMP-1 is the transcription factor YY1.
Proc Natl Acad Sci USA
92
1995
10526
62
Jack
J
Dorsett
D
Delotto
Y
Liu
S
Expression of the Cut locus in the Drosophila wing margin is required for cell type specification and is regulated by a distant enhancer.
Development
113
1991
735
63
Blochlinger
K
Bodmer
R
Jan
LY
Jan
YN
Patterns of expression of Cut, a protein required for external sensory organ development in wild-type and Cut mutant Drosophila embryos.
Genes Dev
4
1990
1322
64
Hardison R, Elnitski L, Elsherbini A, Goldstrohm A, Jackson J, Peck J, Riemer C, Schwartz S, Stojanovic N, Miller W: The Globin Gene Server: An aid to studying the regulation of mammalian globin gene clusters, in Stamatoyannopoulos G, Nienhuis A (eds): Proceedings of the 7th Conference on Hemoglobin Switching. Baltimore, MD, Johns Hopkins University Press, 1991
65
Andrews
NC
Erdjument-Bromage
H
Davidson
MB
Tempst
P
Orkin
SH
Erythroid transcription factor NF-E2 is a haematopoietic-specific basic-leucine zipper protein.
Nature
362
1993
722
66
Miller
IJ
Bieker
JJ
A novel, erythroid cell-specific murine transcription factor that binds to the CACCC element and is related to the Kruppel family of nuclear proteins.
Mol Cell Biol
13
1993
2776
67
Kadanaga
JT
Jones
KA
Tjian
R
Promoter-specific activation of RNA polymerase II transcription by Sp1.
Trends Biochem Sci
11
1986
20
68
Xiao
J
Davidson
I
Macchi
M
Rosales
R
Vigneron
M
Staub
A
Chambon
P
In vitro binding of several cell-specific and ubiquitous nuclear proteins to the GT-1 motif of the SV40 enhancer.
Genes Dev
1
1987
764
69
Carthew
RW
Chodish
LA
Sharp
PA
An RNA polymerase II transcription factor binds to an upstream element in the adenovirus major late promoter.
Cell
43
1985
439
70
Flanagan
JR
Becker
KD
Ennist
DL
Gleason
SL
Driggers
PH
Levi
B-Z
Appella
E
Ozato
K
Cloning of a negative transcription factor that binds to the upstream conserved region of moloney murine leukemia virus.
Mol Cell Biol
12
1992
38
Sign in via your Institution