MMBR Figure table search 04
Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow Copyright Information
Right arrow Books from ASM Press
Right arrow MicrobeWorld
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Hampsey, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hampsey, M.

Microbiol Mol Biol Rev, June 1998, p. 465-503, Vol. 62, No. 2
1092-2172/98/$04.00+0
Copyright © 1998, American Society for Microbiology. All rights reserved.

Molecular Genetics of the RNA Polymerase II General Transcriptional Machinery

Michael Hampsey*

Department of Biochemistry, Division of Nucleic Acids Enzymology, Robert Wood Johnson Medical School, University of Medicine and Dentistry of New Jersey, Piscataway, New Jersey 08854-5635 

SUMMARY
INTRODUCTION
PROMOTER STRUCTURE
    TATA Elements
    Initiator Elements
    UAS and URS Elements
    Poly(dA-dT) Elements
RNA POLYMERASE II
    Overview
    Carboxy-Terminal Repeat Domain
    RNA Polymerase II Holoenzymes
GENERAL TRANSCRIPTION FACTORS
    TATA Binding Protein
        Overview.
        TBP-TATA interactions.
        TBP-TFIIA interactions.
        TBP-TFIIB interactions.
        Other TBP derivatives.
        Suppressors of TBP derivatives.
    TFIIB
        Overview.
        Start site selection.
        Suppressors of TFIIB derivatives.
    TFIIF
    TFIIE
    TFIIH
TRANSCRIPTIONAL COACTIVATORS
    TBP-Associated Factors
    SRB/Mediator
    TFIIA
    Histone Acetyltransferases
    Chromatin-Remodeling Complexes
    Other Coactivators
GENERAL TRANSCRIPTIONAL REPRESSORS
    Mot1
    Ccr4-NOT Complex
    BUR Proteins
    Dr1-DRAP1/NC2
    Spt4-Spt6
    Histone Deacetylases
    Ssn6-Tup1
CONCLUSIONS AND PERSPECTIVES
ACKNOWLEDGMENTS
REFERENCES

SUMMARY
Top
Next
References

Transcription initiation by RNA polymerase II (RNA pol II) requires interaction between cis-acting promoter elements and trans-acting factors. The eukaryotic promoter consists of core elements, which include the TATA box and other DNA sequences that define transcription start sites, and regulatory elements, which either enhance or repress transcription in a gene-specific manner. The core promoter is the site for assembly of the transcription preinitiation complex, which includes RNA pol II and the general transcription fctors TBP, TFIIB, TFIIE, TFIIF, and TFIIH. Regulatory elements bind gene-specific factors, which affect the rate of transcription by interacting, either directly or indirectly, with components of the general transcriptional machinery. A third class of transcription factors, termed coactivators, is not required for basal transcription in vitro but often mediates activation by a broad spectrum of activators. Accordingly, coactivators are neither gene-specific nor general transcription factors, although gene-specific coactivators have been described in metazoan systems. Transcriptional repressors include both gene-specific and general factors. Similar to coactivators, general transcriptional repressors affect the expression of a broad spectrum of genes yet do not repress all genes. General repressors either act through the core transcriptional machinery or are histone related and presumably affect chromatin function. This review focuses on the global effectors of RNA polymerase II transcription in yeast, including the general transcription factors, the coactivators, and the general repressors. Emphasis is placed on the role that yeast genetics has played in identifying these factors and their associated functions.

INTRODUCTION
Top
Previous
Next
References

This review presents an overview of the RNA polymerase II (RNA pol II) core transcriptional machinery. I discuss promoter elements and then review recent advances pertaining to three classes for transcription factors: general transcription factors (GTFs), transcriptional coactivators, and general transcriptional repressors. I focus on the transcriptional machinery from the yeast Saccharomyces cerevisiae, emphasizing the combined roles of yeast genetics and biochemistry in defining factors and their associated functions. However, this subject cannot be considered separately from the RNA pol II core transcriptional machinery from higher eukaryotic organisms, where results obtained with human, rat and Drosophila systems have often led the way. Although I emphasize the yeast system, I have attempted to integrate information from both yeast and metazoan systems whenever appropriate.

A breakthrough in understanding the mechanism of transcription initiation followed the discovery in the laboratory of Roeder that purified RNA pol II would selectively and accurately initiate transcription from template DNA when supplemented with a crude cell extract (529). This activity provided an assay for the fractionation and subsequent identification of the GTFs, defined as factors required for accurate, basal-level transcription initiation in vitro (311). Similar work defined analogous factors in rats, Drosophila, and yeast, suggesting that the GTFs are indeed "general" factors, required for expression of most, perhaps all, class II genes. Thus, the process of transcription initiation by RNA pol II is highly conserved among eukaryotic organisms, allowing for the experimental advantages offered by different organisms to be exploited to identify and define these factors.

S. cerevisiae has proven to be extraordinarily valuable in these studies. In 1987, Lue and Kornberg established an in vitro transcription system derived from yeast nuclei that would accurately initiate transcription from exogenous template DNA (295). A second in vitro transcription system was derived from yeast whole-cell extracts (541, 542). These systems have been instrumental not only for identifying the GTFs and their functions, but also for defining other transcription factors that influence the rate of transcription initiation.

A second advantage of yeast is the potential to exploit the power of classical and molecular genetic methods to investigate fundamental biological problems. An array of genetic selections has been developed to identify factors affecting RNA pol II transcription. In many cases, these studies have identified novel transcription factors or unexpected activities associated with these factors that had gone undetected by biochemical means. As a notable example, genetic selections for mutants unable to ferment sucrose (snf), for mutants that suppress promoter defects caused by insertion mutations (spt), and for mutants defective in mating-type switching (swi) converged, leading to the discovery of the SWI/SNF chromatin-remodeling complex that facilitates transcriptional activation (reviewed in references 51 and 537). Another important example is the genetic selection for suppressors of the conditional growth defect associated with truncation of the RNA pol II carboxy-terminal repeat domain (srb), which led to the discovery of the RNA pol II holoenzyme (reviewed in reference 261a).

The information available from the extensive collection of well-characterized yeast mutants and the complete sequence of the yeast genome are additional advantages to the yeast system. Accordingly, sequence information for proteins identified biochemically, either from yeast or from other organisms, can be compared to the yeast database. In many cases, a biochemically identified protein corresponds to the product of a gene identified in a genetic selection or screen. This combination of biochemistry and genetics often provides novel insight into protein function. Two remarkable examples are a histone acetyltransferase from Tetrahymena and a histone deacetylase from humans. Sequence analysis of these two proteins revealed similarity to the products of the genetically defined yeast GCN5 and RPD3 genes (38, 481). Although the biochemical function of neither gene had been defined, GCN5 was identified in a genetic selection for transcriptional coactivators whereas RPD3 was identified in a selection for transcriptional repressors. This combination of biochemistry and genetics led to the identification of Gcn5 and Rpd3 as histone acetyltransferase and histone deacetylase, respectively. Moreover, it provided a direct link between histone acetylation/deacetylation and transcriptional activation/repression (reviewed in reference 181).

PROMOTER STRUCTURE
Top
Previous
Next
References

Eukaryotic promoters can be divided into core elements and regulatory elements (reviewed in reference 456). Core promoter elements define the site for assembly of the transcription preinitiation complex (PIC) and include a TATA sequence, located upstream of the transcription start site, and an initiator sequence (Inr), encompassing the start site. Promoters can include a TATA box, an Inr sequence, or both of these control elements. A third core element, the downstream promoter element (DPE), was initially described in Drosophila and is located about 30 bp downstream of the start site (48). The DPE appears to function, in conjunction with the Inr element, as a TFIID binding site at TATA-less promoters.

Regulatory elements are gene-specific sequences that are located upstream of the core promoter and control the rate of transcription initiation; they include both upstream activation sequences (UAS) and upstream repression sequences (URS), which serve as binding sites for enhancers and repressors of transcription, respectively. In addition, poly(dA-dT) sequences are bidirectional upstream promoter elements that facilitate constitutive gene expression, not as UAS-like elements but apparently by forming a structure that is less stable to repressing nucleosomes. These elements are reviewed below.

TATA Elements

TATA elements in S. cerevisiae are typically located 40 to 120 bp upstream of the transcription initiation site. This is in contrast to other eukaryotes, including Schizosaccharomyces pombe, where the TATA element is almost always located at a fixed distance of 25 to 30 bp from the start site (reviewed in reference 456). The TATA sequence is the binding site for the TATA binding protein (TBP). TBP-TATA association nucleates the assembly of an approximately 4-MDa transcription preinitiation complex, a step that can be rate limiting for transcription initiation in vivo (257).

Mutational analysis and random selection for functional TATA elements defined TATAAA as the consensus TATA sequence in yeast (74, 441, 539). Many derivatives of this sequence also confer TATA function, albeit with diminished activity. One derivative, TGTAAA, eliminated TATA function and was used to select for TBP derivatives with altered binding specificity (454). TBPm3, described below, allowed transcription from TGTAAA promoters but not from certain other single-nucleotide derivatives of TATAAA (454). This mutant demonstrated the importance of specific interactions between the TATA element and TBP for efficient initiation. Functional analysis of mutated TATA elements revealed that yeast and human TBP have nearly identical TATA sequence requirements, underscoring the evolutionary conservation of the TBP-TATA interaction (539).

Some yeast promoters contain multiple TATA elements. For example, transcriptional analysis of site-directed mutations defined two functional TATA-like sequences within the CYC1 promoter (281). These two elements differed in sequence; one is denoted beta -type, and the other is denoted alpha -type. Interestingly, when both elements were present, both were used equally to direct initiation within distinct but overlapping windows. However, if the same type (either beta  or alpha ) was present at both sites, only the upstream element was used, directing initiation within the upstream window. These results were interpreted to mean that beta - and alpha -type TATA elements are recognized by different factors of the transcriptional apparatus. However, TBP binds both consensus and nonconsensus TATA elements (178), suggesting that regulatory factors other than TBP might confer differential recognition of closely related TATA elements (281).

Yeast promoters lacking canonical TATA elements (TATA-less promoters) have also been identified. For example, the HIS3 promoter contains two TATA elements, one of which (TR) conforms to the canonical TATAAA sequence and is responsible for initiation at position +13 in response to activation by Gcn4. The other element (TC) does not resemble a consensus TBP binding site, directs initiation from position +1, and supports initiation in the absence of activators. Nonetheless, TC-directed transcription is TBP dependent in vivo (99). Interestingly, the relative utilization of TC and TR depends upon the overall level of transcription (223). TC is preferentially utilized at low levels of transcription, TC and TR are utilized equally well at moderate levels of transcription, and TR is preferentially utilized at high levels of transcription. These results suggest that transcription initiation from weak TATA elements is not mechanistically distinct from that mediated by canonical TATA elements but is determined instead by the overall level of transcription (223).

Transcription from TATA-less promoters remains TBP dependent. Accordingly, the term "TATA-less promoter" denotes relatively weak TBP-DNA affinity rather than a fundamentally distinct promoter element. Nonetheless, the rate-limiting step in PIC assembly at TATA-less promoters is unlikely to be TBP recruitment. Presumably, another component(s) of the core machinery recognizes a promoter structure other than TATA to nucleate PIC assembly. Indeed, TAFII60 and TAFII40 from human and Drosophila cells play a direct role in basal transcription by specifically binding the DPE of TATA-less promoters (47, 48).

Initiator Elements

Inr elements are DNA sequences encompassing transcription start sites. The fixed distance between TATA and the Inr element in eukaryotic organisms other than S. cerevisiae suggests that the Inr is determined simply by spacing from TATA. In contrast, the variable distance between TATA and the Inr in S. cerevisiae implies the existence of specific sequences that permit transcription initiation. Experiments to determine the relationship between TATA and the Inr established that the TATA element defines the window within which initiation can occur but that specific sequences within the window define the Inr element (179, 191, 281, 331, 403). Mutational analyses and surveys of start sites have defined preferred Inr sequences (146), yet there is no clearly defined Inr consensus sequence.

Yeast mutants that affect start site selection have been identified (11, 27, 147, 193, 216, 367). Specific mutations in TFIIB and the largest subunit of RNA pol II shift initiation downstream of normal (27, 367). However, in no case is the downstream site a "new" initiation site. Rather, these sites are normal, albeit minor initiation sites that generally conform to preferred Inr sequences. Thus, defects in TFIIB and RNA pol II do not alter the specificity of Inr element recognition but instead shift the window within which initiation can occur further downstream. Although the mechanism of Inr recognition is unclear, RNA pol II and TFIIB are key players in this process (284, 366).

The Inr element, as defined in higher eukaryotes, is not simply the DNA sequence encompassing the transcription start site. Rather, an Inr element was initially defined at the TATA-less promoter of the terminal deoxynucleotidyltransferase gene as a core promoter element, distinct from TATA, that can nucleate PIC assembly (442). Inr elements were subsequently identified at many promoters, both TATA-containing and TATA-less, and have been implicated in transcriptional control by directing accurate initiation in a TATA-independent manner (reviewed in reference 530). Proteins that bind Inr elements include CIF (241), YY1 (497), E2F (318), TFII-I, and USF (398, 399), as well as RNA pol II itself (12, 61). The CIF complex includes a homolog of Drosophila TAFII150 (241), which binds promoter DNA overlapping the Inr region (509) and has been implicated in differential recognition of two tandem Adh promoter elements (187). Furthermore, a recombinant TBP-TAFII150-TAFII250 subcomplex is minimally required for efficient utilization of Inr and downstream promoter elements in a reconstituted transcription system (507). Thus, it appears that TFIID can be recruited to a promoter by either of two distinct pathways, one involving TBP-TATA interaction and the other involving TAF-Inr interaction.

It is not clear whether Inr elements that function as distinct core promoter elements to nucleate PIC assembly exist in yeast. There are some intriguing prospects, though. For example, the GAL80 promoter has been reported to include both an Inr element and a TATA element, with these two elements directing initiation at distinct sites. The GAL80 Inr element is functionally portable, and a GAL80 Inr-binding protein has been detected (412). These results suggest that GAL80 transcription is driven by two independent pathways, one Inr dependent and the other TATA-dependent (412). This scenario is reminiscent of transcription at the HIS3 promoter (456). Thus, Inr elements that facilitate transcriptional control might be a universal feature of eukaryotic of RNA pol II transcription.

UAS and URS Elements

UAS elements are DNA sequences that function as binding sites for specific transcriptional activators. As such, UAS elements are analogous to metazoan enhancers, functioning in either orientation and at variable distances from the core promoter. An important functional distinction between enhancers and UAS elements is that UAS elements do not function when positioned downstream of the TATA box (168, 455). However, this has been reported for only a few genes and needs to be tested more thoroughly. Once associated with their cognate UAS elements, transcriptional activators facilitate assembly of the PIC, either by direct contact with GTFs or indirectly through coactivators, which in some cases mediate activator-GTF interactions. Consistent with its role in activation, deletion of a UAS element diminishes mRNA synthesis under activating conditions.

URS elements are binding sites for gene-specific transcriptional repressors. URS-repressor complexes can impair transcription by several different mechanisms, including interference with activator-UAS binding; interference with the activation domain of an activator-UAS complex; or by contact with the core transcriptional machinery, a process analogous to activation, albeit with opposite effects (reviewed in reference 233). URS-repressor complexes can also mediate repression indirectly by recruiting another complex that targets either the core transcriptional machinery or histones. For example, several URS-repressor complexes recruit the Ssn6-Tup1 complex, which appears to mediate repression by affecting histone function (214). Transcriptional repression associated with histone deacetylation is another example of this type of repression. In the best-characterized example in yeast, the GC-rich URS1 element binds the Ume6 repressor, which in turn recruits the Sin3-Rpd3 histone deacetylase complex (234). This process is analogous to transcriptional repression mediated by histone deacetylases in metazoan systems (reviewed in reference 359).

Poly(dA-dT) Elements

Homopolymeric dA-dT sequences are a common feature of yeast promoters and in several cases have been shown to be required for normal levels of transcription in vivo (reviewed in reference 224). Poly(dA-dT) sequences have distinct structural characteristics that impair nucleosome assembly or stability, which led to the proposal that poly(dA-dT) sequences function as promoter elements based on their intrinsic structure, rather than as conventional UAS elements to which sequence-specific transcription factors bind (75). However, a naturally occurring poly(dA-dT) sequence activated transcription in vitro, an effect that could be squelched by addition of a related oligonucleotide (294). This result argued for involvement of a poly(dA-dT)-specific transcription factor. Indeed, a poly(dA-dT)-binding protein, datin, has been identified (538).

The mechanism of poly(dA-dT)-mediated transcriptional activation has been investigated by using a combination of functional assays and probes of chromatin structure. A poly(dA-dT) sequence located upstream of the Gcn4 binding site in the HIS3 promoter stimulated Gcn4p-activated transcription in a length-dependent manner (224). Moreover, datin repressed, rather than stimulated, gene expression, and poly(dG-dC), which also affects nucleosome structure, functioned similarly to poly(dA-dT) (224). These results imply that poly(dA-dT) stimulates transcription as a consequence of its intrinsic structure, rather than as a conventional UAS element. This conclusion is supported and extended by the demonstration that a poly(dA-dT) element located adjacent to the Candida glabrata metal-dependent transcriptional activator gene, AMT1, plays a critical role in transcriptional autoactivation by causing a localized distortion of the nucleosomal DNA, allowing Amt1 to gain access to its cognate promoter element (565). Recently, a whole-genome analysis revealed that poly(dA-dT) tracts are abundant in S. cerevisiae and occur predominantly at unit nucleosomal length both upstream and downstream of open reading frames, leading to the proposal that such tracts modulate nucleosome positioning (382).

RNA POLYMERASE II
Top
Previous
Next
References

Overview

Yeast RNA pol II is composed of 12 subunits encoded by the RPB1 to RPB12 genes (543). There is extensive structural conservation among the subunits of eukaryotic RNA pol II. Indeed, six subunits of human RNA pol II can functionally replace their homologs in yeast (317). The two largest RNA pol II subunits, Rpb1 (~200 kDa) and Rpb2 (~150 kDa), are the most highly conserved subunits. Moreover, Rpb1 and Rpb2 are homologous to the beta ' and beta  subunits, respectively, of bacterial RNA polymerase. Rpb3 is related to the alpha  subunit of bacterial RNA polymerase based on partial amino acid sequence similarity, size similarity, identical subunit stoichiometry (two per molecule), and assembly defects associated with mutations in either subunit (543). None of the RNA pol II subunits appears to be closely related to the bacterial sigma -subunit family, although structural and functional similarities between sigma  and certain GTFs have been identified (see below).

The Rpb1, Rpb2, Rpb3, and Rpb11 subunits of RNA pol II are homologous to subunits of RNA polymerases I and III. Moreover, five subunits, Rpb5, Rpb6, Rpb8, Rpb10, and Rpb12, are common to all three RNA polymerases. Only Rpb4, Rpb7, and Rpb9 are unique to RNA pol II. Thus, RNA polymerases are assembled from common as well as class-specific subunits. Ten of the yeast genes encoding RNA pol II subunits are essential for cell viability. Only the RPB4 and RPB9 genes are dispensable, although deletion of either gene confers conditional growth phenotypes.

The sequence similarity between Rpb1 and Rpb2 and the bacterial beta ' and beta  subunits occurs in highly conserved domains, designated A to H in Rpb1, and A to I in Rpb2 (555). This structural similarity extends among the two largest subunits for all eukaryotic RNA pol II investigated. Not surprisingly, the structural similarity between Rpb1/beta ' and Rpb2/beta extends to functional similarity. Both Rpb1 and beta ' are involved in DNA binding, whereas Rpb2 and beta  bind nucleotide substrates.

Many mutations in RPB1 and RPB2 have been isolated and characterized (reviewed in reference 6). Most amino acid replacements are located within the highly conserved domains. Specific mutations in RPB1 and RPB2 affect the accuracy of transcription initiation, demonstrating a role for these subunits in defining start site selection (11, 27, 193). Other mutations in RPB1 and RPB2 confer sensitivity to 6-azauracil (6-AU), a phenotype associated with transcription elongation defects, suggesting that both subunits are also involved in overcoming transcriptional arrest (7, 374). 6-AU-sensitive rpb1 mutants can be suppressed by overexpression of PPR2, the gene encoding the elongation factor SII (7). In the case of RPB2, 6-AU-sensitive alleles encode elongation-defective forms of RNA pol II (374).

The Rpb4 and Rpb7 subunits are functionally related. These two subunits can be dissociated from RNA pol II, and RNA pol II purified from a rpb4 null mutant lacks Rpb7 (122). This form of RNA pol II is indistinguishable from wild-type RNA pol II in an in vitro elongation assay but is inactive in promoter-directed transcription initiation. Furthermore, this form of RNA pol II could be complemented in vitro by an inactive RNA pol II with a defective form of Rpb1. These results demonstrate that Rpb4 and Rpb7 function in transcription initiation and suggest that they can shuttle between RNA pol II molecules (122). Interestingly, rpb4 mutants exhibit substantially impaired growth rates at elevated temperature or under conditions of nutritional deprivation, implicating Rpb4 in tolerance of RNA pol II to stress (80).

Similar mutational analyses are needed to define functions for the other RNA pol II subunits. Limited mutational analysis revealed that Rpb3 is involved in RNA pol II assembly (262). Mutations in RPB9 affect start site selection (147, 148, 216, 460), and in one case an rpb9 allele suppresses a TFIIB defect that affects start site selection (460). Thus, Rpb9, like Rpb1 and Rpb2, affects the accuracy of initiation, perhaps through interaction with TFIIB.

Carboxy-Terminal Repeat Domain

A unique feature of the largest RNA pol II subunit is the presence of tandem repeats of a heptapeptide sequence at its carboxy-terminus. This carboxy-terminal repeat domain (CTD) has the consensus sequence Tyr-Ser-Pro-Thr-Ser-Pro-Ser is highly conserved among eukaryotic organisms. Although the CTD is a ubiquitous feature of RNA pol II, the repeat length varies. For example, yeast Rpb1 includes 26 or 27 repeats, the C. elegans CTD has 34 repeats, the Drosophila CTD has 43 repeats, and the human CTD 52 repeats, suggesting that repeat length increases with increasing genome complexity.

Although the CTD is essential for cell viability, its function is not entirely clear. There are two forms of RNA pol II in vivo, designated IIO, which is extensively phosphorylated at the CTD, and IIA, which is not phosphorylated. The IIA form preferentially enters the PIC, whereas IIO is found in the elongating complex (reviewed in reference 103). Conversion of IIA to IIO occurs concomitant with or shortly after the transition from initiation to elongation and is accompanied by extensive CTD phosphorylation (292, 346). These results implicate the CTD in conversion of RNA pol II from a form involved in promoter recognition to an elongation-competent form. Nonetheless, a form of RNA pol II lacking the CTD (IIB) is able to initiate transcription from TATA-containing promoters in vitro, although not from TATA-less promoters (2, 40).

The kinase activity of TFIIH can mediate CTD phosphorylation (132, 293, 429), although other kinases, including Cdc2 (83), Ctk1 (274), the Srb10-Srb11 kinase-cyclin pair (285), and P-TEFb (310), have also been implicated in CTD phosphorylation. Drosophila P-TEFb (positive transcription elongation factor b) affects the transition from abortive to productive transcription elongation (310). The catalytic subunit of P-TEFb is homologous to PITALRE, a Cdc2-related protein kinase (564). Human P-TEFb associates with the Tat protein of human immunodeficiency virus type 1 to potentiate transcriptional elongation (307, 564). Thus, multiple kinases appear to mediate phosphorylation of the RNA pol II CTD. Whether these CTD kinases are gene specific or affect different steps in the transition from initiation to elongation remains to be determined.

A phosphatase responsible for dephosphorylation of the CTD has also been identified (66). CTD phosphatase activity is regulated by TFIIB and TFIIF (67). The RAP74 subunit of TFIIF stimulates CTD phosphatase activity, whereas TFIIB inhibits the stimulatory activity of TFIIF. Since the dephosphorylated form of RNA pol II (IIA) preferentially enters the PIC (103), these results suggest that the CTD phosphatase, TFIIF, and TFIIB interact to regulate RNA pol II recycling.

Although the CTD is essential for cell growth, all but 8 to 10 repeats can be deleted from yeast Rpb1 without loss of viability (345, 532). However, strains with a minimum number of CTD repeats exhibit a cold-sensitive growth defect, a phenotype that was exploited to isolate extragenic suppressors of CTD truncations (345). This selection identified SRB genes, which encode components of the SRB-mediator complex required for transcriptional activation (28). Thus, the CTD functions in transcriptional activation.

The CTD has also been implicated in pre-mRNA processing. A speculative model proposed that the negatively charged, hyperphosphorylated CTD of the IIO form of RNA pol II facilitates electrostatic interactions with positively charged regions of certain splicing factors (167). This model has received considerable experimental support. Splicing is inhibited in vitro (557) and in vivo (118) by CTD repeat polypeptides and in vitro by an anti-CTD antibody (557). Furthermore, proteins that might connect the spliceosome to RNA pol II via the CTD have been identified (450, 557). The CTD has also been implicated in 5' capping of mRNA and 3'-end formation. The 5'-capping enzyme specifically binds the phosphorylated IIO form of RNA pol II, suggesting a mechanism for coupling of cap addition to RNA pol II transcription (79, 313, 556). A role for the CTD in 3'-end formation was discovered by the effects of CTD truncations on 3' processing and poly(A) addition and substantiated by the association of cleavage and polyadenylation factors with the CTD (314). These results suggest that the CTD functions as a platform for the recruitment and assembly of factors involved in pre-mRNA processing (reviewed in reference 449).

RNA Polymerase II Holoenzymes

RNA pol II and the GTFs assemble in a defined order on promoter DNA in vitro (42, 301, 503). These results suggested stepwise assembly of the PIC. However, several GTFs were known to associate with RNA polymerase in the absence of DNA, hinting at the existence of an RNA pol II holoenzyme complex (96). Antibodies directed against SRB proteins provided direct evidence for a holoenzyme complex that includes a subset of the GTFs, as well as SRB proteins (261). Unlike core RNA pol II, holoenzyme responds to transcriptional activators in vitro. Holoenzyme was independently discovered based on the association of RNA pol II with mediator, the protein complex required for transcriptional activation (252). As described in the coactivator section below, mediator includes SRB proteins, MED proteins, and a subcomplex composed of Gal11, Sin4, Rgr1, and Med3. These two holoenzymes are comparable, although one complex is reported to include TFIIB, TFIIF, and TFIIH (261) whereas the other includes TFIIF as the only GTF (252). One holoenzyme preparation is also reported to include Srb8 to Srb11 (194, 285) and the SWI/SNF chromatin remodeling complex (535), whereas the other includes neither of these sets of proteins (57, 252, 330). Regardless of the distinction between these two holoenzyme preparations, a key point is that holoenzyme supports activated transcription with only TBP and other GTFs in reconstituted transcription systems. Transcriptional activation is mediator dependent but TAF independent. This is in contrast to metazoan transcription systems, where activation is TAF dependent (reviewed in reference 183).

There are at least two distinct forms of yeast RNA pol II holoenzyme (reviewed in reference 68). A second form was discovered by a purification strategy based on an RNA pol II affinity column immobilized through the CTD (435). Isolated proteins include TFIIB, TFIIF, TFIIS, and Gal11 but not SRB/mediator components (515). Novel components of this complex include Paf1 and Cdc73 (435, 515), as well as Ccr4 and Hpr1 (68). This form of the holoenzyme affects the expression of a different spectrum of genes and is therefore functionally distinct from the SRB/mediator-containing holoenzyme (436). The percentage of total RNA pol II contained in the Paf1-Cdc73-Ccr4-Hpr1 holoenzyme has not been reported, although this holoenzyme is less abundant than the SRB/mediator holoenzyme (227).

RNA pol II holoenzyme complexes have also been identified in mammalian cells (69, 302, 352). Like the yeast holoenzymes but distinct from core RNA pol II, these complexes are able to respond to transcriptional activators in vitro. In addition to the GTFs, a provocative array of proteins has been found in these enzyme preparations, including DNA repair proteins (302), splicing and polyadenylation factors (314), and even the breast cancer tumor suppressor BRCA1 (427).

GENERAL TRANSCRIPTION FACTORS
Top
Previous
Next
References

The GTFs include TBP, TFIIB, TFIIE, TFIIF, and TFIIH and were identified biochemically as factors required for accurate transcription initiation by RNA pol II from double-stranded DNA templates in vitro (reviewed in references 395 and 558). Fractionation of whole-cell extracts from yeast also identified five factors, designated a, b, d, e, and g, that are required for promoter-specific transcription by RNA pol II (420). The yeast factors are comparable in structure and function to the mammalian GTFs and are now designated by the mammalian nomenclature. These factors are reviewed below and summarized in Table 1.

                              
View this table:
[in this window]
[in a new window]
 
TABLE 1.   Yeast general transcription factors

Order-of-addition experiments demonstrated that PIC assembly is nucleated in vitro by TBP binding to the TATA element followed by binding of TFIIB, RNA pol II-TFIIF, TFIIE, and TFIIH (42; reviewed in references 351 and 395). This scenario was challenged by the discovery of RNA pol II holoenzyme complexes, first in yeast and later in mammalian systems (see the RNA polymerase II section, above). These findings suggest that at least some GTFs associate with RNA pol II prior to promoter binding and have implications for the mechanisms of transcriptional regulation. Whether PIC assembly occurs in a stepwise manner or by holoenzyme recruitment, the order-of-addition experiments provided valuable information about GTF interactions within the PIC. A schematic representation of the PIC is presented in Fig. 1; a crystallographic representation of a DNA-TBP-TFIIA-TFIIB complex is presented in Fig. 2.


View larger version (26K):
[in this window]
[in a new window]
 
FIG. 1.   Schematic depiction of the transcription PIC. PIC assembly is nucleated by TBP binding to the TATA box, inducing a sharp bend in the DNA template, followed by association of TFIIB, RNA pol II/TFIIF, TFIIE, and TFIIH. Each pattern denotes a distinct general transcription factor. Subunit composition is indicated, except for TFIIH (9 subunits) and RNA pol II (12 subunits). Although PIC assembly can occur by stepwise addition of the general transcription factors (GTFs) in vitro, the discovery of RNA pol II holoenzyme complexes that include GTFs suggests that stepwise assembly might not occur in vivo.


View larger version (44K):
[in this window]
[in a new window]
 
FIG. 2.   Tertiary structure of a TATA-TBP-TFIIA-TFIIB complex. The amino- and carboxy-terminal direct repeats of TBP are light and dark blue, respectively. The amino- and carboxy-terminal repeats of core-TFIIB are red and orange, respectively. The Toa1 and Toa2 subunits of TFIIA are green and yellow, respectively. Reprinted from reference 351 with permission of the publisher.

TATA Binding Protein

Overview. TBP is a universal transcription factor, required for initiation by all three eukaryotic RNA polymerases (reviewed in reference 198). TBP was identified as a subunit of TFIID, the large (~750-kDa) multisubunit complex composed of TBP and TBP-associated factors (TAFs). Whereas TBP functions in basal level transcription, TFIID is required for response to transcriptional activators in metazoan in vitro transcription systems (reviewed in references 50, 378, and 508). Direct evidence that TBP plays a general role in eukaryotic transcription came from Comai et al., who found TBP to be an essential subunit of the RNA pol I transcription factor SL1 (93). TBP was also identified as a subunit of the RNA pol III general factor TFIIIB (215, 239, 290, 440, 473, 533). Furthermore, mutations in yeast TBP diminished expression by all three RNA polymerases in vivo (99, 426). Thus, TBP plays a requisite role in transcription initiation by all three RNA polymerases, functioning as a common subunit of SL1, TFIID, and TFIIIB.

Yeast TBP is a monomer of 27 kDa and is functionally interchangeable with mammalian TBP in in vitro transcription systems (43, 65, 178). This result demonstrated that GTF functions are conserved among eukaryotic organisms. Based on protein sequence information, the yeast gene encoding TBP was isolated (64, 177, 210, 424). Sequence analysis revealed identity to SPT15, which was identified in a genetic selection for suppressors of a Ty insertion in the HIS4 promoter (his4-917delta ) (124). In the his4-917delta mutant, transcription initiates within the Ty delta  element to produce abnormally long, nonfunctional transcripts. Mutations at the SPT15 locus suppressed his4-917delta by shifting initiation from the delta  promoter element to the his4 promoter, resulting in a His+ phenotype. Furthermore, spt15 mutants are pleiotropic and deletion of SPT15 is lethal. These results demonstrated that TBP is an essential transcription factor that affects promoter recognition and is required for the expression of many, if not all, genes in vivo (124).

Sequence analysis of the deduced TBP amino acid sequence revealed two direct repeats encompassing the C-terminal two-thirds of the protein. Subsequent comparison with the phylogenetic series of TBP sequences demonstrated that the C-terminal direct repeats are highly conserved. Although the N-terminal domain is more divergent, it is conserved among vertebrate forms of TBP and regulates RNA pol III transcription at the U6 promoter (326). Unlike other DNA binding proteins, TBP recognizes its binding site through minor groove contacts (273, 448).

The crystal structure of Arabidopsis TBP revealed a remarkable structure containing a new DNA binding fold, resembling a molecular "saddle" that sits astride the DNA (342; reviewed in reference 340). The protein molecule includes two similar structural domains related by approximate twofold symmetry. Each alpha /beta domain (~90 amino acids) corresponds to each of the C-terminal direct repeats and is composed of two alpha -helices and five antiparallel beta -sheets connected in the order S1-H1-S2-S3-S4-S5-H2 (Fig. 3) (343). The crystal structures of yeast and Arabidopsis TBP-TATA complexes demonstrated that the TBP saddle induces kinks at both ends of the 8-bp TATA element, bending the DNA 80° toward the major groove (246, 251). The DNA binding surface is a curved, anti-parallel beta -sheet, and the convex seat of the saddle is potentially available for interaction with other factors.


View larger version (21K):
[in this window]
[in a new window]
 
FIG. 3.   Tertiary structure of TBP-2 from Arabidopsis thaliana. The three-dimensional structure is viewed perpendicular to the intramolecular twofold symmetry axis. The alpha -helices (H1, H2 and H1', H2') and beta -sheets (S1 to S5 and S1' to S5') are labeled and can be correlated with the TBP amino acid replacement data summarized in Table 2. Reprinted from reference 342 with permission of the publisher.

TBP plays a critical role in the mechanism of transcriptional activation. This is implied by direct contact between the activation domains of many gene-specific activators and TBP (reviewed in reference 339). The best evidence that TBP plays a critical role in transcriptional activation comes from studies in yeast, where acidic activators enhance the kinetics of TBP recruitment (257). Consistent with this finding, tethering of TBP to a promoter by a heterologous DNA binding domain bypasses the need for a transcriptional activator (70, 254, 544).

Recruitment of TBP to the promoter is a slow and potentially rate-limiting step in transcriptional activation (257, 259, 286, 525). Complex formation between TBP and the TATA element has been proposed to occur by a two-step mechanism involving a slow TBP-TATA association followed by a rapid conformational change (209). Alternatively, the rate-limiting step in TBP-TATA complex formation has been proposed to be dissociation of TBP-TBP dimers. Human TBP has been reported to dimerize in solution, thereby blocking TBP-DNA association (88). TFIID also dimerizes in the absence of DNA, with dimer formation mediated by TBP-TBP association (474). Consistent with these findings, TBP from Arabidopsis crystallized as a dimer with the dimer interface overlapping the DNA binding domain (339, 342). Thus, TBP dimerization and TBP-DNA association appear to be mutually exclusive. Furthermore, TBP dimers dissociate slowly, suggesting that dimer dissociation dictates the kinetics of TBP-DNA binding (87). Kinetic analysis of yeast TBP-DNA interactions provides further support for the existence of TBP-TBP dimers that undergo slow dissociation at physiological concentrations (360). These results have important implications for the mechanisms by which certain activators and coactivators affect the rate of transcription.

A prediction based on TBP recruitment as a rate-limiting step in transcriptional activation is that TBP mutants that are specifically defective in activation yet have no effect on uninduced expression should be found. Indeed, several different genetic selections and screens have identified activation-defective TBP derivatives. These can be divided into functionally distinct classes. One class is defective in TBP-TATA interaction; a second is defective in TBP-TFIIA interaction; a third is defective in TBP-TFIIB interactions; and a fourth class is distinct from the other three. Each of these classes of TBP derivatives is reviewed below; a compilation of yeast TBP derivatives and their associated defects is presented in Table 2.

                              
View this table:
[in this window]
[in a new window]
 
TABLE 2.   Yeast TBP derivatives

TBP-TATA interactions. Several mutational studies of TBP have addressed the domains and residues that bind DNA. Random mutagenesis of SPT15 identified dominant mutations that diminished DNA binding (387). These included V71E, R105C, T112K, and F116Y single-residue derivatives. Each of these replacements occurs in the N-terminal direct repeat domain of TBP. Site-directed replacements of the analogous residues in the C-terminal repeat, V161E, R196C, V203K, and F207Y, also inhibited DNA binding, suggesting that DNA binding is partitioned between the two direct repeats (387). This prediction was confirmed by the crystal structure of the TATA-TBP complex (246, 251).

Regional mutagenesis, focusing on residues 190 to 205, identified a TBP derivative with altered TATA binding specificity. The TBPm3 derivative is the result of a triple amino acid replacement, I194F, V203T, and L205V, that supports transcription from promoters containing a TGTAAA promoter (454). Both the I194F and L205V replacements are required for this effect; V203T is dispensable but augments TGTAAA recognition. Although TBPm3 binds efficiently to the TATAAA sequence, it is unable to support cell growth. This mutant identified residues that interact directly with the TATA element, a conclusion confirmed by the TATA-TBP crystal structure. The TBPm3 derivative has been especially valuable for studying promoter-specific transcription using promoters containing a mutated TGTA element (see, e.g., reference 544).

Characterization of spt mutants also identified a TBP derivative with altered DNA binding specificity. The spt15-122 allele encodes an L205F replacement that enhances TBP binding to nonconsensus TATA elements (8). This accounts for the Spt phenotype of the spt15-122 mutant by shifting TBP recognition from the Ty promoter to the downstream his4 promoter. Based on this result, the analogous replacement, L114F, in the first repeat was constructed. L114F and L205F derivatives conferred nearly identical phenotypes (8). However, in vivo assays that utilize a complete set of point mutations in the TATAAA element found that L114F and L205F play distinct roles in TATA element recognition (10). The most notable distinction is that L114F enhances TATAAG recognition, whereas L205F enhances CATAAA recognition. These results support the premise that orientation of TBP with respect to the TATA element in the TBP-TATA crystal structure reflects the structure that occurs in vivo. These results also suggest that factors might exist to control transcriptional activation by affecting TBP recognition of nonconsensus TATA elements.

The concave surface of TBP interacts with the TATA element, and the convex surface interacts other proteins, including gene-specific activators. In an effort to understand the role of TBP in mediating transcriptional activation, a genetic selection for activation-defective derivatives of TBP that are normal for uninduced expression was devised (275). Presumably this selection would identify residues on the convex surface that are critical for protein-protein interactions that mediate activation. In this study, S118L, N159L, V161A, and F148L were identified as activation-defective TBP derivatives. Surprisingly, each of these positions, except F148, directly contacts DNA, and all four derivatives are defective for TATA binding. Each derivative interacts normally with TFIIB and GAL4-VP16. These results imply that the TBP-TATA interface is a critical determinant of transcriptional activation (275).

Activation-defective TBP mutants were identified in another, independent genetic screen. Random mutations in SPT15 were generated by error-prone PCR and subsequently screened for inositol auxotrophy (Ino-) or failure to grow on galactose medium (Gal-), phenotypes often associated with defects in transcriptional activation (9). Six different amino acid replacements at five unique positions, all within the repeat domains, were identified. Three of these replacements, V71A, F116Y, and V161A, were found at sites previously identified based on diminished DNA binding (387). Three replacements at two unique positions, P109A, P109Q, and N159D, led to mutants that were also defective in DNA binding and exhibited promoter-specific defects in activation. These results are consistent with those described above (275), strongly suggesting that transcriptional activators enhance the formation or stability of the TBP-TATA complex at certain promoters in vivo (9).

TBP-TFIIA interactions. The core domain of TBP includes helices located at the ends of each direct repeat (H2, H2') (Fig. 3). These structures are components of the convex surface of TBP, comprising the seat of the saddle, suggesting that the saddle mediates protein-protein interactions (251). Indeed, mutational analysis of basic residues within the H2 helix demonstrated that residues K133/K138 and K133/K145 are required for interaction with TFIIA, as defined by gel shift analysis (46).

In a genetic screen for TBP derivatives that cause a temperature-sensitive phenotype yet maintain normal pol III transcription, a double K138T/Y139A replacement within the H2 helix was found (447). This double mutant (designated N2-1) failed to interact with TFIIA and is defective for activation by Gal4, Gcn4, and Ace1. This result is consistent with the effect of the K133/K138 replacements (46) and demonstrates that the TBP-TFIIA interaction can be essential for activation in vivo (447).

Surprisingly, the crystal structure of a yeast TATA-TBP-TFIIA ternary complex revealed no direct contacts between TFIIA and the H2 helix (155, 475). The apparent discrepancy between this result and the effects of H2 replacements on TBP-TFIIA interaction might be accounted for by the forms of TFIIA used in the crystallographic studies. The crystal structures of two TATA-TBP-TFIIA ternary complexes have been solved. In both cases, crystals were generated with TFIIA derivatives deleted for large internal regions of the largest (Toa1) subunit (155, 475).

In a study of human TBP-TFIIA interactions, amino acids replacements in TBP (A86, N91, E93, R107 [yeast numbering system]) that are required for TBP-TFIIA interaction in vitro and for transcriptional activation in vivo were identified (39). In this case, the altered residues are located within the H1 helix and the S2 and S3 beta -sheets that comprise one of the TBP stirrups (Fig. 3). These residues directly contact TFIIA (Fig. 2), thereby confirming the importance of the TFIIA-TBP contacts for transcriptional activation. These results predict that certain gene-specific transcriptional activators will stimulate TBP-TFIIA-promoter complex assembly by direct binding to TFIIA, a prediction borne out for the Zta and VP16 activators (259).

Until recently, no TBP mutants had been described that stimulated pol II transcription. In a hunt for TBP mutants that would enhance transcription from a weak, synthetic transcriptional activator, the TBP N69S derivative was identified (29). TBP N69S selectively increases transcription from genes with weak pol II promoters, including those lacking a functional TATA box. TBP N69S does not alter the affinity of TBP for DNA but appears to enhance TBP recruitment to the promoter. Since the L205K replacement disrupts TBP-TATA binding and the double L138T/Y139A replacement disrupts TBP-TFIIA interaction, it was suggested that the N69S effect is dependent upon TBP-TFIIA interaction but independent of TBP-TATA interaction (29). Although the mechanism by which N69S enhances promoter recruitment is unknown, this mutant strengthens the premise that TBP recruitment can be rate limiting for transcription initiation.

TBP-TFIIB interactions. Site-directed mutations were generated in the C-terminal core domain of yeast TBP in an attempt to define residues critical for GAL4-VP16 activation (248). Three replacements, L114K, L189K, and K211L, selectively blocked activation with no effect on basal transcription in vitro. Each of these TBP derivatives was defective in GAL4-VP16-mediated recruitment of TFIIB to the promoter complex and one, L189K, disrupted TBP-TFIIB interaction. These results were interpreted to mean that GAL4-VP16 activation involves TBP recruitment, as well as stabilization or isomerization of an activation-specific TATA-TBP-TFIIB complex (248). A role for TBP-TFIIB interaction in transcriptional activation is consistent with activator-mediated recruitment of TFIIB to preinitiation complexes in vitro (82) and with a model invoking activator-dependent conformational changes in TBP-TFIIB interaction (discussed in reference 175).

A critical role for TBP-TFIIB interaction in transcriptional activation has additional support. Based on the crystal structure of the TATA-TBP-TFIIB ternary complex (341) and on TBP amino acid replacements that block ordered assembly of the PIC (477), amino acid replacements in TBP were made at positions known to directly interact with TFIIB (478). In this case, a human TBP E284R (yeast E186) derivative disrupted the interaction with TFIIB and blocked transcriptional activation in human cells. If the activation defect were due specifically to defective TBP-TFIIB interaction, an amino acid replacement in TFIIB at the position that interacts with TBP E284 might restore activation. Indeed, a TFIIB R169E replacement compensated for the activation defect caused by E284R, restoring activation by GAL4-VP16, GAL4-CTF, and GAL4-p53. In contrast, GAL4-Sp1 was insensitive to disruption of the TBP-TFIIB interaction, implying the selective use of the TBP-TFIIB interaction by activators in higher eukaryotes in vivo (478).

Interestingly, the opposite conclusion was reached in a study of yeast TBP, in this case performed in vivo. As described above, TBP derivatives that impaired TBP-TATA or TBP-TFIIA interactions were activation defective (275, 447). In contrast, TBP replacements (E186A, E188A, and L189A) that disrupted TBP-TFIIB interactions did not block transcriptional activation, leading to the conclusion that TBP-TFIIB interaction is not generally limiting for transcription in vivo (276). This conclusion is further supported by TFIIB derivatives that are defective for TBP-TFIIB-DNA complex formation yet support viability and respond to transcriptional activators (18, 81). To resolve the discrepancy between the effects of the L189A and L189K replacements on activation, it was suggested that the activation defect associated with L189K is not due to impaired TBP-TFIIB interaction but, rather, to impaired TBP-TATA interaction, since L189 contacts both TFIIB and template DNA (276).

In another study of human TBP, the effects of an array of single amino acid replacements on basal and activated transcription, as well as TBP-DNA and TBP-TFIIB binding, were assessed (39). This study included replacements of residues E284, E286, and L289, which are equivalent to residues E186, E188, and L189 in yeast. E284R, E286R, and L287E replacements blocked TBP-TFIIB interaction (39). However, these three mutants were defective in both activated and basal transcription, although the E286R replacement caused only a twofold drop in activated transcription in transient-transfection assays (39). This dramatic effect on TBP-TFIIB interaction, coupled with the subtle effect on activated transcription, is consistent with the conclusion that TBP-TFIIB interaction is not generally limiting for activation (276). Perhaps impaired TBP-TFIIB interactions might readily be compensated for by interaction of TFIIB other components of the PIC (39) or by TFIIB-DNA interactions (270).

The discrepancies among these studies on the role of TBP-TFIIB interaction in activated transcription is likely to be accounted for by the different experimental methods used. Some were performed in vitro, whereas other were done in vivo with either yeast or human cells. The identification and characterization of activation-defective TFIIB mutants might resolve these discrepancies and ultimately provide valuable insights into the role of TBP-TFIIB interactions in gene activation.

Other TBP derivatives. In the same genetic selection that uncovered the activation-defective K138T/Y139A double replacement that affects TBP-TFIIA interaction (447), four activation-defective single replacements, F148H, T153I, E236P, and F237D, were found (446). These replacements lie on the convex surface of TBP. None affects TATA binding, and the F148H, T153I, and E236P replacements do not affect interaction with TFIIA, TFIIB, or glutathione S-transferase-VP16. These activation defects were at least partially rescued by artificial recruitment of the F148H, T153I, and F273D derivatives to the promoter. These results were interpreted as evidence for a second step in transcriptional activation in vivo, one involving recruitment of TBP to the promoter and the other involving activator interaction with another component of the PIC after TBP recruitment (446). This conclusion is consistent with a mutational analysis of human TBP, where activation-defective mutations were identified on the convex surface of TBP that did not affect interaction with either TFIIA or TFIIB (39).

Suppressors of TBP derivatives. The extensive collection of well-defined TBP derivatives is a lucrative source of primary mutations for isolation of suppressors. In one study, dosage-dependent suppression of the temperature sensitivity associated with the K133L/K138L double mutant identified a TFIIB homolog that is a component of the RNA pol III transcriptional machinery (45). In another case, the same factor was isolated as a suppressor of the P65S replacement in TBP (86). Consistent with its role in RNA polymerase III transcription, the same factor was isolated as a dominant suppressor of a tRNA "A block" promoter mutation (291). The gene encoding this factor is designated BRF1/TDS4/PCF4 and encodes the 67-kDa subunit of TFIIB. The N-terminal half of Brf1/Tds4/Pcf4 is significantly similar to TFIIB, an observation that underscores the similarity of RNA pol II and pol III transcriptional systems and suggests that Brf1/Tds4/Pcf4 is a key factor distinguishing the polymerase specificity of a gene (45, 86).

A novel TBP replacement, P191S, was generated as an intragenic suppressor of the Spt phenotype associated with the L205F replacement encoded by spt15-122 (10). P191 is located in the S2'-S3' stirrup and, like L205, contacts the DNA backbone at the A residue of the first base pair of the TATA element. This result is consistent with the relaxed specificity of TATA element recognition associated with L205F.

The spt15-21 allele encodes a G174E replacement within the H1' helix. G174E does not affect TBP stability, DNA binding, basal transcription, or transcriptional activation (123). In an effort to further define the G174E defect, intragenic and extragenic suppressors were isolated and defined. All three intragenic suppressors encode a single residue replacement, F237V, which compensates for the Spt phenotype of G174E but does not confer an Spt phenotype on its own (123). F237 immediately follows the H2' helix, which is in proximity to the H1' helix. All extragenic suppressors of G174E were the result of recessive mutations in the same gene, SPT3. Mutations in SPT3, like mutations in SPT7, SPT8, and SPT15 but unlike all other spt mutations, suppress the his4-917delta allele. The spt3 suppressors of spt15 are allele specific, consistent with physical interaction between TBP and Spt3 (123). These results suggest that Spt3 forms a complex with TBP distinct from TFIID.

A novel gene, designated RTF1, was recently uncovered as a genomic suppressor of the TBP L205F derivative (452). Rtf1 is a nuclear protein of unknown function with no apparent similarity to other proteins. Characterization of rtf1 suppressor and null mutations suggests that Rtf1 regulates the DNA binding properties of TBP and might play a role in recognition of nonconsensus TATA elements in vivo.

TFIIB

Overview. Yeast TFIIB is a monomer of 38 kDa encoded by the SUA7 gene. SUA7 was initially identified in a genetic selection for suppressors of a translational defect at the CYC1 locus (366). Recessive sua7 mutations shifted transcription start site selection downstream of normal such that the translational impediment was eliminated from the cyc1 transcript. This result demonstrated that TFIIB functions in transcription start site selection in vivo.

TFIIB enters the PIC after TBP and as a prerequisite for recruitment of RNA pol II (42). TFIIB interacts directly with TBP and RNA pol II, as well as with other GTFs, including the RAP30 (174) and RAP74 (128) subunits of TFIIF and the TAFII40 subunit of TFIID (161). TFIIB-RAP74 binding precludes TFIIB-RAP30 binding, implying a dynamic interaction between TFIIB and TFIIF during PIC assembly (128). TFIIB has also been implicated as the direct target of many gene-specific transcriptional activators, leading to the proposal that certain activators stimulate transcription by TFIIB recruitment (288, 392).

Sequence analysis of the human and yeast genes encoding TFIIB revealed several structural motifs, including a zinc binding motif near the N terminus and two imperfect repeats encompassing most of the C-terminal two-thirds of the molecule (173, 303, 366). The C-terminal region folds into a protease-resistant core (cTFIIB) that binds TBP, whereas the N-terminal region interacts with RNA pol II-TFIIF (18, 19, 44, 174, 202, 304, 547). The N- and C-terminal domains engage in an intramolecular interaction that undergoes an activator-induced conformational change, allowing assembly of the PIC (391).

A solution structure for human cTFIIB (17) and a crystal structure for a TATA-TBP-cTFIIB ternary complex have been determined (341; reviewed in reference 340). cTFIIB consists of two similar domains, each consisting of five alpha -helices, corresponding approximately to the two imperfect repeats. The alpha -helices defined by the nuclear magnetic resonance structure are termed A1 to E1 and A2 to E2 for the first and second repeats, respectively (17). Based on primary-structure analysis, TFIIB was classified as a member of a protein superfamily that includes the cyclin A and retinoblastoma proteins (159). Consistent with that proposal, the tertiary structure of cyclin A is arranged similarly to cTFIIB, including two imperfect repeats, each arranged as five alpha -helices (228).

Footprinting (277) and cross-linking (102, 269) experiments demonstrated that TFIIB binds beneath and to one face of the TATA-TBP complex, a result confirmed by the crystal structure of the TATA-TBP-TFIIB ternary complex (Fig. 2) (341). A basic region within the D1 and E1 helices of the first repeat of cTFIIB interacts with the acidic C-terminal stirrup of TBP. Residues within the C2 and E2 helices of the second repeat contact DNA, upstream of the TATA box. Interestingly, helices D2 and E2 form a helix-turn-helix motif, which has been proposed to constitute a sequence-specific DNA binding domain (270). Accordingly, TFIIB, like TBP, is a sequence-specific GTF and the template sequence bound by TFIIB defines a novel promoter element (270). Photo-cross-linking studies demonstrated that other GTFs also directly bind promoter DNA (102, 249, 269, 390), raising the possibility that additional GTFs recognize promoter elements in a sequence-specific manner.

The N-terminal region was not included in either the crystal or solution structures of human TFIIB (17, 341). However, a solution structure for the metal binding domain of Pyrococcus TFIIB demonstrated that the N-terminal region forms a zinc ribbon (563). The zinc ribbon and core domain flank the most phylogenetically conserved region of the protein. This region plays an important but undefined role in transcription start site selection in vivo (18, 355, 367).

Start site selection. The effects of sua7 on start site selection are not limited to the CYC1 gene but also affect other genes, including ADH1 (27, 366). Interestingly, the start site shift is always downstream of normal. Furthermore, the downstream start sites are never "new" sites but represent enhanced initiation at minor sites. Therefore, TFIIB defects shift the window within which certain start sites are recognized rather than altering the specificity of start site recognition. Also, the downstream shift is not a consequence of alternative TATA usage. For example, none of the sua7 suppressors compensate for Ty element insertions at the HIS4 or LYS2 loci, which is the basis for isolation of the spt class of suppressors (536).

Neither the mechanism of start site selection nor the basis of the sua7-induced downstream shift is understood. Nonetheless, RNA pol II is known to be an integral component of this process. In addition to sua7, selection for suppressors of the cyc1 translational defect yielded sua8 mutants. Moreover, sua7 and sua8 suppressors confer identical downstream start site shifts (27). The sua8 suppressors are allelic to RPB1, the gene encoding the largest subunit of RNA pol II. Furthermore, double sua7 sua8 mutants are inviable (synthetic lethality) and sua7/SUA7+ sua8/SUA8+ heterozygous diploids display sua phenotypes (nonallelic noncomplementation). These results imply that accurate start site selection involves interaction between TFIIB and the Rpb1 subunit of RNA pol II. This interpretation is consistent with results of biochemical studies. Pairwise replacement of RNA pol II and TFIIB from S. cerevisiae by their counterparts from S. pombe was both necessary and sufficient to shift start sites from the pattern characteristic of S. cerevisiae to that of S. pombe (284).

The sequences of the sua7 suppressor alleles were determined in an effort to define the role of TFIIB in start site selection. Each of four independent sua7 alleles encodes a single amino acid replacement: E62K, R78C, or R78S (355, 367). E62 and R78 lie within the phylogenetically conserved region of TFIIB, immediately downstream of the zinc ribbon. The opposite charge of E62 and R78 and the identical effects of replacements at these two positions on both start site selection and cold sensitivity imply that E62 and R78 interact, perhaps forming an ion pair. Consistent with this hypothesis, the inviability of an R78E replacement is suppressed by an E62R replacement, confirming a functional, if not direct, interaction between E62 and R78 (367). Although this interaction is clearly important for start site selection, it cannot be essential, since mutants expressing either E62K or R78C not only are viable but also exhibit only modestly impaired growth at 30°C. Also, residues in addition to E62 and R78 can affect start site selection, as demonstrated by the downstream shift associated with an R64E replacement (18).

What is the role of TFIIB in start site selection? The N-terminal region of TFIIB is critical for interaction with RNA pol II-TFIIF (19, 44), and the TFIIB R78C derivative binds RNA pol II with ~100-fold-diminished affinity (53). The downstream start site shift therefore correlates with diminished TFIIB-RNA pol II interaction. Based on two-dimensional electron crystallographic data, the distance between TFIIB and the RNA pol II catalytic site is ~110 Å, corresponding to 32 bp of B-form DNA (280). This is the approximate distance between the TATA box and start site for most RNA pol II promoters. Template DNA has been proposed to follow a linear path from the TATA box to the RNA pol II active site and that longer distances between TATA and start sites would result from RNA pol II scanning further downstream (280). Consistent with this proposal, RNA pol II forms an open promoter complex at a fixed distance of approximately 20 bp from the TATA box, regardless of the distance between TATA and start sites (158). To account for the variable distances between TATA and start sites in yeast, RNA pol II was proposed to reach downstream sites by template scanning (158). It was recently proposed that the mechanism of start site selection involves arrest of the scanning polymerase by specific promoter sequences in a manner analogous to arrest of the elongating polymerase (16). Accordingly, altered forms of TFIIB, Rpb1, and other factors that affect start site selection might allow RNA pol II to adopt conformations either more or less compatible with sequences at the start sites. Although reasonable, this model does not address whether or how RNA pol II would clear the promoter to scan for downstream start sites. It is also not clear why scanning might occur in S. cerevisiae yet transcription seems to occur at a fixed distance from TATA in other eukaryotes.

Human TFIIB does not functionally replace yeast TFIIB in vivo (433). Since TFIIB is a determinant of accurate start site selection (366), the differential spacing between TATA and start sites in human (25 to 30 bp) and yeast (40 to 120 bp) could account for the incompatibility of human TFIIB in yeast. By constructing chimeric human-yeast TFIIB hybrids, a species-specific region of TFIIB was identified within a solvent-exposed region of the first repeat cTFIIB (433). Interestingly, mutations within this region impair gene-specific transcriptional activation yet have only subtle effects on start site selection. Thus, the functional distinction between yeast and human TFIIB does not appear to be a consequence of the role of TFIIB in start site selection but is more likely to be accounted for by the differential responses to transcriptional activators (432, 433).

The SUA7 gene was also uncovered in another genetic selection. Mutations in eight different genes, designated soh1 to soh8, were isolated as suppressors of the temperature-sensitive growth defect of an hpr1 hyperrecombination mutant of S. cerevisiae (127). Two of these genes encode components of the general transcriptional machinery. SOH2 and SOH4 are identical to RPB2 and SUA7, respectively (126). Furthermore, SOH1 encodes a novel protein that interacts with components of both the DNA repair and transcriptional machinery. These results imply a link between transcription and recombination (126). A transcription-recombination link is further supported by other genetic results. Mutations in GCR3, whose product plays an undefined role in glycolytic gene expression, and in SRB2, which encodes a component of the SRB/mediator complex, also suppress the temperature-sensitive growth defect of hpr1Delta mutants (369, 495). Also, mutations in SPT4 and SPT6, members of the histone class of SPT genes (see below), confer a hyperrecombination phenotype (300). These results are consistent with earlier observations that link transcription with recombination (242, 482, 512), although the molecular basis of this process is unresolved.

Suppressors of TFIIB derivatives. In an effort to identify factors that interact with TFIIB, the cold-sensitive phenotype of the sua7-1 mutant was exploited to isolate extragenic suppressors. Cold sensitivity is often associated with defects in assembly of multisubunit complexes (reviewed in reference 180); therefore, suppressors of cold sensitivity seemed likely to identify other components of the PIC or perhaps factors that facilitate PIC assembly.

Mutations in two genes, designated SSU71 and SSU73, were isolated as suppressors of sua7-1. In addition to suppression of cold sensitivity, the ssu71 suppressors caused heat lethality, but only in combination with sua7-1. This effect demonstrated a functional relationship between SSU71 and SUA7 and provided a phenotype for cloning SSU71. Sequence analysis of SSU71 identified an 82-kDa protein with sequence similarity to mammalian RAP74, the largest subunit of TFIIF. Subsequent comparison of the Ssu71-deduced amino acid sequence with that of Tfg1, the largest subunit of yeast TFIIF, revealed that Ssu71 and Tfg1 are identical. Whereas sua7-1 shifts start site selection downstream of normal, the ssu71 suppressors compensate for this effect, partially restoring the normal initiation pattern (459). Thus, the largest subunit of TFIIF genetically interacts with TFIIB, an interaction that can influence start site selection.

The ssu73-1 suppressor of sua7-1 is allelic to RPB9 (460). Like the ssu71 suppressors, ssu73-1 compensates for the downstream start site shift associated with sua7-1. The ssu73-1 allele encodes a nonsense mutation immediately following the second of two metal binding motifs within Rpb9. This motif is predicted to form a zinc ribbon that would be disrupted in the truncated Rpb9 derivative (460). Interestingly, the sua7-1-encoded E62K replacement lies near the end of the TFIIB zinc ribbon motif, suggesting that TFIIB and Rpb9 interact either with each other or with the DNA template via their zinc ribbon structures. In contrast to the ssu71 suppressors, which do not appear to affect start site selection in a SUA7 wild-type background (459), the ssu73 suppressor shifts initiation upstream of normal in both SUA7 wild-type and mutant backgrounds (460). This suggests that the Ssu71 (Tfg1) subunit of TFIIF and Rpb9 affect start site selection by different mechanisms.

A role for Rpb9 in start site selection was discovered in two independent studies. In one case, a selection for mutations that affect the spacing between the TATA element and start sites identified a gene designated shi (145), which is allelic to RPB9 (147). In the other study, transcription in the absence of Rpb9 shifted initiation upstream of normal both in vitro and in vivo (216). This effect was attributed solely to Rpb9 because the start site shift was rescued by recombinant Rpb9 and because RNA pol II isolated from an rpb9 null mutant was intact, lacking only the Rpb9 subunit (216). Thus, three independent studies demonstrate a role for Rpb9 in start site selection.

In contrast to SSU71 and SSU73, a mutation in the SSU72 gene was identified as an enhancer rather than a suppressor of the sua7-1 defect (459). Whereas a sua7-1 mutant is cold sensitive but not heat sensitive, a double sua7-1 ssu72-1 mutant is both cold and heat sensitive; moreover, the heat-sensitive phenotype is dependent upon both sua7 and ssu72. The ssu72-1 mutation dramatically enhances the downstream start site shift associated with sua7-1, an effect that can be rescued by either wild-type SUA7 or SSU72. SSU72 is an essential gene, encoding a novel protein of 206 amino acids with unknown function. Although a mammalian counterpart of Ssu72 has not been found in cell-free transcription assays, a human Ssu72 homolog (64% similar) exists in the databases.

The SUB1 gene was also identified as a suppressor of a TFIIB defect. In this case, high-copy-number expression of SUB1 suppressed the cold-sensitive growth defect of TFIIB R78H and E62G derivatives (258). Sub1 directly interacts with TFIIB in vitro and inhibits formation of the TATA-TBP-TFIIB promoter complex. SUB1 is identical to TSP1 and encodes a homolog of the mammalian PC4 transcriptional coactivator (195, 258).

TFIIF

The subunits of TFIIF were identified as RAP30 and RAP74 based on affinity of these two proteins for RNA pol II (52). RAP30 and RAP74 were required for accurate transcription initiation from several promoters, defining the RAP30-RAP74 complex as a general initiation factor that binds RNA pol II (141). Although TFIIF was not initially identified as one of the four HeLa cell chromatographic fractions (TFIIA, TFIIB, TFIID, and TFIIE) required for accurate initiation by RNA pol II (418), further purification resolved the TFIIE fraction into two factors, TFIIE and TFIIF, both of which are essential for initiation (142). Purification of the TFIIF fraction defined two subunits, identical to RAP30 and RAP74 (139). TFIIF was also identified in human cells as factor FC (253), in rat cells as factor beta gamma (95), and in Drosophila cells as factor 5 (377). In each case, these factors were required for specific initiation of transcription in vitro.

TFIIF has several characteristics reminiscent of bacterial sigma  factors. These include tight binding of TFIIF to RNA polymerase; suppression of nonspecific binding of RNA pol II to DNA; and stabilization of the PIC (96, 166). Both subunits exhibit limited sequence similarity to bacterial sigma  factors (149, 315, 444, 459, 552). It is not clear whether these structural similarities reflect functional similarities. However, human RAP30-RAP74 binds E. coli RNA polymerase and can be displaced by sigma 70 (315). Moreover, polymerase binding is attributed to region 2.1 of sigma 70, which is the region of similarity between sigma 70 and RAP30 (315). Thus, RAP30 appears to be partially analogous to bacterial sigma 70.

TFIIF probably does not play a significant role in promoter selectivity but contributes to PIC stability. Photo-cross-linking studies have defined the topology of a TBP-TFIIB-TFIIF-RNA pol II-TFIIE promoter complex. RAP74 and RAP30 bind promoter DNA between the TATA box and start site, a region where TFIIE and RNA pol II also cross-link (102, 249, 390). RAP74 also binds DNA upstream of TATA, inducing a conformational change that affects the position of RNA pol II relative to the DNA template (144).

In addition to its role in initiation, TFIIF functions in transcriptional elongation by suppressing transient pausing of the polymerase (24, 32, 142, 225, 377). Although RAP30 and RAP74 were initially thought to function exclusively in initiation and in elongation, respectively, both subunits are now known to function in both processes (476).

Yeast TFIIF (factor g) stably associates with RNA polymerase, is required for specific initiation at all promoters tested, and is composed of three subunits with apparent molecular masses of 105, 54, and 30 kDa (197). The genes encoding the two largest subunits of TFIIF were cloned based on the partial sequence of the purified proteins (196). Analysis of the deduced amino acid sequences of TFG1 and TFG2 revealed 50 and 51% similarity to human RAP74 and RAP30, respectively, thereby confirming the relationship between factor g and TFIIF. TFG1 and TFG2 are present in single copy, and both are essential for cell viability (196, 459). TFG3 encodes the 30-kDa subunit and is identical to ANC1, a gene originally identified based on genetic interaction with ACT1 (531). Tfg3 is less tightly associated with the complex than are Tfg1 and Tfg2 and has no counterpart in mammalian TFIIF, although Tfg3 is similar in sequence to the leukemogenic proteins ENL and AF-9 (55). Tfg3 is found in RNA pol II holoenzyme complexes (252, 261, 515), probably as a component of TFIIF. Interestingly, Tfg3 is identical to the TAFII30 subunit of the yeast TFIID complex (371) and to the Swp29 subunit of the SWI/SNF chromatin-remodeling complex (55). As such, Tfg3 establishes a connection between basal and regulatory components of the transcriptional machinery (55, 196). Tfg3 is the only yeast GTF that is not essential for cell viability (196), perhaps because yeast contains a Tfg3 homolog (YOR213c), which is a subunit of the RSC chromatin-r