Previous Article | Next Article ![]()
Microbiology and Molecular Biology Reviews, June 2008, p. 266-300, Vol. 72, No. 2
1092-2172/08/$08.00+0 doi:10.1128/MMBR.00001-08
Copyright © 2008, American Society for Microbiology. All Rights Reserved.
Department of Microbiology, University of Alabama at Birmingham, Birmingham, Alabama 35294,1 Department of Biochemistry, University of Illinois, Urbana, Illinois 618012
SUMMARY INTRODUCTION REGULATORY MECHANISMS IN ENTERIC BACTERIA History and Overview ATTENUATION CONTROL BY COUPLED TRANSCRIPTION AND TRANSLATION First Examples of Attenuation Control: a Mold To Be Broken UTP-Sensitive Attenuation Control of pyrBI Expression in E. coli Attenuation Control of pyrBI Expression in Other Enteric Bacteria Attenuation Control of pyrE Expression in E. coli CONTROL OF TRANSLATION INITIATION VIA NUCLEOTIDE-SENSITIVE SELECTION OF TRANSCRIPTION START SITES Promoters and Transcription Start Sites CTP-Sensitive Regulation of pyrC Expression Rules for Selecting Transcription Start Sites and a Revised Model for pyrC Regulation CTP-Sensitive Regulation of pyrD Expression REGULATION BY UTP-SENSITIVE REITERATIVE TRANSCRIPTION Reiterative Transcription Second Mechanism to Regulate pyrBI Expression in E. coli Distribution of the ''TTT Motif'' Regulation of carAB Expression in E. coli COMPOUND MECHANISMS FOR NUCLEOTIDE-SENSITIVE REGULATION Salvage of Pyrimidine Bases Regulation of codBA Expression in E. coli Regulation of upp-uraA Expression in E. coli An Interesting Difference between the upp and codBA Operons REGULATORY MECHANISMS IN GRAM-POSITIVE BACTERIA History and Overview TRANSCRIPTION ATTENUATION BY PyrR, AN mRNA-BINDING PROTEIN Regulation of the Bacillus subtilis pyr Operon Regulatory Function of PyrR: Genetic Evidence Regulatory Function of PyrR: Biochemical Evidence Importance of RNA Secondary Structures in Regulation Regulation of pyr Operon Expression as Deduced from pyr::lacZ Fusions Occurrence and Significance of Transcription Pausing in the pyr Operon BIOCHEMICAL CHARACTERIZATION OF PyrR PyrR Is a UPRTase RNA Sequence and Structure Required for PyrR Binding High-Resolution Structures of PyrR and PyrR Complexes with Nucleotides Characterization of the RNA Binding Site of PyrR PHYLOGENETIC DISTRIBUTION OF pyrR GENES AND MECHANISMS OF PyrR-MEDIATED GENE REGULATION Distribution of pyrR Genes PyrR-Mediated Transcription Attenuation of pyr Operons PyrR-Mediated Transcription Attenuation of Unlinked pyr Genes PyrR as an Inhibitor of pyr Gene Translation Species in Which the Function of PyrR Is Unclear Species in Which pyrR Genes Are Not Identifiable REGULATION OF pyrG EXPRESSION BY A NOVEL MECHANISM BASED ON CTP-SENSITIVE REITERATIVE TRANSCRIPTION pyrG Is Regulated by CTP Levels pyrG Is Regulated by Transcription Attenuation Regulation of pyrG by CTP-Sensitive Reiterative Transcription Further Characterization of pyrG Regulation CONCLUSIONS AND SPECULATION ACKNOWLEDGMENTS REFERENCES
|
|
|---|
|
|
|---|
It is a remarkable fact that, with rare exceptions, the many mechanisms known to regulate the expression of bacterial pyr genes do not involve the participation of a DNA-binding repressor or activator protein. Rather, as will be seen in this review, the information that specifies pyrimidine-responsive regulation of pyr gene expression is generally encoded within the promoter-leader region of the regulated downstream genes. (The leader region is defined as the DNA extending from the start of transcription to the first gene of an operon.) During transcription of the leader regions, alternative sequences and/or secondary structures in the leader-specified RNA determine whether transcripts will be prematurely terminated or fully elongated or, alternatively, whether an elongated transcript will be efficiently translated. In all cases except those involving the pyr mRNA-binding regulatory protein PyrR, the concentration of pyrimidine nucleotides is sensed directly by RNA polymerase. While the predominance of such mechanisms may result from their ancient evolutionary origins, their wide distribution and retention must also reflect their efficiency and sensitivity. The importance of the novel regulatory mechanisms described in this review extends beyond pyr genes, however. Their implications for the mechanism of transcription in bacteria in general and for the ways that transcription can be harnessed for regulation of other genes will be discussed in the course of this review.
|
|
|---|
![]() View larger version (13K): [in a new window] |
FIG. 1. Pyrimidine nucleotide biosynthetic pathway of E. coli and Salmonella. Gene names are used to represent the encoded biosynthetic enzymes. The genes shown in the figure and the encoded proteins are as follows: carA, glutaminase subunit of carbamylphosphate synthetase; carB, catalytic subunit of carbamylphosphate synthetase; pyrB, catalytic subunit of aspartate transcarbamylase; pyrI, regulatory subunit of aspartate transcarbamylase; pyrC, dihydroorotase; pyrD, dihydroorotate dehydrogenase; pyrE, orotate phosphoribosyltransferase; pyrF, OMP decarboxylase; pyrH, UMP kinase; ndk, nucleoside diphosphokinase; and pyrG, CTP synthetase.
|
|
|
|---|
According to the model for regulation and using the trp operon as an example, transcription is initiated at the promoter and proceeds through the leader region specifying transcript segments 1 and 2, which then form the 1:2 hairpin. The transcribing RNA polymerase pauses at this point, permitting a ribosome to bind to the nascent transcript and initiate translation of a 14-codon ORF that encodes the leader peptide. Early in translation the ribosome releases the stalled RNA polymerase by disrupting hairpin 1:2, and then the ribosome proceeds to codons 10 and 11 of the leader ORF, which encode tandem tryptophan (Trp) residues. When Trp is limiting and the level of Trp-tRNATrp in the cell is low, the ribosome pauses at this site and covers transcript segment 1. During this time, the reengaged RNA polymerase continues transcription and synthesizes transcript segment 3, permitting formation of the 2:3 antiterminator hairpin. Continuing transcription extends the leader transcript through segment 4 and the poly(U) tract (without formation of the 3:4 terminator hairpin necessary for transcription termination) and eventually through the entire operon. Translation of the full-length trp mRNA produces the enzymes that increase the cell's capacity to make more Trp. On the other hand, when there is ample Trp and Trp-tRNATrp in the cell, the translating ribosome does not pause at the tandem Trp codons but proceeds to the stop codon at the end of the leader ORF. At this point, the ribosome covers transcript segments 1 and 2. As reengaged transcription continues, transcript segments 3 and 4 and the poly(U) tract are synthesized, allowing the 3:4 terminator hairpin to form and transcription termination to occur. As a consequence, the synthesis of more Trp biosynthetic enzymes is prevented when there is sufficient Trp-tRNATrp to support optimal cell growth. Regulation of the his operon occurs by an analogous mechanism in which seven adjacent histidine codons are used as the control codons in the leader region (72).
Soon after the elucidation of the attenuation control mechanisms of the trp and his operons, similar mechanisms were discovered for several other amino acid biosynthetic operons in enteric bacteria (86). Each example employed ribosome stalling at control codons as a regulatory signal and an alternative transcript secondary structure as a means of preventing terminator hairpin formation. These similarities raised the possibility that attenuation control was limited to amino acid biosynthetic operons and to a single mechanism for regulating transcription termination. However, this idea was soon dispelled by studies of the regulation of pyrBI expression in E. coli, which revealed an attenuation control mechanism that was fundamentally different from that described for the amino acid biosynthetic operons (159).
The pyrBI operon of E. coli encodes the catalytic (pyrB) and regulatory (pyrI) subunits of the allosteric enzyme aspartate transcarbamylase, which catalyzes the first committed step in the de novo synthesis of pyrimidine nucleotides (Fig. 1). Expression of the pyrBI operon is negatively regulated over a wide range by pyrimidine availability, specifically by the intracellular concentration of UTP (98, 158). The regulatory role of UTP was first established in studies employing an in vitro DNA-dependent coupled transcription-translation system in which the levels of nucleotides and other small molecules could be varied (158). The discovery that a substrate for transcription was a regulatory effector of pyrBI expression suggested that the pyrBI control mechanism did not sense UTP levels per se but instead detected the effects of these levels on the rate of transcription of a regulatory site within the operon. This regulatory site was most likely located within the leader region. For this and other reasons, several laboratories determined the DNA sequence of the pyrBI leader region (120, 141, 159).
These studies identified the sequence of a putative intrinsic transcription terminator, or more specifically an attenuator, located 23 base pairs (bp) before the pyrB structural gene. Additional in vitro studies indicated that pyrBI transcription was initiated at either of two promoters located upstream of the attenuator and that this transcription was efficiently (
98%) terminated at the attenuator (159). These results strongly indicated that pyrBI expression was regulated by a transcription attenuation control mechanism. However, the sequence of the pyrBI leader region revealed that the leader transcript could not adopt alternative secondary structures to regulate terminator hairpin formation, implying that attenuation control of pyrBI expression was mechanistically different from that described for the amino acid biosynthetic operons. The construction of a model for this new mechanism for attenuation control required the identification of two additional elements in the pyrBI leader region. The first was a 44-codon ORF that extends through the leader region and ends 6 base pairs before the pyrB gene (Fig. 2). In the leader transcript, this ORF is preceded by an apparent ribosome binding site, indicating that it can be translated. The second element was UTP-sensitive transcription pausing, i.e., pausing caused by low UTP levels, in the pyrBI leader region upstream of the attenuator. This pausing was detected in vitro (at 20 µM UTP) and initially appeared to be limited to a small cluster of sites at which UTP (or UMP after pyrophosphate release) was incorporated into the leader transcript (159). The pause site region was located approximately 20 nucleotides before the terminator hairpin.
![]() View larger version (36K): [in a new window] |
FIG. 2. Model for attenuation control of pyrBI expression in E. coli. The diagram shows the relative positions of RNA polymerase and the translating ribosome within the leader region when UTP concentrations are either low or high. See the text for additional details. (Modified from reference 86 with permission.)
|
For a decade after this model was proposed, numerous studies that confirmed its key features were conducted. The central role of transcription termination at the pyrBI attenuator was established by biochemical analysis of cellular pyrBI transcripts and characterization of deletion mutations in the pyrBI leader region (31, 91, 92, 96, 98, 140). These studies clearly showed that pyrBI transcripts, initiated at a single physiologically relevant promoter, were subject to UTP-sensitive termination at the pyrBI attenuator in vivo. These studies also indicated that attenuation control accounted for most, although not all, pyrimidine-mediated regulation of pyrBI expression. To determine the contribution of attenuation control to this regulation, pyrBI expression was measured in a mutant E. coli strain containing a 9-bp chromosomal deletion that removes the run of eight T·A base pairs at the end of the pyrBI attenuator plus an adjacent base pair to maintain the reading frame of the leader polypeptide (98). All intrinsic transcription termination is abolished at this mutant attenuator. When the mutant strain was grown under conditions of pyrimidine excess, pyrBI expression was approximately 50-fold higher than that in an isogenic pyrBI+ strain. When growth of the mutant was limited for pyrimidines, operon expression increased an additional sevenfold. Growth of the pyrBI+ strain under the same pyrimidine-limiting conditions resulted in a 300- to 350-fold increase in operon expression. These results indicate that attenuation control is responsible for pyrimidine-mediated regulation over a 50-fold range, while additional regulation occurs over a sevenfold range through another control mechanism. The latter mechanism, which involves reiterative transcription, will be described in detail below.
In the pyrBI attenuation control model, translation of the 44-codon ORF of the leader transcript plays a critical regulatory role. To confirm that the leader ORF was indeed translated in vivo, a gene fusion was constructed in which the pyrBI promoter-leader region through codon 11 was fused in frame to codon 9 of the lacZ gene. An E. coli strain carrying this gene fusion synthesized a β-galactosidase fusion protein with the amino-terminal sequence of the leader polypeptide (140). To show that regulation of the pyrBI operon requires translation of the leader ORF, the in vivo effects of mutations that either strongly inhibit translation initiation of the ORF or introduce stop codons early in the ORF, well before the attenuator, were measured (26, 139, 140). Each mutation greatly reduced operon expression, especially under conditions of pyrimidine limitation, and significantly reduced the range of pyrimidine-mediated regulation. Furthermore, mutant (i.e., rpsL) strains containing slowly translating ribosomes exhibited reduced pyrBI expression, apparently due to reduced coupling of transcription and translation in the pyrBI leader region (64). Although translation of the leader ORF is clearly important for regulation, the sequence of the encoded polypeptide is not. A mutant strain carrying a frameshift mutation that changes the sequence of the leader polypeptide, while still allowing translation of the entire leader region, exhibited essentially normal attenuation control (26).
One of the major assumptions of the model is that under conditions of pyrimidine limitation, tight coupling of transcription and translation in the pyrBI leader region allows the ribosome to physically prevent the formation of the terminator RNA hairpin. To test this assumption, stop codons were individually introduced at numerous sites within the 44-codon leader ORF to determine the distance that a ribosome must translate to suppress transcription termination at the attenuator (139). Based on the size of the ribosome footprint on its RNA template, translation would have to proceed to a codon located within approximately 15 nucleotides of the terminator hairpin sequence to permit the ribosome to interact directly with this sequence (79, 154). Examination of the strains carrying separate stop codon mutations showed that translation termination at or before codon 24, which is 16 nucleotides upstream of the terminator hairpin, limited operon expression to approximately 5% of the wild-type level under pyrimidine-limiting conditions. In contrast, translation termination at codon 25, which should be the first stop codon at which ribosome binding overlaps the sequence of the terminator hairpin, allowed expression at 64% of the wild-type level. The level of operon expression generally increased to near-wild-type levels as the stop codon was moved further downstream, perhaps reflecting greater disruption of the terminator hairpin. These results provide strong support for the proposed role of the ribosome. In addition, the observation that pyrBI expression increased as the stop codon mutations were moved downstream of codon 25 suggests that pyrBI expression is enhanced by coupling of translation of the leader ORF and the pyrB cistron. In the wild-type pyrBI operon, it is likely that such coupling occurs due to the close proximity of these ORFs (159).
The model requires only a single round of translation of the leader transcript to allow readthrough transcription, and more translation would presumably be wasteful. Such wasteful translation appears to be limited by the use of a relatively weak ribosome binding site preceding the leader ORF (86). In addition, sequences in the downstream half of the leader transcript are complementary to the leader ribosome binding site (120, 140). Formation of a secondary structure by these sequences could block multiple rounds of translation of readthrough transcripts and perhaps all translation of attenuated transcripts.
The discovery of UTP-sensitive transcription pausing in the pyrBI leader region was key in developing the attenuation control model. This pausing provided the regulatory sensor, equivalent to control codons in the amino acid biosynthetic operons, that responds to different levels of UTP in a way that influences transcription termination at the attenuator. In E. coli, the UTP concentration varies from approximately 50 µM in cells grown under conditions of severe pyrimidine limitation to 1 mM or slightly above in cells grown under conditions that provide ample pyrimidines (3, 124, 158). The first in vitro experiments to detect UTP-sensitive transcription pausing in the pyrBI leader region revealed only a small cluster of pause sites that correspond to a uridine-rich region located approximately 20 nucleotides before the terminator hairpin in the leader transcript (159). Subsequent in vitro transcription studies employing a more sensitive assay provided a different view of pausing in the leader region preceding the attenuator (32). Instead of one cluster of pause sites, there is a large number of sites throughout the leader transcript at which RNA polymerase pauses when the UTP concentration is low. Nearly all of these sites correspond to positions where UMP is added to the leader transcript. Pausing at these sites decreases with increasing UTP concentrations (from 20 to 200 µM) and is no longer detectable at a concentration of 400 µM. Although some degree of pausing apparently can occur before the addition of every UMP in the leader transcript at 20 µM UTP, the strength of individual pause sites is variable. This variability presumably reflects the effects of DNA sequence and RNA secondary structure (18, 19). In this regard, an upstream RNA hairpin enhances pausing within the originally identified cluster of UTP-sensitive transcription pause sites (86, 159). Although some pause sites within the leader region may be stronger than others, the large number of these sites indicates that the cumulative effect of pausing at multiple positions is the key factor in controlling coupling between RNA polymerase and the ribosome translating the pyrBI leader transcript. Consistent with this view, replacing all seven uridines in the originally identified pause cluster with adenines causes only a twofold reduction in the range of pyrimidine-mediated regulation of pyrBI expression (K. Mixter-Mayne and C. L. Turnbough, Jr., unpublished data).
In contrast to transcription pausing observed at low UTP concentrations, extensive pausing in the pyrBI leader region was not induced when the concentration of ATP, GTP, or CTP was low (i.e., 20 µM) (32). This difference appears to be due, at least in part, to a difference in the Km values for these nucleotides during transcription elongation. The apparent Km for UTP during elongation appears to be significantly higher than the Km values for the other nucleoside triphosphates (NTPs) (66, 85). This higher Km apparently results in nonsaturating binding of UTP to an elongating RNA polymerase at all physiological concentrations of UTP (i.e., in cells with limiting or ample pyrimidines). This situation appears to be unique because the physiological concentrations of the other NTPs are typically well above their Km values for transcription elongation (124, 135). Thus, the rate of transcription elongation is uniquely sensitive to the intracellular concentration of UTP, which makes UTP an ideal regulatory effector for a control mechanism based on coupling of transcription and translation.
Additional noteworthy support for the proposed role of UTP-sensitive transcription pausing in attenuation control came from studies of pyrBI regulation in S. enterica serovar Typhimurium, which is similar to that in E. coli (see below). A mutant strain was isolated that carries an altered RNA polymerase that exhibits an approximately sixfold-higher Km for the binding of UTP (and ATP) during transcription elongation (66). This mutant displayed constitutive expression of the pyrBI operon at high intracellular levels of UTP, indicating that transcription pausing during the addition of UMP (or another nucleotide) to the pyrBI leader transcript, and not the UTP level, is the key determinant in regulation. In related studies with E. coli, it was shown that the transcription elongation factor NusA enhances UTP-sensitive pausing within the pyrBI leader region in vitro and appears to be important in determining the level of pyrBI expression in vivo (3, 32). Presumably, NusA plays a key role in establishing a rate of transcription elongation that permits tight coupling of transcription and translation in cells limited for pyrimidines. These results indicate that the activity of NusA or of any factor that influences the rate of transcription elongation can affect the expression of the pyrBI operon or of similarly regulated operons.
The attenuation control mechanisms of the pyrBI operons of E. coli and S. enterica serovar Typhimurium were elucidated the old-fashioned way, i.e., by doing many experiments. These experiments identified readily recognizable regulatory sequences. Today, it is possible to inspect a large number of bacterial genomes for these regulatory sequences and thereby identify other operons that are likely to be regulated by attenuation control mechanisms similar to those described above. Although many search formats can be used, even limited searches reveal interesting information about the prevalence of particular control mechanisms. For example, a BLAST search of currently available bacterial genome sequences using the amino acid sequence of the pyrBI leader polypeptide as the query (with CLUSTAL W alignment) produced 14 matches. All matches correspond to polypeptides encoded by the pyrBI leader regions of five strains of E. coli (i.e., K-12 MG1655, K-12 W3110, O157 EDL933, O157 Sakai, and CFT073), five strains of Shigella (i.e., S. flexneri 301 and 2457T, S. dysenteriae, S. boydii, and S. sonnei), and four strains of Salmonella (i.e., S. enterica serovar Typhimurium LT2, S. enterica serovar Typhi CT18 and Ty2, and S. enterica serovar Paratyphi A). All 10 of the E. coli and Shigella polypeptides contain 44 amino acids; eight of the polypeptide sequences are identical, and two (from the S. flexneri strains) contain a single amino acid difference. All four of the Salmonella polypeptides contain 33 amino acids, due to the shorter leader ORF described above, and their sequences are identical. These four sequences differ at only five residues compared to the other 10 polypeptides. These results and further inspection of leader sequences indicate that the 14 strains listed above employ an essentially identical attenuation control mechanism for pyrimidine-mediated regulation of pyrBI expression. The results are also consistent with the established evolutionary relationships among strains of Escherichia, Shigella, and Salmonella (41).
In the search for matches to the E. coli pyrBI leader polypeptide, the misses are as interesting as the hits. For example, no matches were found in the genome sequences of many other enteric bacteria. This result may indicate that the mechanisms for regulating pyrBI expression in these bacteria are different from that described for E. coli. However, inspection of selected "missed" enteric pyrBI operons indicates that they may still be regulated by an E. coli-like attenuation control mechanism—one that employs comparable regulatory elements with distinct sequences. This situation appears to be the case for Yersinia pestis CO92 and Erwinia cartovora, which have all the regulatory elements found in the E. coli leader region, including 41- and 40-codon ORFs, respectively. These ORFs encode leader polypeptides with no sequence similarity to the leader polypeptide of E. coli and modest sequence similarity with each other. However, the leader ORFs of Y. pestis and E. cartovora both stop at the same position near the middle of the sequence for the terminator hairpin, which is similar to the situation described for S. enterica serovar Typhimurium. Interestingly, the sequence of the leader polypeptide of Y. pestis is very similar (i.e., 57% identical) to that of a 37-amino-acid leader polypeptide encoded by the pyrBI leader ORF of Serratia marcesens. On the other hand, the leader region of S. marcesens does not appear to contain the sequence for an E. coli-like intrinsic transcription terminator, suggesting another regulatory twist. It should also be noted that the search for matches to the E. coli pyrBI leader polypeptide missed all nonenteric gram-negative bacteria. Nonetheless, inspection of selected genomic sequences, e.g., that of Vibrio cholerae, suggests again that E. coli-like regulation of pyrBI expression may occur but with divergent (and perhaps some new) regulatory elements.
|
|
|---|
70. This sigma factor recognizes –10 and –35 regions for which the consensus sequences are 5'-TATAAT and 5'-TTGACA, respectively (118). The spacing between the –10 and –35 regions is typically 17 ± 1 bp, and transcription is usually initiated at one or more sites located 7 ± 1 bp downstream from the –10 region (55, 147). At about 75% of promoters, transcription is initiated with ATP or GTP (95). In some molecular genetics textbooks, this preference is used to imply that initiation with CTP or UTP is of little importance. However, initiation with pyrimidine NTPs is often an essential element in gene expression. This fact was first demonstrated in studies of pyrC expression. The experiments that eventually led to the correct mechanism for pyrimidine-mediated regulation of pyrC expression began with the determination of the sequence of the pyrC operon and primer extension mapping of its transcription start sites (9, 122, 170). Transcription initiation occurs at four adjacent sites in the initially transcribed region (ITR) of the promoter (170). The nontemplate strand sequence of these sites is 5'-TCCG, which is located 6 to 9 bp downstream from the –10 region (Fig. 3). These sites are designated T6, C7, C8, and G9, and the transcripts initiated at these sites are called the U6, C7, C8, and G9 transcripts, respectively (99). Inspection of the pyrC sequence also revealed a hyphenated dyad symmetry that includes the ITR of the promoter and a downstream region specifying part of the Shine-Dalgarno (SD) sequence of the pyrC ribosome binding site (Fig. 3) (100). This sequence indicates that U6 transcripts would form a hairpin with a 6-bp stem in which the upstream segment includes the first six nucleotides of the transcript and the downstream segment includes most of the pyrC SD sequence. Transcripts starting further downstream (i.e., at C7, C8, and G9) would form progressively shorter hairpins, with the shortest being a 3-bp hairpin formed by G9 transcripts. However, the calculated free energy of formation of the shortest possible hairpin suggests that it would not be stable in cells (39, 111), a supposition that was later confirmed experimentally (99).
![]() View larger version (20K): [in a new window] |
FIG. 3. Model for transcription start site switching and translational control of pyrC expression in E. coli and Salmonella. The nucleotide sequence of the pyrC promoter-regulatory region of E. coli is shown, with the –10 region, SD sequence, and pyrC initiation (Met) codon underlined and labeled. Asterisks indicate the four transcription start sites at the pyrC promoter, and the two major start sites, C7 and G9, are indicated. Inverted horizontal arrows indicate the region of dyad symmetry. The sequence and structure of transcripts initiated at start sites C7 (high CTP) and G9 (low CTP) are shown, with the SD sequence boxed. Only C7 transcripts form the hairpin that includes the SD sequence and prevents translation initiation.
|
According to the model (Fig. 3), nucleotide-sensitive selection of transcription start sites is used to produce alternative transcripts with different potentials for translation. When the intracellular level of CTP is high (e.g., during growth with excess pyrimidines), C7 transcripts are synthesized predominantly. These transcripts are not translated, however, because they form a stable hairpin at their 5' ends that blocks ribosome binding to the pyrC SD sequence. In contrast, when the CTP level is low and the GTP level is high, conditions found in cells limited for pyrimidines (142), G9 transcripts are synthesized primarily. The shorter G9 transcripts are unable to form the inhibitory hairpin and are readily translated. Thus, this mechanism allows the level of pyrC expression to change according to the cell's requirement for pyrimidine nucleotides. Furthermore, in this model changes in pyrC expression can be gradual in response to incremental changes in the intracellular CTP (and GTP) concentrations.
The key aspects of the model have been confirmed. The importance of the hairpin at the 5' end of the pyrC transcript was shown by using pairs of mutations in the hyphenated dyad symmetry of the pyrC leader region. Individually, these mutations cause constitutive pyrC expression. However, when a pair of complementary mutations capable of restoring complete base pairing in the leader transcript hairpin was introduced into a strain, it exhibited nearly normal levels of pyrimidine-mediated regulation of pyrC expression (153, 169). In related experiments, direct evidence for the predicted secondary structure at the 5' end of C7 transcripts and the absence of this structure in G9 transcripts was obtained by chemical and enzymatic probing of pyrC transcripts isolated from cells grown under conditions of pyrimidine excess or limitation (151). The importance of start site switching was demonstrated by showing that a strain carrying a mutant pyrC promoter unable to switch start sites (e.g., containing a C7-to-A or C7-to-G mutation [see below]) fails to exhibit pyrimidine-mediated regulation of pyrC expression (99). In addition, nucleotide (CTP/GTP)-sensitive selection of transcription starts sites was demonstrated in vitro using a transcription assay containing only highly purified RNA polymerase, DNA template, NTP substrates, and salts. These results closely mimicked those observed in vivo, indicating that additional regulatory factors are not required for transcription start site switching at the pyrC promoter (169). One seemingly wasteful feature of the model is the synthesis of untranslated C7 transcripts. It was suggested that these transcripts would be prematurely terminated, as observed in polarity (82). Such a fate for C7 transcripts is indeed likely, because multiple Rho-dependent termination sites exist early in the pyrC ORF (J. Liu and C. L. Turnbough, Jr., unpublished data). Perhaps the most intriguing feature of the model for regulation of pyrC expression was nucleotide-sensitive selection of transcription start sites. Characterizing transcription initiation at mutant pyrC promoters provided rules for this selection process.
The nontemplate strand sequence of the pyrC ITR is 5'-TCCGG, located 6 to 10 bases downstream of the –10 region (Fig. 3). Transcription at the wild-type promoter can occur at the first four positions, as described above. Therefore, if context effects are ignored and corrections are made for different transcript stabilities, the levels of C7 and C8 transcripts in cells can be used to calculate the frequency of in vivo transcription initiation at positions C7 and C8 (99). Such an experiment demonstrated that C7 was a fivefold better start site than C8. If a single-base deletion that removes the T residue immediately downstream of the –10 region is introduced into the pyrC promoter, the possible start sites are now CCGG at "new" positions 6 to 9. Repeating the experiment described above with the mutant promoter revealed that C7 was a much better start site than C6, with C6 transcript levels so low that they could not be measured. Likewise, it was possible to use the tandem G8/G9 sites to show that G8 was a 13-fold-better start site than G9. Additional mutant promoters were then constructed in which other deletions (i.e.,
TT and
TTG) or a T insertion were introduced immediately downstream of the –10 region. These promoters created more possible positions for the CC and GG pairs, and the transcripts initiated at these sites were analyzed as described above. Combining all of the results permitted the assignment of the following preferences for start site positions: 7 > 8 > 6 > 9 > 10. Similar analyses were performed to determine preferences for the initiating nucleotide, using a different set of mutant pyrC promoters. These promoters contain single-base substitutions at the best initiation position, 7, and at a relatively poor initiation position, 9. Specifically, C7 was changed to a T, G, or A, and G9 was changed to a C or A. Measuring the frequency of initiation at these sites revealed the following preferences for the initiating nucleotide: ATP
GTP > UTP >> CTP. The actual difference between the initiation efficiencies of UTP and CTP was sixfold, making CTP the poorest initiating nucleotide by far. Although the experiments described above were done with E. coli (99), the same preferences were observed at wild-type and mutant pyrC promoters in S. enterica serovar Typhimurium (152).
The preferences or "rules" for selecting transcription start sites suggest a somewhat revised version of the model for pyrC regulation. Specifically, these rules provide the basis for nucleotide-sensitive start site switching at the wild-type pyrC promoter. The worst initiating nucleotide (CTP) is used to start transcripts at the best start location (position 7), and a good initiating nucleotide (GTP) is used to start transcripts at a weak start location (position 9). These combinations establish competition between initiation at positions C7 and G9, which can be influenced dramatically by changes in intracellular levels of CTP and GTP that reflect pyrimidine availability. The same rules restrict transcription initiation at positions T6 and C8, which utilizes the combination of a suboptimal start position and a poor initiating nucleotide.
It appears that the rules for selecting transcription start sites identified with the pyrC promoter apply in general to other
70 promoters. Examination of several hundred well-characterized E. coli promoters shows frequencies for selecting initiating nucleotides (A [47%], G [28%], T [15%], and C [10%]) (57, 95) and start site positions (7 [40%], 8 [24%], 6 [11%], 9 [10%], and other sites [
5%]) (55) that reflect the preferences identified above. These results suggest that most transcripts start with efficient initiating nucleotides and favored positions to maximize transcript synthesis. It also suggests that the use of inefficient initiating nucleotides and less favored positions is evolutionarily selected to reduce or control transcript synthesis.
Finally, the rules described above for selecting transcription start sites ignore context effects. However, the local DNA sequence can be an important factor in start site selection (17, 69, 93, 166). Of particular importance is the sequence at position +2 of the transcript, which accounts for the so-called second-nucleotide effect. It was demonstrated many years ago (113, 126), and again in a clear fashion during the analysis of mutant pyrC promoters (152), that high concentrations of both the first and second NTP substrates are required for highly efficient initiation of transcription. Apparently, after formation of the first internucleotide bond, the dinucleotide product stabilizes the transcription initiation complex. Lower concentrations of NTP substrates are required for transcript extension beyond position +2, though the relaxation of the requirement for high NTP concentrations may occur gradually until promoter clearance (2). Based on these observations and the fact that the +2 nucleotide in pyrC C7 transcripts is a C, it appears necessary to make a final modification to the model for pyrC regulation. Namely, synthesis of C7 transcripts is restricted at low CTP concentrations because of insufficient levels of the first and second NTPs required to initiate transcription.
Inspection of published bacterial promoter sequences reveals many other transcription initiation regions at which nucleotide-sensitive start site switching is predicted. Such switching can produce transcripts with minor differences in sequence at their 5' ends, which produce major differences in the ability of the transcripts to be translated. This effect may be due to formation of secondary structures that inhibit translation initiation as seen with the pyrC and pyrD regulatory mechanisms. However, nucleotide-sensitive start site switching can generate sequence differences at the 5' ends of transcripts that alter gene expression in many other ways, some of which were also discovered by studying genes of pyrimidine metabolism (see below).
|
|
|---|
The sequence requirement for reiterative transcription at the mutant sar promoter, as well as at several other promoters (51, 54, 59, 106), appeared to be a short (i.e.,
3-bp) tract in the ITR that specified a homopolymeric run in the nascent transcript. Thus, the run of three T residues at positions +3 to +5 in the ITR of the pyrBI promoter appeared to be a possible site for reiterative transcription. To investigate this possibility, the pyrBI promoter-leader region was transcribed in vitro in reaction mixes containing high (
200 µM) or low (20 µM) concentrations of UTP, with high concentrations of [
-32P]ATP, GTP, and CTP. The transcripts produced were separated in a 25% polyacrylamide gel (a procedure never employed in the many previous analyses of pyrBI transcripts synthesized in vitro) and visualized by autoradiography. The results revealed a ladder of transcripts generated at high UTP concentrations, with the longest transcript containing over 30 nucleotides. Synthesis of this ladder was greatly reduced at the low UTP concentration. The sequences of the transcripts in the ladder were shown to be AAUUUn (with n = 1 to >30), which established that reiterative transcription indeed occurs at the T3 tract within the pyrBI ITR. Furthermore, transcripts containing extra (i.e., >3) U residues were always released from the transcription initiation complex without switching to normal transcript elongation (which was also demonstrated in vivo), and synthesis of the AAUUUn transcripts inhibited the production of full-length pyrBI transcripts (97). These results suggested that reiterative transcription could be involved in UTP-sensitive regulation of transcription initiation at the pyrBI promoter.
To examine the role of reiterative transcription in regulation of pyrBI expression, base substitutions were introduced into the T3 tract within the pyrBI ITR. Transcription in vitro of DNA templates carrying these substitutions showed that any change in the T3 tract abolished reiterative transcription (L. Heath and C. L. Turnbough, Jr., unpublished data). Using a mutant strain carrying one of these base substitutions, it was shown that pyrBI expression was sevenfold greater that that observed in a pyrBI+ strain when cells were grown under conditions of pyrimidine excess. When this base substitution was introduced into a strain carrying a defective pyrBI attenuator, pyrimidine-mediated regulation of pyrBI expression was effectively eliminated (97). These results demonstrate the regulatory role of reiterative transcription at the pyrBI promoter and show that UTP-dependent reiterative transcription and UTP-sensitive transcription attenuation are sufficient to account for all pyrimidine-mediated regulation of pyrBI expression in E. coli.
According to these observations, the following model was proposed for regulation of pyrBI expression by reiterative transcription (Fig. 4) (97). After the synthesis of the nascent transcript AAUUU, weak base pairing between the transcript and its DNA template allows a rapid and reversible one-base upstream shift (or slip) of the nascent transcript. When the intracellular level of UTP is high and the transcript is in the "slipped" position, the last (i.e., 5') A in the AAA tract in the DNA template efficiently directs the addition of another U residue to the 3' end of the transcript. This transcript can be released from the transcription initiation complex or it can shift again. The cycle of slippage and U addition can occur repeatedly, resulting in transcripts with progressively longer runs of U residues. However, all AAUUUUn transcripts are eventually released from the initiation complex, thereby preventing productive transcription of the pyrBI operon. On the other hand, when the intracellular level of UTP is low, slippage (if it occurs) and correct repositioning of the AAUUU transcript—without addition of extra U residues—occurs predominantly. Correct positioning of the RNA-DNA hybrid permits the addition of a G residue to the 3' end (i.e., position +6) of the transcript. Once this addition occurs, more stable base pairing between the transcript and template precludes further slippage. The AAUUUG transcript either is released from the initiation complex, as a simple aborted transcript, or is extended by the addition of a C residue, which apparently commits the transcription complex to the elongation mode (97). Therefore, high levels of full-length pyrBI transcripts are produced only when their encoded enzyme, aspartate transcarbamylase, is needed to synthesize more UTP. In this model, regulation of pyrBI expression can occur gradually, over a range of intracellular UTP concentrations, by corresponding adjustments in the efficiency of reiterative transcription.
![]() View larger version (23K): [in a new window] |
FIG. 4. Model for the regulation of pyrBI expression by UTP-sensitive reiterative transcription. DNA sequences in the transcription bubble are shown, and the sequence of the nascent transcript, starting at position +1, is italicized. For details, see the text.
|
![]() View larger version (9K): [in a new window] |
FIG. 5. Promoter-regulatory region of the carAB operon of E. coli. Promoters P1 and P2 and the binding sites for IHF, PepA, RutR, PurR, and ArgR are shown. The partial sequence of promoter P1 includes the –10 region and transcription start site (+1), which are underlined.
|
1) were not extended to include sequences specified by the carAB genes (53). Finally, +3T-to-G or +3T-to-C mutations were shown to prevent reiterative transcription at promoter P1 while increasing the production of normally elongated, full-length carAB transcripts. Each mutation also caused an approximately threefold reduction in pyrimidine-mediated regulation of carAB expression, which was independent of regulation involving IHF and PepA. Pyrimidine-mediated regulation involving IHF and PepA occurs over a six- to ninefold range (53). Taken together, these results indicate that regulation of carAB expression by UTP-sensitive reiterative transcription occurs by a mechanism analogous to that described for the pyrBI operon (Fig. 4). In this mechanism, transcription is initiated at the G7 start site in a manner independent of the UTP concentration. After the nascent transcript is extended normally to include four bases and has the sequence GUUU, weak base pairing between the transcript and DNA template permits reversible one-base slippage. With a high UTP concentration and the nascent transcript in the slipped position, an extra U residue is added to the 3' end of the transcript. Either this transcript can be released from the initiation complex or another round of slippage and U addition can occur. Repeating this cycle generates transcripts with long runs of U residues; however, these transcripts are excluded from the normal mode of transcription elongation. With a low UTP concentration, slippage (if it occurs) and correct repositioning of the GUUU transcript—without extra U addition—permit normal template-directed insertion of a G residue at position +5. This addition results in a more stable RNA-DNA hybrid and the loss of alternative alignments for the 3' end of the transcript, which precludes further slippage. The GUUUG transcript is either released as a simple aborted transcript or extended downstream with a high probability that it will become a full-length carAB transcript. In this model, the level of carAB expression is inversely proportional to UTP-sensitive reiterative transcription, and the production of carbamylphosphate synthetase corresponds to the cell's need for pyrimidine nucleotides. Although not included in this (or the pyrBI) model, it is possible that intracellular GTP levels affect operon expression by influencing the addition of a U or G residue at position +5 (or +6 in the case of pyrBI) of the nascent transcript (70). Physiological conditions that allow GTP levels to modulate reiterative and productive transcription at the carAB P1 and pyrBI promoters remain to be established. However, this possibility seems likely because pyrimidine limitation typically results in both a decrease in the UTP level and an increase in the GTP level in the cell (142).
The full range of pyrimidine-mediated regulation of carAB expression requires two independent control mechanisms that respond to the same or comparable (i.e., UTP and uracil) small-molecule effectors. Similar situations exist for UTP-sensitive regulation of pyrBI expression (i.e., transcription attenuation and reiterative transcription), Trp-sensitive regulation of the trpEDCBA expression (TrpR-mediated repression and transcription attenuation) (176), and numerous other operons in E. coli and other bacteria (48). The major advantage of such multiple control mechanisms is that regulation can respond to a wide range of concentrations of a particular effector molecule, with each control mechanism sensitive to a different range of effector concentrations. In the case of the carAB operon, it appears that IHF/PepA/RutR-mediated regulation occurs when UTP levels are relatively high (i.e., between 0.9 and 1.4 mM), while regulation by reiterative transcription occurs when UTP levels are lower (i.e., between 0.9 mM and 50 µM) (53). The lowest intracellular levels of UTP may be experienced by pyrimidine auxotrophs grown under pyrimidine-limiting conditions or by prototrophs following a shift from a pyrimidine-rich to a pyrimidine-poor environment.
Finally, an interesting difference between the reiterative transcription control mechanisms of the carAB and pyrBI operons is that the range of regulation provided by UTP-sensitive reiterative transcription at the carAB P1 promoter is smaller (by a factor of two to three) than that observed with the pyrBI promoter. This difference is presumably due to differences in the carAB and pyrBI promoter sequences. Mutational variants of the carAB P1 promoter were constructed to examine this assumption (X. Han and C. L. Turnbough, Jr., unpublished data). One variant showed that changing the G at the 5' end of the transcript to an A increases the range of regulation nearly twofold. This result suggests that stronger G/C base pairing between the 5' end of the transcript and the DNA template suppresses reiterative transcription and restricts the range of regulation. Another variant showed that changing the location of the transcription start site from position 7 to position 8 increases the range of regulation 2.5-fold. The reason for this enhancement is not obvious. However, the observations with the mutant P1 promoters indicate that promoter sequences and the architecture of the transcription initiation complex can significantly affect reiterative transcription. As a corollary, a TTT motif is necessary but not sufficient for UTP-dependent reiterative transcription. The dependence of reiterative transcription on additional promoter elements—sometimes an absolute dependence—was clearly established by the following examples of gene regulation in E. coli.
|
|
|---|
![]() View larger version (26K): [in a new window] |
FIG. 6. Salvage pathways for uracil and cytosine. Gene names are used to represent the encoded proteins. Excluding those shown in Fig. 1, the genes and their encoded proteins are as follows: cdd, cytidine deaminase; cmk, CMP kinase; codA, cytosine deaminase; codB, cytosine permease; udk, uridine kinase; udp, uridine phosphorylase; upp, UPRTase; and uraA, uracil permease.
|
The first indication that codBA expression might be regulated by UTP-sensitive reiterative transcription was the discovery that the codBA promoter contains a T6 tract in its ITR, at a position that resembles the location of the T tract in the pyrBI promoter. The sequence of the codBA promoter region containing the ITR is 5'-TAGAATGCGGCGGATTTTTTGGG, with the –10 region and predicted transcription start sites underlined. However, the T tract in the codBA promoter is twice as long as the T tract of the pyrBI operon, which suggested that regulation of reiterative transcription at the codBA promoter would be different from that at the pyrBI promoter. Specifically, it was not clear how low UTP levels would inhibit reiterative transcription. With six U residues specified by the codBA ITR and only three U residues required for reiterative transcription, it appeared that extra U residues would be added to essentially every nascent codBA transcript, regardless of the UTP level. Furthermore, if extra U addition prevented productive transcript elongation, as observed with nascent pyrBI transcripts, then transcription of the codBA operon would be precluded. Obviously, something was missing in this scenario.
To examine reiterative transcription at the codBA promoter, a DNA template containing the codBA promoter region was transcribed in vitro in reaction mixtures containing various physiological concentrations (from 20 µM to 1 mM) of UTP (137). Analysis of the transcription products showed that codBA transcripts are initiated at two sites: a G residue and an A residue located seven and eight bases downstream from the –10 region (and designated G7 and A8), respectively (Fig. 7A). Most transcripts initiated at position G7 appeared to be elongated normally (i.e., they did not engage in reiterative transcription). In contrast, all transcripts initiated at position A8 appeared to engage in reiterative transcription that produced AUUUUn (where n = 1 to >15) transcripts. These transcripts were released from the transcription initiation complex without further downstream extension. Varying the UTP concentration did not affect the extent of the reiterative transcript; i.e., AUUUUn transcript ladders were always comparable in length. Although the UTP concentration did not affect reiterative transcription, it had a major effect on start site selection: higher concentrations of UTP favored initiation at position A8. At 1 mM UTP, initiation at position A8 was strongly favored; below 100 µM UTP, nearly all initiation occurred at position G7. Thus, the UTP concentration controlled start site selection, even though UTP was not used as the initiating NTP (which will be explained below).
![]() View larger version (33K): [in a new window] |
FIG. 7. Transcription from wild-type and mutant codBA promoters. (A) Sequence of the wild-type codBA promoter region with the –10 region underlined and the G7 and A8 start sites labeled with the number 7 or 8, respectively. Horizontal arrows indicate transcription initiation at the two start sites. The two T-to-G substitutions that created the TGTG mutant promoter are shown in gray. (B) Levels of codB::lacZ transcripts initiated at the wild-type and TGTG mutant promoters. Cells carrying either the wild-type or TGTG mutant codB::lacZ fusion were grown under conditions of pyrimidine excess or limitation, and fusion transcript levels were measured by quantitative primer extension mapping (137). Transcript levels are in arbitrary units. (Modified from reference 137 with permission from Elsevier.)
|
codBA-lacZYA) strain of E. coli. This strain and an isogenic control strain with a wild-type codB::lacZ fusion were grown under conditions of pyrimidine limitation and excess, and β-galactosidase levels in these cells were compared. Wild-type codB::lacZ expression was regulated over an approximately 30-fold range, while mutant codB::lacZ expression was unregulated (excluding
1.5-fold pyrimidine-independent basal regulation). When cells were grown under conditions of pyrimidine excess, expression of the mutant codB::lacZ fusion was approximately 30-fold higher than that of the wild-type fusion. These results indicated that UTP-dependent reiterative transcription at the codBA promoter was required for all pyrimidine-mediated regulation of codBA expression. Using the same four cultures (i.e., mutant and wild-type codB::lacZ fusion strains grown with limiting or excess pyrimidines), the steady-state levels and transcription start sites of the codB::lacZ transcripts were determined by quantitative primer extension mapping (137). The primer used in these experiments was complementary to codB sequences included in the codB::lacZ fusion. In the case of the wild-type strain, essentially all detectable transcripts were initiated at position G7, and the level of G7 transcripts in cells grown under conditions of pyrimidine limitation was at least 10-fold higher than that in cells grown with excess pyrimidines (Fig. 7B). Transcripts initiated at position A8 were not detected in cells grown under either condition. In the case of the mutant strain, in which reiterative transcription at the codBA promoter is precluded, both G7 and A8 transcripts were detected in cells grown under either condition (Fig. 7B). In both cultures, the levels of total (i.e., G7 plus A8) transcripts were similar, i.e., only 1.4-fold higher in cells grown under conditions of pyrimidine limitation. Also, the levels of total transcripts in the mutant cells (grown with either pyrimidine excess or limitation) were roughly threefold higher than that in wild-type cells grown with limiting pyrimidines, with a large part of this increase due to A8 transcripts. These results indicated that position A8 was a major transcription start site at the codBA promoter and that in the wild-type fusion strain, A8 transcripts were not detected because they were not extended downstream to include codB sequences. Presumably, these A8 transcripts were produced by nonproductive reiterative transcription and contain the sequence AUUUUn. In addition, the results with the mutant strain showed a high level of pyrimidine-mediated switching between the G7 and A8 start sites. In mutant cells grown with excess pyrimidines, approximately 75% of the transcripts were initiated at position A8. In contrast, in mutant cells grown with limiting pyrimidines, the level of G7 transcripts was twice that of A8 transcripts (Fig. 7B). These responses suggest that in the case of the wild-type codBA promoter, pyrimidine (presumably UTP)-mediated switching between productive transcription initiation at position G7 and nonproductive transcription initiation at position A8 plays a major role in regulation.
Based on the in vitro and in vivo results, the following model involving both UTP-dependent reiterative transcription and UTP-sensitive transcription start site switching was proposed for pyrimidine-mediated regulation of codBA expression (Fig. 8) (137). When UTP levels are high, RNA polymerase initiates transcription primarily at position A8—the preferred start site—and synthesizes a nascent transcript with the sequence AUUU (or perhaps AUUUU). At this point, weak base pairing between the transcript and its DNA template allows slippage between the two strands, resulting in a one-base, upstream shift of the transcript. RNA polymerase then switches to the reiterative mode of transcription and adds an extra U residue, which thereafter excludes this transcript from normal elongation. This transcript can be released from the initiation complex or the slippage/extra U addition cycle can be repeated many times, producing longer AUUUUn transcripts that have a fixed probability of release after each cycle. Furthermore, synthesis of the AUUUn transcripts precludes initiation at position G7, resulting in a low level of productive transcription and operon expression. Alternatively, when UTP levels are low, initiation at position A8 is inefficient, which restricts the synthesis of AUUUn transcripts. This restriction allows efficient transcription initiation at the secondary start site G7, which results in the synthesis of high levels of full-length codBA transcripts. Translation of these transcripts provides the proteins required for cytosine uptake and conversion to pyrimidine nucleotides when the nucleotides are needed by the cell.
![]() View larger version (24K): [in a new window] |
FIG. 8. Model for UTP-sensitive regulation of codBA expression. The model shows the effects of UTP concentration on productive transcription and nonproductive reiterative transcription (or stuttering), which occurs following transcription initiation at start sites G7 and A8, respectively. (Modified from reference 137 with permission from Elsevier.)
|
![]() View larger version (11K): [in a new window] |
FIG. 9. Promoter region sequences of the upp and codBA operons. The –10 regions are underlined, and asterisks indicate the two transcription start sites at each promoter. The start sites are numbered according to their position downstream from the –10 region.
|
The regulatory role of reiterative transcription at the upp promoter was investigated by constructing isogenic E. coli strains carrying a chromosomal upp::lacZ gene fusion with either a wild-type upp promoter or a mutant promoter in which the T tract was altered (e.g., T8 to TTGTTTTT) to eliminate reiterative transcription. These strains were grown under conditions of pyrimidine limitation or excess, and upp::lacZ expression levels were measured. The results showed that wild-type upp::lacZ expression was regulated over a sixfold range and that this regulation was effectively abolished by the elimination of reiterative transcription. In addition, elimination of reiterative transcription at the upp promoter caused constitutive upp::lacZ expression. Quantitative primer extension mapping of upp::lacZ transcripts isolated from these cultures showed that in the case of the wild-type fusion strain, only G6 transcripts were detected and the level of G6 transcripts in cells grown with pyrimidine limitation was sevenfold higher than that in cells grown with excess pyrimidines. (Note that with the primer used here, AUUUUn transcripts could not be detected.) In the case of the mutant fusion strain, both G6 and A7 transcripts were detected in cells grown under either condition; however, pyrimidine availability dramatically affected the relative amounts of the two transcripts. For cells grown with excess pyrimidines, 40% of the transcripts were initiated at position G6 and 60% were initiated at position A7. For cells grown with limiting pyrimidines, 90% of the transcripts were initiated at position G6 and 10% were initiated at position A7. Under both conditions, the levels of total (G7 plus A8) transcripts were similar, i.e., only 1.4-fold higher in cells grown under conditions of pyrimidine limitation. Taken together, these results revealed UTP-sensitive selection of alternative transcription start sites and different fates of the transcripts initiated at these sites that mirror the key regulatory elements of the codBA operon. Thus, a model for pyrimidine-mediated regulation of upp expression was proposed that is completely analogous to that for codBA expression (Fig. 8).
Briefly, according to the model, when intracellular levels of UTP are high, RNA polymerase preferentially initiates transcription at position A7. The resulting nascent transcript is extended until it contains three or perhaps four U residues, at which point weak base pairing in the RNA-DNA hybrid permits the transcript to slip one base upstream. RNA polymerase then adds a U residue to the 3' end of transcript, which irreversibly directs the transcript into a nonproductive transcription pathway. The transcript can be released from the initiation complex, or another round of slippage and U addition can occur. This process can be repeated many times, with a similar probability of transcript release after every U addition. Synthesis of the resulting AUUUUn transcripts occludes the promoter, thereby reducing the opportunity for initiation at position G6 and the production of full-length upp transcripts. In contrast, when intracellular levels of UTP are low, RNA polymerase initiates transcription almost exclusively at position G6. The resulting transcripts, in general, avoid reiterative transcription due to the formation of a more stable hybrid between the transcript and DNA template. These nonstuttering transcripts are elongated normally and can be extended downstream to generate translatable upp transcripts. Consequently, high levels of the proteins required for uracil salvage are produced only under conditions of pyrimidine limitation.
Other factors that could contribute to the different ranges of upp and codBA regulation are the different positions of the transcription start sites and the preferences for selecting these sites (99). The G and A start sites are located at positions 6 and 7 in the upp promoter and at positions 7 and 8 in the codBA promoter. At the upp promoter, the G6 start site is predicted to be a much weaker start site than the A7 site. In contrast, at the codBA promoter, both start sites are predicted to be highly efficient. Therefore, much stronger competition between the start sites should occur at the codBA promoter. Consistent with this prediction, the observed pyrimidine-mediated start site switching at the codBA promoter is much more extensive than that at the upp promoter, which should permit a wider range of regulation. To examine this explanation, a mutant upp::lacZ operon was constructed by inserting a C residue into the sequence between the –10 region and transcription start sites of the upp promoter (Fig. 9), which changes the wild-type sequence CCGTC to CCGTCC. This +C mutation changes the positions of the start sites from G6 and A7 to G7 and A8, which are the same as the start sites in the codBA promoter. According to the previous arguments, the +C mutation should change the range of pyrimidine-mediated regulation of upp expression from 6-fold to 30-fold. However, the +C mutation did something quite unexpected: it completely eliminated pyrimidine-mediated regulation of upp::lacZ expression. The loss of regulation occurred because transcription was initiated almost exclusively at position G7 (i.e., without competition from position A8) in cells grown with either limiting or excess pyrimidines (E. Várady and C. L. Turnbough, Jr., unpublished data). Interestingly, partial regulation—and competition from the A8 start site—could be restored by a second mutation in the –10 region of the upp promoter. These surprising results suggest that interactions between distinct promoter elements can alter the conformation of the transcription initiation complex in ways that strongly influence the selection of transcription start sites. Additional studies are needed to provide the rules for these interactions.
|
|
|---|
The study of pyrimidine biosynthetic gene regulation in B. subtilis in the Switzer laboratory originated from an interest in the regulation of aspartate transcarbamylase activity by intracellular proteolysis (112). B. subtilis aspartate transcarbamylase differs from its enteric homolog in that it contains no regulatory subunit capable of mediating allosteric control of enzyme activity (16); allosteric regulation of de novo UMP biosynthesis is exerted at the level of a pyrimidine-repressible carbamylphosphate synthetase (131). The B. subtilis pyrB locus was sequenced to determine the amino acid sequence of aspartate transcarbamylase (90). The DNA sequences flanking the pyrB gene showed that, in contrast to the case in enteric bacteria, the pyrB gene was part of a 10-gene pyr operon that encoded all of the enzymes required for the de novo synthesis of UMP (Fig. 10) (89, 138, 161). The sequence of the pyr operon revealed three putative intrinsic transcription terminators in the promoter-proximal region, and the positions of these terminators suggested their involvement in a transcription attenuation control mechanism (Fig. 10). The characterization of this mechanism is described in detail in this review. The B. subtilis pyrG gene, which as in enteric bacteria encodes CTP synthetase, is not located in the multigene pyr operon. Regulation of B. subtilis pyrG expression is also regulated by an attenuation control mechanism, but this mechanism, which is described below, is quite different from that of the pyr operon.
![]() View larger version (8K): [in a new window] |
FIG. 10. Map of the B. subtilis pyr operon. Shaded bars indicate ORFs, and the bent arrow denotes the pyr promoter. The proteins encoded by the genes shown were identified in Fig. 1, except as follows: pyrR, pyr mRNA-binding attenuation regulatory protein; pyrP, uracil permease; pyrAA, glutamine-utilizing subunit of carbamylphosphate synthetase equivalent to carA; pyrAB, catalytic subunit of carbamylphosphate synthetase equivalent to carB; pyrK, electron-transferring accessory protein to dihdroorotate dehydrogenase. Numbers indicate the positions of nucleotides in the B. subtilis genome. (Modified from reference 155 with permission from Elsevier.)
|
|
|
|---|
The B. subtilis pyr operon is transcribed from a single promoter, which appears to be constitutive (105, 161). Operon expression is controlled by an attenuation mechanism in which transcription termination at three sites in the 5' region of the operon is regulated by uridine and guanosine nucleotides via the PyrR regulatory protein. It was evident from the earliest studies that each of the three untranslated segments of the operon contains a typical intrinsic transcription terminator, which specifies a terminator hairpin (the 3:4 stem-loops in Fig. 11, left) followed immediately by a run of U residues in the mRNA. These untranslated segments are the 5' pyr leader (attenuation region 1, 151 nucleotides), the pyrR-pyrP intercistronic region (attenuation region 2, 173 nucleotides), and the pyrP-pyrB intercistronic region (attenuation region 3, 145 nucleotides) (Fig. 10 and 11). Clearly, if the action of these terminators is not suppressed, little expression of the downstream genes, which include all of the enzymes of de novo pyrimidine biosynthesis, can occur. It was also recognized that the RNA from all three untranslated segments is capable of folding into an alternative secondary structure, which is a large hyphenated stem-loop (Fig. 11, right) (161). These structures were named antiterminators because they prevent formation of the downstream terminator hairpin by sequestering residues of the 5' segment of the terminator stem-loop via base pairing with upstream sequences. The key to regulation of the pyr operon lies in the ability of PyrR to favor formation of the terminator hairpins, which results in premature termination of transcription and reduced expression of the downstream genes. PyrR does this by binding to pyr mRNA when the protein is activated by binding of uridine nucleotides. Furthermore, binding of PyrR to pyr mRNA is antagonized by guanosine nucleotides, which effectively activates pyr operon expression. The segment of pyr mRNA to which PyrR binds was first identified from conserved regions within the nucleotide sequences of the three B. subtilis attenuator regions. These conserved regions were always found in the upstream segment of the antiterminator structure, so that binding by PyrR would be predicted to prevent base pairing in the antiterminator RNA and allow the downstream segment of the antiterminator to fold into the alternate terminator hairpin (Fig. 11).
![]() View larger version (13K): [in a new window] |
FIG. 11. Predicted secondary structures of the regions of pyr transcripts specified by attenuation regions 1 (5' leader), 2 (pyrR-pyrP), and 3 (pyrP-pyrB). Nucleotides are numbered from the start of transcription (i.e., +1). (Left side) RNA is shown folded into the binding loop, formed by base pairing of segments 1 and 2, and the terminator hairpin, formed by base pairing of segments 3 and 4, conformation. Bases involved in the formation of the alternative antiterminator stem-loop conformation are circled. (Right side) RNA is shown folded into the antiterminator stem-loop conformation. (Modified from reference 178.)
|
![]() View larger version (18K): [in a new window] |
FIG. 12. Mechanism of PyrR-mediated transcription attenuation control of pyr operon expression in B. subtilis. For simplicity, only transcription of the pyr 5'-leader attenuation region is shown. For details, see the text. (Modified from reference 178.)
|
A second line of genetic evidence for a role of PyrR in regulation of the pyr operon came from characterization of 12 point mutations that caused constitutive expression of the pyr operon (46). All of the mutations were located in or near the pyrR gene. Two were premature chain termination mutations, one altered the pyrR ribosome binding site, and the others were missense mutations in the pyrR gene. The most direct demonstration of the crucial role of PyrR in the regulation of pyr operon expression came from the analysis of a B. subtilis strain carrying an in-frame deletion in the chromosomal pyrR gene (161). In the mutant strain, expression of the pyrB gene was 250-fold greater than the repressed levels found in pyrR+ cells, and the elevated expression was completely refractory to repression by exogenous pyrimidines. The regulation of pyrB expression in the mutant strain was restored to normal by the introduction of a plasmid-borne copy of pyrR.
Purified B. subtilis PyrR has been shown to possess two other properties required by the regulatory model: it binds with high specificity to pyr binding loop RNA sequences, and its affinity for these RNAs is increased by UMP and UTP (14, 160). More recent studies with purified PyrR from Bacillus caldolyticus led to the previously unrecognized finding that guanosine nucleotides cause greatly reduced affinity of PyrR for pyr RNA and antagonize the effects of uridine nucleotides (21, 73). Similar opposing effects of uridine nucleotides and guanosine nucleotides were observed in the allosteric regulation of the pyrAA/pyrAB-encoded carbamylphosphate synthetase from B. subtilis (131). Such effects have been proposed to provide a means of coordinating rates of pyrimidine biosynthesis with the size of intracellular purine nucleotide pools (21, 73).
The properties of a set of deletion mutations introduced into a gene fusion in which the pyr promoter-leader region was joined to lacZ also illustrated the importance of RNA secondary structures in regulation (161). Deletion of the terminator in the leader region resulted in high and constitutive lacZ expression, whereas deletion of the antiterminator region eliminated expression under all growth conditions. A detailed, systematic deletion analysis of the pyr leader region has not been conducted, however.
As noted above, the secondary structure of binding loop 2 of the pyrR-pyrP intercistronic attenuation region has been studied by nuclease digestion (14). The patterns of cleavage by single-strand-specific and double-strand-specific nucleases were consistent with the secondary structure shown in Fig. 13B, although these results do not establish this structure unequivocally. Weak cleavage of the RNA in the terminal loop regions provided an indication that this loop may fold into a relatively compact structure, as is known for some other terminal RNA loops (38, 58). Unexpectedly strong single-strand cleavages in the region of A·U base pairs upstream of the bulge also suggested that the single-stranded segment of the lower 5' strand of the stem-loop is longer than predicted from computer analysis of RNA folding. The other predicted RNA secondary structures shown in Fig. 11 have not been subjected to nuclease digestion analysis.
![]() View larger version (22K): [in a new window] |
FIG. 13. Elements in the pyr binding loop RNA that are important for PyrR binding and pyr operon regulation. Nucleotides are numbered as in Fig. 11. (A) Mutations in the pyr 5' leader RNA that cause a loss of repression by pyrimidines. (B) Protection against hydroxyl radical cleavage of binding loop 2 RNA by PyrR and mapping of secondary structure by nuclease digestion. Sites adjacent to nucleotides shown in blue were strongly protected against hydroxyl radical cleavage, the site adjacent to the nucleotide in red was moderately protected, and sites adjacent to nucleotides in green were weakly protected. Arrows with circles indicate sites of cleavage by a single-strand-specific nuclease (RNase I), arrows with squares indicate sites of cleavage by a double-strand-specific nuclease (RNase V1), and S and W denote strong and weak cleavage, respectively. Suggested alternative structures for the terminal loop of the RNA hairpin are shown in circles. (C) Specificity of PyrR binding to binding loop 2 RNA determined by gel mobility shift analysis. Residues shown in red cannot be replaced without loss of binding, residues in green can be replaced as long as the secondary structure of the RNA is preserved, and residues in black can be replaced or deleted. (D) Consensus sequence and structure of the PyrR binding site. The consensus structure was derived from 20 binding loops identified in pyr operons of gram-positive bacteria. Secondary structures were predicted by MFOLD (R = A or G, Y = U or C, and N = any nucleotide). Parentheses indicate nucleotides present in only some species. In 8 of 20 examples, the U nucleotide shown in a dashed circle is part of the internal loop, which directs the predicted alternative base pairing shown with dashed lines. The sequences and base pairs shown in boxes are highly but not universally conserved. (Panel A is reprinted from reference 155 with permission from Elsevier; panels B and D are reprinted from reference 14 with permission from Oxford University Press.)
|
For the quantitative analysis of pyr transcription, Lu et al. analyzed a series of chromosomal pyr::lacZ fusions expressed in cells under conditions that repress or derepress pyr operon expression (105). The first set of transcriptional fusions included the pyr promoter followed by DNA containing either attenuator region 1, attenuator region 2, or attenuator region 3. For the latter two fusions, the native intervening upstream attenuators and ORFs were excised. Expression of all fusions was repressed to about the same extent by exogenous pyrimidines. Repression was approximately 4-fold relative to that in cells grown on minimal medium and 20-fold relative to that in cells starved for pyrimidines by slow growth on orotate, a poor pyrimidine source. Repression was completely dependent on a wild-type pyrR gene. In addition, the expression levels of the three pyr::lacZ fusions were similar. In another set of transcriptional fusions, two or three attenuators were linked in tandem. In these cases, repression of lacZ expression was cumulative. That is, the extents of repression by excess uracil relative to that in cells starved for pyrimidines were 18-fold, 136-fold, and 200-fold when the fusions contained one, two, and three attenuators, respectively. These observations were entirely consistent with the proposed regulatory model. Studies with other pyr::lacZ fusions provided clear evidence that the DNA sequences upstream of the pyr promoter are not involved in regulation of operon expression and that possible translation of a small ORF in the pyrP-pyrB intercistronic region is not required for attenuation control.
In a subsequent study, Zhang et al. attempted to evaluate the function of transcription pausing in the pyr operon in vivo by constructing and analyzing mutations that greatly reduce pausing (177). The mutations changed selected pyrimidine nucleotides to purine nucleotides near pause sites, which was shown to substantially reduce transcription pausing at the mutant site in vitro. These mutations were then incorporated into pyr::lacZ fusions, and their effect on pyrimidine-mediated regulation of pyr::lacZ expression was determined. No consistent correlation between elimination of transcription pausing in vitro and defects in cellular regulation was observed. A major complication in these experiments was the inability to separate the effects of the mutations on transcription pausing from their effects on transcription termination at an attenuator.
|
|
|---|
The three-dimensional structure of PyrR demonstrates that the protein is a member of the type I phosphoribosyltransferase structural family in spite of its low sequence relatedness to them (149). The kinetic mechanism of the UPRTase reaction catalyzed by B. subtilis PyrR has been characterized (49). In most respects, this enzyme is a typical phosphoribosyltransferase. Other than the requirement for the binding of nucleotides to the UPRTase active site, there is no obvious relationship between the protein's enzymatic and regulatory activities. Mutant forms of PyrR that fail to bind RNA and regulate pyr transcription, but which have normal UPRTase activity, have been described (143). A mutant PyrR that has lost catalytic activity while retaining the ability to regulate pyr expression has not been isolated, but it has been reported that Lactococcus lactis PyrR lacks UPRTase activity (109). Thus, it is unlikely that PyrR must be able to catalyze this reaction to exert its regulatory function. We believe that the UPRTase activity of PyrR reflects its evolutionary origin as a phosphoribosyltransferase that later acquired the ability to bind to RNA. Of all the members of the type I phosphoribosyltransferase family, PyrR most strongly resembles, in both sequence and tertiary structure, hypoxanthine guanine phosphoribosyltransferase (76). This similarity suggests that PyrR did not evolve from the upp family (156). The recent observation of a dimer of B. caldolyticus PyrR with GMP in one active site and UMP in the other reinforces the idea that the protein is evolved from the hypoxanthine guanine phosphoribosyltransferases (21). The purine repressor PurR of B. subtilis provides another example of a phosphoribosyltransferase structural domain that has acquired a regulatory function (148). The binding of PurR to pur operator DNA is regulated by PRPP, which binds to the conserved phosphoribosyltransferase active site of the protein. PurR apparently does not catalyze an enzymatic reaction, however, and the nucleic acid binding site resides in an additional helix-turn-helix domain that is not found in PyrR or other phosphoribosyltransferases (148).
Hydroxyl radical footprinting experiments with an RNA specifying B. subtilis binding loop 2, which is known to bind tightly to PyrR, were used to map the portions of the RNA that were protected by PyrR (14). Three segments were protected from hydroxyl radical cleavage (Fig. 13B). The terminal loop and upper stem were strongly protected, as were three nucleotides at the top of the lower stem opposite the conserved element that initiates the purine-rich bulge. The purine-rich bulge itself was weakly protected.
Electrophoretic gel mobility shift experiments were used to examine the binding of purified PyrR to 37 structural variants of B. subtilis binding loop 2 (14). The results are summarized in Fig. 13C. The requirements for tight PyrR binding to pyr RNA appeared to be quite exacting. Numerous single-nucleotide substitutions or deletions led to a reduction in the apparent affinity by as much as three orders of magnitude. The importance of maintaining the RNA secondary structure shown in Fig. 13C was documented. The requirement for specific nucleotide residues in positions in the upper stem and loop and in the lower stem just below the purine-rich bulge was confirmed, as was the requirement for the bulge itself. The smallest RNA that bound well to PyrR contained the 28 nucleotides from position 708 to 735 in Fig. 13C (14). Recently, examination of the specificity of PyrR binding to all three B. subtilis binding loop RNAs in vivo using a yeast three-hybrid method confirmed the generalizations shown in Fig. 13C (60).
The results of genetic, footprinting, and RNA binding studies yielded a consistent picture of the PyrR binding site. However, several aspects of PyrR binding suggested by gel shift assays were puzzling. Specifically, PyrR binding to binding loops 1 and 3 appeared to be much weaker and less affected by nucleotides than binding to binding loop 2. These differences were inconsistent with the essentially equivalent regulation in vivo of pyr::lacZ fusions containing individual attenuation regions. For this reason, a detailed study of the binding of B. caldolyticus PyrR to the three B. caldolyticus pyr attenuation regions was undertaken using a filter binding assay (73). (For technical reasons, B. subtilis PyrR binding to RNA could not be measured with this assay.) The apparent dissociation constants (0.1 to 1 nM) for PyrR binding and the effects of nucleotides on this binding were similar for all three binding loops. These results were consistent with regulation of all three attenuators by physiological concentrations of nucleotides. The ratio of uridine to guanine nucleotides appeared to be the primary determinant of PyrR binding, a conclusion supported by the effects of exogenous uridine and guanosine on pyr operon expression in growing cells of B. subtilis. Recent studies by Jørgensen et al. revealed a requirement for Mg2+ in the gels used for gel shift assays, which may have led to substantial artifacts in the previous determinations of binding constants for PyrR binding to B. subtilis attenuation regions 1 and 3 (73). Furthermore, some of the conclusions from the study of RNA structural variants summarized in Fig. 13C should be reexamined, especially those results with variants that appeared to bind very poorly to PyrR. Many of the conclusions of Bonner et al. (14) concerning the RNA sequence and secondary structure required for binding to PyrR were confirmed, however.
How well do the studies of B. subtilis PyrR binding to attenuation regions predict the binding requirements of PyrR proteins from other species? A mutational analysis of PyrR binding to attenuation regions in Lactobacillus plantarum indicates that the required RNA features are similar to those found in B. subtilis (125). A phylogenetic comparison of 20 known or probable PyrR binding loops from nine different species suggests that there is little variation in PyrR selectivity among these species (14). As seen in Fig. 13D, the loops vary in the lengths of the upper and lower stems and the size of the bulge/internal loop but not in the overall secondary structure or identity of critical nucleotides. The structure of the binding loop can vary as much from attenuator to attenuator within a given species (e.g., in B. subtilis or L. lactis) as it does from species to species. We have relied on this conservation of structure and sequence to identify likely modes of action of PyrR in other bacteria (see below).
PyrR folds into a core domain consisting of a curved central sheet formed by five parallel β-strands and flanked by three
-helices (Fig. 14B). A small subdomain called the "hood," which is made up of three antiparallel β-strands, caps the major core domain. The core domain strongly resembles the architecture found in many other type I phosphoribosyltransferases (149). The conserved amino acid residues of the PRPP/nucleotide binding site in all type I phosphoribosyltransferases are located in the central β-strand of the core domain and the following
-helix; PyrR obeys this generalization. Binding of a sulfate ion in this site in unliganded B. subtilis PyrR and subsequent determination of the location of nucleotides bound to B. caldolyticus PyrR confirmed this identification of the nucleotide binding site (21). The "hood" domain varies greatly from one phosphoribosyltransferase to another; residues in this domain are involved in determining the specificity of nucleotide binding.
![]() View larger version (49K): [in a new window] |
FIG. 14. Ribbon diagram of the crystal structure of PyrR from B. caldolyticus. Each polypeptide chain is shown in a distinct color. (A) The native tetrameric structure. (B) One of the two identical dimeric structures that combine to form the tetramer. Very similar dimeric structures are found in PyrR from other species. The black circles indicate the location of bound Mg2+ ions. The stick structures indicate the locations of 5'-UMP bound to the active site of the green subunit, 5'-GMP bound to the active site of the purple subunit, and 3'-GMP bound in a crystal contact lattice. (Reprinted from reference 21.)
|
Crystals of B. subtilis PyrR with UMP bound could not be obtained under conditions in which unliganded PyrR crystallizes, and the addition of UMP to PyrR crystals led to dissolution of the crystals (J. L. Smith, personal communication). These observations suggest that substantial conformational changes accompany binding of uridine nucleotides to PyrR. Such conformational changes are of particular interest because they might help to explain how the binding of nucleotides increases the affinity of PyrR for RNA. Crystals of B. caldolyticus PyrR with nucleotides bound were first obtained inadvertently in attempts to obtain a PyrR-RNA cocrystal (21). An RNase contaminant led to degradation of the RNA, and PyrR was obtained with UMP in the active site of one monomer of the dimeric unit and with GMP in the active site of the other monomer (Fig. 14B). These nucleotides interact with Mg2+ and with conserved active-site amino acid residues in a manner that differs in interesting ways from that observed with other phosphoribosyltransferases (21). Specifically, the magnesium ions in the active sites of PyrR are not ligated to the phosphate moiety or to the vicinal 2', 3' hydroxyls of the nucleotide, as is usual in the substrate complexes of other phosphoribosyltransferases. UMP and GMP are bound in a very similar manner to the active sites; the same amino acid residues are hydrogen bonded to the uracil and guanine bases. A third nucleotide, probably 3'-GMP, was found located between tetramers in the crystal lattice. More recently, the structure of B. caldolyticus PyrR with only 5'-UMP bound to the active sites was solved (20). The structure of this form of the protein was identical to the structure of the unliganded (but sulfate-bound) state and the UMP- plus GMP-bound state of PyrR. Curiously, no protein conformational changes among unliganded and two nucleotide-bound PyrR crystal structures were detectable, so the structures do not reveal how nucleotide binding alters the affinity of PyrR for RNA. Perhaps the association of PyrR subunits in the tetrameric state in the crystal obscures the changes that are induced by nucleotides in the dimeric RNA binding form of PyrR.
Site-directed mutagenesis of B. subtilis PyrR was used to test the hypothesis that conserved amino acid residues whose side chains lie on the concave basic surface are required for RNA binding and regulation of the pyr operon (143). Glutamine substitution mutations in four residues that lie in this surface (Thr18, His22, Arg141, and Arg146) clearly identified them as involved in normal RNA binding and pyr regulation; the mutants had no detectable loss of UPRTase activity or structural integrity. Two other residues (Arg27 and Lys152) were similarly implicated, with the reservation that small changes in their average apparent native molecular weight were observed, which might indicate that these mutant proteins did not fold into fully native tertiary structure (143). These six residues are generally conserved in bacterial PyrR sequences. It seems likely that they form part of the RNA binding surface, although one cannot conclude that they interact directly with RNA. The elucidation of a detailed map of PyrR-pyr RNA interactions must await a high-resolution X-ray diffraction analysis of PyrR-RNA cocrystals.
|
|
|---|
The chromosomes of L. plantarum and a number of other members of the Lactobacillus genus contain two pyrR genes (8). The product of one of these, PyrR1, mediates regulation of the L. plantarum pyr operon in response to pyrimidines, as shown for B. subtilis and other species described above (125). Recently, the product of the second gene, PyrR2, was shown to regulate expression of the L. plantarum pyr operon and the unlinked pyrP operon in response to the CO2/HCO3– level in the medium (8). The two PyrR proteins operate independently and respond to different physiological signals. The biochemical mechanism of PyrR2 action is not yet clear. Two possible mechanisms have been discussed (8). PyrR2 could act by forming heterodimers with PyrR1 that are unable to repress pyr expression, or PyrR2 could interact with pyr RNA regulatory sequences to promote antitermination instead of favoring termination, the known action of PyrR1. This discovery of dual regulation of pyr expression in lactobacilli adds a fascinating new chapter to the study of PyrR function in metabolic regulation.
The combination of pyrR genes and PyrR binding sequences that overlap the ribosome binding sites for pyr genes, which we take to be suggestive of translational repression of those genes by PyrR, is quite widespread in other bacterial species. Numerous species have been identified in which ORFs for putative pyrP (uraA) genes and/or another putative transport protein of the major facilitator protein superfamily fit this pattern (C. J. Fields, unpublished data). In some species (e.g., Bacillus, Clostridium, Lactobacillus, and Streptococcus spp.), this arrangement coexists with regulation of pyr operon expression regulated by PyrR-mediated transcription attenuation. In other cases, translational repression of the genes for transport proteins appears to be the only mode of PyrR regulation (e.g., in Haemophilus influenzae and Pasteurella maltocida).
A very interesting hybrid regulatory mechanism appears to exist in Thermus strain ZO5 and related Thermus species (162). In Thermus strain ZO5 an ORF for short leader polypeptide precedes the pyrR gene and other downstream genes of a pyr operon. The ribosome binding site in the mRNA for the leader polypeptide overlaps a consensus PyrR binding sequence and could be subject to translational repression by PyrR. The RNA encoding the leader polypeptide is also capable of forming a transcription terminator. Van de Casteele et al. (162) have suggested that attenuation at this terminator is regulated by the rate of translation of the leader polypeptide, which is in turn responsive to pyrimidines via PyrR-mediated translational repression. Furthermore, 6 of 28 codons in the leader polypeptide encode arginine, which might account for stimulation of pyr gene expression in Thermus strain ZO5 by this amino acid. This interesting model has not yet been tested by genetic or biochemical means, however.
pyrR mutant strain but that pyrR from Synechocystis sp. strain PCC 6803 could not (24). This result indicates that H. influenzae PyrR is capable of binding to B. subtilis pyr RNA in a pyrimidine-dependent manner; we have suggested above that it may do so to regulate genes for uracil transport proteins. In the case of Synechocystis, the role of PyrR in regulation, if any, remains obscure. It is conceivable that PyrR serves only as a phosphoribosyltransferase in some species, even though they also have upp genes. The function of the PyrR homolog in Pseudomonas presents a particularly interesting unsolved problem. In the Pseudomonas aeruginosa and Pseudomonas putida genomes, a pyrR gene lies at the 5' end of the operon pyrRBC'. Genetic evidence indicates that the product of the pyrR gene possesses UPRTase activity and is involved in repression of pyrB by uracil (A. P. Kumar, C. J. Fields, and G. A. O'Donovan, personal communication). However, none of the RNA structures involved in PyrR binding or transcription attenuation as in B. subtilis can be identified in the pyrRBC' operon. Furthermore, the deduced sequences of Pseudomonas PyrR homologs differ from that of Bacillus PyrR in numerous residues that are thought to be important for RNA recognition and binding. If PyrR is involved in the regulation of pyr gene expression in Pseudomonas, its mechanism of action must be quite different from the previously characterized mechanisms. In preliminary studies by O'Donovan and colleagues, it was proposed that the P. putida PyrR acts as a DNA-binding protein to activate pyr gene expression, but detailed experiments have not been published. Future research on this system will be of great interest.
|
|
|---|
These conclusions, which were based on studies with B. subtilis (115), were confirmed and extended in experiments performed with L. lactis by Jørgensen et al. (74). These investigators used similar genetic methods to manipulate nucleotide pools, and they employed pyrG::lacLM fusion strains to measure pyrG expression and to determine nucleotide pool levels. Their studies demonstrated that pyrG expression was directly correlated with the intracellular concentration of CTP.
Comparison of the pyrG leader regions of several low-G+C gram-positive bacteria yielded valuable clues concerning the mechanism of attenuation control of pyrG expression (115). Only three short nucleotide sequences are conserved. These leader sequences specify GGGCUC at the 5' end of the pyrG transcript and two complementary sequences, GCUCCC and GGGACG, at the bottom of the stem of the terminator hairpin of the attenuator. The nucleotides between the conserved sequences apparently do not play a role in regulation because they can be deleted (114). Systematic mutagenesis of the pyrG leader region of B. subtilis indicated that the conserved sequences 5'-GGGC at the start of the transcript and 5'-GCUCCC in the upstream segment of the terminator hairpin are the only cis-acting elements required for normal pyrG regulation (114). The lack of a requirement for an antiterminator stem-loop led Meng and Switzer to suggest that a regulatory protein might bind to critical transcript sequences at low CTP levels and prevent formation of the terminator hairpin (114). However, a search for the gene encoding such a protein by transposon mutagenesis was unsuccessful. Furthermore, Jørgensen et al. demonstrated that the regulation of the expression of pyrG::lacLM fusions in L. lactis was essentially the same whether the fusions were present as a single chromosomal copy or carried on a multicopy plasmid (74). This observation indicated either that a regulator protein was not titrated by multiple copies of pyrG mRNA or that no such protein was involved in regulation.
10 nucleotides longer than the 5'-GGGCUC... transcripts was also detected. Copying of these extended transcripts by reverse transcription, followed by cloning and sequencing, demonstrated that these longer transcripts contained 5'-end extensions produced by the addition of a variable number of G residues (116). To identify the source and function of the longer transcripts, mutant pyrG promoters were constructed in which one of the first four residues in the ITR (i.e., GGGC) was replaced by another base. Analysis of transcripts initiated at the mutant promoters in cells grown under conditions of pyrimidine excess and limitation revealed that substitution of any residue in the GGG sequence eliminated the formation of poly(G) extensions. Furthermore, the elimination of poly(G) extensions caused very low and essentially unregulated pyrG (actually pyrG::lacZ) expression (116). Analysis of a mutant promoter in which the C at position +4 in the ITR (i.e., +4C) was changed to a T (specifying a U in the transcript) showed that pyrG expression and regulation were similar to those observed with the wild-type pyrG promoter. However, a +4C-to-A substitution in the pyrG promoter abolished regulation. Finally, a mutant pyrG promoter was constructed in which four G residues were inserted after position +G3 in the ITR, creating a promoter with an ITR with the sequence 5'-GGGGGGGCUC. A pyrG::lacZ fusion containing this mutant promoter exhibited constitutive expression (116).
These observations are accounted for by conditional reiterative transcription at the pyrG promoter, which provides the basis for the current model for CTP-sensitive regulation of pyrG expression in B. subtilis (Fig. 15). When the intracellular level of CTP is high, pyrG transcripts are faithful copies of the DNA template and transcription elongation continues until termination at the attenuator. Therefore, when CTP is plentiful, transcription of the pyrG gene is suppressed (Fig. 15). On the other hand, when the intracellular level of CTP is low, pyrG transcription pauses after the synthesis of the nascent transcript 5'-GGG (and before position +4C) because of insufficient substrate. This pause provides time for the nascent transcript to slip upstream (relative to the DNA template) and allow an extra G residue to be added to the nascent transcript. This process can be repeated multiple times (e.g., up to at least 10 times) until eventually a C residue is inserted. The transcript is then elongated normally until RNA polymerase transcribes the attenuator sequence that specifies the upstream segment of the terminator hairpin. The sequence of this segment in B. subtilis is 5'-GCUCCCUUUCAA, which includes a tract of nine pyrimidines. Because both C and U residues base pair with G residues, the run of pyrimidines will immediately base pair with the poly(G) tract at the 5' end of the transcript, forming an antiterminator stem-loop. As RNA polymerase continues to elongate the pyrG transcript, formation of the terminator hairpin is precluded by the antiterminator secondary structure and full-length pyrG transcripts are formed (Fig. 15). These transcripts are translated to make CTP synthetase, which is needed to overcome the CTP deficiency. Although the model describes pyrG expression at high and low intracellular concentrations of CTP, regulation can occur continuously over a wide range of CTP concentrations that control the extent of pausing at position +4.
![]() View larger version (23K): [in a new window] |
FIG. 15. Model for CTP-mediated regulation of pyrG expression in B. subtilis. The figure shows the effects of CTP concentration on the fate of the pyrG transcript after the first three G residues have been incorporated into the nascent transcript. A high CTP concentration allows normal transcript elongation until intrinsic termination occurs in the pyrG leader region. A low CTP concentration induces a transcription pause that allows reiterative transcription and the addition of extra G residues, which participate in the formation of an antiterminator hairpin. The extra G residues are boxed. The figure shows the insertion of six extra G residues, but as many as 10 extra residues can be added. (Modified from reference 116 with permission of the publisher. Copyright 2004 National Academy of Sciences, U.S.A.)
|
Reiterative transcription is a central element in the regulation of several pyrimidine biosynthetic and salvage operons in enteric bacteria, as described above, but the reiterative transcription reaction in these examples is fundamentally different from the reiterative transcription reaction that occurs at the pyrG promoter of B. subtilis. Specifically, in the case of the enteric promoters, all transcripts produced by reiterative transcription are aborted during the initiation phase of transcription, while transcripts that undergo reiterative transcription at the pyrG promoter switch to the normal mode of elongation, which allows transcription through the pyrG gene. A major part of this difference clearly involves the substrate for reiterative transcription, as discussed above. Reiterative transcription with UTP produces aborted transcripts, while reiterative transcription with non-UTP substrates produces transcripts that can be productively extended. The ability to make this distinction appears to be an intrinsic property of all RNA polymerases, although not much else is known about it.
The conserved sequences required for regulation of pyrG expression in B. subtilis—namely, the GGGC sequence that starts the ITR and the leader sequence specifying the polypyrimidine tract in the leader transcript—are found in the pyrG leader regions of at least 17 low-G+C gram-positive bacteria, including the genera Bacillus, Listeria, Lactococcus, Enterococcus, Staphylococcus, and Streptococcus (116). While reiterative transcription that produces runs of G residues has been demonstrated only for the pyrG operon of B. subtilis and CTP-sensitive regulation of pyrG expression has been shown only for the pyrG operons of B. subtilis and L. lactis, it seems highly likely that many species of the genera listed above will share these capabilities. On the other hand, the pyrG operons of enteric bacteria, which also encode CTP synthetase, possess leader regions that do not contain these conserved elements and are regulated by other mechanisms.
To determine the minimum number of G residues at the 5' end of the pyrG transcript that is required to prevent formation of the terminator hairpin, a set of mutant pyrG promoters was constructed with the number of G residues in the ITR varied systematically (34). The effects of these mutations on pyrG expression and CTP-sensitive regulation were measured, with decreases in the range of regulation used to indicate the inability of the terminator hairpin to form. The range of regulation with a promoter containing four G residues was slightly less than half of that with the wild-type promoter (i.e., 5.6-fold instead of 14-fold). Regulation with a promoter containing five or more G residues was essentially absent, and pyrG expression was constitutive. These phenotypes indicate that five G residues are sufficient to form the antitermination stem—with the exclusion of the terminator hairpin—in essentially every pyrG transcript. The mutant pyrG promoters were also used to determine the number of G residues in the ITR that are needed to support maximum reiterative transcription in vivo. The results showed that a minimum of three G residues (as in the wild-type promoter) is required for reiterative transcription and that promoters with three or four G residues exhibit comparable levels of reiterative transcription. Compared to these levels, reiterative transcription is slightly reduced at a promoter with five G residues. In contrast, reiterative transcription is severely reduced or eliminated at promoters with six or more G residues, even in cells starved for pyrimidines. Apparently, an rG6·dC6 RNA-DNA hybrid is too stable to permit transcript slippage, even with extensive transcription pausing. Taken together, the results of this analysis reveal that the wild-type pyrG promoter, with its G3 tract in the ITR, permits the widest range of regulation by the reiterative transcription mechanism.
It is interesting to note that recent studies with yeast mitochondrial RNA polymerase suggest that progressively lower concentrations of NTP substrates are required for active-site binding as the nascent transcript is extended from position +3 through +11 during transcription initiation (2). The end of this gradient at position +11 apparently reflects the transition to the elongation phase of transcription, when even lower concentrations of NTP substrates are needed. If these findings can be generalized to the B. subtilis RNA polymerase—which seems likely—they suggest that the sequence of the pyrG ITR, which directs CTP to position +4 of the transcript, was selected to maximize the sensitivity of reiterative transcription to the intracellular concentration of CTP. If the site for reiterative transcription in the ITR were followed by a C residue located downstream of position +4, the affinity of RNA polymerase for CTP might be too great to permit pausing even at low levels of CTP in the cell, and reiterative transcription producing poly(G) tracts would be prevented.
|
|
|---|
Although these models are unlikely to be modified significantly in the future, many questions remain regarding fundamental processes that underlie the regulatory mechanisms. For example, some of the mechanisms involve transcription start site switching, which to a first approximation can be predicted from promoter sequences. However, the rules for start site selection do not take into account context effects that can significantly influence the process, as illustrated with mutant upp promoters of E. coli. Clearly, more work is required to understand these context effects. In addition, some regulatory mechanisms involve reiterative transcription, but this reaction can have quite different outcomes. Reiterative transcription at the pyrBI promoter of E. coli produces AAUUUUn transcripts that are always released from the initiation complex, whereas reiterative transcription at the pyrG promoter of B. subtilis produces GGGGn transcripts that can return to the normal mode of transcription elongation. The basis of these alternative outcomes is presently a mystery. Furthermore, the extent of reiterative transcription at a promoter can be influenced by the location of the transcription start site, as observed with a mutant carAB promoter. Apparently, there are undefined architectural features of the transcription initiation complex that modulate the reiterative transcription reaction. Finally, the structure-function relationships of the PyrR-RNA complex remain undefined. In future studies, it will be important to describe amino acid-RNA interactions, conformational changes that occur in both PyrR and its RNA substrate upon binding, and the structural basis for the uridine nucleotide-mediated increase and guanosine nucleotide-mediated decrease in the affinity of PyrR for RNA. Analysis of high-resolution structures of PyrR with RNA and nucleotides bound would likely accomplish most of these goals.
In this review, we have emphasized that the mechanisms and concepts elucidated by the study of pyr gene regulation can serve as useful guides in the analysis of unknown regulatory mechanisms. One interesting example is the study of pyrimidine (CTP)-mediated regulation of pyrG expression in E. coli. It appears that the regulatory mechanism, which is clearly different from the mechanism controlling pyrG expression in B. subtilis, requires CTP-sensitive start site switching in much the same way as described for the pyrC and pyrD regulatory mechanisms (T. Bedekovics and C. L. Turnbough, Jr., unpublished data). However, the mechanism by which pyrG transcripts initiated at neighboring start sites are differentially expressed remains to be determined. Other intriguing examples are the unresolved and apparently novel mechanisms of regulation by PyrR. Included in this list are the mechanism of PyrR2-mediated regulation of pyr gene expression in response to inorganic carbon in L. plantarum and PyrR-mediated regulation of pyr gene expression in the absence of recognizable PyrR binding sequences in Pseudomonas species.
Importantly, the information garnered from the study of pyr gene regulation can also facilitate the analysis of regulatory mechanisms that are unrelated to pyrimidine biosynthesis. In fact, this information has already contributed to the discovery of such mechanisms. Two noteworthy examples are the regulation of expression in E. coli of the rRNA operons (42, 144) and the fis operon (165) by the concentration of the initiating NTP. In these cases, promoters contain sequences that restrict initiation to a single "unfavorable" position (e.g., A9 in the case of most rRNA operons), as defined by the rules established with mutant pyrC promoters. As a consequence, the efficiency of transcription initiation at the rRNA and fis promoters can be determined by the intracellular concentration of the initiating NTP (i.e., ATP or GTP for the rRNA promoters and CTP for the fis promoter). Furthermore, the concentrations of the initiating NTPs vary under different growth conditions in a manner that causes appropriate levels of rRNA and fis operon expression, providing simple yet elegant regulation of gene expression.
Many other bacterial operons that are unrelated to pyrimidine biosynthesis contain elements that are critical components in the mechanisms of pyr gene regulation described in this review, so it seems likely that at least some of these operons will employ these features in comparable ways. However, the number of combinations and permutations of these elements is large, and thus there may be a high degree of flexibility in the assembly of regulatory mechanisms. In addition, it seems highly likely that hybrid mechanisms have evolved that combine features described above with a different set of regulatory elements. For example, many of the pyr gene regulatory mechanisms rely on nucleotide-sensitive reiterative transcription, start site switching, or transcription pausing as the key regulatory event in conditional gene expression. It is certainly conceivable, especially with the nuances described in this review, that factors other than the intracellular levels of nucleotides could also modulate these phases of transcription. Activating or inactivating these other factors by a cellular metabolite or condition would then make gene expression responsive to a nonnucleotide signal. On the other hand, the analysis of bacterial genome sequences reveals many pyr genes that do not appear to be regulated by any of the mechanisms discussed in this review. It is not known whether expression of these genes is altered in response to intracellular nucleotide pools, but this seems to be a likely possibility. The identification and examination of such regulated pyr genes offers rich possibilities for the discovery of novel modes of metabolic control.
Bioinformatic analysis of genomic DNA sequences was a useful tool in the development of models for many of the regulatory mechanisms discussed in this review, particularly in the identification of transcription terminators, antiterminators, and transcription pause sites that play roles in these mechanisms. However, a major limitation to this approach was illustrated by the mechanism of pyrG regulation in B. subtilis. In this mechanism, the antiterminator RNA hairpin contains an essential tract of nucleotides that was added by reiterative transcription and could not have been predicted from the sequence of the pyrG operon. In general, regulatory mechanisms involving reiterative transcription rely on subtle features in the DNA sequence of the promoter-leader region. These features are not readily identified by current bioinformatic analyses.
It should also be noted that many of the regulatory features described in this review involve only the basic transcriptional machinery of the cell and simple DNA sequences. Therefore, it is reasonable to suspect that some of the mechanisms and regulatory elements described here are also operative in eukaryotic cells. In fact, it would be surprising if they were not. The obstacle in their discovery is likely to be the education of more investigators about the surprises provided by the study of pyrimidine nucleotide biosynthesis in bacteria.
Finally, this review is yet another example of the importance of collaboration with other scientists. The combination of independent, intelligent minds that bring divergent experiences and ways of analyzing problems, when brought to bear on an unsolved scientific puzzle, is almost always more likely find a solution that survives the most thorough experimental testing. We also emphasize our shared pleasure in discovery. The more unexpected the answer, the greater the joy in finding it.
Research in the authors' laboratories was supported by NIH grants GM47112 to R.L.S. and GM29466 to C.L.T.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»