Microbiology and Molecular Biology Reviews, December 2003, p. 475-490, Vol. 67, No. 4
1092-2172/03/$08.00+0 DOI: 10.1128/MMBR.67.4.475-490.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
and Juke S. Lolkema*
Molecular Microbiology, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Haren, The Netherlands
SUMMARY INTRODUCTION STRATEGY PROTEIN FAMILIES HPr Kinase HPr-Like Proteins DISTRIBUTION AND GENOME ANALYSIS Firmicutes Proteobacteria Other Phyla SEQUENCE ANALYSES Full-Length and Short-Version HPr Kinases Phylogenetic Relationships of HPr-Like Proteins Sequence Motifs in HPr-Like Proteins CONCLUSIONS PROSPECTS ACKNOWLEDGMENTS REFERENCES
|
|
|---|
and ß subdivisions of the Proteobacteria, the presence of HPr kinase appears to be common, while in the
subdivision it is more of an exception. The genes coding for the HPr kinase homologues of the Proteobacteria are in a gene cluster together with an HPr-like protein, termed XPr, suggesting a functional relationship. Moreover, the XPr proteins contain the serine phosphorylation sequence motif. Remarkably, the analysis suggests a possible relation between CcpA-dependent gene regulation and the nitrogen regulation system (Ntr) found in the
subdivision of the Proteobacteria. The relation is suggested by the clustering of CCR and Ntr components on the genome of members of the Proteobacteria and by the close phylogenetic relationship between XPr and NPr, the HPr-like protein in the Ntr system. In bacteria in the phylum Proteobacteria that contain HPr kinase and XPr, the latter may be at the center of a complex regulatory network involving both CCR and the Ntr system. |
|
|---|
P) is involved in sugar transport and PRD-mediated regulation, and HPr(Ser-P) is involved in CCR. The primary sensor in the regulatory pathway is HPrK, which is activated by glycolytic intermediates. HPr(Ser-P) binds to the transcriptional regulator CcpA, thereby inducing the binding of the complex to so-called cre sites (for "catabolite responsive element") in the promoter region of the target genes, which prevents transcription of the genes (13).
![]() View larger version (18K): [in a new window] |
FIG. 1. Schematic representation of CRP-dependent (A) and CcpA-dependent (B) carbon catabolite repression pathways and the Ntr regulatory pathway (C). (A and B) Shown on the left-hand side is PTS-mediated glucose uptake in the model organisms E. coli and B. subtilis, respectively. The phospho-carrier protein HPr is phosphorylated at the catalytic histidine residue by enzyme EI at the expense of phosphoenolpyruvate (PEP). The phosphoryl group is then transferred to EIIA, which is a cytoplasmic protein in E. coli (A) or part of the multidomain complex EIIABC in B. subtilis (B). From EIIA, the phosphoryl group is transferred to EIIB, a soluble domain attached to the integral membrane transporter domain EIIC. The glucose molecule is transported into the cell and at the same time phosphorylated by EIIB, yielding glucose-6-phosphate in the cell. Shown on the right-hand side is transcriptional regulator-mediated CCR. The transcriptional regulators CRP (A) and CcpA (B) are indicated by a dark background, and the PTS components involved, EIIA and HPr in panels A and B, respectively, are indicated by a white background. In E. coli (A), the degree of phosphorylation of EIIA determines the activity of adenylate cyclase (AC) and, consequently, the concentration of cAMP in the cell. Binding of cAMP to CRP results in a complex that stimulates transcription of target genes (positive regulation). In B. subtilis (B), fructose-1,6-phosphate produced from glucose-6-phosphate in glycolysis activates HPrK that phosphorylates HPr at the regulatory-site serine at the expense of ATP or PPi. Binding of HPr-Ser-P to CcpA results in acomplex that inhibits the transcription of target genes (negative regulation). The HPr molecule in HPr-mediated signal transduction is the same as the HPr involved in glucose uptake. In Crh-mediated signal transduction, the HPr molecule is not part of the uptake system and is termed Crh. (C) Phosphoryl group transfer chain in the nitrogen regulation pathway Ntr. The HPr-like protein NPr is phosphorylated by enzyme INtr at the expense of phosphoenolpyruvate, after which the phosphoryl group is transferred to EIIANtr. The pathway is thought to operate independently of the PTS sugar uptake pathway.
|
A difference between CRP-dependent CCR found in gram-negative bacteria and CcpA-dependent CCR found in gram-positive bacteria is the strictness of the coupling between the PTS transport and regulatory functions. In the CRP-dependent mechanism (Fig. 1A), regulation of expression is directly coupled to turnover of IIAGlc. The level of phosphorylation of IIAGlc is determined by the uptake rate of glucose from the medium and by the uptake of other PTS sugars which compete for P-HPr and, thereby, limit the rate of phosphorylation of IIAGlc. In the CcpA-dependent mechanism (Fig. 1B), the primary sensor is HPrK that is activated by glycolytic intermediates (15, 17, 28) but, in principle, may be activated by other metabolites as well, making the mechanism more versatile. Our studies of the regulation of expression of the Mg2+ citrate transporter of B. subtilis demonstrate that, in addition to glucose, the pathway is potentiated by the non-PTS sugar inositol and by the nonsugars succinate and glutamate (44). Moreover, the signal transduction pathway in the HPr molecule is physically separated from the phosphoryl group transfer chain of the PTS transport function, i.e., via the regulatory-site Ser residue and active-site His residue, respectively. The regulatory state is modulated rather than being determined by the uptake system by virtue of the phosphorylation state of the active-site histidine (3, 27). Crh-mediated regulation may be a manifestation of this loose coupling between regulatory and transport function of the PTS; a single mutation in HPr (His to Gln) results in a regulatory pathway that is independent of the uptake system. The mechanism found in gram-positive bacteria, involving HPr and HPrK, may be a more general gene regulation system. Gram-negative bacteria may have compensated for their specialized but inflexible mechanism by developing the Ntr (nitrogen regulation) system, a PTS-based regulatory mechanism composed of a complete phosphotransfer chain that operates independently of the uptake system and is not found in gram-positive bacteria (Fig. 1C) (39). The Ntr system is involved in the regulation of nitrogen metabolism, but its influence may be much broader (23), including a role in virulence (33, 38).
In the present study, we investigated the distribution of CcpA-dependent CCR in the bacterial kingdom by searching the available sequence databases for HPrK homologues and HPr-like proteins and we investigated the evolutionary origin of the pathway by analyzing the relationship between the proteins and searching the genome databases for evolutionary links between regulatory systems. It follows that homologues of HPrK are found in many gram-negative bacteria; more importantly, the results suggest an evolutionary link between CcpA-dependent CCR and the Ntr type of regulation found in gram-negative bacteria.
|
|
|---|
|
|
|---|
, and
subdivisions of the Proteobacteria and in the phyla Fusobacteria, Spirochaetes, and Chlorobi (see also references 8, 10, and 14) (Table 1). The different groups of bacteria are indicated in the phylogenetic tree of a subset of 24 typical sequences (Fig. 2). A small number of hits from the BLAST search represent proteins that are considerably smaller than the consensus length of HPrK/P (14). They contain between 141 and 159 residues, and, remarkably, all are found in bacteria belonging to the
subdivision of the Proteobacteria. Multiple sequence alignments of the full-length and short versions show that the latter corresponds to an internal fragment of the former, which corresponds to part of the catalytic domain. This is discussed in more detail later in this review. |
View this table: [in a new window] |
TABLE 1. The HPrK/P familya
|
![]() View larger version (25K): [in a new window] |
FIG. 2. Phylogenetic tree of HPrK/P. A subset of 24 typical HPrK/P sequences (the bold sequences in Table 1) was aligned using the Clustal X program (40), and then the tree was constructed using the DrawTree program in the Phylip package (6). The typical sequences represent groups of sequences with pairwise sequence identities of at least 60%. Organisms are indicated by a shorthand following the protein that consists of the first letter of the genus followed by the first three letters of the species. The short-version HPrKs that form a separate branch in the tree were left out of the analysis.
|
|
View this table: [in a new window] |
TABLE 2. The family of HPr-like proteinsa
|
![]() View larger version (29K): [in a new window] |
FIG. 3. Phylogenetic tree of the HPr-like proteins. The tree containing a subset of 38 typical HPr-like sequences was constructed as described in the legend to Fig. 2. A typical sequence (the bold sequences in Table 2) represents a group of sequences with pairwise sequence identities of at least 55%. XPr , NPr, XPrß, , , SPr, and Crh represent HPr-like proteins that are explained in the text. Organisms are indicated by a shorthand following the protein that consists of the first letter of the genus followed by the first three letters of the species.
|
|
|
|---|
The eight HPr-like proteins in the complete set that do not contain the active-site histidine and, therefore, potential Crh molecules are all found in the Firmicutes (Table 2). Five of these proteins are found in bacteria (B. subtilis, Bacillus halodurans, Bacillus anthracis, Oceanobacillus iheyensis, and Thermoanaerobacter tengcongensis) that, in addition, contain an HPr-like protein with the active-site histidine. Clostridium thermocellum contains three HPr-like molecules, one with and two without the active-site histidine. Finally, Ureaplasma urealyticum is the only bacterium that contains a single HPr-like molecule without the active-site histidine. In B. subtilis Crh, the active-site histidine is replaced by a glutamine while the adjacent residues are still very similar as in HPr sequences of similar organisms (see also Table 5, region A). A multiple sequence alignment of the HPr-like molecules revealed the same sequence motif in the three HPr-like proteins of the other bacilli that therefore contain Crh in addition to HPr. An investigation of the unfinished genome sequence data of Geobacillus stearothermophilus also revealed a second HPr-like molecule in addition to HPr, with the same characteristics of B. subtilis Crh (www.genome.ou.edu/bstearo.html) (not in Table 2). T. tengcongensis, which belongs to the clostridia, contains two HPr-like molecules, one with the active-site histidine (FRUB) and one in which the histidine is deleted (TTE0115). Apart from the different active-site substitution, the molecule differs from the bacillus Crhs in that the conserved sequence motif around the site is not retained (see Table 5). Nevertheless, the missing histidine suggests a function other than in PTS-mediated transport, possibly in Crh-mediated signal transduction. C. thermocellum is the only bacterium in the phylum Firmicutes with three HPr-like molecules, one containing the active-site histidine (CHTEP203) and two without (CHTEP180 and CHTEP182). CHTEP180 is closely related to TTE0115 of T. tengcongensis and also shows a deletion at the position of the active-site histidine. These two sequences are the most distant from the other members of the family (Fig. 3). In the second sequence of C. thermocellum without the active-site histidine (CHTEP182), His is replaced by Glu and, also, the surrounding region is not conserved as in the bacillus Crhs. Remarkably, the two HPr-like proteins of T. tengcongensis and C. thermocellum, FRUB and CHTEP203, respectively, that contain the active-site histidine residues are the closest relatives of the Crh proteins of the Bacillus species (Fig. 3).
|
View this table: [in a new window] |
TABLE 5. Consensus sequence around the active-site histidine (region A) and the regulatory-site serine (region B) in the HPr-like proteins
|
The genes coding for HPr of the bacilli are organized in an operon together with PTS enzyme I and usually are polycistronically transcribed in a single mRNA. In B. subtilis and B. anthracis, the pair of genes is preceded by the gene coding for the sugar-specific PTS enzyme IIABC (Tables 2 and 3). The gene coding for Crh is clustered on the chromosome of the four Bacillus species with three genes, yvcJ, yvcK, and yvcL, in B. subtilis that code for proteins of unknown function. We will name the genes j, k, and l, respectively. Except for the Mollicutes, the three genes are also present as a cluster in the Firmicutes that do not contain a Crh protein. Then, the cluster is not associated with any of the PTS proteins. The close phylogenetic relationship mentioned above between the HPr proteins of T. tengcongensis and C. thermocellum, FRUB and CHTEP203, respectively, and the bacillus Crh proteins is also supported by the location of the proteins on the genome. Both are located close to the j, k, and l proteins. Remarkably, CHTEP182 of C. thermocellum that does not contain the active-site histidine is found in an operon together with PTS enzyme I, which is responsible for phosphorylation of the histidine residue.
|
View this table: [in a new window] |
TABLE 3. Typical gene clusters containing HPr-like proteins in the phyla Firmicutes and Proteobacteriaa
|
subdivision, which contains the typical gram-negative bacteria such as Escherichia coli (Table 2). A number of these, Klebsiella pneumoniae, E. coli, Yersinia pestis, Vibrio cholerae, and Salmonella enterica, possess more than one HPr-like molecule but no HPrK/P. These organisms contain, in addition to HPr, a homologue termed NPr, which is a component of the Ntr regulatory system involved in the regulation of nitrogen metabolism, among others (26). The NPrs form a separate cluster in the phylogenetic analysis of the HPr-like molecules (the ptsO genes [Fig. 3]). HPr and NPr molecules are distinguished by the clustering of the coding genes with other genes on the chromosome (Tables 2 and 3). The gene coding for HPr (ptsH) is organized together with genes coding for the PTS enzymes EI and glucose-specific IIAGlc in the pts operon in the order H-I-IIA. The three proteins are the PTS components involved in CRP-dependent carbon catabolite repression. The gene coding for NPr (ptsO) is located elsewhere on the chromosome in the Ntr cluster, together with four other genes that are involved in the Ntr regulatory pathway (N-h-IIA-j-H). The cluster starts with an RNA polymerase
54 factor (N) followed by a putative
54 modulation protein (h), a PTS IIA homologue termed IIANtr, a P-loop-containing protein (j), and, finally, the gene coding for NPr. The P-loop-containing protein j is homologous to the j protein in the j-k-l-Crh cluster found in the Firmicutes (Table 3). The IIANtr protein and NPr, together with a third protein termed INtr, form an independent phosphoryl group transfer chain that uses phosphoenolpyruvate as the donor (Fig. 1C). INtr is a multidomain protein consisting of a homologue of the PTS EI and a domain also found in NifA activator proteins (30).
The whole-genome sequences of Haemophilus influenzae, Pasteurella multocida, Buchnera sp., and Buchnera aphidicola in the
subdivision contain a single HPr-like molecule embedded in an HPr-like gene cluster (Table 2), suggesting that these organisms do not use NPr-mediated regulation. For sequence similarity reasons, the same is likely to be true for Serratia marcescens and Haemophilus somnus. In contrast, the genome of Pseudomonas aeruginosa contains a single HPr-like molecule embedded in an NPr-like gene cluster that, therefore, is likely to be an NPr species. The strictly aerobic Pseudomonas and also the Azotobacter species were thought not to use the PTS for uptake of sugars, except for fructose (32). Analysis of the P. aeruginosa genome revealed two complete sugar-specific PTSs, the one for fructose and another one for N-acetylglucosamine, in which HPr moieties are present as components of multidomain proteins (29). These HPr domains are not likely to play a role in sugar uptake as general PTS components. The Pseudomonas and Azotobacter genera (and the same may be true for the Shewanella, Proteus, and Microbulbifer genera [Table 2]) use an HPr-like protein only for regulatory purposes.
E. coli and S. enterica (and also S. enterica serovar Typhimurium) both have, in addition to HPr and NPr, a third HPr-like molecule, Z4879 and STY4004, respectively. In E. coli, the protein is present only in the enterohemorrhagic strains OD157:H7 and OD157:H7 EDL933 and not in the K-12 strain. The proteins in the Escherichia and Salmonella strains are closely related (Fig. 3) and are located in a similar gene cluster on the genome (Table 2). Upstream of the gene coding for the HPr-like protein, a gene annotated as a sugar kinase (sk) and a complete set of PTS enzyme II proteins (IIA-IIB-IIC) are located. The HPr-like protein in the cluster may represent a sugar-specific HPr (SPr, for "sugar-specific HPr" [Table 3]).
Three bacteria in the
-subdivision contain an HPrK homologue: Xanthomonas campestris, Xanthomonas axonopodis, and Xylella fastidiosa (Table 2). They also contain a single HPr-like protein; remarkably, both are in the same gene cluster that consists of seven or eight genes (N-h-IIA-K-j-IIA-H-I). We will term this cluster and the single HPr-like protein the X-cluster and XPr, respectively (for "Xanthomonas cluster" and "Xanthomonas HPr"). The first genes in the X-cluster form an Ntr gene cluster from which the gene coding for NPr is missing and in which the gene coding for HPrK is inserted upstream of the j gene (Table 3). The IIA protein encoded in this part of the cluster is of the IIANtr type and is not found in the X. fastidiosa cluster (Table 2). A second IIA gene and the genes coding for XPr and PTS enzyme I follow the Ntr-like cluster. The second IIA protein is of the IIAMan type, homologues of the IIA domain of the mannose-specific IIABMan that is part of the mannose uptake system in E. coli (34). The two Xanthomonas species and X. fastidiosa lack both NPr and INtr; i.e., it is not clear how the IIANtr protein encoded in the X-cluster is phosphorylated. An exact copy of the X-cluster found in X. fastidiosa, without IIANtr, is also found in Geobacter metallireducens in the
subdivision of the Proteobacteria.
With the exceptions of Mesorhizobium loti and Magnetospirillum magnetotacticum in the
subdivision, the bacteria in the
and ß subdivisions of the Proteobacteria contain a single HPr-like molecule (Table 2). The complete genome sequence and the data from unfinished genomes suggest that the presence of HPrK in both subdivisions is common. The HPrKs of the
subdivision are all of the short-version type (see above) (14), and the HPr-like proteins are in the same gene cluster that, in addition, contains at least a IIA molecule of the IIAMan type. The cluster resembles the last part of the X-cluster found in the
and
subdivisions, but the genes coding for protein j and enzyme I are not always present (Table 3). The Rhodospirillum rubrum and M. magnetotacticum clusters are an exact match of this part of the X-cluster (Table 2). The genes in the remaining part of the X-cluster, N-h-IINtr, are also found clustered on the genome of the bacteria in the
subdivision but are located distantly from the XPr/HPrK part of the cluster (Table 3). All bacteria in the
subdivision contain the gene coding for INtr, while only M. loti, M. magnetotacticum, and R. rubrum contain the classical enzyme I. The presence of enzyme I correlates with the presence of a second HPr-like protein in M. loti and M. magnetotacticum.
The situation observed in the ß subdivision is similar, but the X-cluster is broken up in different parts (Table 3). The single gene coding for the HPr-like molecule (XPr) clusters with the genes coding for enzyme I and IIAMan in the same order as in the X-cluster of the
and
subdivisions. The gene coding for HPrK clusters elsewhere on the genome, together with IIANtr and protein j in the order IIA-K-j. In the complete genome sequences of Ralstonia solanacearum and Neisseria meningitidis, the three genes are preceded only in the latter organism by the N and h genes (Tables 2 and 3). A difference with the
subdivision is that in the ß subdivision, all bacteria contain the PTS enzyme I but not INtr.
Summarizing, HPrK is found in all four subdivisions of the Proteobacteria and seems to cluster on the genomes together with the genes coding for components involved in Ntr-type of gene regulation. The X-cluster in the Xanthomonas species in the
subdivision represents the most complete cluster, while fission of the cluster is observed in the
and ß subdivisions.
|
|
|---|
subdivision are organized, together with the HPr-like protein XPr, in a similar gene cluster (the X-cluster) to that of the full-length versions in the ß,
, and
subdivisions of the Proteobacteria, strongly suggesting that they serve the same function, putatively as HPr kinases/phosphatases. HPrK is known to consist of two domains. The three-dimensional structure of HPrK of Staphylococcus xylosus resolved at 1.95 Å shows a hexameric arrangement (a dimer of trimers) in which the N-terminal and C-terminal domains are well separated, with no apparent intramolecular contacts (20). While the function of the N-terminal domain is not clear, the C-terminal domain is the catalytic domain. Deletion of the N-terminal 127 residues of Lactobacillus casei HPrK, more or less corresponding to the N-terminal domain, yielded an active entity whose three-dimensional structure was resolved separately at 2.8 Å (8). The structures of the C-terminal catalytic domains (Fig. 4A) from the two organisms closely matched each other. Multiple sequence alignment of the full-length and short-version HPrK homologues suggests that the latter corresponds to the catalytic domain; the beginning of the short versions correlates more or less with the beginning of the C-terminal domain of the full-length HPrKs. However, the length of the short versions and the C-terminal domains of the full-length proteins differ by roughly 50 residues; they are about 150 and 200 residues, respectively. The C-terminal domain of the full-length HPrK contains four conserved sequence motifs (A, B, C, and D in Fig. 4B), the first of which contains the so-called Walker A motif typical for the binding of the phosphate groups of ATP (43). Motifs A, B, and C are also found in the short-version homologues, but motif D is missing (Fig. 4B and C). In fact, the sequence similarity of the two versions covers only approximately the first 100 residues of the short version, the part that contains sequence motifs A, B, and C. The C-terminal 50 residues of the short versions do not seem to be related to the corresponding area in the full-length proteins. It follows that the short versions correspond to the top part in the structure depicted in Fig. 4A, up to strand ßJ. The C-terminal part of the full-length proteins, consisting of ßJ, ßK,
3, and
4, would be missing. In the crystal structures (8, 20), the loop between ßK and
3 in conserved domain D (the K3 loop) and the two
-helices
3 and
4 are involved in intimate contacts within two pairs of trimers that form the overall hexameric structure of the complex. The short versions may not form a multimeric structure and may instead exist as monomers.
![]() View larger version (42K): [in a new window] |
FIG. 4. Sequence signatures in full-length and short-version HPrK homologues. (A) Structural model of the C-terminal domain of the full-length HPrK/P. Ovals indicate conserved regions A, B, C, and D. (B) Pairwise sequence identity in the multiple sequence alignments of the short-version (top) and full-length (bottom) HPr kinases. The "cluster" function gives the fraction of identities at each position in the alignment in an all-against-all comparison. The scores were averaged over a sliding window of nine positions. The plot of the short-version HPrKs was "aligned" with the plot of the full-length versions based on the multiple sequence alignment. The set of full-length sequences used in the alignment is described in the legend to Fig. 2, and the set of short versions is taken from Table 2 (Proteobacteria, subdivision). The bars in the top half of the plots indicate positions where a gap occurs in any of the sequences in the alignment. The four conserved regions in the full-length HPrK homologues are indicated as A to D. (C) Sequence motifs corresponding to regions A to D in panels A and B. The top and bottom sequences in the boxes correspond to the full-length and short-version HPr kinases, respectively. Conserved motif D is not present in the short-version HPr kinases. (Panel A reprinted from reference 8 with permission from the publisher.)
|
4 on the neighboring monomer. This interaction is absent in the short versions. The analysis supports the conclusion of the genetic analysis, i.e., that the short-version HPrK homologues actually may function as HPrK/phosphatases. |
View this table: [in a new window] |
TABLE 4. Pairwise sequence identity of the HPr and Crh sequences of Bacillus speciesa
|
subdivision of the Proteobacteria (Fig. 3). Consistent with the phylogenetic distance between Crh and HPr of the bacillus species, PTSH of U. urealyticum is quite distant from HPr of the related mollicutes and TTE0115 and CHTEP180 are distant from the clostridium branch. The conservation that is observed in the region around the mutated active-site histidine residue in the bacillus Crh proteins is completely absent in the four putative Crh proteins (see Table 5). Apart from the mutated active-site histidine, the latter do not seem to have much in common with the former. The gram-negative bacteria of the
, ß, and
subdivisions of the Proteobacteria are likely to have CcpA-dependent CCR since they possess HPrK. The HPr-like proteins of the
subdivision (XPr
) all cluster in one branch of the tree (Fig. 3). The HPr-like proteins of the ß and
subdivisions are on the same branch as the XPr proteins of the bacteria in the
subdivision of the Proteobacteria that possess HPrK (Xanthomonas and Xylella species; XPrß,
,
). The branch is distant from HPr of the gram-negative bacteria in the
subdivision that do not contain HPrK (e.g., Escherichia and Klebsiella species). Both the XPr
and XPrß,
,
proteins are distant from the gram-positive HPr proteins. Importantly, the XPrß,
,
proteins and, especially, the XPr
proteins are loosely associated with the NPr proteins from the
subdivision. Moreover, the XPr proteins of the ß,
, and
subdivisions have >55% sequence identity to the HPr-like proteins of the Pseudomonas species in the
subdivision that are likely to function in Ntr-type regulation (see above).
-helices (a, b, and c [Fig. 5B]) on top of a four-stranded ß-sheet (strands ß1 to ß4 [Fig. 5B]). In a linear representation, the order of the secondary-structure elements would be ß1aß2ß3bß4c. Conserved region A contains the active-site histidine, which is positioned just in front of
-helix a. The region comprises the interface of the loop preceding helix a and helix a. The region is involved in the interaction of HPr and the PTS enzymes EI and IIA. Region B contains the regulatory-site serine, which is positioned at the interface of the loop between ß3 and
-helix b. The region covers most of helix b, which is only two turns long. Region C comprises the loop between ß4 and
-helix c plus the first turn at the N-terminal end of helix c. Region C is quite distant from the other two conserved regions on the surface of the protein. The HPr-like molecules from the Actinobacteria members Corynebacterium glutamicum and Streptomyces coelicolor, as well as from Chloroflexus aurantiacus, contain an insertion of 4 residues in the loop connecting strands ß2 and ß3, where they are not likely to disturb the folding of the protein (not shown).
![]() ![]() View larger version (65K): [in a new window] |
FIG. 5. Conserved regions in HPr-like molecules. (A) Clustering of pairwise sequence identity. The plot is explained in the legend to Fig. 4. The "spike" at position 20 is an artifact caused by the presence of two sequences with an N-terminal extension. The bars in the top part of the plot indicate positions where a gap was found in any of the sequences. (B) Structural model of HPr-like proteins. Conserved regions A, B, and C are indicated. (C) Sequence analysis of conserved region C. The bars indicate the frequency of the negatively charged residues D and E at the indicated positions. The dotted line indicates the average frequency in the whole set. At positions with low frequencies, the dominant residue(s) is indicated. Position numbering is according to Crh of B. subtilis. (Panel B reprinted from reference 22 with permission from the publisher.)
|
The regulatory-site serine in conserved region B is the best-conserved residue in the whole family. Only three HPr-like molecules in the family do not contain a serine at this position: STY4004 of Salmonella enterica, CHLOP311 of Chloroflexus aurantiacus, and PTSH of Bifidobacterium longum (Table 2). None of these three organisms contains an HPrK homologue. The missing regulatory-site serine in the S. enterica protein is most probably an alignment artifact since the sequence SILG, which is similar to the consensus sequence (see below), is located 4 residues downstream. In contrast, the serine residue in B. longum is replaced by arginine while the surrounding residues are still conserved. In the C. aurantiacus protein, the serine residue is deleted and the adjacent residues are not conserved. The high conservation of the serine residue throughout the whole set makes it difficult to accept that the serine, or at least the conserved region, has no function in those bacteria that lack HPr kinase.
Conserved region B shows more clustering of the sequences following the phylogeny of the bacteria (Tables 1 and 2) than may be detected in the case of conserved region A (Table 5). Even though the HPrs and Crhs of (lacto)bacilli are in separate branches of the phylogenetic tree (Fig. 3), the regions around the regulatory-site serines are almost identical. The consensus sequence is VN(AL)KSIMG(VL)MSL(AG), in which the regulatory-site serine is shown in bold. The HPr and Crh sequences differ only at the third, ninth, and last positions, as indicated. The putative Crh proteins TTE0115 of T. tengcongensis and CHTEP180 of C. thermocellum have very similar sequences in region B, while CHTEP182 of C. thermocellum and PTSH of U. urealyticum are more divergent (Table 5). The HPr proteins of the gram-negative bacteria of the
subdivision of the Proteobacteria are strongly conserved in region B, but the motif is quite different from that of HPr and Crh of the Firmicutes. Consistent with the presence of HPrK in these bacteria, the XPr proteins of the bacteria in the
, ß, and
subdivisions of the Proteobacteria contain B-motifs that resemble the motif observed in gram-positive bacteria. The triad IMG following the regulatory-site serine residue seems to be typical for the HPr-like proteins associated with HPrK. Remarkably, although XPr and NPr cluster on the same side of the phylogenetic tree, regions B are not conserved between the two types of HPr-like molecules. Region B of NPr is the least conserved among the HPr-like proteins (Table 5).
Conserved region C contains an unusually high fraction of the negatively charged residues aspartate and glutamate. Eight positions in this region contain 30% of all the D and E residues in the set. Figure 5C shows the frequency per position. In between the positions with high D-plus-E content are positions with conserved small or hydrophobic residues. Region C represents a negative patch on the surface of the protein. No specific function for the region is known.
|
|
|---|
and ß subdivisions of the Proteobacteria, HPrK even appears to be common, while in the
subdivision it is more of an exception and is found only in Xanthomonas and Xylella species.
HPrK/P homologues come in two versions: the full-length version, consisting of about 325 residues is the common one and is found in all phyla, whereas the short version, consisting of about 150 residues, is unique to the
subdivision of the Proteobacteria. The short version corresponds to an internal fragment of the full version. The full-length HPrKs may consist of three rather than two domains, an N-terminal domain of unknown function, a catalytic domain consisting of the next 100 residues, and a C-terminal domain that is responsible for the association of the protein into a multimeric structure. The short version corresponds to the internal catalytic domain.
Three lines of evidence suggest that the HPrK homologues in the phyla other than the Firmicutes function as HPrKs: (i) the interaction site with HPr on the catalytic domain is conserved (motifs A and B [Fig. 4]); (ii) the genes coding for the HPrK homologues and the HPr-like proteins of the same organisms (XPrs) are clustered on the genome (the X cluster [Table 3]); and (iii) the amino acid sequence around the regulatory-site serine in XPr (region B) is similar to that observed in the HPr-like proteins (HPr and Crh) of the Firmicutes that are phosphorylated by HPrK and different from the HPr-like proteins (HPr and NPr) from organisms that do not contain HPrK (Table 5). Overall, the XPr proteins do not cluster on the same branch of the phylogenetic tree as the HPr and Crh proteins of the Firmicutes (Fig. 3). A number of organisms in the
subdivision of the Proteobacteria do contain an HPrK homologue but no enzyme I, the PTS enzyme responsible for the phosphorylation of HPr at the active-site histidine (e.g., R. sphaeroides). XPr of these organisms seems to function only in CcpA-dependent gene regulation and not in PTS-mediated uptake, a situation similar to that observed in the strictly aerobic bacteria of the Pseudomonas species with respect to the Ntr regulatory system.
Crh-mediated signal transduction is observed only in the phylum Firmicutes. Uncoupling of the PTS transport function and the CCR function by mutation of the active-site histidine, yielding Crh proteins, allows the coexistence of independent uptake and regulatory functions in these bacteria. Again, a parallel may be drawn to the Ntr regulatory system involving NPr in the
subdivision of the Proteobacteria, which is also thought to operate independently of the PTS uptake system. Close homologues of Crh of B. subtilis are found only in other bacilli. The putative Crh proteins of the Clostridia and U. urealyticum differ from the bacillius Crh in that the amino acid sequence of conserved region A containing the mutated active-site histidine is not conserved. Regions B of C. thermocellum CHTEP180 and of T. tengcongensis TTE0115 resemble the serine phosphorylation motif in the Firmicutes HPr and Crh proteins and therefore are the best candidates for a Crh function (Table 5). The presence of a second putative Crh protein in C. thermocellum with a divergent region B suggests a more differentiated regulatory system in this Clostridium species.
The most remarkable finding of the database searches presented here is a possible relationship between CcpA-dependent gene regulation and Ntr found in the
subdivision of the Proteobacteria. The relationship is suggested by (i) the clustering of CCR and Ntr components on the genomes of members of the Proteobacteria (the X-cluster), (ii) the phylogenetic relationship between XPr and NPr (Fig. 3), and (iii) the presence of the j protein in the Ntr cluster and Crh cluster of the Firmicutes. The X-cluster in the Xanthomonas species in the
subdivision represents the most complete cluster of genes. It contains the genes for IIANtr, HPrK, IIAMan, XPr, and enzyme I (Table 3). Since NPr and INtr are missing in the Xanthomonas species, their role may have been taken over by XPr and enzyme I, respectively. This would place XPr at the center of a complex regulatory network; it would be phosphorylated by HPrK in CcpA-dependent CCR and would be intermediate between IIANtr and enzyme I in the Ntr system. The X-clusters of X. fastidiosa and G. metallireducens in the
subdivision lack the gene for IIANtr. If the N, h, and j genes are indicative of Ntr regulation, the function of IIANtr may have been taken over by IIAMan. In the
and ß subdivision of the Proteobacteria, the genes of the X-cluster are organized in two subclusters. In the
subdivision, the gene coding for HPrK is found together with the genes for IIAMan and XPr, while in the ß subdivision, the HPrK gene is clustered with the Ntr-associated genes. In both subdivisions, IIANtr and IIAMan are present. Most of the bacteria from the
subdivision do not possess enzyme I, but all of them contain INtr. The components in a bacterium like Caulobacter crescentus in the
subdivision provide the best evidence for XPr being an intermediate in both CcpA-dependent CCR and Ntr type of regulation. It contains the putative phosphotransfer chain INtr > XPr > IIANtr, and XPr contains the serine phosphorylation motif (region B), which allows phosphorylation by (the short version of) HPrK (Fig. 6). The mechanism could serve as a coupling between carbon and nitrogen metabolism of the cell.
![]() View larger version (20K): [in a new window] |
FIG. 6. Putative signal transduction network in the subdivision of Proteobacteria. HPr-like protein XPr would connect two signal transduction pathways, CcpA-mediated gene regulation on the left and the Ntr-regulation system on the right.
|
|
|
|---|
, ß,
, and
subdivisions of the phylum Proteobacteria may have to be studied. For each question asked, we may have to find the best organism to study. With the ongoing genome-sequencing projects, this may soon become reality.
Present address: Medical Biology, Department of Pathology and Laboratory Medicine, University of Groningen, The Netherlands. ![]()
|
|
|---|
This article has been cited by other articles:
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2010 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»