Previous Article | Next Article ![]()
Microbiology and Molecular Biology Reviews, March 2000, p. 202-236, Vol. 64, No. 1
Department of Microbiology, University of
Illinois, Urbana, Illinois 618011;
Center for Biomolecular Recognition, Department of Medical
Biochemistry and Genetics, Laboratory B, The Panum Institute, DK-2200
Copenhagen N, Denmark2; and Department
of Molecular Biophysics and Biochemistry and Department of
Molecular, Cellular and Developmental Biology, Yale University, New
Haven, Connecticut 06520-81143
1092-2172/00/$04.00+0
Copyright © 2000, American Society for Microbiology. All rights reserved.
Aminoacyl-tRNA Synthetases, the Genetic Code, and
the Evolutionary Process
SUMMARY
INTRODUCTION
BIOCHEMICAL SKETCH OF AMINOACYL-TRNA
SYNTHESIS
EVOLUTIONARY OVERVIEW OF AMINOACYL-TRNA
SYNTHETASES
ORDER IN THE GENETIC CODE
AMINOACYL-TRNA SYNTHETASE EVOLUTION: RECURRING
GENE TRANSFER
EVOLUTIONARY PROFILES OF THE INDIVIDUAL
AMINOACYL-TRNA SYNTHETASES
Synthetases for the NUN-Encoded Amino Acids
Phe; UUY; class II; tetramer of
- and
-subunits.
Leu; UUR and CUN; class I; monomer.
Ile; AUH; class I; monomer.
Met; AUG; class I; homodimer.
Val; GUN; class I; monomer.
Synthetases for the NCN-Encoded Amino Acids
Ser; UCN and AGY; class II; homodimer.
Pro; CCN; class II; homodimer.
Thr; ACN; class II; homodimer.
Ala; GCN; class II; homotetramer.
Synthetases for the NAN-Encoded Amino Acids
Tyr; UAY; class I; homodimer.
His; CAY; class II; homodimer.
Gln; CAR; class I; monomer.
Asn; AAY; class II; homodimer.
Lys; AAR; class I (monomer) and class II (homodimer).
Asp; GAY; class II; homodimer.
Glu; GAR; class I; monomer.
Synthetases for the NGN-Encoded Amino Acids
Cys; UGY; class I; monomer.
Trp; UGG; class I; homodimer.
Arg; CGN and AGR; class I; homodimer.
Gly; GGN; class II; heterotetramer or homodimer.
Recruitment of Aminoacyl-tRNA Synthetases into Other
Roles
Summary of Aminoacyl-tRNA Synthetase Evolutionary
Profiles
PHYLOGENETIC PICTURE EMERGING FROM THE
AMINOACYL-TRNA SYNTHETASES
Spirochetes
Cytophaga-Chlorobium Kingdom
Deinococcus-Thermus Division
Proteobacteria (Purple Bacteria)
Gram-Positive Bacteria
Rickettsias, Mycoplasmas, and Certain Other Bacteria
Archaea
EVOLUTIONARY SYNTHESIS
Evolutionary Significance of the Canonical Pattern
Some General Evolutionary Matters
CONCLUSION
ACKNOWLEDGMENTS
REFERENCES
SUMMARY
|
|
|---|
The aminoacyl-tRNA synthetases (AARSs) and their relationship to the genetic code are examined from the evolutionary perspective. Despite a loose correlation between codon assignments and AARS evolutionary relationships, the code is far too highly structured to have been ordered merely through the evolutionary wanderings of these enzymes. Nevertheless, the AARSs are very informative about the evolutionary process. Examination of the phylogenetic trees for each of the AARSs reveals the following. (i) Their evolutionary relationships mostly conform to established organismal phylogeny: a strong distinction exists between bacterial- and archaeal-type AARSs. (ii) Although the evolutionary profiles of the individual AARSs might be expected to be similar in general respects, they are not. It is argued that these differences in profiles reflect the stages in the evolutionary process when the taxonomic distributions of the individual AARSs became fixed, not the nature of the individual enzymes. (iii) Horizontal transfer of AARS genes between Bacteria and Archaea is asymmetric: transfer of archaeal AARSs to the Bacteria is more prevalent than the reverse, which is seen only for the "gemini group." (iv) The most far-ranging transfers of AARS genes have tended to occur in the distant evolutionary past, before or during formation of the primary organismal domains. These findings are also used to refine the theory that at the evolutionary stage represented by the root of the universal phylogenetic tree, cells were far more primitive than their modern counterparts and thus exchanged genetic material in far less restricted ways, in effect evolving in a communal sense.
INTRODUCTION
|
|
|---|
The aminoacyl-tRNA synthetases (AARSs) have long fascinated
biologists. They are the linchpin of translation, the link between the
worlds of protein and nucleic acid. Their structures and functions, which have both practical and basic significance, are deserving of and
have received much attention. However, it is not only the structure-function aspect of these enzymes that has captured the biologist's imagination; it is also the possibility that they could
tell us the secrets of the genetic code. To understand these enzymes in
standard molecular terms is to add one more piece, a most important
one, to the puzzle of what the cell is, how it works. But to understand
them in evolutionary terms is to ask what the cell is in a deeper
sense, how it evolved, how life came to be
the biologist's ultimate
question. Reading the history written into the AARSs was not possible
previously for the simple reason that doing so requires molecular
sequences from a large number of these molecules, and the necessary
body of data was lacking. The progress of genomics in the late 1990s is
now providing the needed data, and a picture of AARS evolution is
beginning to emerge (5, 7, 15-18, 45, 75). In the present
review we examine the still murky image of synthetase evolution from a
slightly different perspective and bring forth more of its rich detail and evolutionary depth.
BIOCHEMICAL SKETCH OF AMINOACYL-TRNA
SYNTHESIS
|
|
|---|
In a departure from the long-accepted view (48) that
every cell harbors 20 aminoacyl-tRNA synthetases responsible for the synthesis of the set of 20 canonical aminoacyl-tRNA families, it is now
clearly established that there are at least two ways of forming
aminoacyl-tRNA (12, 33). The direct acylation of tRNA by
aminoacyl-tRNA synthetases is well understood; the ATP-dependent reaction (Fig. 1) is carried out by
enzymes which, in general, are exceedingly specific in selecting their
substrates, i.e., amino acid and tRNA. They fall into two classes of 10 based on the topology of their ATP binding domain; class I proteins
contain a Rossmann fold (characterized by the HIGH and KMSK motifs),
while class II enzymes possess an unrelated
-sheet arrangement and are characterized by three degenerate sequence motifs (3, 10, 14,
20). Examples of most of the aminoacyl-tRNA synthetases have been
structurally characterized, and it is expected that in the near future
the crystal structure of at least one enzyme from all these families
will be known (42, 43). There is also an indirect pathway of
aminoacyl-tRNA synthesis, tRNA-dependent amino acid modification (Fig.
1). This pathway relies on the acylation of tRNA with a "precursor"
amino acid by a nondiscriminating AARS (33). Currently our
knowledge of the discriminating versus nondiscriminating AARS is not
advanced enough to deduce this property from their amino acid sequence
alone. This "precursor" amino acid is then converted, while bound
to tRNA, to the correct amino acid (matching the tRNA specificity) by a
second, nonsynthetase enzyme, which recognizes only such a mischarged
aminoacyl-tRNA species. Our current knowledge about the number and
nature of these enzymes is still far from complete, but it is clear
that in many organisms this is the essential and only way to form
Asn-tRNA and Gln-tRNA (12, 13, 63).
|
EVOLUTIONARY OVERVIEW OF AMINOACYL-TRNA
SYNTHETASES
|
|
|---|
The assumption that translationally produced protein was a part of
the very first translation mechanism raises the chicken-and-egg paradox. However, there can be little doubt that once translation did
exist, proteins that facilitated tRNA charging would be among the first
proteins to evolve, the selective advantage of their specificity being
great. Thus, the evolutionary history of the current aminoacyl-tRNA
synthetases must go deep into translation's past, to the emergence of
the modern genetic code. The central role played by the AARSs in
translation would suggest that their histories and that of the genetic
code are somehow intertwined. This then raises the question of whether
the AARSs in their evolution have contributed to the code's present
structure; put another way, are the codon assignments simply
reflections of AARS evolutionary wanderings? It is important that
conjectures of this sort be examined in detail
and in a genomic era
this can be done.
In an evolutionary sense, the most striking thing about the synthetases is the existence of the two distinct classes (3, 10, 14, 20). Common characteristic domain structures and sequence homologies define each class, but the two have nothing in common except the biochemistry of the reactions they catalyze (22): between the two classes, proteins show no structural resemblance, have almost no common motifs (see reference 49 for a possible exception), encounter the tRNA from different angles, and acylate the amino acid to different hydroxyl groups of the terminal ribose of the tRNA (57). This has been widely assumed to suggest that the tRNA-charging function evolved at least twice. In the origin of these two classes of tRNA-charging enzymes lies a clue to one of biology's deepest mysteries (45). Perhaps the two reflect a dichotomous origin of translation itself, in some sort of fusion between two different primitive processes, each associated with its own set of amino acids. Perhaps the two classes are the surviving traces of an ancient evolutionary battle between emerging tRNA-charging mechanisms as biology evolved beyond the RNA world. In any case, the existence of unrelated tRNA-charging systems must be considered a most telling evolutionary relic (45).
The aminoacyl-tRNA synthetases are distributed between the classes according to specific rules. Each class encompasses 10 of the amino acids, and all examples of a given amino acid's synthetase are of the same class, the so-called "class rule." Within a class, all synthetases associated with a given amino acid are specifically related to one another to the exclusion of the AARSs associated with any other amino acids, the "monophyly rule." A third, class-independent generalization is that for each organism, all tRNAs assigned to a given amino acid (so-called isoacceptors [50]) can be charged by a single synthetase, a rule that holds even for amino acids such as serine with two distinct sets of codons, UCN and AGY (reviewed in reference 39). Except possibly for the last, these rules have exceptions. The class rule and hence the monophyly rule are violated by lysine; in some organisms its synthetase is class I, while in others it is class II (32, 34, 35). Four more exceptions to the monophyly rule but not the class rule exist, involving glycine, serine, glutamic acid, and aspartic acid. For glycine and serine, each amino acid is associated with two synthetases (for both amino acids they are class II [37, 44]). However, the two enzymes in each case are not specifically related to one another, as the monophyly rule demands. This is most obvious for glycine, where the overall structures of the two enzymes are completely dissimilar, with one being a homodimer and the other being a heterotetramer. For the glutamyl- and aspartyl-tRNA synthetases, the violation of the monophyly rule is of a different nature. In each of these cases, all the synthetases associated with a particular amino acid constitute a related group. However, the synthetase for the amidated form of the amino acid (i.e., glutamine or asparagine) arises from within the same group, which then renders the parent grouping paraphyletic (4, 38, 46, 52, 53). Mention should also be made here of the charging system for cysteine, which breaks the class and monophyly rules in another way. In at least two organisms, the methanogens Methanococcus jannaschii and Methanobacterium thermoautotrophicum, neither a class I nor a class II cysteinyl-tRNA synthetase can be found in the genome, and the exact mechanism of Cys-tRNA formation (direct or indirect) has long remained a mystery. These exceptions to the class and monophyly rules do not rob the rules of their potential evolutionary significance. Erosion of the historical trace is the hallmark of evolution. The exceptions merely restrict what kinds of explanations can be given the rules.
The common general structure and sequence motifs shared by all members of a given synthetase class demand common ancestry and Darwinian descent. The later stages of this descent are captured in the sequence similarities among existing synthetases; importantly, their branching patterns recall structural similarities among the amino acids and patterns in the genetic code. To give examples: the ValRs and IleRS (class I) are impressively similar in sequence; this is not simply a matter of a sequence motif here and there (7). These sequences in turn are somewhat less similar to those of the LeuRS; the MetRSs then join the group at a still lower level of similarity (45). All four corresponding amino acids are nonpolar and aliphatic, and their codons all conform to the general composition NUN. Similarly, the class II enzymes for serine, threonine, proline, histidine, and glycine group phylogenetically and structurally (43). (Only one of the two unrelated forms of the GlyRS shows this specific relatedness, however [17].) The amino acids serine and threonine are obviously related structurally, and both are capable of forming an internal hydrogen-bonded five-membered ring structure that mimics the ring structure of the imino acid proline. However, histidine and glycine appear structurally unrelated to these three (and to one another). In their codons, the first three amino acids are also related; all conform to the general composition NCN, but the codons for histidine (CAY) and glycine (GGN) are not related to the others except, of course, to the CCY codons of proline in the former case.
The third major AARS grouping involves the class II synthetases for lysine, aspartic acid, and asparagine (14, 17, 52), all of which are closely related in structure (4). The amino acids aspartic acid and asparagine are obviously related, but lysine stands apart. In their codons, the three exhibit an overlapping kind of relatedness, with the Asn codons (AAY) being close to both their Asp (GAY) and Lys (AAR) counterparts whereas the last two sets are not closely related.
The close evolutionary relationship between the (class I) synthetases for glutamic acid and glutamine (mentioned above; see also reference 46) is mirrored in the obvious structural relationship between the corresponding amino acids and the relationship between their codons (GAR-Glu versus CAR-Gln). Finally, a pronounced similarity exists between the synthetases for the two aromatic amino acids tyrosine and tryptophan (6, 19), but their codons (UAY and UGG, respectively) are not closely related.
Although the existence of correlations between the genetic code and the evolutionary patterns of the AARSs is clear, their significance is not. Does the fact that the valine, isoleucine, leucine, and methionine enzymes came from a common ancestor mean that this ancestor itself could not distinguish among these amino acids or that the ancestor was able to specifically charge four separate aminoacyl-tRNAs? That seems absurd in the context of modern translation (45). A more acceptable explanation would seem to be that the AARS relationships reflect evolutionary replacement of one tRNA-charging enzyme or acylation system by another. Indeed, what we see here may be only the latest in a series of such evolutionary replacements, a series that traces far back into the code's past and an evolutionary process that still goes on today in a less radical form, involving replacements within the confines of a given amino acid type (see below).
ORDER IN THE GENETIC CODE
|
|
|---|
The significance of AARS evolution vis-à-vis that of the
genetic code cannot be properly assessed without some appreciation of
the nature and extent of the code's order. Within the last decade,
significant strides have been made in this area. The so-called synonym
order in the code, i.e., the degeneracy in codon assignment, which
manifests itself almost exclusively in the third codon position, has
never been in doubt, except as regards what caused it in the first
place. However, such is not the case for the ordering that pertains to
related amino acids. Although most biologists accept the existence of
such an order, they have disagreed about its exact form, its extent,
and its cause. Some have argued that the related amino acid order
evolved to ameliorate the phenotypic consequences of mutations, an
evolutionary scenario that would produce both synonym and related amino
acid orderings (59). An alternative but conceptually related
explanation is that the assignments have somehow been adjusted to
minimize the consequences of errors in a primitive translation
mechanism that was highly inaccurate (66). Seemingly, both
error minimization models would lead to a very similar type of order in
the code. However, a computer simulation study (26) showed
that the assumptions of the first model are unlikely to lead to a
synonym order in the code that is almost entirely confined to a single
codon position
a type of order that is consistent with, if not
predicted by, the second model (which also suggests the third codon
position should be the degenerate one [66]). It has
also been proposed that the form of the code was predetermined, at
least in part, by specific interactions between amino acids and nucleic
acids (reference 76 and references therein).
Perhaps the main difficulty in comprehending the code's related amino acid ordering is that amino acid relatedness is context dependent; amino acids that appear similar in one context can be unrelated in another. The amino acid replacement spectra of proteins prove the point: the replacement pattern can differ from position to position in a protein sequence for any amino acid. However, nobody knows what property or properties of the amino acids the code actually reflects.
One important advance in this area was the definition of an amino acid
property called the polar requirement, which is a number derived from
the paper chromatographic mobility of an amino acid in pyridine-water
mixtures of various ratios (71). Simply plotting these
numbers on a codon table (Table 1)
reveals the existence of a remarkable degree of order, much of which
would be unexpected on the basis of amino acid properties as normally
understood. For example, codons of the form NUN define a set of five
amino acids, all of which have very similar polar requirements.
Likewise, the set of amino acids defined by the NCN codons all have
nearly the same unique polar requirement. The codon couplets CAY-CAR, AAY-AAR, and GAY-GAR each define a pair of amino acids
(histidine-glutamine, asparagine-lysine, and aspartic acid-glutamic
acid, respectively) that has a unique polar requirement. Only for the
last of these (aspartic and glutamic acids), however, would the two
amino acids be judged highly similar by more conventional criteria.
Perhaps the most remarkable thing about polar requirement is that
although it is only a unidimensional characterization of the amino
acids, it still seems to capture the essence of the way in which amino acids, all of which are capable of reacting in varied ways with their
surroundings, are related in the context of the genetic code. Also of
note is the fact that the context in which polar requirement is
defined, i.e., the interaction of amino acids with heterocyclic
aromatic compounds in an aqueous environment, is more suggestive of a
similarity in the way amino acids might interact with nucleic acids
than of any similarity in the way they would behave in a proteinaceous
environment (70).
|
More recently, computer simulation studies have been used to try to assess the merit of polar requirement as an indicator of the code's related amino acid order is compared to other amino acid properties, how well ordered the code actually is, and the nature of the code's order. An appealingly straightforward approach to the problem was explored by Hurst and his colleagues (23, 28). In summary, they compared the natural code to a series of synthetic codes generated by randomly reassigning the 20 amino acids to the set of synonym codon categories that are defined by the natural code. Each code is then measured for how conservative it is with regard to a given amino acid property under "mutation"; i.e., each codon in a given code is compared to all other codons that are 1 base change removed from it, the numerical difference in that property between the amino acids corresponding to the original and the "mutated" codon is measured, and the squared differences are summed over the code as a whole or over each of the three codon positions individually. For all amino acid properties tested except one, the natural code was not notably superior to the random codes. That exception, polar requirement, revealed a natural code superior to all but 0.01% of the random codes (28). A subsequent, more refined simulation of this sort, which took transition-transversion ratios into account, showed the natural code was "one in a million" (23). There can be no doubt that when viewed in terms of amino acid polar requirements, the genetic code is a highly structured array. It would also seem that it has somehow been optimized to reduce the consequences of translational errors. However, the evolutionary dynamic that shaped the code remains a mystery.
While it must be admitted that the evolutionary relationships among the AARSs bear some resemblance to the related amino acid order of the code, it seems unlikely that they are responsible for that order (45): the evolutionary wanderings of these enzymes alone simply could not produce a code so highly ordered, in both degree and kind, as we now know the genetic code to be. These enzymes could at best be the agents through which other constraints acted to shape the code. However, even in such a capacity they would not be alone: the tRNAs offer a simple and facile alternative mechanism for changing codon assignments (65). It would seem, therefore, that the evolutionary patterns among the aminoacyl-tRNA synthetases do not imply a role for these enzymes in structuring the genetic code (45). The resemblance between their evolutionary patterns and the patterns seen in the code are a loose convergence, forced by the fact that both evolutions independently reflect somewhat similar properties of the amino acids. The evolutionary patterns in the AARSs do seem to represent evolutionary replacements that occurred against the background of an already established, or otherwise fashioned, code (45).
AMINOACYL-TRNA SYNTHETASE EVOLUTION: RECURRING
GENE TRANSFER
|
|
|---|
If the AARSs do not reveal the code's evolution, what do their evolutionary relationships tell us? The answer is clear. Aminoacyl-tRNA synthetase evolution is a superb indicator of the evolutionary dynamic in general.
It should be noted that the AARSs are unique among components of the
translation system in their evolutionary behavior. Starting with the
rRNAs and continuing through the ribosomal proteins and the translation
initiation and elongation factors runs one dominant evolutionary
theme
molecules tend to show the same evolutionary history; i.e.,
their molecular phylogenies are consistent with the accepted overall
organismal phylogeny. At the highest level, they tend to yield what we
herein call the canonical phylogenetic pattern, which is basically a
division of all life into the three primary groupings
Bacteria, Archaea, and eukaryotes, with the closest relationship being between the Archaea and
eukaryotes (73) (see below). The evolutionary picture
painted by the synthetases, however, is a world apart from this
canonical pattern. Not only do the phylogenies fail to yield the
canonical pattern in a number of cases, but also they typically violate
the accepted taxonomic structure within the organismal domains.
Furthermore, the molecular phylogenies inferred from the synthetases of
different amino acid types tend not to agree with one another
but this
is the telling point.
Why should the synthetases show such atypical and disparate evolutionary pictures? The answer again is clear. The AARSs are in essence modular components of the cell; they function in isolation from the rest of the translation apparatus and from the rest of the cell, except for their individual contacts in each case with a small subset of the tRNAs (58). Because of this and because of their universality, the AARSs can function in a wide spectrum of cellular environments, often without disadvantage to the host. In other words, the AARSs are ideal candidates for widespread horizontal gene transfer, and the evidence certainly indicates this, since quite a few examples are known in which two different AARSs for the same amino acid coexist in the same organism. Versions of a given enzyme characteristic of the Archaea can be seen scattered among the bacterial taxa (see below). Versions characteristic of the eukaryotes have been seen in the Bacteria or in the Archaea. Within the Bacteria alone, the different bacterial subtypes of a given enzyme intermix among and within the taxa. There is no set pattern to all this; there is merely evidence consistent with frequent, widespread, indiscriminate horizontal gene transfer.
As suggested above, it is tempting to view the evolution of aminoacyl-tRNA synthesis as a study in horizontal gene transfer from top to bottom: at the deepest level, horizontal replacements involving the ancestors of the two synthetase classes, then replacements that gave rise to the phylogenetic structure within each class, and, finally, the replacements involving the different (modern) synthetases that use the same amino acid.
EVOLUTIONARY PROFILES OF THE INDIVIDUAL
AMINOACYL-TRNA SYNTHETASES
|
|
|---|
We now examine in some detail the evolutionary profiles for each of the 20 aminoacyl-tRNA synthetases, with the principal objective of determining the extent to which each conforms to the canonical phylogenetic pattern (defined below) and asking what, if anything, the exceptions to canonical pattern tell us about these enzymes and about stages in the evolution of the cell.
The organisms mentioned in the figures and tables are listed in Table
2.
|
The analysis presented is a synthesis of four approaches: (i) conventional phylogenetic trees (see Fig. 2 caption); (ii) visual inspection of alignments to reveal qualitative differences not apparent from the other analyses; (iii) dipeptide similarity matrices (see Table 3 footnotes); and (iv) signature analysis. Signatures are defined in terms of positions in the alignment wherein at least 80% of the members of a given group show a constant composition but one that is found elsewhere in the alignment no more that once within some larger phylogenetic taxonomic context. For example, spirochete signatures would usually be relative to all other bacterial groups but not relative to the more distantly related archaeal and eukaryotic versions of the enzyme.
Because the canonical evolutionary pattern is central to our thesis and
because the tRNA-charging enzymes exhibit different partial forms of
that pattern, it is necessary to begin by explaining clearly what we
mean by the phrase. In its essence (which we will call the basal
canonical pattern), the canonical pattern is defined by the
relationship between the bacterial and archaeal versions of a given
molecule. For the basal canonical pattern to hold, regardless of how
many subtypes of a given protein exist, it must be possible to
distinguish strongly between characteristic bacterial and archaeal
versions of the molecule. This distinction should be a pronounced
quantitative one (on the level of sequence similarities) and/or a
qualitative one (evident in terms of gross areas in a sequence
alignment wherein homology between the two is only weakly evident or
nonexistent). In other words, for these two organismal domains, the
interdomain differences between the characteristic archaeal and
bacterial proteins must far outweigh any intradomain differences: the
two must appear to differ in genre. For the full canonical pattern to
hold, there must also then exist a characteristic eukaryotic version(s)
of the molecule that is distinguishable from both the archaeal and the
bacterial versions but which is clearly of the archaeal genre. Tables
3 and 4 are
representative dipeptide similarity matrices for two aminoacyl-tRNA
synthetases typical of those showing canonical pattern (PheRS and
TyrRS), while Tables 5 and
6 are matrices for enzymes (SerRS and
CysRS) that do not show canonical pattern.
|
|
|
|
The aminoacyl-tRNA synthetases are considered individually below in an order defined by their corresponding codons. We have not included most of the mitochondrial data in the analysis, because doing so would add nothing to our conclusions and would needlessly complicate an already complex picture (27, 29).
Synthetases for the NUN-Encoded Amino Acids
Phe; UUY; class II; tetramer of
- and
-subunits.
PheRS is the only class II synthetase in the NUN codon group, and
it has no close relatives within that class. Not surprisingly, both the
- and
-subunits present the same evolutionary picture; their
sequences are combined to produce Fig. 2.
PheRS shows the classical full canonical pattern, the only exception
being the spirochete PheRSs, which are of the archaeal, not the
bacterial genre, and which seem to be specifically related to the
Pyrococcus PheRS within that grouping, as sequence signature
analysis suggests and Fig. 2 confirms.
|
- and
-subunits of PheRS, significant length
differences distinguish the bacterial subunits from their archaeal counterpart. The bacterial
-subunit is about 120 amino acids shorter
than the archaeal/eukaryotic
-subunit at its N terminus, and the
first 90 amino acids of the bacterial sequence show little or no
similarity to the archaeal/eukaryotic counterpart. However, for the
-subunit, the bacterial version is the longer, by approximately 250 amino acids. At both termini the bacterial version of the
-subunit
extends beyond the archaeal/eukaryotic version by about 100 amino
acids; in the N-terminal ~50 amino acids, the archaeal version of the
-subunit shows no recognizable similarity to its bacterial
counterpart. In addition, large sequence gaps distinguish the two
genres in the interior of the
-subunit.
Leu; UUR and CUN; class I; monomer. LeuRS conforms to the full canonical pattern as well, in this case without exception. A striking lack of similarity in various regions of the molecule distinguishes the bacterial and archaeal genres of LeuRS, and a number of sizable insertion and deletion differences distinguish the two genres throughout the alignment. A nearly total lack of sequence similarity between the two is seen in the C-terminal (KMSK) section of the molecule.
Within the Bacteria, however, the accepted phylogenetic relationships are not all preserved
at least two distinct bacterial subtypes of the molecule exist and have obviously migrated
horizontally. The best-defined bacterial subtype (by all methods of
analysis) is that common to the majority of gram-positive bacteria (and relatives), the spirochetes, chlamydias, and the
Cytophaga-Chlorobium grouping (represented by
Chlorobium tepidum and Porphyromonas gingivalis).
However, this grouping fails to include Clostridium acetobutylicum, a gram-positive species whose LeuRS groups with that of Deinococcus in Fig. 3
(a relationship supported by sequence signature). On the other hand,
the proteobacteria (Escherichia coli and relatives) do form
a grouping quite consistent with their established phylogeny (Fig. 3).
|
Ile; AUH; class I; monomer.
IleRS also shows the full
canonical pattern. As with LeuRS, this fact is obvious upon visual
inspection of the alignment, especially its C-terminal section, wherein
the bacterial and archaeal genres exhibit very little sequence
similarity and show major alignment gaps relative to one another.
However, as all methods of analysis clearly show, a sizable minority of
bacterial taxa possess an IleRS of the archaeal rather than the
bacterial genre (Fig. 4). All of these
bacterial examples are specifically related to their eukaryotic
counterparts, with the closest relationship being between the
eukaryotes and a bacterial subgroup comprising the spirochetes,
chlamydias, Mycobacterium, and Rickettsia. Note in Fig. 4 the specific relationship between the IleRS of
Mycobacterium and that of Rickettsia, which is
strongly suggested by sequence signature as well. Also note the
relationship between the C. acetobutylicum IleRS and the
plasmid-borne IleRS found in mupirocin-resistant strains of
Staphylococcus aureus; this relationship is also supported by sequence signature.
|
Met; AUG; class I; homodimer.
Methionine presents one
of the more complex evolutionary profiles among the aminoacyl-tRNA
synthetases. The enzyme marginally shows the canonical picture: the
majority of bacterial examples
the group represented by
Helicobacter in Fig. 5
define
a bacterial genre, while the archaea, eukaryotes and a number of
bacterial MetRSs constitute the archaeal genre (Fig. 5). However, there is another bacterial grouping, confined to the
and
proteobacteria, which is of the archaeal genre (Fig. 5). The difference
between the bacterial and archaeal genres of MetRS is not as extreme as that seen for the other members of the NUN codon group. However, one
large alignment gap (~25 amino acids) separates the bacterial genre
from all others (the latter appear to contain a metal binding region at
this point, the consensus sequence of which is
CP . C . . . . . a . gD . C . . C . . . . . . . . . . L
(where lowercase signifies its presence in only four of the five
groupings involved). A strong signature distinguishes the bacterial
genre from the others, and its distinctiveness is also evident in a
dipeptide similarity matrix.
|
proteobacterial representatives.)
The C-terminal domain of MetRS, about 150 amino acids in length, can
take one of three forms: (i) it can be covalently linked to the rest of
the molecule, as in most bacteria and most archaea; (ii) it can be
completely missing, as in a number of bacteria, e.g., cyanobacteria and
mycoplasmas; or (iii) it can be present but not covalently linked to
the rest of the molecule, as in all eukaryotes (except
Caenorhabditis elegans, where it is covalently linked), in
Aquifex, and in the Crenarchaeota. In eukaryotes, this
separate protein, known as Arc1p, occurs as a part of some higher-order
complexes involving eukaryotic synthetases, wherein it is involved in
amino acid recognition (54). The C termini of these proteins
extend about 60 amino acids beyond the normal C-terminus of MetRS. It
is interesting that the spirochete MetRSs (in which the C terminus is a
covalently linked part of the molecule) also extend beyond the normal C
terminus of MetRS, and in this extension, they show homology to
sequences in the Arc1p family, providing further support for a specific
relationship between the spirochete and eukaryote enzymes (Fig. 5).
Arc1p-like domains can be seen in a few other aminoacyl-tRNA
synthetases as well. Approximately 100 residues of the N terminus of
the
-subunit of bacterial PheRS is homologous to a portion of Arc1p.
Mammalian TyrRS (only) has appended to its C terminus a more extensive
homolog, which is impressively similar to the MetRS extensions just
discussed. Also, it has been demonstrated in the mammalian TyrRS case
that the extension functions not as Arc1p (i.e., in amino acid
recognition) but as a cytokine (64).
Val; GUN; class I; monomer.
The valine-charging enzyme
conforms only to the basal canonical pattern; the eukaryotic ValRSs are
not archaeal in nature but obviously bacterial (Fig.
6). Also, within the bacterial group a
37-amino-acid insertion in the alignment found only in the eukaryotes and
,
, and
proteobacteria suggests a specific relationship among them. The distinction between the archaeal and bacterial genres
of ValRS is again a strong one and is manifested most strongly in the
C-terminal (KMSK) portion of the molecule. The rickettsial ValRS, alone
among the bacterial examples, is of the archaeal genre, seemingly
specifically related therein to the ValRS of the crenarchaeon
Pyrobaculum aerophilum; this relationship is supported by
sequence signature.
|
Synthetases for the NCN-Encoded Amino Acids
Serine, threonine, and proline have related structures, codons, and aminoacyl-tRNA synthetases; in this last respect, the group also encompasses histidine and glycine. (However, as mentioned above, only one of the two unrelated GlyRS forms shows the relationship.)
Ser; UCN and AGY; class II; homodimer.
The seryl-tRNA
synthetase is of particular interest for two reasons: (i) it
clearly fails to conform to the canonical pattern, and (ii) there are
two distinct serine-charging enzymes, a very rare form that has been
found so far only in M. thermoautotrophicum and the two
Methanococcus species examined and a major form that has
been found in all other organisms. Although both the major and minor
forms of SerRS belong to the above-mentioned Ser-Thr-Pro supercluster,
it is unclear whether the two are specifically related to one another
therein. (The minor form is not included in Fig. 7.)
|
all clearly related by signature sequence
seem to be
mitochondrial, their relationship to spirochetes rather than
proteobacteria becomes of interest.
Pro; CCN; class II; homodimer. ProRS exhibits the full canonical pattern but again with exceptions. The bacterial genre is distinguished from the archaeal by having an insertion of about 180 amino acid residues not seen in the latter at approximate (E. coli) position 190, while the latter extends at the C terminus of the molecule for about 70 residues beyond the former. Dipeptide similarities between the two genre are remarkably low.
The ProRSs of a few bacterial taxa, i.e., the mycoplasmas, Deinococcus, Chlorobium, Porphyromonas, and Borrelia (but not Treponema), are of the archaeal genre, and the eukaryotic enzymes (with the exception of that from Giardia) are included in this phylogenetic grouping; sequence signature analysis shows a sister relationship therein to the genera Borrelia, Chlorobium, and Porphyromonas, which the Fig. 8 tree confirms.
|
Thr; ACN; class II; homodimer.
Like its valyl
counterpart, ThrRS exhibits only the basal canonical pattern, with the
eukaryotic versions of the enzyme being bacterial rather than archaeal
in nature. The bacterial and archaeal genres are readily distinguished
by sizable additions/deletions in the N-terminal ~250 amino acids or
so of the alignment, and evidence of similarity between the two in this
portion of the molecule is minimal (Fig.
9). Two of the three available
crenarchaeal ThrRSs add a further complication to the picture (see
below).
|
Ala; GCN; class II; homotetramer. Although a class II enzyme, AlaRS is not a member of the supercluster that contains the other NCN-associated synthetases. The archaeal and bacterial forms of the enzyme are clearly distinguished by dipeptide similarities, sequence signature, and a few small but significant insertions and deletions in the alignment; the N terminus of the archaeal form also begins some 50 amino acids before the bacterial one does (Fig. 10).
Although the canonical pattern holds for the AlaRS, it is only the basal canonical pattern, since the eukaryotic AlaRSs (except for that of Giardia) cluster with the bacterial AlaRSs; and within that grouping they appear to be specifically related to the Chlorobium-Porphyromonas cluster (Fig. 10), a relationship that is supported by sequence signature. The Giardia AlaRS, however, is of the archaeal genre. This is confirmed by a strong sequence signature, which is also consistent with Giardia's position in Fig. 10 as an outgroup to the archaeal clade. The spirochete AlaRSs, although clearly of the bacterial genre, are highly derived. They both show two characteristic large deletions, one interior and the other C-terminal.
|
Synthetases for the NAN-Encoded Amino Acids
Tyr; UAY; class I; homodimer.
The TyrRS makes a strong
canonical distinction (Fig. 11 and
Table 4). In the C-terminal (KMSK) section of the molecule there is
very little similarity between the TyrRSs of the bacterial and archaeal
genres, and a number of insertion-deletion differences distinguish the
two throughout the molecule as well.
|
proteobacteria, E. coli, Salmonella, and Yersinia exhibit the first type
while Haemophilus, Actinobacillus, and
Vibrio exhibit the second. Among the
proteobacteria, Neisseria exhibits the first type while
Bordetella and Thiobacillus exhibit the second.
Porphyromonas and its relative Chlorobium are
phylogenetically split in this way too. B. subtilis and
C. acetobutylicum each contain TyrRSs of both subtypes.
Within the archaeal genre, the eukaryotic and archaeal TyrRSs are
intermixed. The euryarchaeal enzymes (except for those of the
pyrococci) cluster specifically with the animal and fungal TyrRSs,
while the three crenarchaeal TyrRSs (and those of the pyrococci) group
with the two plant examples (Arabidopsis and tobacco).
Sequence signatures strongly support this entire phylogenetic arrangement.
His; CAY; class II; homodimer.
HisRS also shows the
full canonical pattern. However, as signature analysis indicates and
Fig. 12 confirms, a small group of bacterial taxa
spirochetes, Helicobacter, C. acetobutylicum, Caulobacter, and
Porphyromonas
have HisRSs of the archaeal genre. This
bacterial grouping in turn encompasses the eukaryotic HisRSs, which
shows a specific relationship to Porphyromonas HisRS
therein, a relationship supported by sequence signature.
|
Gln; CAR; class I; monomer.
It has been convincingly
demonstrated that GlnRS stems specifically from the eukaryotic lineage
of GluRSs (53). Not only is this evident at the sequence
level, but also it has been demonstrated in terms of the overall
structure of the molecule (46). In its N-terminal (HIGH)
region, the GlnRS sequence is decidedly more similar to eukaryotic than
to archaeal GluRSs (and least similar of all to bacterial GluRS). In
the C-terminal (KMSK) region, the similarities of GlnRS to the
eukaryotic and archaeal versions of GluRS are roughly comparable but
sequence similarity to the bacterial GluRS is effectively nonexistent
(Fig. 13).
|
and
subdivisions of the
Proteobacteria, the Deinococcus-Thermus division,
and Porphyromonas. In other words, among bacteria known not
to contain GlnRS are representatives of the
and
subdivisions of
the Proteobacteria, the gram-positive bacteria, the
cyanobacteria, the spirochetes, the chlamydias, and the genera
Aquifex and Thermotoga. The only specific
phylogenetic relationship apparent among the bacterial versions of the
GlnRS is the proteobacterial grouping, but the proteobacterial
representatives are not strongly distinguished from the other bacterial
GlnRSs. Indeed, one of the
proteobacteria, Bordetella
pertussis, has a GlnRS that appears specifically related to that
found in Porphyromonas, a relationship reinforced by a sequence signature.
Asn; AAY; class II; homodimer.
Although they represent
different synthetase classes (II and I respectively), in their
evolutions the AsnRS and GlnRS families have much in common (Fig.
14). Both arose from within the cluster of the synthetases for their corresponding diacid. For glutamine, it
was from the eukaryotic lineage per se that the enzyme arose, while for
asparagine, the origin is localized only to the archaeal genre of AspRS
in general. In both instances it is in the C-terminal portion of the
molecule that the origin of the synthetase for the amidated amino acid
is most strikingly seen. As is the case for GluRSs (see below), BLAST
sequence similarity searches based upon the C-terminal 40% of the
archaeal and eukaryotic AspRS have much higher scores with one another
and with the AsnRSs. The root of the AsnRS tree itself separates the
eukaryotic AsnRSs from their counterparts (see Fig. 16). The root of a
combined phylogenetic tree for the AsnRS and AspRS enzymes (rooted
by LysRS) occurs between the bacterial AspRSs and the grouping of
archaeal and eukaryotic AspRSs with the AsnRSs.
|
Lys; AAR; class I (monomer) and class II (homodimer).
LysRS represents the only known violation of the class rule: a
class II LysRS is found in eukaryotes, most bacteria, and a few archaea
(i.e., Sulfolobus and Pyrobaculum) (Fig. 15).
However, a class I LysRS is found in the euryarchaeotes, two other
members of the Crenarchaeota (Cenarchaeum and
Aeropyrum), and a scattering of bacteria (34).
The class II LysRSs clearly had a common ancestor with the AspRSs and
AsnRSs in the deep past, but the class I enzyme stands essentially
alone phylogenetically within its
class.
|
proteobacteria, except for Rhizobium meliloti, whose LysRS is class II; however, Aeropyrum, also a crenarchaeon,
is not a member of this group. The other known bacterial examples, the
spirochete and Streptomyces LysRSs, as a group show a
specific relationship to the Pyrococcus enzyme, while the
remaining (eury)archaeal examples appear in an outgroup relationship to
all those just discussed (Fig. 15).
Asp; GAY; class II; homodimer.
The AspRSs strongly
exhibit the full canonical pattern: a single bacterial type exists,
which differs dramatically from the AspRSs of the archaeal genre. In
the interior of the AspRS sequence alignment, a stretch of about 220 amino acids in the bacterial genre (starting at ca. position 250 in the
E. coli sequence) shows almost no similarity to the
corresponding (~100-amino-acid) section in the archaeal genre.
Sequence similarity resumes thereafter at ca. bacterial position 470 and continues to the C terminus of the molecule, slightly more than 100 amino acids distant (Fig. 16). Because
the AsnRS has arisen from within the grouping of the AspRSs (see
above), the latter must be considered paraphyletic, which breaks the
monophyly rule.
|
proteobacteria, while the Porphyromonas enzyme shows no
clear specific relationships to any other bacterial AspRS.
Glu; GAR; class I; monomer.
Again, the full canonical
pattern is strongly evident; the difference between the bacterial genre
and its archaeal counterpart is striking. Not only are the bacterial
examples about 100 amino acids shorter than the archaeal sequences at
the N terminus, but also in the C-terminal (KMSK) section of the
molecule the difference between them is extreme: the two show no
resemblance, in either sequence or overall structure (46).
(BLAST searches based on the archaeal and eukaryotic examples of this
region readily detect one another and also the comparable region of all
GlnRSs but never detect their bacterial counterparts.) Because the
GlnRS has arisen from within the GluRS cluster, the latter breaks the
monophyly rule. The bacteria show at least two subtypes of GluRS, which are specifically related to one another (to the exclusion of the GluRSs
of the archaeal genre), and a number of bacterial species contain two
GluRSs as well (24), all of which makes for a somewhat confusing phylogenetic picture (Fig. 17). It is worth noting that a
rather clear grouping emerges that includes the spirochetes, the
Cytophaga-Chlorobium group, the
Deinococcus-Thermus division, the chlamydias, and two
proteobacteria, i.e., Pseudomonas (
division) and
Rhizobium (
division).
|
Synthetases for the NGN-Encoded Amino Acids
Cys; UGY; class I; monomer. The mechanism of Cys-tRNA formation in M. jannaschii and M. thermoautotrophicum has until now been a mystery. Nothing identifiable as a CysRS was seen in their (complete) genomes. However, a normal functioning CysRS has been identified in Methanococcus maripaludis, a close relative of M. jannaschii (30, 40). Did a third, unrecognized synthetase class exist in these cases, or could the cysteine tRNA be charged indirectly, as in the case of selenocysteinyl-tRNA (11, 33)? The possibility that the highly aberrant SerRS found in M. jannaschii and M. thermoautotrophicum is somehow related to the lack of recognizable CysRS in these organisms was considered (37), the rationale being that such a SerRS might form Ser-tRNACys, which would be a key intermediate in Cys-tRNA formation by a tRNA-mediated amino acid transformation pathway (33). However, in vitro data did not support this view (37). Instead, biochemical and genetic approaches have now revealed that in M. jannaschii and M. thermoautotrophicum, ProRS is able to specifically synthesize both Cys-tRNACys and Pro-tRNAPro (60). This unprecedented dual functionality in an AARS is not reflected in any distinguishing features of these ProRSs at the sequence level. Interpretation of the evolutionary significance of this unexpected versatility among AARSs must now await more detailed biochemical description of its phylogenetic distribution.
As can be inferred from Table 6, the CysRSs do not exhibit the canonical pattern. There is also considerable evidence of interdomain horizontal gene transfer, particularly involving the archaeal CysRSs: In Fig. 18, four of the archaeal CysRSs do cluster. However, the M. maripaludis enzyme (see above) is disturbingly similar to that from Pyrococcus, with pair showing 65% sequence identity (40). Three other archaeal CysRSs, from Methanosarcina, Archaeoglobus, and Cenarchaeum, group among the bacterial examples of the enzyme but show no phylogenetic relationship to one another therein. By contrast, the relationships among the bacterial CysRSs in Fig. 18 are not particularly out of kilter with established bacterial taxonomy, which might suggest that the horizontal gene transfers have been mainly from the Bacteria to the Archaea.
|
Trp; UGG; class I; homodimer.
TrpRS is an obvious
relative of TyrRS (19), although, as mentioned above, their
corresponding codons are not related. The tryptophan enzyme conforms to
the full canonical pattern, which can be inferred from Fig.
19, dipeptide similarity matrices, and striking sequence signatures. The TrpRSs of the archaeal genre show a
substantial N-terminal extension relative to those of the bacterial
genre. Within the bacterial genre, a number of subtypes can be
recognized, and two organisms possess two TrpRSs, each of a different
bacterial subtype (Fig. 19). By signature analysis, five bacterial
subtypes can be identified, designated as such in Fig. 19, with
P. gingivalis probably representing a sixth subtype. Within
the archaeal genre, neither the archaeal-eukaryotic distinction nor the
customary crenarchaeal-euryarchaeal divide holds. Most euryarchaea form
a phylogenetic unit, but this unit also includes the crenarchaeon
Aeropyrum. The remaining (two) crenarchaeal representatives (Pyrobaculum and Sulfolobus) plus
Pyrococcus have a TrpRS of the eukaryotic type (or vice
versa). The TyrRSs of Sulfolobus and Pyrococcus
are exceptionally close (~60% sequence similarity). Interestingly,
one of the two TrpRSs found in Deinococcus is chimeric. The
N-terminal (HIGH) section of the molecule clearly belongs to the
bacterial subtype represented by E. coli, while the
C-terminal (KMSK) section belongs to the bacterial subtype represented
by Aquifex, as shown by sequence signature.
|
Arg; CGN and AGR; class I; homodimer.
In its
evolutionary profile, ArgRS is arguably the most complex of all the
aminoacyl-tRNA synthetases: there are at least four bacterial subtypes
identifiable by sequence signatures (Fig. 20) and two each for the eukaryotes and
archaea. The relationships among them are similarly complex and
definitely violate the full canonical pattern. The most distinctive,
largest, and most taxonomically diverse of the ArgRS groups is that
represented by the mycoplasmas in Fig. 20; it has the strongest
sequence signature, and upon gross examination of the alignment it
appears the most unusual. It is possible that this group represents a
bacterial genre. However, the remaining bacterial subtypes do not
appear specifically related to it, and the ArgRS tree cannot be
reliably rooted. Given this and its complexity, one cannot confidently
state at this point whether ArgRS exhibits any canonical pattern.
|
Gly; GGN; class II; heterotetramer or homodimer.
As
previously noted, glycine is one of the tRNA-charging enzymes that
violates the monophyly rule: two class II GlyRSs exist that are
unrelated in both sequence and overall structure. One of the two,
confined so far to the bacteria, is a tetramer of two
- and two
-subunits. The other, characteristic of the archaea and eukaryotes
but also found in some bacteria, is a homodimer. This latter GlyRS has
specific relatives among other members of the class II synthetases (as
described above in the section of NCN-associated synthetases), but the
former has none. The former cannot follow the canonical pattern, and
the latter does not (which is seen clearly by the lack of a strong
archaeal-bacterial distinction in a dipeptide similarity matrix) (Fig.
21).
|
-subunit of the
strictly bacterial GlyRS is remarkable for its high degree of sequence
conservation, far greater than is seen in the
-subunit or other
aminoacyl-tRNA synthetases. However, the two subunits, as might be
expected, yield the same phylogeny, which (except for those bacteria
having the archaeal type of GlyRS) more or less conforms to accepted
organismal taxonomy. The so-called archaeal GlyRS type shows three main
groupings, i.e., archaeal, eukaryotic, and bacterial. All three
groupings appear monophyletic, but the eukaryotic GlyRS cluster lies
within the archaeal group in Fig. 21, making the latter paraphyletic.
Recruitment of Aminoacyl-tRNA Synthetases into Other Roles
A number of proteins exist that have obviously derived from
particular aminoacyl-tRNA synthetases but are known or suspected for
obvious reasons not to function in translation. The pseudo-synthetases in question belong to the glutamate, lysine, histidine, phenylalanine, alanine, and asparagine synthetase families. The glutamate homolog (yadB in E. coli), which is only about 60% the
length of a normal GluRS, seems confined to the
and
proteobacteria and appears marginally specifically related to the
normal proteobacterial GluRSs. The class II lysine homolog, which lacks
the N-terminal ~170 amino acids of the normal enzyme, is found in the
enteric-vibrio group, Aquifex, and T. pallidum.
It bears no specific relationship to any of the normal class II LysRS
types. The histidinyl homolog appears to be a normal full-length HisRS.
It is found in some gram-positive bacteria, a few proteobacteria,
cyanobacteria, and Aquifex. This pseudo-synthetase is
marginally of the archaeal HisRS genre, yet it does not occur in the
archaea. Recently, it was shown that this gene (hisZ) lacks
aminoacylation activity but is a required component of histidine
biosynthesis in a number of bacteria in which the hisG gene
is truncated (56). In three of the four euryarchaea
(Pyrococcus excluded), there is a protein that resembles the
chain of PheRS. The molecule lacks ~180 amino acids at the N
terminus relative to normal archaeal PheA and extends past the normal C
terminus by about the same number (63), while other areas
exist within the molecule that also show little or no similarity to the
normal PheRS. A highly truncated version(s) of the AlaRS exists in some
Archaea and a few Bacteria. Finally, Pyrococcus species contain a highly truncated form of their
normal AsnRS (e.g., accession no. BAA29342).
Summary of Aminoacyl-tRNA Synthetase Evolutionary Profiles
The aminoacyl-tRNA synthetases have obviously been
subject to a great deal of horizontal gene transfer over the
evolutionary course covered by the universal phylogenetic tree. Given
this, it is somewhat surprising how much of the ancient evolutionary trace these enzymes have retained. The full canonical pattern is shown
almost without exception by four of the enzymes (aspartic and glutamic
acids, phenylalanine, and leucine) and arguably by two others (tyrosine
and tryptophan). In some of these, the bacterial genre contains two or
more dramatically dissimilar subtypes, the most extreme example perhaps
being TrpRS, where no fewer than six distinct bacterial subtypes can be
discerned (Fig. 19). For another four synthetases, i.e., isoleucine,
histidine, proline, and probably methionine, the full canonical pattern
holds, although a significant minority of bacterial taxa employ a
synthetase of the archaeal rather than the bacterial genre. Three more
of the amino acids
valine, threonine, and alanine
and perhaps a
fourth, arginine, show only the basal canonical pattern; the eukaryotic enzymes in these cases are obviously of the bacterial genre. Almost certainly this pattern results from the horizontal displacement deep in
the eukaryotic stem of an original eukaryotic synthetase of the
archaeal genre by one of the bacterial genre. Indeed, the Giardia
lamblia AlaRS (Fig. 10) lends support to this notion, for that
enzyme, unlike its eukaryotic counterparts, is clearly of the archaeal
genre (see the alanine section above) but, as expected, is
phylogenetically external to the archaeal group per se. The remaining 6 of the 20 amino acids, i.e., cysteine, serine, lysine, glycine,
asparagine, and glutamine, do not conform to the canonical pattern.
These six we call the "gemini group."
The gemini group derives its name from the fact that for each of its six members (and only these six), two unrelated charging systems exist. For glutamine and asparagine, the difference between the two systems is of one kind: one system is a normal direct charging, while the other is indirect, with a "precursor" amino acid being placed on the tRNA and then biochemically converted in situ to the appropriate amino acid by amidation (see above) (12, 33). For cysteine, the alternative mode of charging, which involves ProRS (60), is completely novel (see the cysteine section above).
The other three members of the gemini group, i.e., serine, lysine, and glycine, each have two normal but not (specifically) related charging systems, thereby violating the monophyly rule. The second (minor) charging enzyme for serine, which so far has been found only in two genera of methanogens, belongs to the same class II supercluster as does the dominant (normal) SerRS but is probably not specifically related to the dominant SerRS therein (see the serine section above). The two LysRS enzymes belong to different structural classes; the class I enzyme occurs in most archaea and a small number of bacterial taxa, while its class II counterpart dominates the bacteria and is seen in a few archaea and all of the eukaryotes so far examined. The heterotetrameric form of GlyRS is confined to and dominates the bacteria, while its unrelated homodimeric counterpart occurs in all characterized archaea and eukaryotes as well as in a minority of bacterial taxa.
We do not consider it coincidental that the members of the gemini group
share these two characteristics, i.e., lack of canonical pattern and
twin charging systems, because in our opinion it reflects an
evolutionary circumstance unique to this group
one that may well shed
light on the meaning of the canonical pattern and the long-term dynamic
of horizontal gene transfer (see the discussion below).
PHYLOGENETIC PICTURE EMERGING FROM THE
AMINOACYL-TRNA SYNTHETASES
|
|
|---|
Much has been said and written about aminoacyl-tRNA synthetases
contradicting established organismal phylogenies. From the above
analysis, one can see that in many cases they do so
sometimes in
spectacular ways. In at least as many other cases, however, they do
not. In this section, we discuss the question of phylogenetic congruence between the aminoacyl-tRNA synthetases and established organismal phylogenies. The main framework for the discussion is
Table 7, a tabulation of the extents to
which the various aminoacyl-tRNA synthetase relationships confirm
certain established major taxonomic groupings. As we shall see, none of
the established organismal relationships are disproven, most are
confirmed (by a majority of the synthetases), and the horizontal gene
transfer characteristics of these enzymes suggests other evolutionary
relationships that were not detected by the phylogenetic analyses of
vertically inherited genes.
|
The aminoacyl-tRNA synthetases most strikingly confirm the profound evolutionary divide that separates the Bacteria from the Archaea and Eucarya. Over and above this, these enzymes support most of the accepted major taxa (kingdoms, divisions, etc.) within each of the three domains (in cases where a sufficient number of species have been characterized to permit making reliable assessments). The aminoacyl-tRNA synthetases do not confirm the branching orders among the major taxa within each domain. Here they present only a confused, and so unreliable, picture.
Spirochetes
The spirochetes constitute a major bacterial grouping that by
multicellular eukaryotic standards would be considered at least a
division or phylum (47, 68). The two sequenced spirochete genomes
from Borrelia burgdorferi and Treponema
pallidum
do not cover the full phylogenetic breadth of the
division, but they are well separated phylogenetically within it. The
two spirochetes possess 19 of the aminoacyl-tRNA synthetases (they have
no GlnRS). In only two cases, i.e., threonine, and proline, are the
spirochete enzymes clearly not specifically related to one another. In
the first case, the Treponema ThrRS groups with those of the
proteobacteria, quite distant from the grouping that encompasses the
Borrelia ThrRS (Fig. 9). The T. pallidum ProRS
clusters within the main bacterial group, while the B. burgdorferi ProRS is of the archaeal genre and is specifically
related to the eukaryotic ProRS (Fig. 8). In a few other trees (Fig. 3
and 14), the two spirochetes do not appear to be specifically related
but nevertheless are clustered by sequence signature analysis.
Moreover, in these trees the two genera lie close enough that the
conclusion that they are not specifically related is by no means certain.
The spirochetes are notable for the number of cases in which their
aminoacyl-tRNA synthetases are of the archaeal and eukaryotic genre: 7 of their 19 synthetases are so (Table 8).
In four of these, i.e., isoleucine, methionine, proline, and histidine,
the group to which a spirochete enzyme belongs not only lies within the
general archaeal and eukaryotic grouping but therein is a sister
lineage to the eukaryotic synthetases. (For proline, only the
Borrelia enzyme is of this nature.) These four examples
cover all those in which a bacterial subtype is of the archaeal and eukaryotic genre and specifically related to its eukaryotic
counterparts (Table 8).
|
For two others of the seven spirochete exceptions (phenylalanine and
lysine), the synthetase is precisely of the archaeal type. The
spirochete PheRS is the only bacterial PheRS of the archaeal genre and
is most similar to its Pyrococcus counterpart (Fig. 2) (see
the phenylalanine section above). The spirochetes are one of three
bacterial taxa known to use the class I LysRSs. While the spirochete
enzyme again appears Pyrococcus-like, the other bacterial
class I LysRSs (found in the
proteobacteria and streptomycetes)
resemble the class I LysRSs of the crenarchaeal branch of the
Archaea. The final (seventh) example of a "nonbacterial" synthetase in the spirochetes is GlyRS. Here the spirochete enzyme belongs to a minor but significant clade of bacterial GlyRSs that are
of the archaeal and eukaryotic type and almost certainly of archaeal
origin (see the glycine section above).
ArgRS, the most evolutionarily complex of the 20 synthetases, should
also be mentioned here. The spirochete ArgRS is unusual in that it (and
the Porphyromonas enzyme) forms a phylogenetically coherent
grouping peripherally related to the two bacterial groupings in which
the eukaryotic ArgRSs reside
predicted by sequence signature (Fig.
20). This entire combined unit appears not to be specifically related
to the "main" bacterial type (genre) and can at best be only
marginally related to the archaeal genre (see the arginine section above).
Cytophaga-Chlorobium Kingdom
While the two divisions that comprise the Cytophaga-Chlorobium kingdom were each solidly defined by 16S rRNA analysis (47, 68), their sister relationship was only weakly suggested thereby. The addition of 23S rRNA data made the case for the kingdom convincing (74). Porphyromonas, on the bacteroides branch of the Cytophaga division, and Chlorobium tepidum together span the full phylogenetic breadth of this kingdom. The aminoacyl-tRNA synthetases from these two organisms further reinforce the specific relationship between the Cytophaga and Chlorobium (green sulfur) divisions.
As Table 7 shows, for 13 of the amino acids, the Porphyromonas and Chlorobium synthetases are of the same bacterial subtype and specifically related to one another. (In one of these, leucine, the relationship is seen only in sequence signatures, not in the phylogenetic trees. However, the tree in Fig. 3 does put the two taxa in close enough proximity that the branching order shown cannot be considered definitive.) For two of the seven (confirmed) exceptions to the specific relationship, i.e., asparagine and glutamine, no synthetase has been detected in the currently available Chlorobium sequence data. For the remaining five exceptions, i.e., aspartic acid, arginine, histidine, tryptophan, and tyrosine, the Porphyromonas and Chlorobium enzymes clearly belong to different bacterial subtypes (Fig. 11, 12, 16, 19, and 20). Table 8 shows that a total of four of the synthetases in both taxa, i.e., isoleucine, methionine, proline and glycine, are of the archaeal genre, while a fifth (histidine) is so in Porphyromonas only.
Deinococcus-Thermus Division
The Deinococcus-Thermus bacterial division was recognized initially through 16S rRNA analysis. (The kingdom-level unit to which it belongs includes a second division, represented by Chloroflexus and Thermomicrobium, which has yet to be characterized in genomic sequence terms [47, 68].) Although no extensive genomic data are publically available for Thermus species, sequences exist in the public databases for 11 different Thermus aminoacyl-tRNA synthetases. In all 11 cases, the Thermus enzyme is specifically related to its Deinococcus counterpart. (Using the BLAST score server provided by the Göttingen Genomics Laboratory [www.g2l.bio.uni-goettingen.de], one can infer that another seven Thermus enzymes at least, i.e., CysRS, GlnRS, IleRS, ThrRS, TyrRS, TrpRS, and ArgRS, also are specifically related to their Deinococcus counterparts.) These 18 examples give solid support to the predicted specific relationship between the genera Deinococcus and Thermus. For 2 of the 20 synthetases, i.e., isoleucine and proline, the Deinococcus (and Thermus) enzymes are known to cluster with their spirochete and Cytophaga-Chlorobium counterparts in a unit that also contains the eukaryotic synthetases (Fig. 4 and 8). In all, 4 of the 20 Deinococcus-Thermus synthetases are of archaeal character (Table 8).
The extent to which the three major bacterial divisions or kingdoms just discussed, i.e., the spirochetes, the Cytophaga-Chlorobium kingdom, and the Deinococcus-Thermus division, show the same minor bacterial synthetase type is noteworthy. For the amino acids isoleucine, proline, glutamic acid, cysteine, and glycine (Fig. 4, 8, 17, 18, and 21), all three bacterial taxa are represented in the same minor bacterial synthetase group and, except for glutamic acid and cysteine, that synthetase is of archaeal character. For another two or three amino acids, i.e., methionine, histidine, and possibly serine, two of the three taxa are represented in the same minor bacterial synthetase group (Tables 7 and 8). One wonders whether this correlation is suggestive of some evolutionary relationship among these three divisions and kingdoms that is too subtle to have been detected by analyses of rRNA or other vertically inherited genes.
Proteobacteria (Purple Bacteria)
As defined by rRNA analysis, the kingdom Proteobacteria
comprises five divisions (originally designated subdivisions),
through
(47, 68). The
division is actually a highly
divergent branch within
, and these two together show a sister
relationship to
. The remaining two divisions,
and
, appear
specifically related to one another and together specifically relate to
the
-
-
grouping (47, 68). Except for
, all the
proteobacterial divisions have genomically characterized representatives.
For over half of the 20 amino acids, the aminoacyl-tRNA synthetases of
the major bacterial subtype encompass all four characterized proteobacterial divisions. Otherwise, the divisions, or various taxa
within them, show different subtypes, usually bacterial. For asparagine
and glutamine (as in many other bacterial taxa), some proteobacteria
possess no synthetases for one or both of these amino acids. The
phylogenetic unit that comprises the
,
, and
divisions is
seen in at least nine of the synthetases; the higher-level relationship
that then includes the
division as well can be seen in about eight
cases (Table 7). Only one of the proteobacterial synthetases, MetRS, is
of the archaeal genre, with the exception of four cases in which some
or all
proteobacteria are so (Table 8).
Gram-Positive Bacteria
The gram-positive bacteria as defined by rRNA comprise a number of major lineages, not merely the two conventionally recognized high- and low-G+C types (47, 68). The kingdom is phylogenetically sufficiently disperse that its coherence is only minimally suggested even by rRNA analyses. It is not unexpected, therefore, that the aminoacyl-tRNA synthetases do not give evidence supporting the phylogenetic coherence of the kingdom as a whole. At this point in time, three of the gram-positive lineages are represented in genomic terms, the high-G+C gram-positive lineage (mycobacteria and some streptomycetes), the Bacillus-Lactobacillus lineage (genomes of Bacillus, Staphylococcus, Streptococcus, Enterococcus, and two mycoplasmas), and a third lineage represented by C. acetobutylicum.
The aminoacyl-tRNA synthetases of B. subtilis are never of the archaeal genre, and almost without exception the organism has a synthetase of the main bacterial subtype. For two of the amino acids, i.e., threonine and tyrosine, B. subtilis carries two synthetases, each of a different bacterial subtype. Almost without exception, the synthetases of Streptococcus and Enterococcus are of the same bacterial subtype as those of B. subtilis and are specifically related therein to their B. subtilis counterpart. The corresponding mycoplasma synthetases are somewhat more idiosyncratic (discussed below).
The synthetases of C. acetobutylicum (representing a distinct gram-positive division) are of the same bacterial subtype as those of Bacillus in the majority of cases, at least six of which show a specific relationship to the Bacillus "family" (Table 7).
The synthetases representing the high-G+C division of the gram-positive bacteria give no definitive indication that this division is related to the others in the gram-positive kingdom. In most cases, however, the mycobacterial version of a synthetase belongs to the same bacterial subtype as does its Bacillus counterpart. Although the representative mycobacterial synthetases do not confirm the organism's relationship to other gram-positive bacteria, neither do they contradict it, since the mycobacterial enzymes show no consistent specific relationship to synthetases from other taxa.
Bacterial taxa represented by a single genome only are not discussed in this context, since the data are too few to draw meaningful conclusions and little in the way of solid relationships to other taxa are seen (however, see Table 7).
Rickettsias, Mycoplasmas, and Certain Other Bacteria
Paleontologists have known for the better part of this century that different lineages evolve at different rates or tempos (55). Molecular evolutionists encountered this phenomenon early in the comparative study of molecular sequences. What paleontologists also understood was that this quantitative measure, tempo, was associated with a qualitative one, "mode," i.e., the nature of the evolutionary outcomes (55). The question then became whether evolutionary tempo at the molecular level also has some kind of mode associated with it. Comparative analysis of rRNA sequences showed that this was indeed the case, in turn suggesting that the tempo-mode relationship was a fundamental characteristic of the evolutionary process (68).
Molecular mode can be explained as follows. If changes (residue replacements and the like) at the sequence level were purely a matter of tempo, the frequency distribution for residue replacements (as a function of position in a given sequence) would have the same relative shape for rapidly and slowly evolving lineages. However, it has been observed that the shape of this distribution is different in the two cases: rapidly evolving lineages show a disproportionate increase in the replacement rate for the more highly conserved residues in the molecule (68). This in principle then allows the detection of rapidly (or slowly) evolving lineages without resort to outgroup rooting of a tree (68).
Three good examples of rapidly evolving lineages defined by rRNA criteria are represented by full genomic sequences: those of the mycoplasmas, Rickettsia prowazekii, and Helicobacter pylori (47, 68). Mycoplasmas are considered the most rapidly evolving lineage of all (68). The mycoplasmas have been shown by rRNA and other analyses to form a phylogenetic grouping that is specifically related to the major gram-positive taxon built around the genera Bacillus, Lactobacillus, Streptococcus, and relatives (47, 68); a specific relationship may even exist between the mycoplasmas and streptococci. In the aminoacyl-tRNA synthetase trees, the mycoplasmas cluster specifically with Bacillus and the others in only two cases, isoleucine and glutamic acid (Table 7). In two others, the mycoplasmas have a synthetase of the archaeal genre (which Bacillus never has). In the remaining cases, the mycoplasmas belong to the same bacterial synthetase subtype as does Bacillus, but they do not appear specifically related to the latter group. Indeed, in the majority of trees the mycoplasmas (alone or in combination with a small number of other bacterial taxa) appear to form the deepest-branching lineage in their bacterial synthetase subtype grouping. In contrast, a weak to strong sequence signature suggests specific relationship between the mycoplasmas and Bacillus and relatives in at least 10 cases (Table 7).
Failure of the mycoplasmas to tree reliably is a common experience. Mycoplasma proteins are generally quite idiosyncratic in sequence (with a number of positions therein that break with what otherwise is the universal composition). Because of their highly derived nature (manifested also in impressively low dipeptide similarities), the parsimony treeing algorithm at least tends to place them improperly (21). The conclusion of nonrelationship of the mycoplasma synthetases to their Bacillus group counterparts implied by the above phylogenetic trees is best held in abeyance until sequence data are available from closer relatives of the mycoplasmas, such as their walled relatives (68); as might be expected, experience has shown that the sequences from rapidly evolving species are "better behaved" when treed in the presence of specific close relatives that are not so rapidly evolving (68).
Helicobacter is another case in which aminoacyl-tRNA
synthetase sequences appear somewhat highly derived. Although in the above trees the organism groups with the other proteobacteria in some
cases, in others it tends to wander among the taxa, often branching
near the bottom of a given major bacterial group and sometimes
"specifically" with the mycoplasmas, both of which are signs of
rapidly evolving, highly derived sequences. As with the mycoplasmas,
the treeing of Helicobacter sequences should benefit from
sequence data from more closely related but slower-evolving relatives,
for example, members of the
division of the
Proteobacteria.
The R. prowazekii aminoacyl-tRNA synthetase sequences appear
less highly derived than those of the two taxa just discussed. The
above trees show that 14 of the 18 rickettsial aminoacyl-tRNA synthetases cluster with their
proteobacterial counterparts, as do
rickettsial proteins in general (2). In the four remaining cases, the rickettsial synthetases are of a different genre from their
proteobacterial counterparts. It is too early to draw firm
conclusions about rapidly evolving lineages from these three organisms
(mycoplasmas, rickettsias, and Helicobacter), but the data
appear to suggest that at least some types of rapidly evolving lineages
might be more prone to horizontal gene transfers than are their more
slowly evolving relatives.
Archaea
Too few archaeal aminoacyl-tRNA synthetases have been sequenced to permit detailed evolutionary conclusions to be drawn. However, some loose general conclusions can be made at this point. (i) The archaeal synthetases usually form a monophyletic (though sometimes paraphyletic) grouping. (ii) The fundamental euryarchaeal versus crenarchaeal divide within the Archaea is respected in most but not in all cases; it is seen in 11 cases (Table 8) and in several more in which the crenarchaeal cluster is joined by a Pyrococcus synthetase. (iii) Horizontal gene transfers of aminoacyl-tRNA synthetases have occurred both within the archaea and between the archaea and eukaryotes. (The TyrRSs and TrpRS give especially clear indications of this [see the tyrosine and tryptophan sections above].) (iv) While synthetases of the archaeal genre seem often to have migrated into the bacteria, the opposite seems not to occur, with the exception of the gemini group of synthetases (discussed above), which show a unique evolutionary dynamic.
EVOLUTIONARY SYNTHESIS
|
|
|---|
Understanding in an evolutionary context is not like understanding in a general (modern) biological context. It is nature, not the biologist, that does the evolutionary experiment, and essentially all such experiments have been done in the distant past and certainly cannot be repeated in the laboratory. Such limitations require that evolutionary understanding draw heavily upon inference and inspired conjecture. Therefore, if we are to reconstruct the history of life on this planet and understand the process by which it arose, theories capable of relating and interpreting the available facts in broad, meaningful ways, capable of defining and focusing scientific thinking on particular ideas, will have to be developed. In this concluding section, we try to do exactly this, i.e., to use the aminoacyl-tRNA synthetases to help construct a tentative picture of what the primary lineages are and what the evolutionary process that produced them is. The goal is to develop a clearer concept of how modern cells evolved and to identify various stages in that process.
Because of their modular nature (see above) (51), the aminoacyl-tRNA synthetases readily undergo horizontal gene transfer. Also, because their functions are ancient and universal, these transfers are expected to be broad in scope (which they are), to occur throughout the recorded evolutionary course (which they appear to do), and to be largely selectively neutral in character (which is certainly the simplest explanation, for example, for the fact that the same organism can harbor genes for two very different but functionally equivalent subtypes of a given synthetase). Because the 20 charging enzymes are in essence alike in function (and presumably alike in their relationships to the cell as a whole), one would expect the horizontal gene transfer profiles of the 20 synthetases all to be similar in some general respect (although in their details, of course, each would be unique). However, this is clearly not the case: the horizontal gene transfer profiles of the different synthetases can be qualitatively different. Nevertheless, these profiles seem to be of several general types, which are crudely distinguished by the extent to which and ways in which they manifest the canonical pattern.
Evolutionary Significance of the Canonical Pattern
What is the significance of the canonical pattern; at what stage in the evolution of the cell did the canonical pattern arise? The horizontal gene transfer patterns of the aminoacyl-tRNA synthetases begin to provide answers here, but far too few data now exist to treat these answers as anything but theoretical conjectures requiring more thorough testing in the future.
It is essentially self-evident that, other factors being equal, most horizontal gene transfers will be taxonomically local (the donor and recipient would be close neighbors on the phylogenetic tree), for the simple reason that in such cases, horizontal displacement of an indigenous gene by a foreign equivalent would be minimally perturbing of cellular function, in that the indigenous and foreign genes are relatively similar in sequence. With the caveat that the more taxonomically local the horizontal gene replacement, the harder it is to recognize (the more data are needed), we nevertheless think that the data available at present suggest that horizontal gene transfers of the aminoacyl-tRNA synthetases are not predominantly local. The horizontal displacements that have occurred deep in the phylogenetic tree, involving transfers from one major taxon to another major taxon, appear to be unexpectedly prevalent.
The majority of transfers that introduce synthetases of the archaeal genre into the bacteria appear to involve a gene that arose deep in the common archaeal/eukaryotic lineage and transferred to the ancestor of some major bacterial taxon. These can only have been very ancient evolutionary events. Also, the fact that they appear to dominate the landscape of aminoacyl-tRNA relationships suggests that the dynamic of horizontal gene transfer has not remained constant over the evolutionary course covered by the universal phylogenetic tree.
We assume that the primary determinant of the nature of horizontal transfer is the nature of the recipient cell. If this is so, a dynamic of horizontal gene transfer that changes over the evolutionary course means that the nature of cells has changed over that course. Our theory, then, is that the deeper branching in the universal phylogenetic tree corresponds to evolutionary stages when the evolution of the cell was not yet complete, when the modern cell(s) had yet to emerge. In other words, the primary branchings in the phylogenetic tree (the canonical pattern) and the deep branchings in each of the domains involved primitive entities that were in the process of evolving to become modern cells. This would easily explain why the sequences of bacterial and archaeal aminoacyl-tRNA synthetases (different genres) differ so dramatically from one another while the sequences within each of the domains differ in relatively trivial ways (reference 68 offers a more detailed discussion of this matter).
The complex, highly refined cells of today obviously came from
primitive counterparts that were far simpler and more rudimentary. Not
only were the subsystems of the cell
the translation and transcription mechanisms, the genome and its replication apparatus, and so
on
simpler and less accurate in their functions than today's versions
thereof, but also the primitive cell as a whole was more loosely and
less hierarchically organized and its states were fewer and less well defined (72). Such simple systems, loosely and simply
organized and defined, can undergo changes of a more dramatic sort than would be permissible in complex, highly integrated modern cells. Horizontal transfer of genes would be less disruptive of these primitive cells and their subsystems, and so the process of horizontal gene transfer would be of a very different nature, i.e., more pervasive, widespread, and spectacular, than it now is. Indeed, at very
early times, horizontal gene transfer could have encompassed all
aspects of the cell, all its genes (67). The process would dominate the primitive evolutionary dynamic; most of the evolutionary innovation would be horizontally acquired. At such early evolutionary stages, life (no matter how varied at the level of the individual organism) can be looked at as communal in the evolutionary sense, for
it is only through horizontally shared innovation that the evolution
leading to modern life was possible; i.e., the community evolved as a
whole, not as individual organismal lineages (69; see also discussions of earlier forms of the idea in references 36 and 67).
Gradually, then, as the subsystems of the cell over time became
increasingly complex and refined and as they became more intricately interwoven into the evolving fabric of the cell, horizontal gene transfers would become increasingly restricted: foreign parts tend not
to be compatible with complex, precisely defined machines. Of course,
the first systems in the cell to become so refined, which probably were
the information-processing mechanisms
in particular translation
(72)
would be the first to become refractory to horizontal
gene transfer.
What the basal canonical pattern then represents is an early stage when the first (the more complex) subsystems of the cell became more or less refractory to horizontal gene transfer and the universal ancestor had differentiated into the communities that would spawn the primary organismal groupings. At that early time, the eukaryotic and archaeal lineages were still communally unified (at least in terms of the information processing systems) and all aspects of the cell had long evolutionary developments ahead of them.
The aminoacyl-tRNA synthetases that exhibit the canonical phylogenetic pattern are therefore those whose organismal taxonomic distributions became fixed at this early stage in cellular evolution. In their evolutionary profiles, these enzymes retain some record of the evolutions of the three basic cell types. Conversely, the aminoacyl-tRNA synthetases whose evolutionary profiles show little or no canonical pattern were still in evolutionary flux at this early stage; their organismal distributions would not stabilize until much later in the evolutionary course, after modern cells had evolved in some cases. The last synthetases to achieve their present taxonomic distribution, we think, are the members of the gemini group, to which we now return our attention.
The gemini group exemplifies the evolution of the amino acid charging
systems. Multiple ways of associating amino acids with their tRNAs have
always existed
as the existence of the two main synthetase structural
classes implies. A considerable body of experimental data now shows
that structures derived from RNA alone can both chemically discriminate
between amino acids and catalyze aminoacylation with a high degree of
substrate specificity (77). This ability of RNA to mimic
some of the essential characteristics of contemporary aminoacyl-tRNA
synthetases strongly supports older ideas that direct interactions
between nucleic acids and amino acids (or peptides) contributed to the
origin of translation (25, 71).
The evolutionary field is strewn with the relics of apparent takeover (replacement) battles among the tRNA-charging systems. Long before the universal ancestor gave rise to the primary organismal groupings, the ancestor of the LeuRS, IleRS, and ValRS, say, spawned some variant that came to displace earlier versions of the leucine, isoleucine, and valine-charging systems. We know these to be early events because the LeuRS, IleRS, and ValRS evolutionary profiles retain semblances of the canonical pattern. At that early time or before, the two unrelated enzymes of the glycine-charging system as well as the two lysine-charging enzyme also existed, but from their evolutionary profiles (i.e., from the lack of canonical pattern shown), it would appear that the takeover battles in these cases continued to some extent into the modern evolutionary era.
Serine and cysteine represent takeovers in which one of the two systems has achieved almost complete dominance. There seems to have been an archaeal type for both systems, quite different from the corresponding bacterial type. However, these archaeal types have now largely been displaced by their bacterial counterparts. The lack of canonical pattern in these cases is prima facie evidence that the spread of the bacterial types occurred late in the evolutionary course, probably after the major branchings in each of the primary organismal groupings had begun to form.
The final two aminoacyl-tRNA synthetases in the gemini group, glutamine and asparagine, add a solid time point to the developing picture of the evolutionary course. For each of these amino acids, the aminoacyl-tRNA synthetase has arisen from within the cluster defined by the synthetases of the corresponding dicarboxylic acid, and for both, the ancestral source has been a GluRS or AspRS of the archaeal/eukaryotic genre. (As seen above, the GluRS and AspRS enzymes exhibit strongly canonical pattern in their own right; i.e., they are ancient in both origin and taxonomic distribution.)
GlnRS arises specifically from the eukaryotic GluRS lineage (see the glutamine section above). Although AsnRS shows clear affinity with the archaeal and eukaryotic AspRSs (to the exclusion of bacterial AsnRSs), no convincing specific affinity to either the eukaryotic or archaeal AspRS is evident. This could be explained by the AsnRS arising somewhat prior to the GlnRS, which is also consistent with the wider taxonomic distribution of the former. Given its ancestry, the modern GlnRS, unlike its parental GluRS, had to have come into being after the three organismal domains themselves arose; the enzyme achieved its taxonomic distribution subsequently. We think that the lack of canonical pattern shown by all synthetases in the gemini group has a similar explanation, i.e., that all their taxonomic distributions, like that of GlnRS, were established relatively late in the evolutionary course, well after the domains themselves had arisen and their major branchings had begun to coalesce.
Some General Evolutionary Matters
It is one thing to note that certain bacterial taxa contain one or more aminoacyl-tRNA synthetases of the archaeal genre or that there are two very dissimilar types of a bacterial synthetase that are nevertheless of the same genre; e.g., TyrRS. It is another to understand how these came to be. In some cases, the source of the enzyme can be satisfactorily localized. Several examples exist (see above) in which the bacterial enzyme is not only of the archaeal genre but distinctly Pyrococcus-like. Such localizations will refine with the sequencing of additional archaeal genomes, just as will the precise source of the (plasmid-borne) mupirocin-resistant IleRS recently acquired by some Staphylococcus strains (9) (currently localized phylogenetically only to the general vicinity of the major gram-positive bacterial taxon represented by C. acetobutylicum [see the isoleucine section above]).
However, in a number of cases a bacterial synthetase is obviously of the general archaeal genre but resembles neither the archaeal nor the eukaryotic version specifically. This immediately raises a nontrivial question about the source of the synthetase. The same general question arises in the context of the bacteria alone when two very disparate versions of a synthetase exist, both bacterial in genre. The reflex answer to such questions, of course, is that undiscovered deeply branching lineages in either the Archaea or the Bacteria are the sources. Such explanations serve only to trivialize the questions. What are these hypothetical lineages? In the archaeal cases, some of them would need to branch not from the archaeal stem itself but from the common stem shared by the archaea and eukaryotes, in which case they would represent undiscovered organismal domains! Obviously, microbiologists have yet to uncover a great deal of the diversity in the microbial world, but the idea that in this day and age they have missed entire domains stretches credulity somewhat.
A more satisfying explanation may lie in the classical evolutionists' concept of evolutionary radiations, which again involves the tempo-mode relationship (55). The fossil record holds key evidence that otherwise would not be available, i.e., evidence of extinct lineages. In the world of macroscopic organisms, an evolution radiation (which is a period of rapid evolution) has three important characteristics: (i) accelerated evolutionary tempo, (ii) remarkable and remarkably diverse evolutionary invention, and (iii) creation of new lineages, most of which are short-lived (on the evolutionary timescale).
This last characteristic offers a conceptually challenging explanation for the point under discussion: the evolutions that underlay the development of the Bacteria and the Archaea (and Eucarya) were not the simple straightforward courses one might naively infer from the ancestral stems on phylogenetic trees. Rather, these ostensibly bare linear stems actually correspond to periods of great evolutionary turmoil, invention, and radiation. However, the evidence for almost all of this is gone (especially on the organismal level); the lineages are extinct. However, what the aminoacyl-tRNA synthetases may be telling us is that not all traces of extinct lineages are necessarily erased. Horizontal gene transfer makes it possible for some of the genes, some of the evolutionary inventions that occurred in these extinct lineages, to survive today, preserved by their transfer to lineages that have persisted.
As mentioned above, the horizontal transfer of aminoacyl-tRNA synthetase genes between Archaea and Bacteria is asymmetric in the sense that transfers of archaeal-type synthetases into the bacteria are common, both early on and after the organismal domains coalesced (and their major taxa had begun to emerge). Transfers of bacterial synthetases into the Archaea appear not to have occurred (at least early on [see the discussion of the gemini group above]). This asymmetry has no satisfactory explanation right now. It is not because archaea are refractory to the transfer of aminoacyl-tRNA synthetase (or any other) genes. We have seen ample evidence above that intra-archaeal exchanges occur. The explanation could be that the archaeal cell type and the bacterial cell type are of different natures, the one more permissive as regards accepting foreign genetic material than the other. Alternatively, the archaeal and bacterial cell types may not have matured evolutionarily (become modern cells) at the same time, so that transfer of genes occurred from a relatively mature archaeal type of cell to a relatively immature bacterial type of cell, which would therefore be more receptive to foreign genetic material.
It is not intended that the reader accept the above hypothetical scheme as truth; nor do we wish him or her to view it as idle and so useless speculation. The ideas we have put forth in this section are simply part of our attempt to paint a picture of the course of cellular evolution and to generate a theory that has the consistency, explanatory power, and conceptual power to inspire and inform studies of deep evolutionary questions.
CONCLUSION
|
|
|---|
It is unlikely that the aminoacyl-tRNA synthetases played any specific role in the evolution of the genetic code; their evolutions did not shape the codon assignments. Far from diminishing the evolutionary significance of the aminoacyl-tRNA synthetases, this realization, in a sense, enhances it. The separation of the two problems allows biologists to see the evolution of the code for the profound problem that it is and to focus on what the evolutions of these synthetases do tell us. Aminoacyl-tRNA synthetase genes are among the more frequently exchanged of the functionally important molecules in the cell. However, this does not diminish, but enhances, the value of the AARSs as evolutionary indicators. Admittedly, horizontal transfer introduces noise into organismal genealogical records, but evolution has dimensions, qualities, that are not captured in vertically inherited genes; it is precisely because the AARSs are horizontally transferred that they provide information about some of these otherwise inaccessible dimensions. The aminoacyl-tRNA synthetases appear to tell of the existence of deeply branching lineages in both the Bacteria and Archaea that do not have known (or extant) representatives. In that the evolutionary dynamic of horizontal gene transfer is in the first instance a function of the nature of the recipient cell, the fact that this dynamic appears increasingly to change (becoming more extreme in nature) the further back in time evolution is traced implies that even after the phylogenetic structure of the domains began to form, cells may not have been as they are today; they had yet to evolve, to mature into modern cells.
In all this is the suggestion that the evolutionary process is not what it might seem when attention is confined to the genome sequence level, i.e., a more or less continuous progression leading from very simple primitive forms to the modern ones. Rather, evolution may proceed through a series of discontinuous stages, with the quality of the evolving systems changing dramatically from stage to stage, and these stages can be defined by the type of horizontal gene transfer each exhibits. If we can infer the dynamics of horizontal gene transfer finely enough at early enough evolutionary times, the stages should reveal themselves. In the present review, we have used the AARSs merely for what they can tell about the later evolutionary stages, those associated with the emergence of the primary lines of descent. It is obvious that the synthetases collectively carry a record of far earlier times, earlier and more genetically "violent" evolutionary stages. These were stages that preceded those represented by the root of the universal (rRNA) phylogenetic tree, and they can be glimpsed (and so identified) by the evolutionary relationships among aminoacyl-tRNA synthetases of different amino acid types, in other words, by synthetase takeovers that are not confined within a given amino acid type.
Individually, the synthetases represent universal, constant, and
ancient functions, which are defined mainly by the tRNAs
by all
accounts among the most ancient and structurally constant of molecules
in the cell. The selective pressures against the horizontal transfer of
these enzymes seem minimal and general. In that the 20 aminoacyl-tRNA
synthetases come as close as possible to being identical in
evolutionary respects, their evolutions collectively serve to
"interpret" one another; comparisons among them tend to sort out
the trivial, and the idiosyncratic in each of their evolutionary
records from the telling aspects of their histories. This is what makes
the aminoacyl-tRNA synthetases so valuable a tool in attempts to
reconstruct the history of life on this planet.
ACKNOWLEDGMENTS
|
|
|---|
Work in our laboratories on this subject was supported by grants from NASA and DOE (to C.R.W. and G.J.O.) and the National Institute of General Medical Sciences (to D.S.). M.I. is the recipient of an Investigator award from the Alfred Benzon Foundation. The computational work used resources at the W. M. Keck Center for Comparative and Functional Genomics.
We are extremely grateful to S. T. Fitz-Gibbon, W. Metcalf, and M. L. Sogin, who have provided data prior to publication. We thank D. Graham for help with the manuscript.
FOOTNOTES
* Corresponding author. Mailing address: Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Ave., P.O. Box 208114, New Haven, CT 06520-8114. Phone: (203) 432-6200. Fax: (203) 432-6202. E-mail: soll{at}tma.chem.yale.edu.
REFERENCES
|
|
|---|
| 1. | Adduce, J., and M. Hoosegow. 1992. MOLPHY: programs for maximum likelihood inference of protein phylogeny. Computer science monographs, no. 27. Institute of Statistical Mathematics, Tokyo, Japan. |
| 2. | Andersson, S. G., A. Zomorodipour, J. O. Andersson, T. Sicheritz-Ponten, U. C. Alsmark, R. M. Podowski, A. K. Naslund, A. S. Eriksson, H. H. Winkler, and C. G. Kurland. 1998. The genome sequence of Rickettsia prowazekii and the origin of mitochondria. Nature 396:133-140[CrossRef][Medline]. |
| 3. | Arnez, J. G., and D. Moras. 1997. Structural and functional considerations of the aminoacylation reaction. Trends Biochem. Sci. 22:211-216[CrossRef][Medline]. |
| 4. | Berthet-Colominas, C., L. Seignovert, M. Härtlein, M. Grotli, S. Cusack, and R. Leberman. 1998. The crystal structure of asparaginyl-tRNA synthetase from Thermus thermophilus and its complexes with ATP and asparaginyl-adenylate: the mechanism of discrimination between asparagine and aspartic acid. EMBO J. 17:2947-2960[CrossRef][Medline]. |
| 5. | Brown, J. R. 1998. Aminoacyl-tRNA synthetases: evolution of a troubled family, p. 217-230. In J. Wiegel, and M. H. W. Adams (ed.), Thermophiles: the keys to molecular evolution and the origin of life? Taylor & Francis, London, United Kingdom. |
| 6. | Brown, J. R., F. T. Robb, R. Weiss, and W. F. Doolittle. 1997. Evidence for the early divergence of tryptophanyl- and tyrosyl-tRNA synthetases. J. Mol. Evol. 45:9-16[CrossRef][Medline]. |
| 7. |
Brown, J. R., and W. F. Doolittle.
1995.
Root of the universal tree of life based on ancient aminoacyl-tRNA synthetase gene duplications.
Proc. Natl. Acad. Sci. USA
92:2441-2445 |
| 8. | Brown, J. R., and W. F. Doolittle. 1999. Gene descent, duplication, and horizontal transfer in the evolution of glutamyl- and glutaminyl-tRNA synthetases. J. Mol. Evol. 49:485-495[CrossRef][Medline]. |
| 9. | Brown, J. R., J. Zhang, and J. E. Hodgson. 1998. A bacterial antibiotic resistance gene with eukaryotic origins. Curr. Biol. 8:R365-R367[CrossRef][Medline]. |
| 10. |
Burbaum, J. J., and P. Schimmel.
1991.
Structural relationships and the classification of aminoacyl-tRNA synthetases.
J. Biol. Chem.
266:16965-16968 |
| 11. | Commans, S., and A. Böck. 1999. Selenocysteine inserting tRNAs: an overview. FEMS Microbiol. Rev. 23:335-351[CrossRef][Medline]. |
| 12. |
Curnow, A. W.,
K.-W. Hong,
R. Yuan,
S.-I. Kim,
O. Martins,
W. Winkler,
T. M. Henkin, and D. Söll.
1997.
Glu-tRNAGln amidotransferase: a novel heterotrimeric enzyme required for correct decoding of glutamine codons during translation.
Proc. Natl. Acad. Sci. USA
94:11819-11826 |
| 13. |
Curnow, A. W.,
D. L. Tumbula,
J. T. Pelaschier,
B. Min, and D. Söll.
1998.
Glutamyl-tRNAGln amidotransferase in Deinococcus radiodurans may be confined to asparagine biosynthesis.
Proc. Natl. Acad. Sci. USA
95:12838-12843 |
| 14. | Cusack, S. 1997. Aminoacyl-tRNA synthetases. Curr. Opin. Struct. Biol. 7:881-889[CrossRef][Medline]. |
| 15. | Dennis, P. P. 1997. Ancient ciphers: translation in Archaea. Cell 89:1007-1010[CrossRef][Medline]. |
| 16. |
Diaz-Lazcoz, Y.,
J. C. Aude,
P. Nitschke,
H. Chiapello, and C. Landes-Devauchelle.
1998.
Evolution of genes, evolution of species: the case of aminoacyl-tRNA synthetases.
Mol. Biol. Evol.
15:1548-1561 |
| 17. | Doolittle, R. F., and J. Handy. 1998. Evolutionary anomalies among the aminoacyl-tRNA synthetases. Curr. Opin. Genet. Dev. 8:630-636[CrossRef][Medline]. |
| 18. |
Doolittle, W. F.
1999.
Phylogenetic classification and the universal tree.
Science
284:2124-2129 |
| 19. | Doublié, S., G. Bricogne, C. Gilmore, and C. W. Carter, Jr. 1995. Tryptophanyl-tRNA synthetase crystal structure reveals an unexpected homology to tyrosyl-tRNA synthetase. Structure 3:17-31[CrossRef][Medline]. |
| 20. | Eriani, G., M. Delarue, O. Poch, J. Gangloff, and D. Moras. 1990. Partition of tRNA synthetases into two classes based on mutually exclusive sets of sequence motifs. Nature 347:203-206[CrossRef][Medline]. |
| 21. | Felsenstein, J. 1978. Cases in which parsimony or compatibility methods will be positively misleading. Syst. Zool. 27:401-410[CrossRef]. |
| 22. | First, E. A. 1998. Catalysis of aminoacylation by class I and class II aminoacyl-tRNA synthetases, p. 573-607. In M. L. Sinnott (ed.), Comprehensive biological catalysis. Academic Press, Ltd., London, United Kingdom. |
| 23. | Freeland, S. J., and L. D. Hurst. 1998. The genetic code is one in a million. J. Mol. Evol. 47:238-248[CrossRef][Medline]. |
| 24. |
Gagnon, Y.,
L. Lacoste,
N. Champagne, and J. Lapointe.
1996.
Widespread use of the Glu-tRNAGln transamidation pathway among bacteria. A member of the purple bacteria lacks glutaminyl-tRNA synthetase.
J. Biol. Chem.
271:14856-14863 |
| 25. | Gamow, G. A. 1954. Possible relation between deoxyribonucleic acid and protein structures. Nature 173:318[CrossRef]. |
| 26. | Goldman, N. 1993. Further results on error minimization in the genetic code. J. Mol. Evol. 37:662-664[Medline]. |
| 27. |
Gray, M. W.,
B. F. Lang,
R. Cedergren,
G. B. Golding,
C. Lemieux,
D. Sankoff,
M. Turmel,
N. Brossard,
E. Delage,
T. G. Littlejohn,
I. Plante,
P. Rioux,
D. Saint-Louis,
Y. Zhu, and G. Burger.
1998.
Genome structure and gene content in protist mitochondrial DNAs.
Nucleic Acids Res.
26:865-878 |
| 28. | Haig, D., and L. D. Hurst. 1991. A quantitative measure of error minimization in the genetic code. J. Mol. Evol. 33:412-417[CrossRef][Medline]. |
| 29. |
Hashimoto, T.,
L. B. Sanchez,
T. Shirakura,
M. Muller, and M. Hasegawa.
1998.
Secondary absence of mitochondria in Giardia lamblia and Trichomonas vaginalis revealed by valyl-tRNA synthetase phylogeny.
Proc. Natl. Acad. Sci. USA
95:6860-6865 |
| 30. |
Hamann, C. S.,
K. R. Sowers,
R. S. Lipman, and Y. M. Hou.
1999.
An archaeal aminoacyl-tRNA synthetase missing from genomic analysis.
J. Bacteriol.
181:5880-5884 |
| 31. |
Henikoff, S., and J. G. Henikoff.
1992.
Amino acid substitution matrices from protein blocks.
Proc. Natl. Acad. Sci. USA
89:10915-10919 |
| 32. |
Ibba, M.,
J. L. Bono,
P. A. Rosa, and D. Söll.
1997.
Archaeal-type lysyl-tRNA synthetase in the Lyme disease spirochete, Borrelia burgdorferi.
Proc. Natl. Acad. Sci. USA
94:14383-14388 |
| 33. | Ibba, M., A. W. Curnow, and D. Söll. 1997. Aminoacyl-tRNA synthesis: divergent routes to a common goal. Trends Biochem. Sci. 22:39-42[CrossRef][Medline]. |
| 34. |
Ibba, M.,
C. H. Losey,
Y. Kawarabayasi,
H. Kikuchi,
S. Bunjun, and D. Söll.
1999.
Substrate recognition by class I lysyl-tRNA synthetases: a molecular basis for gene displacement.
Proc. Natl. Acad. Sci. USA
96:418-423 |
| 35. |
Ibba, M.,
S. Morgan,
A. W. Curnow,
D. R. Pridmore,
U. C. Vothknecht,
W. Gardner,
W. Lin,
C. R. Woese, and D. Söll.
1997.
A euryarchaeal lysyl-tRNA synthetase: resemblance to class I synthetases.
Science
278:1119-1122 |
| 36. | Kandler, O. 1994. Cell wall biochemistry in Archaea and its phylogenetic implications. J. Biol. Phys. 20:165-169. [CrossRef] |
| 37. |
Kim, H. S.,
U. C. Vothknecht,
R. Hedderich,
I. Celic, and D. Söll.
1998.
Sequence divergence of seryl-tRNA synthetases in Archaea.
J. Bacteriol.
180:6446-6449 |
| 38. |
Lamour, V.,
S. Quevillon,
S. Diriong,
V. C. N'Guyen,
M. Lipinski, and M. Mirande.
1994.
Evolution of the Glx-tRNA synthetase family: the glutaminyl enzyme as a case of horizontal gene transfer.
Proc. Natl. Acad. Sci. USA
91:8670-8674 |
| 39. |
Lenhard, B.,
O. Orellana,
M. Ibba, and I. Weygand-Durasevic.
1999.
tRNA recognition and evolution of determinants in seryl-tRNA synthesis.
Nucleic Acids Res.
27:721-729 |
| 40. | Li, T., D. Graham, C. Stathopoulos, P. Haney, H.-S. Kim, U. Vothknecht, M. Kitabatake, K.-W. Hong, G. Eggertsson, A. W. Curnow, W. Lin, I. Celic, W. Whitman, and D. Söll. 1999. Cysteinyl-tRNA formation: the last puzzle of aminoacyl-tRNA synthesis. FEBS Lett. 462:302-306[CrossRef][Medline]. |
| 41. | Marsh, T. L., C. I. Reich, R. B. Whitelock, and G. J. Olsen. 1994. Transcription factor IID in the Archaea: Sequences in the Thermococcus celer genome would encode a product closely related to the TATA-binding protein of eukaryotes. Proc. Natl. Acad. Sci. USA 81:4180-4184[CrossRef]. |
| 42. | Martinis, S. A., P. Plateau, J. Cavarelli, and C. Florentz. 1999. Aminoacyl-tRNA synthetases: a family of expanding functions. Mittelwihr, France, October 10-15. EMBO J. 18:4591-4596[CrossRef][Medline]. |
| 43. | Martinis, S. A., P. Plateau, J. Cavarelli, and C. Florentz. 1999. Aminoacyl-tRNA synthetases: a new image for a classical family. Biochimie 81:683-700[CrossRef][Medline]. |
| 44. | Mazauric, M. H., G. Keith, D. Logan, R. Kreutzer, R. Giegé, and D. Kern. 1998. Glycyl-tRNA synthetase from Thermus thermophilus-wide structural divergence with other prokaryotic glycyl-tRNA synthetases and functional inter-relation with prokaryotic and eukaryotic glycylation systems. Eur. J. Biochem. 251:744-757[Abstract]. |
| 45. | Nagel, G. M., and R. F. Doolittle. 1995. Phylogenetic analysis of the aminoacyl-tRNA synthetases. J. Mol. Evol. 40:487-498[CrossRef][Medline]. |
| 46. |
Nureki, O.,
D. G. Vassylyev,
K. Katayanagi,
T. Shimizu,
S. Sekine,
T. Kigawa,
T. Miyazawa,
S. Yokoyama, and K. Morikawa.
1995.
Architectures of class-defining and specific domains of glutamyl-tRNA synthetase.
Science
267:1958-1965 |
| 47. |
Olsen, G. J.,
C. R. Woese, and R. Overbeek.
1994.
The winds of (evolutionary) change: breathing new life into microbiology.
J. Bacteriol.
176:1-6 |
| 48. |
RajBhandary, U. L.
1997.
Once there were twenty.
Proc. Natl. Acad. Sci. USA
94:11761-11763 |
| 49. | Ribas de Pouplana, L., D. Buechter, N. Y. Sardesai, and P. Schimmel. 1998. Functional analysis of peptide motif for RNA microhelix binding suggests new family of RNA-binding domains. EMBO J. 17:5449-5457[CrossRef][Medline]. |
| 50. |
Saks, M. E.,
J. R. Sampson, and J. Abelson.
1998.
Evolution of a transfer RNA gene through a point mutation in the anticodon.
Science
279:1665-1670 |
| 51. | Schimmel, P., and T. Ripmaster. 1995. Modular design of components of the operational RNA code for alanine in evolution. Trends Biochem. Sci. 20:333-334[CrossRef][Medline]. |
| 52. |
Shiba, K.,
H. Motegi,
M. Yoshida, and T. Noda.
1998.
Human asparaginyl-tRNA synthetase: molecular cloning and the inference of the evolutionary history of Asx-tRNA synthetase family.
Nucleic Acids Res.
26:5045-5051 |
| 53. |
Siatecka, M.,
M. Rozek,
J. Barciszewski, and M. Mirande.
1998.
Modular evolution of the Glx-tRNA synthetase family rooting of the evolutionary tree between the bacteria and archaea/eukarya branches.
Eur. J. Biochem.
256:80-87[Abstract].
|
| 54. | Simos, G., A. Sauer, F. Fasiolo, and E. C. Hurt. 1998. A conserved domain within Arc1p delivers tRNA to aminoacyl-tRNA synthetases. Mol. Cell 1:235-242[CrossRef][Medline]. |
| 55. | Simpson, G. G. 1944. Tempo and mode in evolution. Columbia University Press, New York, N.Y. |
| 56. |
Sissler, M.,
C. Delorme,
J. Bond,
S. D. Elrlich,
P. Renault, and C. Francklyn.
1999.
An aminoacyl-tRNA synthetase paralog with a catalytic role in histidine biosynthesis.
Proc. Natl. Acad. Sci. USA
96:8985-8990 |
| 57. | Söll, D., and M. Ibba. Aminoacyl-tRNA synthesis. Annu. Rev. Biochem., in press. |
| 58. | Söll, D., and U. L. RajBhandary. 1995. tRNA: structure, biosynthesis and function. ASM Press, Washington, D.C. |
| 59. | Sonneborn, T. M. 1965. Degeneracy of the genetic code: extent, nature and genetic implications, p. 377-397. In V. Bryson, and J. H. Vogel (ed.), Evolving genes and proteins. Academic Press, Inc., New York, N.Y. |
| 60. |
Stathopoulos, C.,
T. Li,
R. Longman,
U. C. Vothknecht,
H. Becker,
M. Ibba, and D. Söll.
2000.
One polypeptide with two aminoacyl-tRNA synthetase activities.
Science
287:479-482 |
| 61. | Swofford, D. L. 1999. PAUP 4.0: phylogenetic analysis using parsimony (and other methods). Sinauer Associates, Sunderland, Mass. |
| 62. | Taupin, C. M., and R. Leberman. 1999. Archaebacterial seryl-tRNA synthetases: adaptation to extreme environments and evolutionary analysis. J. Mol. Evol. 48:408-420[CrossRef][Medline]. |
| 63. |
Tumbula, D.,
U. C. Vothknecht,
H. S. Kim,
M. Ibba,
B. Min,
T. Li,
J. T. Pelaschier,
C. Stathopoulos,
H. Becker, and D. Söll.
1999.
Archaeal aminoacyl-tRNA synthesis: diversity replaces dogma.
Genetics
152:1269-1276 |
| 64. |
Wakasugi, K., and P. Schimmel.
1999.
Two distinct cytokines released from a human aminoacyl-tRNA synthetase.
Science
284:147-151 |
| 65. | Watanabe, K., and S. Osawa. 1995. In D. Söll, and U. RajBhandary (ed.), tRNA: structure, biosynthesis, and function. ASM Press, Washington, D.C. |
| 66. |
Woese, C. R.
1965.
On the evolution of the genetic code.
Proc. Natl. Acad. Sci. USA
54:1546-1552 |
| 67. | Woese, C. R. 1982. Archaebacteria and cellular origins: an overview. Zentbl. Bakteriol. Mikrobiol. Hyg. I Abt. Orig. C 3:1-17. |
| 68. |
Woese, C. R.
1987.
Bacterial evolution.
Microbiol. Rev.
51:221-271 |
| 69. |
Woese, C. R.
1998.
The universal ancestor.
Proc. Natl. Acad. Sci. USA
95:6854-6859 |
| 70. | Woese, C. R., D. H. Dugre, S. A. Dugre, M. Kondo, and W. C. Saxinger. 1966. On the fundamental nature and evolution of the genetic code. Cold Spring Harbor Symp. Quant. Biol. 31:723-736[Medline]. |
| 71. |
Woese, C. R.,
D. H. Dugre,
W. C. Saxinger, and S. A. Dugre.
1966.
The molecular basis for the genetic code.
Proc. Natl. Acad. Sci. USA
55:966-974 |
| 72. | Woese, C. R., and G. E. Fox. 1977. The concept of cellular evolution. J. Mol. Evol. 10:1-6[CrossRef][Medline]. |
| 73. |
Woese, C. R.,
O. Kandler, and M. L. Wheelis.
1990.
Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya.
Proc. Natl. Acad. Sci. USA
87:4576-4579 |
| 74. | Woese, C. R., L. Mandelco, D. Yang, R. Gherna, and M. T. Madigan. 1990. The case for relationship of the flavobacteria and their relatives to the green sulfur bacteria. Syst. Appl. Microbiol. 13:258-262[Medline]. |
| 75. |
Wolf, Y. I.,
L. Aravind,
N. V. Grishin, and E. V. Koonin.
1999.
Evolution of aminoacyl-tRNA synthetases-analysis of unique domain architectures and phylogenetic trees reveals a complex history of horizontal gene transfer events.
Genome Res.
9:689-710 |
| 76. | Yarus, M. 1998. Amino acids as RNA ligands: a direct-RNA-template theory for the code's origin. J. Mol. Evol. 47:109-117[CrossRef][Medline]. |
| 77. | Yarus, M., and M. Illangasekare. 1999. Aminoacyl-tRNA synthetases and self acylating ribozymes, p. 183-196. In R. F. Gesteland, T. R. Cech, and J. F. Atkins (ed.), The RNA world. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. |
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»