Department of Medical Genetics and Microbiology, University of Toronto, Toronto M5S 1A8, Canada
SUMMARY INTRODUCTION VIRION PROTEOMICS Nonenveloped Viruses Enveloped RNA Viruses Herpesviruses Poxviruses Other Enveloped DNA Viruses Bacteriophages Summary STRUCTURAL PROTEOMICS PROTEIN INTERACTIONS Virus-Virus Interactions Screens with individual viral proteins. Genome-wide screens. Virus-Host Interactions VIRUS-INDUCED CHANGES IN THE CELLULAR PROTEOME CONCLUDING REMARKS ACKNOWLEDGMENTS REFERENCES
| SUMMARY |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
The development of proteomic methods has revolutionized our ability to assess protein interactions and cellular changes on a global scale, allowing the discovery of previously unknown connections. For example, proteomic methods used to generate genome-wide protein interaction maps for budding yeast uncovered a number of novel protein complexes (38, 58). Viruses are suitable targets for genome-wide analyses since their relatively small size makes them a readily tractable system and because there is a large number of fully sequenced genomes available (including many adenovirus, ichnovirus, influenza virus, herpesvirus, papillomavirus, rotavirus, and reovirus isolates). In addition, many are amenable to genetic manipulation. Indeed, the first completely sequenced genome from any organism was that of bacteriophage
X174, reported in 1977 (88). Sequences of larger viral genomes began to enter the public databases several years later, with the 172-kb Epstein-Barr virus (EBV) genome in 1984 and the 230-kb human cytomegalovirus genome in 1990 (4, 18). There has since been an explosion in the number of sequenced viral genomes, totaling 2,323 eukaryotic viruses and 411 phages in the NCBI genome database (http://www.ncbi.nlm.nih.gov/genomes/static/vis.html) at the time of writing of this review. Although most viral genomes are considerably smaller than cellular genomes, there is a growing list of remarkable "giant" viruses whose genomes are similar in size and coding potential to some cellular organisms (23). The giant viruses are from a diverse mix of viral families including bacteriophage, phycodnavirus, nimavirus, and poxvirus, which infect bacteria, algae, invertebrates, and vertebrates, respectively. The recently discovered mimivirus challenges our views of the boundaries separating viruses and cellular organisms, since its 1.2-megabase genome surpasses the size of several bacterial genomes, and its 1,262 potential proteins include a translation system not previously found in viruses (81). Our understanding of the large viruses and their host cell interactions would benefit enormously from the use of proteomic methods to analyze the contributions of the numerous encoded viral proteins in an unbiased way.
In this review, we discuss the work that has been done to date using proteomic approaches to study viruses from any viridae infecting eukaryotic or prokaryotic cells. This includes determining the protein composition of virions, the structure and protein interactions of viral proteins, and the effects of viral infection and individual viral proteins on the cellular proteome.
| VIRION PROTEOMICS |
|---|
|
|
|---|
|
Two groups used LC-MS/MS to identify viral and host proteins in HIV-1 virions. Saphire et al. (89) compared tryptic digests of whole virions produced in Jurkat and 293T cells. In both cell types, the expected HIV proteins along with several human proteins that were previously reported to be incorporated into HIV-1 were identified. The human CD48 protein and histones H1, H2A, H3, and H4 were also identified as virion components. Chertova et al. (20) analyzed HIV-1 virions derived from monocyte-derived macrophages by SDS-PAGE followed by tryptic digestion of proteins in individual gel slices. They found 253 different human proteins, 33 of which were already known to be virion components. The numerous newly identified human proteins included histones, in keeping with the results described by Saphire et al., and several proteins of the endosomal compartment that may reflect the use of the late endosomal pathway for virion budding from monocytes.
The large discrepancy in the numbers of host proteins identified by Saphire et al. and Chertova et al. likely reflects a combination of the different sensitivities of the methods used for sample preparation for mass spectrometry as well as genuine differences in virions produced in the different cell types. It has been reported that preparations of HIV-1 virions can also contain cellular vesicles, which would complicate any analysis of cellular proteins incorporated into the virions (10, 39). However, Chertova et al. used CD45 immunoaffinity depletion to remove these vesicles from their virion preparations prior to density gradient purification, thereby limiting the number of cellular proteins that would be present in the virion preparations but not actually incorporated into the virions. While many of the 253 cellular proteins may be incorporated into the HIV-1 virions, it seems unlikely that all of these proteins are of functional significance, and some may be packaged accidentally due to their abundance and cellular localization. Detailed functional studies will be necessary to determine which of the packaged cellular proteins actually contribute to HIV-1 infection. It is also worth noting that despite the very high number of human proteins identified in HIV-1 virions, some human proteins reported to be HIV-1 virion components by other methods, such as Staufen and tRNA synthetase, were not found (for a review of previously identified HIV virion components, see reference 16). Thus, mass spectrometry-based analyses are not completely comprehensive and, like most methods, will have some false-positive and false-negative results.
A proteomic analysis of purified HCMV virions was initially conducted by Baldick and Shenk in 1996 (5), where protein components were identified by SDS-PAGE followed by N-terminal sequencing. In this way, six viral proteins and one actin-like cellular protein (not previously known to be packaged), in addition to known virion components, were identified. These proteins were also found in noninfectious enveloped particles and dense bodies, two types of aberrant viral particles generated in HCMV infection. Recently, Varnum et al. (113) used the more sensitive LC-MS/MS and LC-Fourier transform ion cyclotron resonance mass spectrometry techniques to determine the composition of the HCMV proteome. As expected, they identified the set of proteins described previously by Baldick and Shenk as well as less abundant viral and cellular proteins, totaling 71 HCMV and 71 host proteins. Twenty-four of the viral proteins were not previously identified in virions, and 12 of these proteins were from open reading frames (ORFs) not previously shown to express proteins. The LC-Fourier transform ion cyclotron resonance-mass spectrometry approach allowed for the relative abundance of the virion proteins to be determined, indicating that the pp65 (UL83) tegument protein was the most prevalent viral component and that gM was the most abundant glycoprotein. Approximately 50% of the viral proteins in the virion were found to be tegument proteins, compared to 30% capsid proteins, 13% envelope proteins, and 7% proteins of undefined localization. Dense HCMV bodies were also analyzed as part of this study and, consistent with previous reports, were found to contain considerably fewer proteins than did the virions, with 29 viral and 13 cellular proteins identified. It is important that the HCMV virions were extensively purified from culture media on both sedimentation and density gradients and verified to be free of cellular organelles by electron microscopy. Therefore, the cellular proteins in the HCMV virion preparations are likely to be inside the virion. While the localization of the cellular proteins within the virion was not determined, the numerous cytoplasmic proteins identified are expected to be in the tegument compartment, picked up as the capsids move through the cytoplasm from the endoplasmic reticulum (ER) to the Golgi apparatus.
Highly purified murine CMV virions have also been analyzed by LC-MS/MS, leading to the identification of 58 viral proteins, 20 of which were uncharacterized, and 7 cellular proteins (actin, annexin, cofilin, histone H2A, elongation factor 1
, glyceraldehyde-3-phosphate dehydrogenase, and rho GDP dissociation inhibitor) (55). All of the cellular proteins except histone H2A were also found by Varnum et al. in HCMV virions.
The virion compositions of several gammaherpesviruses have been determined. For EBV and KSHV, this involved treatments to induce lytic infection (since these viruses are usually in a latent state) followed by extensive gradient purifications of the virions. Using LC-MS/MS analyses, Johannsen et al. (51) identified 35 viral proteins that largely comprised homologues of proteins previously found in other herpesvirus virions, while none of the EBV latency proteins were detected. Similar to results obtained for CMV, the most numerous viral proteins were tegument components; however, for EBV, these included five proteins unique to gammaherpesviruses. The six most prevalent cellular proteins identified (actin, hsp70, hsp90, cofilin, ß-tubulin, and enolase) were found in the tegument, and these six proteins were also identified in HCMV virions.
Purified virions from KSHV, the human virus most closely related to EBV, have also been the subject of proteomic analyses by two groups. Zhu et al. (123) identified 24 viral and multiple cellular proteins using LC-MS/MS, while Bechtel et al. (9) identified 10 viral and nine cellular proteins using MALDI. All but one of the viral proteins identified by Bechtel et al. were also found by Zhu et al., and 19 out of the 24 viral proteins identified by Zhu et al. were found by Johannsen et al. to have corresponding proteins in EBV virions. As for EBV, actin, hsp70, hsp90, tubulin, and enolase were identified as being cellular components of the KSHV virions in both analyses. In addition, Zhu et al. further characterized the form of actin identified as nonmuscle ß-actin and showed that the actin and myosin proteins in their virion preparation were resistant to trypsin digestion, indicating that they were indeed inside the virions. Similarly, Bechtel et al. showed that six of the cellular proteins that they identified (enolase, Hsc70, tubulin, moesin, ezrin, and EF-2b) were trypsin resistant in virion preparations and therefore likely to be packaged inside the virion (the remaining four cellular proteins were not tested).
The virion compositions of gammaherpesviruses that infect mice (MHV68) and rhesus monkeys (RRV) have also been studied. Bortz et al. (13) identified 15 viral proteins in MHV68; these overlap considerably but not entirely with those identified in KSHV. O'Connor and Kedes (74) used two different mass spectrometry approaches to identify 33 viral proteins in RRV; MS/MS of protein bands excised from polyacrylamide gels and multidimensional protein identification technology in which proteolytic peptides from the entire virion were analyzed en masse by 2D LC-MS/MS. The identified proteins included three proteins not previously found in any herpesvirus virions. As for other herpesviruses, the majority of the virion components were from the tegument.
, hsp70, and hsp90), but a few abundant cellular proteins found in virions from other viruses were not detected (e.g., myosin, moesin, and cofilin). Since IMVs are prepared from cell lysates, it is possible that some of the cellular proteins copurifying with these particles bind to the outside of the virions as opposed to being incorporated into the virion. This important point remains to be addressed. In addition to vaccinia virus, IMV particles from the myxoma leporipoxvirus have been subjected to proteomic analysis (120), leading to the identification of 17 viral proteins. These proteins are homologues of the viral proteins found in vaccinia virus IMVs, with the exception of M151R (B13R in vaccinia virus), which has not been found in vaccinia virus virions, and M093L, which lacks a vaccinia virus homologue.
White spot syndrome virus (WSSV) of the Nimaviridae is another large enveloped DNA virus that infects shrimp and other crustaceans. Its 300-kb circular DNA genome encodes about 180 proteins, and the rod-shaped virions have been studied by several groups. In 2002, Huang et al. (45) analyzed WSSV virions by SDS-PAGE and identified 18 viral proteins from excised bands by MALDI-TOF and MS/MS. All but one of those proteins were shown to be late proteins. A similar approach described by Tsai et al. (109) led to the identification of 33 viral proteins, 20 of which had not been previously identified. In a subsequent paper, that same group used detergent and salt extractions to fractionate purified WSSV virions in order to categorize the localization of the major virion proteins as nucleocapsid (four proteins reported), tegument (four proteins reported), or envelope (seven proteins reported) components (110). Xie et al. (117) conducted a similar analysis of fractionated WSSV virions; of the 30 viral proteins identified, they classified 23 as being envelope proteins and 8 as being nucleocapsids. These proteins included the envelope and nucleocapsid proteins described by Tsai et al. (110), with the exception of two envelope proteins reported by Tsai et al. (VP36B and VP53A), which were not identified.
A proteomic analysis of the virions from the gigantic mimivirus has recently been conducted, resulting in the identification of 114 viral proteins (83). In addition to major structural proteins, mimivirus virions were found to contain 12 proteins predicted to be involved in transcription, 9 proteins from oxidative pathways, 3 DNA repair proteins, 2 topoisomerases, and 45 proteins with no sequence similarity to other proteins in the database. It is also remarkable that no amoeba host proteins were found despite the fact that two amoebal genome sequences are present in the database. This suggests that mimivirus packages few or no host proteins, in marked contrast to HIV, vaccinia virus, and herpesvirus virions.
The bacteriophage
11b, which infects Flavobacterium, was also examined in a study reported by Borriss et al. (12). SDS-PAGE analysis of the phage particle resolved 11 proteins, and 5 were identified through LC-mass spectrometry, including the major capsid protein (gp21), minor head protein (gp18), portal protein (gp16), and two hypothetical proteins. Lavigne et al. (60) studied the T7-like psuedomonal bacteriophage pKMV using two approaches: separation of the phage particle proteins by SDS-PAGE coupled with LC-electrospray ionization-MS/MS and a whole-phage shotgun analysis using trypsin-digested phage particles separated by reversed-phase high-performance liquid chromatography followed by MS/MS. The data gathered from these two approaches allowed the identification of 12 virion proteins. Six of these proteins were previously annotated as being structural proteins due to sequence similarity with known phage structural proteins. Five additional proteins that had little or no sequence similarity to known phage proteins were successfully classified as structural proteins.
Naryshkina et al. (73) examined the proteome of pYS40, a lytic tailed bacteriophage of Thermus thermophilus. This phage has a large genome, with a total of 170 potential ORFs. Mass spectrometry was performed on proteins resolved by SDS-PAGE, and two major virion components were identified. They are believed to correspond to the major head and tail proteins. Their functions could not have been predicted by sequence comparisons because of a lack of homologues in the database. Purified phage was also examined by a shotgun proteomic approach using multidimensional protein identification technology. Peptides matching 33 pYS40 proteins were detected, including the tail sheath protein (gp69), baseplate assembly protein (gp150), fibritin neck whisker (gp152), and a DEAD box helicase (gp79). The remaining proteins identified were novel proteins with no database homologues, including a group of 13 proteins detected at high levels that may correspond to the late gene cluster.
Studies examining the composition of phage particles provide valuable information that can be used to aid in the correct annotation of phage genomes and allow comparative genomics studies. Phage genomes are highly mosaic as a result of horizontal gene exchange through both homologous and nonhomologous recombination. Although it is often difficult to detect sequence similarity between genes that are likely homologous due to the extreme divergence of phage genomes, strong conservation of gene order can facilitate identification. Thus, the identification of major capsid and tail proteins in phages with no identifiable sequence similarity to phages already in the genome database can allow the annotation of the morphogenetic region.
, elongation factor 2, hsp70, and hsp90 have been found in HIV, vaccinia virus, and two or more types of herpesvirus virions. It is not yet known whether these cellular proteins play a conserved functional role in these virions, are passenger proteins that are accidentally packed due to their location and high abundance, or are contaminants that copurify with the virions. The latter possibility seems unlikely, since the different types of virions analyzed were prepared using different methods and, in each case, checked for purity prior to analysis. Whether or not the above-described host proteins are purposefully packaged in the virions will require more extensive investigation. However, it is not hard to imagine that host chaperone proteins like hsp70 and hsp90 may be required to chaperone various viral proteins during packaging and/or release from the virions and hence are packaged along with their viral partners. | STRUCTURAL PROTEOMICS |
|---|
|
|
|---|
A systematic comparison of the expression and solubility of a large number of viral proteins in E. coli and insect cells was conducted by Gao et al. (37). This involved the use of a high-throughput multiwell plate system for isolating bacmids, generating baculoviruses, and screening the expression, solubility, and recovery of hexahistidine-tagged proteins on nickel resin. A total of 337 proteins from HSV-1, EBV, CMV, and vaccinia viruses were scored for expression level and solubility in insect cells. A subset of 152 of these proteins was also tested for expression and solubility in an E. coli strain augmented with tRNA genes for codons rarely used in E. coli but common in mammalian proteins. For all three herpesviruses, expression in insect cells was superior to that in E. coli, with only 27 to 29% of the proteins scored as being expressed in E. coli and 90% of HSV and EBV proteins and 50% of CMV proteins expressed in insect cells. For vaccinia virus, however, expression rates were similar, at about 80%. For all four viruses, solubility of the expressed proteins in either system was a major issue, with approximately 20% of proteins expressed in soluble form. Similarly, the Structural Proteomics in Europe (SPINE) group reported problems with the expression and solubility of herpesvirus proteins in E. coli, with improved results using baculovirus (35). Like Gao et al., they had higher success rates with poxvirus proteins in E. coli expression. Tarbouriech et al. (102) monitored the expression of 23 EBV proteins in E. coli and found only 8 to be produced in soluble form. This outcome was not improved by modifying the constructs by N- or C-terminal truncations, changing the E. coli strain, or changing the expression temperature. The refolding of insoluble proteins also failed to result in soluble protein for the 12 EBV proteins tested (102) but was more successful (four out of nine proteins) for secreted vaccinia virus immunomodulators (35).
Despite the difficulties mentioned above, structures have been determined for numerous eukaryotic viral proteins or protein domains, totaling approximately 300 different structures at the time of writing of this review. As expected, the number of viral protein structures solved has been rapidly increasing in recent years (Fig. 2). For example, less than 10 viral protein structures were determined in 1995, including the first herpesvirus structure (11), compared to 44 unique structures deposited in the Protein Data Bank (PDB) in 2005. The number of structures determined varies considerably for different viruses. This likely represents a combination of the amount of effort a virus receives as well as the ease of production of soluble proteins and the tendency of these proteins to crystallize, which may be inherently different for different classes of viruses. An example of a virus that has moved relatively quickly in structural terms is SARS coronavirus. Since its genome sequence was determined in 2003 (65, 86), structures for 13 different proteins/domains have been deposited in the PDB from the 28 encoded proteins (see http://sars.scripps.edu/ProtSummary1.htm for a summary of these structures). This surpasses the number of unique structures for any single herpesvirus (currently 12 for HSV-1 and 11 for EBV) or for vaccinia virus (nine structures), each of which encodes four to nine times more proteins than SARS. Another example of a virus that has proven to be very amenable to structural studies is hepatitis C virus (HCV). Protein structures corresponding to most of the HCV coding sequences have been determined, providing insight into the mechanisms of action of the nonstructural proteins and facilitating the development of inhibitors for these proteins (57, 61, 63, 64, 106, 118).
|
Viral protein structures have come largely from individual laboratories with interests in specific viral proteins or subsets of viral proteins, a complete discussion of which is beyond the scope of this review. However, in recent years, organized groups have used high-throughput approaches to tackle multiple viral protein targets. For example, the Consortium for Functional and Structural Proteomics of SARS Coronavirus-Related Proteins (http://sars.scripps.edu) studies the structure and function of many SARS proteins, which has led to four high-resolution structures (53, 79, 82, 87). The European Vizier group (http://www.vizier-europe.org/) studies the replication proteins of several RNA viruses, including SARS, measles virus, influenza virus, HCV, dengue virus, and viruses that cause gastroenteritis and encephalitis, with the aim of identifying new drug targets against these viruses. To date, they have reported structures for the VP9 outer coat protein of Banna virus (46) and the ORF-9b SARS protein (72), both in conjunction with SPINE. SPINE focuses on the development and implementation of methods suitable for high-throughput structure determination of proteins that impact human health, including proteins from vaccinia virus (nine proteins targeted by the Oxford group), EBV (studied by the Grenoble group), and SARS (targeted by groups at Oxford and Marseille) (http://www.spineurope.org) (35). To date, they reported one measles virus protein structure (52), four SARS protein structures (although only two for NSP9 have been published) (32, 99), four vaccinia virus protein structures, including A41L (none have been made publicly available), and four EBV protein structures (102). Structures were determined for EBV BARF1, dUTPase, uracil DNA glycosylase, and the EBV protease, with the latter three in complex with inhibitors (15, 103, 104). Finally, the Ontario Center for Structural Proteomics, a center affiliated with the Midwest Center for Structural Genomics, has implemented a systematic approach to solving bacteriophage proteins using both nuclear magnetic resonance spectroscopy and X-ray crystallography. To date, they have published five structures from bacteriophage lambda (gpU, CII, NinB, FII, and W) (30, 48, 67-69) as well as one from the related phage HK97 (PDB accession number 2OB9).
| PROTEIN INTERACTIONS |
|---|
|
|
|---|
The most common method used to screen for protein interactions is a yeast two-hybrid screen using GAL4 and/or LexA fusion proteins (33) (Fig. 3A). Results with this method are very dependent on the quality of the library being screened and the relative representation of each cDNA. They also depend on the expression level of individual proteins and their ability to localize to the yeast nuclei, where the reporter constructs are located. As a result, many bona fide protein interactions are missed by this method (31). Tandem affinity purification (TAP) tagging approaches are becoming increasingly common for the detection of protein complexes (84, 121). This involves expressing the target protein fused to two affinity tags in host cells and purifying the tagged protein on two affinity columns under conditions that do not disrupt protein interactions (Fig. 3C). Copurifying proteins are then identified by mass spectrometry. While this method is excellent for identifying stable complexes, it tends to miss more transient interactions, which dissociate during the multistep purification. A more comprehensive method for detecting protein interactions of various strengths is an affinity column approach, where the purified target protein is immobilized on resin (97) (Fig. 3B). Cell extracts are passed through the column, and the profile of retained proteins is determined by mass spectrometry and compared to those proteins retained on negative control columns. The ability to detect specific interactions by this method depends on the concentration of individual proteins in the soluble lysate and hence is also not completely comprehensive. In addition, this approach requires a highly purified target protein, making it less popular than other methods.
|
Each of the above-described proteomic methods has the potential to detect unanticipated interactions, making them suitable for exploring poorly understood systems. In the long term, the information gained from these types of studies will increase our understanding of viral infections and may lead to improved strategies for treating them. In this review, we will focus on studies using unbiased screening approaches to identify binding partners, as opposed to approaches designed to verify already known or suspected interactions.
Several different proteins from herpesviruses have been screened for viral protein interactions. Taylor and Knipe (105) isolated protein complexes formed with the ICP8 single-stranded DNA binding protein of HSV-1 by immunoprecipitation and mass spectrometry, identifying six viral proteins in addition to many cellular proteins. Vittone et al. (115) screened 13 HSV-1 tegument proteins for interactions with each other in a LexA-based two-hybrid system. This led to the identification of nine interactions, including three self-interactions, five of which were not previously known. Schierling et al. (91) generated a random genomic library of HCMV fused to the GAL4 activation domain and screened it for interactions with HCMV protein UL82 (pp71), involved in the activation of immediate-early transcription. The screen identified interactions with UL32 (pp150) and the UL35 tegument protein, the latter of which was verified by other assays. Lake and Hutt-Fletcher (59) also used a two-hybrid approach to screen an EBV cDNA library for interactions with the EBV BFRF1 protein (a membrane protein homologous to the UL34 alphaherpesvirus protein), identifying an interaction with the UL31 homologue BFLF2.
The viral protein interactions of a few different vaccinia virus proteins have also been determined. Szajner et al. (100) investigated the viral protein interactions of the vaccinia virus F10 kinase. Epitope-tagged F10 was isolated from vaccinia virus-infected cells by immunoaffinity chromatography, and interacting proteins were analyzed by SDS-PAGE and mass spectrometry. Interactions were seen for the previously identified F10 targets A30 and G7 as well as for A15, D2, D3, and J1. The same method was used by this laboratory to investigate the viral protein interactions of four vaccinia virus membrane proteins, A21, A28, H2, and L5. The results revealed that these four proteins form a complex along with four additional uncharacterized proteins (93).
Genome-wide screens. A small number of genome-wide screens for viral protein-viral protein interactions have been conducted. The first genome-wide protein-protein interaction study was carried out on bacteriophage T7 using a yeast two-hybrid system (8). A random library of T7 DNA fragments was cloned into DNA binding domain and activation domain vectors, and the fragments were tested against one another. There were 25 interactions detected, including 9 interactions for which no previous genetic or biochemical evidence existed. These interactions suggested cross-connections between pathways and provided information about the potential functions of these proteins. For example, the interactions of proteins 4.7 and 6.5 with DNA polymerase suggested that these previously unannotated proteins affect some aspect of DNA replication.
Viruses in which mature proteins are generated from the proteolytic processing of polyproteins have also been subjected to genome-wide screens for viral protein interactions. Flajolet et al. (34) conducted a two-hybrid screen of a random genomic HCV library against itself, which proved to be more successful in detecting interactions than screening with mature HCV proteins. The screen of 200 bait clones identified NS3-E2, NS5A-E1, NS4A-NS2, and NS4A-NS3 interactions, the latter of which was previously known. A similar two-hybrid screen was performed with random genomic fragments of wheat streak mosaic virus, which, like HCV, uses the proteolytic processing of polyproteins to generate mature viral proteins (21). The screen identified multiple interactions between protein domains that were not detected in two-hybrid assays with the mature proteins. However, the lack of two-hybrid interactions of mature proteins generated from processed polyproteins is not a general rule, as interactions were detected between mature proteins from the potyviruses potato virus A and pea seed-borne mosaic virus in two-hybrid screens (41).
There have been two publications that reported genome-wide screens of large DNA viruses. McCraith et al. (71) conducted a two-hybrid screen of 266 predicted vaccinia virus proteins. This resulted in the identification of 37 protein-protein interactions, 28 of which were previously known. Uetz et al. (111) performed genome-wide two-hybrid screens for two herpesviruses, KSHV and varicella-zoster virus (VZV). For VZV, approximately 10,000 potential interactions were tested for the 69 proteins encoded by this virus as well as protein fragments, resulting in 173 nonredundant protein pairs. Tests of 12,000 potential interactions of the 89 KSHV full-length proteins, and fragments thereof, identified 123 nonredundant protein pairs, including five of the seven previously known KSHV interactions. Fifty percent of the newly identified interactions were confirmed by coimmunoprecipitation, and several others have known orthologous interactions in other herpesviruses, suggesting that many of these interactions may be biologically significant. The KSHV data set was used to predict orthologous protein interactions in other herpesviruses (HSV-1, VZV, EBV, and CMV), and of the 112 predicted interactions, 72 were confirmed by coimmunoprecipitation. However, only 5 of the 19 interactions predicted for VZV were actually detected in the two-hybrid screen, which is in keeping with other reports of a high degree of false-negative results in two-hybrid assays (31).
A small but increasing number of studies have used proteomics methods to reveal additional virus-host interactions. Affinity column approaches, coupled to either mass spectrometry or N-terminal sequencing, have been used by several groups to identify proteins from cell extracts that are retained by an immobilized viral protein. Using an HIV-1 Nef affinity column and N-terminal sequencing, Cohen et al. (24) identified human thioesterase II as a Nef binding partner, an interaction which had also been observed in a two-hybrid screen. Kang et al. (54) analyzed hepatocyte proteins retained on an HCV core protein column by two-dimensional gel electrophoresis and MALDI-TOF mass spectrometry of individual spots, leading to the identification of four different intermediate filament proteins as core binding partners. Holowaty et al. (44) identified several human proteins retained on an Epstein-Barr nuclear antigen 1 (EBNA1) column by MALDI-TOF mass spectrometry of protein bands after SDS-PAGE. The strength and specificity of these interactions were further assessed by repeated passage of the eluted proteins through the EBNA1 column and by performing the same experiment with a charge-matched protein affinity column. A TAP tagging approach was also used to assess stable host cell interactions formed by EBNA1 when expressed in human cells (44). This identified a subset of cellular protein interactions found by the affinity column approach, including one with USP7, which was later shown to enable EBNA1 to modulate the p53 pathway (90).
In general, simple immunoprecipitation approaches are not amenable to proteomic analyses due to the high degree of background proteins in the samples and interference from the antibody bands. However, this approach can been used successfully for proteomic analyses if the antibodies are of high specificity and affinity and particularly if they are covalently coupled to beads, reducing the amount of antibody in the final sample. For example, immunoprecipitation has been used to identify interactions with HSV-1 proteins ICP8 and ICP27 (36, 105). Immunoprecipitation of ICP8 from HSV-1-infected cells led to the identification of over 50 cellular proteins by LC-MS/MS in addition to the viral proteins mentioned above (105). The large number of proteins associated with DNA processes is consistent with the known function of ICP0 as a single-stranded DNA binding protein; however, at least some of these interactions are likely to be indirect, mediated by nucleic acid or other proteins. Similar studies with the multifunctional ICP27 protein identified several cellular partners, including three translation initiation factors (36). Finally, the host interactions of the nsP3 protein of Sindbis alphavirus during the course of infection have been explored using a different immunoaffinity purification approach, which involved expressing nsP3 fused to a green fluorescent protein tag and recovering the protein on magnetic beads coupled to green fluorescent protein antibody (27, 28). Recovered proteins were then identified by SDS-PAGE and MALDI-TOF mass spectrometry, revealing an interaction of nsP3 with host G3BP throughout the course of infection and an interaction with 14-3-3 late in infection.
| VIRUS-INDUCED CHANGES IN THE CELLULAR PROTEOME |
|---|
|
|
|---|
2D gel electrophoresis has been used to determine changes in the cellular proteome upon infection by several different viruses, where protein spots that differ before and after infection are excised and identified by mass spectrometry. The host interactions of three different plant viruses have been studied by this approach. Ventelon-Debout et al. (114) used this method to study infection by rice yellow mottle virus and found changes in 24 to 40 cellular proteins depending on the cultivar of rice infected. These proteins included several stress response proteins such as salt-induced protein, heat shock proteins, and superoxide dismutase (SOD). A similar study compared changes in tomato protein expression after infection with tobacco mosaic virus and also found changes in antioxidant enzymes (including SOD) as well as peptidases, chitinases, and proteins in the ascorbate-glutathione cycle (17). In a somewhat different application, Cooper et al. (26) conducted 2D gel analyses on tobacco plants infected with an unknown virus and identified the proteins expressed after infection as a means of identifying the viral proteins and hence the nature of the virus itself (identified as potato virus X).
The host interactions of a small number of animal viruses have also been studied by 2D gel approaches. Alfonso et al. (1) used 2D electrophoresis to examine changes in Vero cells after infection with African swine fever virus and identified 12 induced cellular proteins, which included SOD and two heat shock proteins (hsp70 and hsp27). Brasier et al. (14) reported changes in 24 nuclear proteins upon infection with respiratory syncytial virus, and, like the above-described viruses, these proteins included heat shock proteins and SOD. 2D gel analysis was also used to compare changes in cellular protein levels before and after EBV infection as well as before and after the expression of a single EBV protein, EBNA2 (92). Changes were identified in 20 cellular proteins upon EBNA2 expression, several of which showed similar changes upon EBV infection, suggesting that EBNA2 has a major role in affecting the cellular proteome in an EBV infection.
Jiang et al. (50) studied the cellular response to SARS infection by both a 2D gel approach and mass spectrometry utilizing isotope-coded affinity tag (ICAT) technology. The 2D gel approach used in that study differed from those described above in using different fluorescent dyes to uniformly label the protein samples, which could then be mixed together and analyzed on a single 2D gel, as opposed to analyzing the two samples on separate 2D gels (Fig. 4). This approach, called difference gel electrophoresis (DIGE) (112), has greatly facilitated the comparison of two (or three) samples by removing the gel-to-gel variability factor and by using dyes with a greater dynamic range than the traditionally used silver or Coomassie stains (reviewed in reference 78). However, like all 2D gel electrophoresis methods, DIGE is limited in its ability to detect very large proteins or proteins with extreme isoelectric points and is not useful for studying membrane proteins. The ICAT adaptation of mass spectrometry used by Jiang et al. (50) involves the labeling of two protein samples with isotopically light and heavy ICAT reagents, which react with the thiol group in the cysteine residues of proteins (42, 78) (Fig. 4). The heavy- and light-labeled samples are then combined, trypsinized, and analyzed by LC-mass spectrometry, enabling the quantification of the relative abundance of each protein in the two samples. This method is more sensitive than DIGE for the detection of low-abundant proteins but cannot detect proteins lacking cysteine or changes in posttranslational modifications. Using these two methods, Jiang et al. (50) identified quantitative changes of 1.5-fold or more upon SARS infection for 186 cellular proteins (out of 355 identified proteins), but only 15 proteins were identified as being differentially expressed by both methods. DIGE identified 48 unique proteins, 30 of which were altered in abundance upon SARS infection, while ICAT identified 322 proteins, 167 of which were altered. This reflected inherent differences of the two methods to detect specific proteins as discussed above, where ICAT was more sensitive in general and better for the identification of higher-molecular-weight proteins, while 2D electrophoresis favored lower-molecular-weight proteins.
|
Finally, Bartee et al. (7) examined the effects of expressing the KSHV membrane-associated K5 ubiquitin ligase protein in human cells using stable isotope labeling with amino acids in cell culture (SILAC) to differentially label samples with light or heavy isotopes of C and N before and after K5 expression, respectively (Fig. 4). This method is similar to ICAT in allowing quantitative comparisons of individual proteins in combined samples but gives more uniform labeling of proteins (provided that long labeling times are used) and, unlike ICAT, does not require the purification of the tryptic peptides (to remove excess ICAT reagent) prior to mass spectrometry (75, 76). As the name implies, SILAC can be used only in tissue culture cells and not on primary cells or tissues. Bartee et al. (7) compared the ratios of heavy and light proteins in plasma, Golgi apparatus, and ER membrane fractions in order to assess the effect of K5 expression on human membrane proteins. Over 100 cellular proteins were repeatedly identified for each of these compartments. While the only consistent change in protein expression in the Golgi apparatus and ER was the K5 protein itself, four cellular proteins were found to be consistently down-regulated in the plasma membrane (including the expected major histocompatibility complex class I), and one (leukocyte adhesion molecule) was confirmed by other methods to be a target of K5 ubiquitination.
| CONCLUDING REMARKS |
|---|
|
|
|---|
| ACKNOWLEDGMENTS |
|---|
L.F. is a Canada Research Chair in Molecular Virology.
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
| 1. | Alfonso, P., J. Rivera, B. Hernaez, C. Alonso, and J. M. Escribano. 2004. Identification of cellular proteins modified in response to African swine fever virus infection by proteomics. Proteomics 4:2037-2046.[CrossRef][Medline] |
| 2. | Anderson, W. F., D. H. Ohlendorf, Y. Takeda, and B. W. Matthews. 1981. Structure of the cro repressor from bacteriophage lambda and its interaction with DNA. Nature 290:754-758.[CrossRef][Medline] |
| 3. | Baas, T., C. R. Baskin, D. L. Diamond, A. Garcia-Sastre, H. Bielefeldt-Ohmann, T. M. Tumpey, M. J. Thomas, V. S. Carter, T. H. Teal, N. Van Hoeven, S. Proll, J. M. Jacobs, Z. R. Caldwell, M. A. Gritsenko, R. R. Hukkanen, D. G. Camp II, R. D. Smith, and M. G. Katze. 2006. Integrated molecular signature of disease: analysis of influenza virus-infected macaques through functional genomics and proteomics. J. Virol. 80:10813-10828. |
| 4. | Baer, R., A. T. Bankier, M. D. Biggin, P. L. Deininger, P. J. Farrell, T. J. Gibson, G. Hatfull, G. S. Hudson, S. C. Satchwell, C. Seguin, P. S. Tuffnell, and B. G. Barrell. 1984. DNA sequence and expression of the B95-8 Epstein-Barr virus genome. Nature 310:207-211.[CrossRef][Medline] |
| 5. | Baldick, C. J., Jr., and T. Shenk. 1996. Proteins associated with purified human cytomegalovirus particles. J. Virol. 70:6097-6105.[Abstract] |
| 6. | Barry, M., and K. Fruh. 2006. Viral modulators of cullin RING ubiquitin ligases: culling the host defense. Sci. STKE 2006:pe21. |
| 7. | Bartee, E., A. L. McCormack, and K. Fruh. 2006. Quantitative membrane proteomics reveals new cellular targets of viral immune modulators. PLoS Pathog. 2:e107.[CrossRef][Medline] |
| 8. | Bartel, P. L., J. A. Roecklein, D. SenGupta, and S. Fields. 1996. A protein linkage map of Escherichia coli bacteriophage T7. Nat. Genet. 12:72-77.[CrossRef][Medline] |
| 9. | Bechtel, J. T., R. C. Winant, and D. Ganem. 2005. Host and viral proteins in the virion of Kaposi's sarcoma-associated herpesvirus. J. Virol. 79:4952-4964. |
| 10. | Bess, J. W., Jr., R. J. Gorelick, W. J. Bosche, L. E. Henderson, and L. O. Arthur. 1997. Microvesicles are a source of contaminating cellular proteins found in purified HIV-1 preparations. Virology 230:134-144.[CrossRef][Medline] |
| 11. | Bochkarev, A., J. Barwell, R. Pfuetzner, W. Furey, A. Edwards, and L. Frappier. 1995. Crystal structure of the DNA binding domain of the Epstein-Barr virus origin binding protein EBNA1. Cell 83:39-46.[CrossRef][Medline] |
| 12. | Borriss, M., T. Lombardot, F. O. Glockner, D. Becher, D. Albrecht, and T. Schweder. 2007. Genome and proteome characterization of the psychrophilic Flavobacterium bacteriophage 11b. Extremophiles 11:95-104.[CrossRef][Medline] |
| 13. | Bortz, E., J. P. Whitelegge, Q. Jia, Z. H. Zhou, J. P. Stewart, T. T. Wu, and R. Sun. 2003. Identification of proteins associated with murine gammaherpesvirus 68 virions. J. Virol. 77:13425-13432. |
| 14. | Brasier, A. R., H. Spratt, Z. Wu, I. Boldogh, Y. Zhang, R. P. Garofalo, A. Casola, J. Pashmi, A. Haag, B. Luxon, and A. Kurosky. 2004. Nuclear heat shock response and novel nuclear domain 10 reorganization in respiratory syncytial virus-infected a549 cells identified by high-resolution two-dimensional gel electrophoresis. J. Virol. 78:11461-11476. |
| 15. | Buisson, M., J. F. Hernandez, D. Lascoux, G. Schoehn, E. Forest, G. Arlaud, J. M. Seigneurin, R. W. Ruigrok, and W. P. Burmeister. 2002. The crystal structure of the Epstein-Barr virus protease shows rearrangement of the processed C terminus. J. Mol. Biol. 324:89-103.[CrossRef][Medline] |
| 16. | Cantin, R., S. Methot, and M. J. Tremblay. 2005. Plunder and stowaways: incorporation of cellular proteins by enveloped viruses. J. Virol. 79:6577-6587. |
| 17. | Casado-Vela, J., S. Selles, and R. B. Martinez. 2006. Proteomic analysis of tobacco mosaic virus-infected tomato (Lycopersicon esculentum M.) fruits and detection of viral coat protein. Proteomics 6(Suppl. 1):S196-S206.[CrossRef][Medline] |
| 18. | Chee, M. S., A. T. Bankier, S. Beck, R. Bohni, C. M. Brown, R. Cerny, T. Horsnell, C. A. Hutchison III, T. Kouzarides, J. A. Martignetti, et al. 1990. Analysis of the protein-coding content of the sequence of human cytomegalovirus strain AD169. Curr. Top. Microbiol. Immunol. 154:125-169.[Medline] |
| 19. | Chelius, D., A. F. Huhmer, C. H. Shieh, E. Lehmberg, J. A. Traina, T. K. Slattery, and E. Pungor, Jr. 2002. Analysis of the adenovirus type 5 proteome by liquid chromatography and tandem mass spectrometry methods. J. Proteome Res. 1:501-513.[CrossRef][Medline] |
| 20. | Chertova, E., O. Chertov, L. V. Coren, J. D. Roser, C. M. Trubey, J. W. Bess, Jr., R. C. Sowder II, E. Barsov, B. L. Hood, R. J. Fisher, K. Nagashima, T. P. Conrads, T. D. Veenstra, J. D. Lifson, and D. E. Ott. 2006. Proteomic and biochemical analysis of purified human immunodeficiency virus type 1 produced from infected monocyte-derived macrophages. J. Virol. 80:9039-9052. |
| 21. | Choi, I. R., D. C. Stenger, and R. French. 2000. Multiple interactions among proteins encoded by the mite-transmitted wheat streak mosaic tritimovirus. Virology 267:185-198.[CrossRef][Medline] |
| 22. | Chung, C. S., C. H. Chen, M. Y. Ho, C. Y. Huang, C. L. Liao, and W. Chang. 2006. Vaccinia virus proteome: identification of proteins in vaccinia virus intracellular mature virion particles. J. Virol. 80:2127-2140. |
| 23. | Claverie, J. M., H. Ogata, S. Audic, C. Abergel, K. Suhre, and P. E. Fournier. 2006. Mimivirus and the emerging concept of "giant" virus. Virus Res. 117:133-144.[CrossRef][Medline] |
| 24. | Cohen, G. B., V. S. Rangan, B. K. Chen, S. Smith, and D. Baltimore. 2000. The human thioesterase II protein binds to a site on HIV-1 Nef critical for CD4 down-regulation. J. Biol. Chem. 275:23097-23105. |
| 25. | Condit, R. C., N. Moussatche, and P. Traktman. 2006. In a nutshell: structure and assembly of the vaccinia virion. Adv. Virus Res. 66:31-124.[CrossRef][Medline] |
| 26. | Cooper, B., D. Eckert, N. L. Andon, J. R. Yates, and P. A. Haynes. 2003. Investigative proteomics: identification of an unknown plant virus from infected plants using mass spectrometry. J. Am. Soc. Mass Spectrom. 14:736-741.[CrossRef][Medline] |
| 27. | Cristea, I. M., J. W. Carroll, M. P. Rout, C. M. Rice, B. T. Chait, and M. R. MacDonald. 2006. Tracking and elucidating alphavirus-host protein interactions. J. Biol. Chem. 281:30269-30278. |
| 28. | Cristea, I. M., R. Williams, B. T. Chait, and M. P. Rout. 2005. Fluorescent proteins as proteomic probes. Mol. Cell. Proteomics 4:1933-1941. |
| 29. | DeFilippis, V., C. Raggo, A. Moses, and K. Fruh. 2003. Functional genomics in virology and antiviral drug discovery. Trends Biotechnol. 21:452-457.[CrossRef][Medline] |
| 30. | Edmonds, L., A. Liu, J. J. Kwan, A. Avanessy, M. Caracoglia, I. Yang, K. L. Maxwell, J. Rubenstein, A. R. Davidson, and L. W. Donaldson. 2007. The NMR structure of the gpU tail-terminator protein from bacteriophage lambda: identification of sites contributing to Mg(II)-mediated oligomerization and biological function. J. Mol. Biol. 365:175-186.[CrossRef][Medline] |
| 31. | Edwards, A. M., B. Kus, R. Jansen, D. Greenbaum, J. Greenblatt, and M. Gerstein. 2002. Bridging structural biology and genomics: assessing protein interaction data with known complexes. Trends Genet. 18:529-536.[CrossRef][Medline] |
| 32. | Egloff, M. P., F. Ferron, V. Campanacci, S. Longhi, C. Rancurel, H. Dutartre, E. J. Snijder, A. E. Gorbalenya, C. Cambillau, and B. Canard. 2004. The severe acute respiratory syndrome-coronavirus replicative protein nsp9 is a single-stranded RNA-binding subunit unique in the RNA virus world. Proc. Natl. Acad. Sci. USA 101:3792-3796. |
| 33. | Fields, S., and O. Song. 1989. A novel genetic system to detect protein-protein interactions. Nature 340:245.[CrossRef][Medline] |
| 34. | Flajolet, M., G. Rotondo, L. Daviet, F. Bergametti, G. Inchauspe, P. Tiollais, C. Transy, and P. Legrain. 2000. A genomic approach of the hepatitis C virus generates a protein interaction map. Gene 242:369-379.[CrossRef][Medline] |
| 35. | Fogg, M. J., P. Alzari, M. Bahar, I. Bertini, J. M. Betton, W. P. Burmeister, C. Cambillau, B. Canard, M. Carrondo, M. Coll, S. Daenke, O. Dym, M. P. Egloff, F. J. Enguita, A. Geerlof, A. Haouz, T. A. Jones, Q. Ma, S. N. Manicka, M. Migliardi, P. Nordlund, R. J. Owens, Y. Peleg, G. Schneider, R. Schnell, D. I. Stuart, N. Tarbouriech, T. Unge, A. J. Wilkinson, M. Wilmanns, K. S. Wilson, O. Zimhony, and J. M. Grimes. 2006. Application of the use of high-throughput technologies to the determination of protein structures of bacterial and viral pathogens. Acta Crystallogr. D Biol. Crystallogr. 62:1196-1207.[CrossRef][Medline] |
| 36. | Fontaine-Rodriguez, E. C., T. J. Taylor, M. Olesky, and D. M. Knipe. 2004. Proteomics of herpes simplex virus infected cell protein 27: association with translation initiation factors. Virology 330:487-492.[CrossRef][Medline] |
| 37. | Gao, M., N. Brufatto, T. Chen, L. L. Murley, R. Thalakada, M. Domagala, B. Beattie, D. Mamelak, V. Athanasopoulos, D. Johnson, G. McFadden, C. Burks, and L. Frappier. 2005. Expression profiling of herpesvirus and vaccinia virus proteins using a high-throughput baculovirus screening system. J. Proteome Res. 4:2225-2235.[CrossRef][Medline] |
| 38. | Gavin, A. C., P. Aloy, P. Grandi, R. Krause, M. Boesche, M. Marzioch, C. Rau, L. J. Jensen, S. Bastuck, B. Dumpelfeld, A. Edelmann, M. A. Heurtier, V. Hoffman, C. Hoefert, K. Klein, M. Hudak, A. M. Michon, M. Schelder, M. Schirle, M. Remor, T. Rudi, S. Hooper, A. Bauer, T. Bouwmeester, G. Casari, G. Drewes, G. Neubauer, J. M. Rick, B. Kuster, P. Bork, R. B. Russell, and G. Superti-Furga. 2006. Proteome survey reveals modularity of the yeast cell machinery. Nature 440:631-636.[CrossRef][Medline] |
| 39. | Gluschankof, P., I. Mondor, H. R. Gelderblom, and Q. J. Sattentau. 1997. Cell membrane vesicles are a major contaminant of gradient-enriched human immunodeficiency virus type-1 preparations. Virology 230:125-133.[CrossRef][Medline] |
| 40. | Go, E. P., W. R. Wikoff, Z. Shen, G. O'Maille, H. Morita, T. P. Conrads, A. Nordstrom, S. A. Trauger, W. Uritboonthai, D. A. Lucas, K. C. Chan, T. D. Veenstra, H. Lewicki, M. B. Oldstone, A. Schneemann, and G. Siuzdak. 2006. Mass spectrometry reveals specific and global molecular transformations during viral infection. J. Proteome Res. 5:2405-2416.[CrossRef][Medline] |
| 41. | Guo, D., M. L. Rajamaki, M. Saarma, and J. P. Valkonen. 2001. Towards a protein interaction map of potyviruses: protein interaction matrixes of two potyviruses based on the yeast two-hybrid system. J. Gen. Virol. 82:935-939. |
| 42. | Gygi, S. P., B. Rist, S. A. Gerber, F. Turecek, M. H. Gelb, and R. Aebersold. 1999. Quantitative analysis of complex protein mixtures using isotope-coded affinity tags. Nat. Biotechnol. 17:994-999.[CrossRef][Medline] |
| 43. | Gygi, S. P., Y. Rochon, B. R. Franza, and R. Aebersold. 1999. Correlation between protein and mRNA abundance in yeast. Mol. Cell. Biol. 19:1720-1730. |
| 44. | Holowaty, M. N., M. Zeghouf, H. Wu, J. Tellam, V. Athanasopoulos, J. Greenblatt, and L. Frappier. 2003. Protein profiling with Epstein-Barr nuclear antigen 1 reveals an interaction with the herpesvirus-associated ubiquitin-specific protease HAUSP/USP7. J. Biol. Chem. 278:29987-29994. |
| 45. | Huang, C., X. Zhang, Q. Lin, X. Xu, |