Comparative metagenomic analyses reveal viral-induced shifts of host metabolism towards nucleotide biosynthesis
© Enav et al.; licensee BioMed Central Ltd. 2014
Received: 4 November 2013
Accepted: 13 February 2014
Published: 26 March 2014
Viral genomes often contain metabolic genes that were acquired from host genomes (auxiliary genes). It is assumed that these genes are fixed in viral genomes as a result of a selective force, favoring viruses that acquire specific metabolic functions. While many individual auxiliary genes were observed in viral genomes and metagenomes, there is great importance in investigating the abundance of auxiliary genes and metabolic functions in the marine environment towards a better understanding of their role in promoting viral reproduction.
In this study, we searched for enriched viral auxiliary genes and mapped them to metabolic pathways. To initially identify enriched auxiliary genes, we analyzed metagenomic microbial reads from the Global Ocean Survey (GOS) dataset that were characterized as viral, as well as marine virome and microbiome datasets from the Line Islands. Viral-enriched genes were mapped to a “global metabolism network” that comprises all KEGG metabolic pathways. Our analysis of the viral-enriched pathways revealed that purine and pyrimidine metabolism pathways are among the most enriched pathways. Moreover, many other viral-enriched metabolic pathways were found to be closely associated with the purine and pyrimidine metabolism pathways. Furthermore, we observed that sequential reactions are promoted in pathways having a high proportion of enriched genes. In addition, these enriched genes were found to be of modular nature, participating in several pathways.
Our naïve metagenomic analyses strongly support the well-established notion that viral auxiliary genes promote viral replication via both degradation of host DNA and RNA as well as a shift of the host metabolism towards nucleotide biosynthesis, clearly indicating that comparative metagenomics can be used to understand different environments and systems without prior knowledge of pathways involved.
KeywordsMetabolic networks Metabolism Nucleotide biosynthesis Phage Virus
Viruses are the most abundant biological entities in the marine environment [1–3], with observations of millions of viral particles per single drop of water . The high abundance of viruses combined with evidence that viral particles are a major source of planktonic mortality [5–7] and horizontal gene transfer  demonstrate the immense influence of the virosphere on the ecology and biology of the marine environment.
One of the key aspects of viral evolution is the acquisition of host genes and their fixation in the viral genomes. It was suggested that such genes improve viral fitness [9, 10] by supporting key steps in host metabolism and were therefore termed ‘auxiliary metabolic genes’ . For example, the contribution of the viral psbA gene, which is a key factor in host photosynthesis, was shown to increase the viral genome number produced during the lytic cycle of cyanophage P-SSP7 . The identification of auxiliary metabolic genes found in viral genomes is enabling a deeper understanding of the functions that increase viral fitness and the mechanism in which these functions are promoted. Recent studies have provided evidence of viral acquisition of host genes encoding enzymes in key metabolic pathways, such as phosphate metabolism [13–15], nitrogen metabolism , photosynthesis and pigment biosynthesis [9, 10, 13, 17–22], pentose phosphate pathway [23–25], nucleic acid synthesis  and others . It was suggested that these genes play a role in overcoming biochemical bottlenecks in metabolic processes , thus increasing viral fitness by enhancing specific metabolic processes during infection. Notably, while increasing viral fitness, newly acquired genes are fixed in the viral genomes at the expense of larger genome sizes, and thus only the most essential genes are expected to be enriched in the viral genomes.
In this work, two viral metagenomic datasets originating from the Global Ocean Survey (GOS) project  and the Line Islands metagenomic biomes project  were analyzed to detect statistically-enriched host-like metabolic genes in viral reads. To investigate the potential contribution of the enriched genes and pathways found to viral fitness, we mapped the enriched ortholog groups of genes onto a “global metabolism network.” The global metabolism network was constructed by connecting metabolic pathways sharing the same metabolic compounds. Our analysis demonstrates that the viral-enriched pathways have higher connectivity values and are closely associated with the purine and pyrimidine metabolism pathways, which are among the most enriched pathways in the viral metagenome. Furthermore, we show that the enriched ortholog groups are often clustered in specific parts of individual metabolic maps and enhance consecutive reactions. Overall, our data strongly support the hypothesis that the closely associated metabolic pathways play a role in shifting the host metabolism towards purine and pyrimidine biosynthesis to increase viral fitness.
Two marine viral metagenomic datasets were used in this study. The first dataset is based on data generated in the VirMic project  (http://www.cs.technion.ac.il/~itaish/VirMic/) and includes a collection of GOS scaffolds that are considered to be of viral origin (and contain microbial genes). Briefly, the metagenomic scaffolds included in the VirMic dataset were identified using a reciprocal blast approach against the Refseq viral and microbial databases . In the final screening phase of the VirMic pipeline, only scaffolds containing either three genes, out of which the two edge genes are considered to be of viral origin, or at least four genes, out of which three or more are considered to be of viral origin, were included. The original viral metagenome was comprised of metagenomic scaffolds that were converted to the corresponding reads using the “VirMic.scaffolds.gff” file and further filtered to remove possible contaminations.
The second set of data originates from the Line Islands metagenomic biomes project . These metagenomes were chosen as they allow a comparative analysis of the Line Islands viromes and microbiomes. This comparison is possible as each geographical site was sampled twice using different filtering approaches, resulting in viral and microbial datasets using the same sequencing method. Line Islands viromes analyzed were: Line Is Kingman, Line Is Christmas, Line Is Palmyra, and Line Is Tabuaeran (SEED accession: 4440036.3, 4440038.3, 4440040.3, and 4440280.3). Line Islands microbiomes analyzed were: Line Is Kingman, Line Is Christmas, Line Is Palmyra, and Line Is Tabuaeran (SEED accession: 4440037.3, 4440039.3, 4440041.3, and 4440279.3).
Advantages and limitations of GOS-based datasets and viral-size fraction metagenomes
The major analysis is based on the classification of GOS metagenomic scaffolds, as was performed by Sharon et al. . Analysis of GOS scaffolds presents several problems characteristic to this type of data. The main drawback of the GOS scaffolds results from the assembly procedure that was chosen to map the DNA reads; reads from different GOS stations (i.e., different geographical locations) were often assembled onto a single scaffold. While this tactic results in longer scaffolds and higher assembly rates, it might induce biases resulting from misassembled scaffolds. Another disadvantage of the dataset is the limited knowledge of viral diversity in oceans. The VirMic dataset was generated in 2011 and is therefore based on annotations updated in the same year. Recent discoveries, such as the identification of viruses infecting SAR11 and SAR116 bacteria [30, 31] and the following updates to existing databases, might alter the results of the classification performed by Sharon et al. . The pipeline described in this study could be used reliably to detect dominating trends in the viral metagenome such as pathways with high enrichment scores. Nevertheless, with regard to specific genes, a conservative approach should be taken, including a manual examination of the relevant GOS scaffolds and reads, in order to verify, once again, the viral origin of the specific gene. An important advantage of GOS-based data is the length of DNA reads, ~900–1000 bp/read. The length of these reads allows accurate functional annotation of the dataset.
Datasets derived from viral-size fraction metagenomes, such as the Line Islands dataset used in this project, introduce some biases and difficulties. The most prominent problem is the contamination with bacterial DNA , presumably derived from Gene Transfer Agents  or bacterial vesicles . Moreover, the short length of the reads in those datasets (~100 bp) often results in poor functional annotations and a high rate of false positive results.
Annotation, enrichment calculation, and pathway mapping
Updated annotated files (August 27, 2012) of the entire GOS reads against the KEGG database were downloaded from the MG-RAST website  (http://metagenomics.anl.gov/). In the files downloaded from MG-RAST, some reads were clustered and appeared in the following format: aa90_xxx. We converted these clusters to their corresponding reads to be employed in the downstream analysis using the 550.cluster.aa90.mapping files for each GOS station. Only annotations with e-values <1e-5 were used for downstream analysis. Annotation files for the Line Islands viromes were also downloaded from the MG-RAST website. For the Line Islands metagenomes, we considered only annotations with e-values <1e-5 and sequence identity >75%, as the shorter read length of these datasets might result in miss-annotations.
where N is the number of KO annotations found in the GOS project, n is the number of KO annotations in the VirMic dataset, K is the number of reads annotated as a specific KO in the GOS project, and k is the number of the reads annotated as the same specific KO in the VirMic dataset. KOs with a P <1 × 10-4 were considered enriched (Additional file 1: Table S1); these KOs were mapped to KEGG metabolic maps using publically available data for each KO in the KEGG website. In order to analyze the Line Islands virome, we modified our pipeline using the Fisher exact test instead of the hyper geometric test, as the Line Islands metagenomes are divided into two corresponding sets (microbiome and virome) and not into set and subset, as in the case of the GOS and VirMic.
where ES is the pathway enrichment score, N is the number of KOs that participated in the pathway and were found enriched in the viral metagenome, Pn is the hypergeometric P value for each enriched KO, and T is the total number of KOs in the specific KEGG pathway.
Global metabolism network generation and analysis
To generate the global metabolism network, we used the 148 KEGG metabolic pathways that comprise the “metabolic pathways” in the KEGG database (KO01100). We treated each metabolic pathway as a node in the network; if one of the products of pathway A is compound c1 and the substrate of pathway B is compound c1, we created an edge connecting the two nodes, employing a similar approach to the one performed by Kreimer et al. . The edges in this network were defined as having equal weights and are non-directional. Network visualization and analysis was performed using the cytoscape software platform [36, 37].
Metabolic motif identification and examination
where P is the probability of finding a reaction promoted by a viral-enriched KO in a specific pathway, n is the total number of reactions containing compound c (excluding the previous reaction of the metabolic motif), and y is the number of reactions promoted by enriched KOs that contain compound c.
Results and discussion
Detection of viral-enriched host-like metabolic genes
Metabolic pathways found to be enriched in VirMic
KO’s in the pathway
K00525, K00526, K00762, K01493, K02323, K02335, K03006, K03465, K10807, K10808
K02314, K02335, K03111, K10755
K00525, K00526, K00860, K00939, K01768, K02323, K02335, K03006, K10807, K10808
Fructose and mannose metabolism
K00971, K01623, K01711, K01809, K02377
One carbon pool by folate
Carbon fixation in photosynthetic organisms
Amino sugar and nucleotide sugar metabolism
K00971, K01711, K01809, K02377
Nucleotide excision repair
K00124, K01079, K01623, K08350
Pentose phosphate pathway
Fatty acid biosynthesis
Pantothenate and CoA biosynthesis
Glyoxylate and dicarboxylate metabolism
Meiosis - yeast
Glycine, serine and threonine metabolism
Sulfur relay system
Cell cycle – Caulobacter
Fatty acid metabolism
Bacterial secretion system
Biosynthesis of vancomycin group antibiotics
Polyketide sugar unit biosynthesis
Arginine and proline metabolism
Cysteine and methionine metabolism
Base excision repair
Cell cycle – yeast
Vibrio cholerae pathogenic cycle
p53 signaling pathway
Protein processing in endoplasmic reticulum
Herpes simplex infection
Epstein-Barr virus infection
While we do expect contamination with cellular sequences in the viral-size fraction metagenomic set , we observed a significant overlap between the two enriched groups of pathways (39 pathways that were found enriched in the VirMic dataset were found enriched in the Line Islands virome, P value = 1.6 × 10-20, Additional file 2: Figure S1; overlapping pathways are italicized in Table 1). The Line Islands metagenomes consist of DNA reads having an average length of ~100 bp that yield less reliable KO annotations compared to the VirMic annotations (read lengths are 900–1000 bp). Therefore, considering the high overlap between the groups of enriched pathways, we decided to use only the VirMic annotations for further analysis.
Global metabolism map and network
Viral-enriched pathways play a central role in the global metabolism network
Pathways enriched in the viral metagenome are associated with purine and pyrimidine metabolism pathways
Two pathways having the highest enrichment scores were purine and pyrimidine metabolism pathways (Table 1). The importance of these pathways to viral replication has been demonstrated previously . The presence of genes enhancing these functions was reported in both the genomes of cultured marine viruses [13, 16] and in viromes . We examined the association of purine and pyrimidine metabolism pathways with other pathways in the global metabolism network, especially other viral-enriched pathways. We hypothesized that a high association between purine and pyrimidine metabolism pathways and other enriched pathways might be the result of the enhancement of downstream metabolic processes that lead to purine and pyrimidine synthesis, while a low association can indicate that these pathways are independent.
Overall, as can be observed in the global metabolism network, most of the viral-enriched metabolic pathways are closely associated with purine and pyrimidine metabolism pathways. We hypothesize that the close association between enriched pathways and the relatively short path length connecting the enriched pathways to the purine and pyrimidine metabolism pathways results in a shift in cellular metabolic flux towards nucleotide synthesis. Therefore, it is possible that the fixation of metabolic pathways other than purine and pyrimidine metabolism also enhances nucleotide synthesis, and as a result, higher burst sizes and increased viral fitness.
While the existence of enzymes involved in purine and pyrimidine metabolism pathways in viral genomes and metagenomes has been reported previously [13, 16, 28], by employing a naïve pathway enrichment test, we were able to show quantitatively that these are the most significantly enriched pathways in viral metagenomes. The high enrichment score of these two pathways and the relatively high proportion of enriched KOs in the pathways (Table 1) reinforce that these two metabolic functions are highly important for viral replication and are fixed at the expense of increasing genome size.
As aforementioned, we found 13 viral-enriched KOs that belong to purine (Additional file 3: Figure S2) and pyrimidine (Additional file 4: Figure S3) metabolism pathways. An examination of the distribution of the enriched KOs within the purine and pyrimidine metabolism pathway graphs revealed that most of the reactions that are enhanced are located around the two end-products of these pathways, DNA and RNA. While this distribution can be explained by shifting the flux of these pathways towards the end-products, an appealing explanation is that the enhanced reactions promoting the incorporation of nucleic acids into DNA and RNA are bidirectional. Therefore, the enriched KOs might promote the degradation of the host genome and transcriptome in order to reuse its building blocks to replicate and transcribe the viral genome. This hypothesis is supported by the enrichment of enzymes promoting the conversion of nucleoside diphosphate (NDP) to deoxynucleoside diphosphate (dNDP) (ribonucleoside-diphosphate reductase alpha and beta chains (K00525, K00526), ribonucleoside-diphosphate reductase subunits M1 and M2 (K10807, K10808)). Genes enhancing the conversion of NDP to dNDP were observed previously in T4 phage  as well as in cyanophage genomes (S-PM2, P60), and it was assumed that they are potentially responsible for the scavenging of ribonucleotides for DNA synthesis [42, 43].
Pathways enriched in the viral metagenome
One of the viral-enriched metabolic pathways found to be directly associated with both purine and pyrimidine metabolism pathways (using the global metabolism network graph) is the pentose phosphate pathway. Genes belonging to the pentose phosphate pathway were detected previously in viruses infecting cultured cyanobacteria [13, 23, 24], including glucose-6-phosphate dehydrogenase (K00036), 6-phosphogluconate dehydorgenase (K00033), and transaldolase (K01623). Moreover, a recent study demonstrated that cyanophages carry a gene (cp12) that inhibits the Calvin cycle and therefore creates a shift in metabolic flux, and as a result enhances pentose phosphate pathway . Two statistically-enriched KOs that are part of the pentose phosphate pathway were found enriched in this study: K01623 (transaldolase) and K01808 (RpiB). RpiB catalyzes the bidirectional conversion of D-ribulose 5-phosphate to D-ribose 5-phosphate; 6-phosphate dehydrogenase and 6-phosphogluconate dehydrogenase promote two of the three unidirectional reactions that result in elevated concentrations of D-ribulose 5-phosphate. Therefore, the substrate/products ratio for the reaction promoted by RpiB could result in the conversion of D-ribulose 5-phosphate to D-ribose 5-phosphate. Higher levels of D-ribose 5-phosphate can facilitate its conversion to 5-phosphoribosyl diphosphate, which is a compound used in purine and pyrimidine metabolism pathways, where it is converted to orothidine-5p by orotate phosphoribosyltransferase (pyrE, K00762), which was also enriched in the viral metagenome (Additional file 5: Figure S4). Therefore, we suggest that the viral-enriched enzymes of the pentose phosphate pathway promote nucleotide synthesis not only by increasing the generation of reducing power (NADPH), but also by higher synthesis rates of metabolic precursors to be processed in the purine and pyrimidine metabolism pathways.
We examined in detail the VirMic scaffolds containing the rpiB gene (JCVI_SCAF_1096627104658/1096628255974/1096627055600), and determined that they originate from cyanophage genomes based on their gene content (blastn against database containing the abundant SAR11  and SAR116  phages, updated in December 2012) and genomic composition. The presence of the rpiB gene in scaffolds associated with phages infecting cyanobacteria is of great interest as there is no evidence for the existence of rpiB in cyanobacteria. Nevertheless, rpiA, whose product promotes the same reaction, is found frequently in cyanobacteria genomes. Two main differences arise when inspecting these two paralogs, which surprisingly do not share a sequence or structural similarity : the rpiB gene is significantly shorter than the rpiA gene (e.g., 450 nt compared to 660 nt in Escherichia coli), and its product was shown to be involved in the metabolism of the rare sugar allose in addition to ribose . Based on the enrichment of rpiB in the viral subset, we suggest that the fixation of the rpiB gene in cyanophage genomes (and not rpiA) is preferred due to its shorter length, which allows replication, translation, and transcription at lower energy costs. This observation is consistent with previous studies, which demonstrated that viral gene length is about 2/3 of their bacterial homologs [23, 46].
Among the enriched KOs in the viral metagenome, we also detected two enriched KOs that belong to two-component system pathways: K02040, the pstS gene related to phosphate sensing detected previously in marine cyanophage genomes ; and marine metagenomes  and K08350, the fdnI gene related to nitrate sensing . Two-component systems are fundamental prokaryotic stimulus–response mechanisms that allow modifications in cellular gene expression and metabolism as a response to environmental conditions . Cyanophage pstS was recently shown to be up-regulated in phosphate starvation conditions , and it was therefore suggested that fixation of this gene in phage genomes allows higher viral fitness through modification of the host P-acquisition. The formate-dehydrogenase-N gene (fdnI, K08350) is a member of the two-component system pathway that is regulated apparently by extra cellular nitrite/nitrate sensing . We postulate that this member of the two-component system allows viral modification of the host metabolism in a similar manner to the pstS gene. While pstS genes are detected both in cultured and environmental marine cyanophage genomes, to the best of our knowledge, this is the first detection of viral enrichment of fdnI genes.
KOs enriched in the viral metagenome promote sequential reactions within pathways
It has been hypothesized previously that auxiliary genes overcome metabolic bottlenecks within specific metabolic pathways . To test this hypothesis, we examined in detail metabolic pathways that included the highest number of enriched KOs. Among them were four metabolic pathways that contained ≥4 enriched KOs: purine metabolism (KO00230); pyrimidine metabolism (KO00240); fructose and mannose metabolism (KO00051); and the amino sugar and nucleotide sugar metabolism (KO00520). Interestingly, we found that the enriched KOs tend to be clustered in specific regions of the metabolic pathways and often promote sequential reactions. Sequential reactions comprised of at least four reactions in which at most one KO is not enriched in the viral metagenome were defined as “metabolic motifs.” We calculated the probability of randomly finding such motifs within the enriched pathways assuming a binomial distribution. Employing successive Bernoulli experiments (see Methods), we identified 12 occurrences of significant motifs (Additional file 6: Table S2). The presence of viral-enriched metabolic motifs suggests that viral host-like genes do not only enhance reactions that are “metabolic bottlenecks” within pathways, but also promote metabolic processes with a higher degree of complexity.
Analysis of significant metabolic motifs in the different pathways revealed an interesting overlap between them. For example, the metabolic motif found in the fructose and mannose metabolism pathway (Additional file 7: Figure S5) is composed of the same KOs (in the same order) as the metabolic motif found in the amino sugar and nucleotide sugar metabolism pathway (Additional file 8: Figure S6). Such modularity was also found in the purine metabolism pathways (Additional file 3: Figure S2), where two metabolic motifs comprised of the same KOs were identified in the same order. We hypothesize that viral host-like genes are under the selection of two complementary forces: the first selective force favors the fixation of functional genes with a reduced gene length in order to minimize energy costs, as was previously suggested [23, 46]; the second selective force favors the fixation of genes that promote critical reactions coordinately in a number of pathways, therefore allowing for the enhancement of two (or more) metabolic processes at the expense of fixing a single gene.
Modifications of host metabolism by viruses, which result in elevated nucleotide biosynthesis, were suggested previously based on several cultured model viruses [26, 51, 52]. Our study, based on viral metagenomes representing the marine viral gene pool, demonstrates the importance of this metabolic function to viral fitness, as reflected by the high enrichment of the purine and pyrimidine metabolism pathways across the marine viral gene pool. Moreover, our analysis, based on the global metabolism network, suggests that many other viral-enriched pathways are closely associated with the purine and pyrimidine metabolism pathways. We also observed the enrichment of viral genes responsible for the degradation of cellular DNA and RNA, as well as for the conversion of ribonucleotides to deoxyribonucleotides. Therefore, we hypothesize that metabolic genes carried in marine viral genomes expand the nucleotide pool in infected hosts using two combined strategies: i) recycling the building blocks of the cellular genome and transcriptome; and ii) shifting the host metabolism in order to provide substrates for de novo synthesis of purine and pyrimidine.
Global Ocean Survey
We would like to thank I. Paz for her graphical assistance. This work was supported by grant 580/10 of the Israel Science Foundation (OB).
- Suttle CA: Marine viruses – major players in the global ecosystem. Nat Rev Microbiol. 2007, 5: 801-812. 10.1038/nrmicro1750.View ArticlePubMedGoogle Scholar
- Breitbart M: Marine viruses: truth or dare. Ann Rev Mar Sci. 2012, 4: 425-448. 10.1146/annurev-marine-120709-142805.View ArticlePubMedGoogle Scholar
- Fuhrman JA: Marine viruses and their biogeochemical and ecological effects. Nature. 1999, 399: 541-548. 10.1038/21119.View ArticlePubMedGoogle Scholar
- Bergh O, Borsheim KY, Bratbak G, Heldal M: High abundance of viruses found in aquatic environments. Nature. 1989, 340: 467-468. 10.1038/340467a0.View ArticlePubMedGoogle Scholar
- Proctor LM, Fuhrman JA: Viral mortality of marine bacteria and cyanobacteria. Nature. 1990, 343: 60-62. 10.1038/343060a0.View ArticleGoogle Scholar
- Suttle CA, Chan AM, Cottrell MT: Infection of phytoplankton by viruses and reduction of primary productivity. Nature. 1990, 347: 467-469. 10.1038/347467a0.View ArticleGoogle Scholar
- Nagasaki K: Dinoflagellates, diatoms, and their viruses. J Microbiol. 2008, 46: 235-243. 10.1007/s12275-008-0098-y.View ArticlePubMedGoogle Scholar
- McDaniel LD, Young E, Delaney J, Ruhnau F, Ritchie KB, Paul JH: High frequency of horizontal gene transfer in the oceans. Science. 2010, 330: 50-10.1126/science.1192243.View ArticlePubMedGoogle Scholar
- Lindell D, Jaffe JD, Johnson ZI, Church GM, Chisholm SW: Photosynthesis genes in marine viruses yield proteins during host infection. Nature. 2005, 438: 86-89. 10.1038/nature04111.View ArticlePubMedGoogle Scholar
- Dammeyer T, Bagby SC, Sullivan MB, Chisholm SW, Frankenberg-Dinkel N: Efficient phage-mediated pigment biosynthesis in oceanic cyanobacteria. Curr Biol. 2008, 18: 442-448.View ArticlePubMedGoogle Scholar
- Breitbart M, Thompson LR, Suttle CA, Sullivan MB: Exploring the vast diversity of marine viruses. Oceanography. 2007, 20: 135-139.View ArticleGoogle Scholar
- Bragg JG, Chisholm SW: Modeling the fitness consequences of a cyanophage-encoded photosynthesis gene. PLoS ONE. 2008, 3: e3550-10.1371/journal.pone.0003550.View ArticlePubMedPubMed CentralGoogle Scholar
- Sullivan MB, Coleman ML, Weigele P, Rohwer F, Chisholm SW: Three Prochlorococcus cyanophage genomes: signature features and ecological interpretations. PLoS Biol. 2005, 3: e144-10.1371/journal.pbio.0030144.View ArticlePubMedPubMed CentralGoogle Scholar
- Martiny AC, Huang Y, Li W: Occurrence of phosphate acquisition genes in Prochlorococcus cells from different ocean regions. Environ Microbiol. 2009, 11: 1340-1347. 10.1111/j.1462-2920.2009.01860.x.View ArticlePubMedGoogle Scholar
- Kathuria S, Martiny AC: Prevalence of a calcium-based alkaline phosphatase associated with the marine cyanobacterium Prochlorococcus and other ocean bacteria. Environ Microbiol. 2011, 13: 74-83. 10.1111/j.1462-2920.2010.02310.x.View ArticlePubMedGoogle Scholar
- Sullivan MB, Huang KH, Ignacio-Espinoza JC, Berlin AM, Kelly L, Weigele PR, Defrancesco AS, Kern SE, Thompson LR, Young S, Yandava C, Fu R, Krastins B, Chase M, Sarracino D, Osburne MS, Henn MR, Chisholm SW: Genomic analysis of oceanic cyanobacterial myoviruses compared with T4-like myoviruses from diverse hosts and environments. Environ Microbiol. 2010, 12: 3035-3056. 10.1111/j.1462-2920.2010.02280.x.View ArticlePubMedPubMed CentralGoogle Scholar
- Lindell D, Sullivan MB, Johnson ZI, Tolonen AC, Rohwer F, Chisholm SW: Transfer of photosynthesis genes to and from Prochlorococcus viruses. Proc Natl Acad Sci USA. 2004, 101: 11013-11018. 10.1073/pnas.0401526101.View ArticlePubMedPubMed CentralGoogle Scholar
- Loy A, Duller S, Baranyi C, Mussmann M, Ott J, Sharon I, Béjà O, Le Paslier D, Dahl C, Wagner M: Reverse dissimilatory sulfite reductase as phylogenetic marker for a subgroup of sulfur-oxidizing prokaryotes. Environ Microbiol. 2009, 11: 289-299. 10.1111/j.1462-2920.2008.01760.x.View ArticlePubMedPubMed CentralGoogle Scholar
- Alperovitch-Lavy A, Sharon I, Rohwer F, Aro EM, Milo R, Nelson N, Beja O: Reconstructing a puzzle: existence of cyanophages containing both photosystem-I & photosystem-II gene-suites inferred from oceanic metagenomic datasets. Environ Microbiol. 2011, 13: 24-32. 10.1111/j.1462-2920.2010.02304.x.View ArticlePubMedGoogle Scholar
- Sharon I, Alperovitch A, Rohwer F, Haynes M, Glaser F, Atamna-Ismaeel N, Pinter RY, Partensky F, Koonin EV, Wolf YI, Nelson N, Béjà O: Photosystem I gene cassettes are present in marine virus genomes. Nature. 2009, 461: 258-262. 10.1038/nature08284.View ArticlePubMedPubMed CentralGoogle Scholar
- Zeidner G, Bielawski JP, Shmoish M, Scanlan DJ, Sabehi G, Béjà O: Potential photosynthesis gene recombination between Prochlorococcus and Synechococcus via viral intermediates. Environ Microbiol. 2005, 7: 1505-1513. 10.1111/j.1462-2920.2005.00833.x.View ArticlePubMedGoogle Scholar
- Sullivan MB, Lindell D, Lee JA, Thompson LR, Bielawski JP, Chisholm SW: Prevalence and evolution of core photosystem II genes in marine cyanobacterial viruses and their hosts. PLoS Biol. 2006, 4: e234-10.1371/journal.pbio.0040234.View ArticlePubMedPubMed CentralGoogle Scholar
- Thompson LR, Zeng Q, Kelly L, Huang KH, Singer AU, Stubbe J, Chisholm SW: Phage auxiliary metabolic genes and the redirection of cyanobacterial host carbon metabolism. Proc Natl Acad Sci U S A. 2011, 108: e757-e764. 10.1073/pnas.1102164108.View ArticlePubMedPubMed CentralGoogle Scholar
- Millard A, Clokie MRJ, Shub DA, Mann NH: Genetic organization of the psbAD region in phages infecting marine Synechococcus strains. Proc Natl Acad Sci U S A. 2004, 101: 11007-11012. 10.1073/pnas.0401478101.View ArticlePubMedPubMed CentralGoogle Scholar
- Sharon I, Battchikova N, Aro E-M, Giglione C, Meinnel T, Glaser F, Pinter RY, Breitbart M, Rohwer F, Béjà O: Comparative metagenomics of microbial traits within oceanic viral communities. ISME J. 2011, 5: 1178-1190. 10.1038/ismej.2011.2.View ArticlePubMedPubMed CentralGoogle Scholar
- Miller ES, Kutter E, Mosig G, Arisaka F, Kunisawa T, Ruger W: Bacteriophage T4 genome. Microbiol Mol Biol Rev. 2003, 67: 86-156. 10.1128/MMBR.67.1.86-156.2003.View ArticlePubMedPubMed CentralGoogle Scholar
- Rusch DB, Halpern AL, Heidelberg KB, Sutton G, Williamson SJ, Yooseph S, Wu D, Eisen JA, Hoffman J, Remington K, Beeson K, Tran B, Smith H, Baden-Tillson H, Stewart C, Thorpe J, Freeman J, Andrews-Pfannkoch C, Venter JE, Li K, Kravitz S, Heidelberg JF, Utterback T, Rogers YH, Falcón LI, Souza V, Bonilla-Rosso G, Eguiarte LE, Karl DM, Sathyendranath S: The sorcerer II global ocean sampling expedition: I, the northwest Atlantic through the eastern tropical Pacific. PLoS Biol. 2007, 5: e77-10.1371/journal.pbio.0050077.View ArticlePubMedPubMed CentralGoogle Scholar
- Dinsdale EA, Edwards RA, Hall D, Angly F, Breitbart M, Brulc JM, Furlan M, Desnues C, Haynes M, Li L, McDaniel L, Moran MA, Nelson KE, Nilsson C, Olson R, Paul J, Brito BR, Ruan Y, Swan BK, Stevens R, Valentine DL, Thurber RV, Wegley L, White BA, Rohwer F: Functional metagenomic profiling of nine biomes. Nature. 2008, 452: 629-632. 10.1038/nature06810.View ArticlePubMedGoogle Scholar
- Tatusova T, Ciufo S, Fedorov B, O’Neill K, Tolstoy I: RefSeq microbial genomes database: new representation and annotation strategy. Nucleic Acids Res. 2014, 42: D553-D559. 10.1093/nar/gkt1274.View ArticlePubMedPubMed CentralGoogle Scholar
- Zhao Y, Temperton B, Thrash JC, Schwalbach MS, Vergin KL, Landry ZC, Ellisman M, Deerinck T, Sullivan MB, Giovannoni SJ: Abundant SAR11 viruses in the ocean. Nature. 2013, 494: 357-360. 10.1038/nature11921.View ArticlePubMedGoogle Scholar
- Kang I, Oh H-M, Kang D, Cho J-C: Genome of a SAR116 bacteriophage shows the prevalence of this phage type in the oceans. Proc Natl Acad Sci U S A. 2013, 110: 12343-12348. 10.1073/pnas.1219930110.View ArticlePubMedPubMed CentralGoogle Scholar
- Roux S, Krupovic M, Debroas D, Forterre P, Enault F: Assessment of viral community functional potential from viral metagenomes may be hampered by contamination with cellular sequences. Open Biol. 2013, 3: 130160-10.1098/rsob.130160.View ArticlePubMedPubMed CentralGoogle Scholar
- Biller SJ, Schubotz F, Roggensack SE, Thompson AW, Summons RE, Chisholm SW: Bacterial vesicles in marine ecosystems. Science. 2014, 343: 183-186. 10.1126/science.1243457.View ArticlePubMedGoogle Scholar
- Meyer F, Paarmann D, D’Souza M, Olson R, Glass EM, Kubal M, Paczian T, Rodriguez A, Stevens R, Wilke A, Wilkening J, Edwards RA: The Metagenomics RAST server - a public resource for the automatic phylogenetic and functional analysis of metagenomes. BMC Bioinforma. 2008, 9: 386-10.1186/1471-2105-9-386.View ArticleGoogle Scholar
- Kreimer A, Borenstein E, Gophna U, Ruppin E: The evolution of modularity in bacterial metabolic networks. Proc Natl Acad Sci U S A. 2008, 105: 6976-6981. 10.1073/pnas.0712149105.View ArticlePubMedPubMed CentralGoogle Scholar
- Cline MS, Smoot M, Cerami E, Kuchinsky A, Landys N, Workman C, Christmas R, Avila-Campilo I, Creech M, Gross B, Hanspers K, Isserlin R, Kelley R, Killcoyne S, Lotia S, Maere S, Morris J, Ono K, Pavlovic V, Pico AR, Vailaya A, Wang PL, Adler A, Conklin BR, Hood L, Kuiper M, Sander C, Schmulevich I, Schwikowski B, Warner GJ: Integration of biological networks and gene expression data using cytoscape. Nat Protocols. 2007, 2: 2366-2382. 10.1038/nprot.2007.324.View ArticlePubMedGoogle Scholar
- Smoot ME, Ono K, Ruscheinski J, Wang P-L, Ideker T: Cytoscape 2.8: new features for data integration and network visualization. Bioinformatics. 2011, 27: 431-432. 10.1093/bioinformatics/btq675.View ArticlePubMedPubMed CentralGoogle Scholar
- Kanehisa M, Goto S: KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000, 28: 27-30. 10.1093/nar/28.1.27.View ArticlePubMedPubMed CentralGoogle Scholar
- Kanehisa M, Goto S, Sato Y, Furumichi M, Tanabe M: KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 2012, 40: D109-D114. 10.1093/nar/gkr988.View ArticlePubMedPubMed CentralGoogle Scholar
- Eden E, Navon R, Steinfeld I, Lipson D, Yakhini Z: GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists. BMC Bioinforma. 2009, 10: 48-10.1186/1471-2105-10-48.View ArticleGoogle Scholar
- Berglund O, Karlstrom O, Reichard P: A new ribonucleotide reductase system after infection with phage T4. Proc Natl Acad Sci U S A. 1969, 62: 829-835. 10.1073/pnas.62.3.829.View ArticlePubMedPubMed CentralGoogle Scholar
- Mann NH, Clokie MRJ, Millard A, Cook A, Wilson WH, Wheatley PG, Letarov A, Krisch HM: The genome of S-PM2, a “photosynthetic” T4-type bacteriophage that infects marine Synechococcus strains. J Bacteriol. 2005, 187: 3188-3200. 10.1128/JB.187.9.3188-3200.2005.View ArticlePubMedPubMed CentralGoogle Scholar
- Chen F, Lu J: Genomic sequence and evolution of marine cyanophage P60: a new insight on lytic and lysogenic phages. Appl Environ Microbiol. 2002, 68: 2589-2594. 10.1128/AEM.68.5.2589-2594.2002.View ArticlePubMedPubMed CentralGoogle Scholar
- Zhang RG, Andersson CE, Skarina T, Evdokimova E, Edwards AM, Joachimiak A, Savchenko A, Mowbray SL: The 2.2 Å resolution structure of RpiB/AlsB from Escherichia coli illustrates a new approach to the ribose-5-phosphate isomerase reaction. J Mol Biol. 2003, 332: 1083-1094. 10.1016/j.jmb.2003.08.009.View ArticlePubMedPubMed CentralGoogle Scholar
- Poulsen TS, Chang YY, Hove-Jensen B: D-Allose catabolism of Escherichia coli: involvement of alsI and regulation of als regulon expression by allose and ribose. J Bacteriol. 1999, 181: 7126-7130.PubMedPubMed CentralGoogle Scholar
- Rappoport N, Linial M: Viral proteins acquired from a host converge to simplified domain architectures. PLoS Comput Biol. 2012, 8: e1002364-10.1371/journal.pcbi.1002364.View ArticlePubMedPubMed CentralGoogle Scholar
- Williamson SJ, Rusch DB, Yooseph S, Halpern AL, Heidelberg KB, Glass JI, Andrews-Pfannkoch C, Fadrosh D, Miller CS, Sutton G, Frazier M, Venter JC: The sorcerer II global ocean sampling expedition: metagenomic characterization of viruses within aquatic microbial samples. PLoS ONE. 2008, 3: e1456-10.1371/journal.pone.0001456.View ArticlePubMedPubMed CentralGoogle Scholar
- Berg BL, Li J, Heider J, Stewart V: Nitrate-inducible formate dehydrogenase in Escherichia coli K-12. I. Nucleotide sequence of the fdnGHI operon and evidence that opal (UGA) encodes selenocysteine. J Biol Chem. 1991, 266: 22380-22385.PubMedGoogle Scholar
- Stock AM, Robinson VL, Goudreau PN: Two-component signal transduction. Annu Rev Biochem. 2000, 69: 183-215. 10.1146/annurev.biochem.69.1.183.View ArticlePubMedGoogle Scholar
- Zeng Q, Chisholm SW: Marine viruses exploit their host’s two-component regulatory system in response to resource limitation. Curr Biol. 2012, 22: 124-128. 10.1016/j.cub.2011.11.055.View ArticlePubMedGoogle Scholar
- Qimron U, Tabor S, Richardson CC: New details about bacteriophage T7-host interactions. Microbe. 2010, 5: 117-122.Google Scholar
- Molineux IJ: The T7 Group. The Bacteriophages. Edited by: Calendar R. 2006, New York, NY: Oxford University Press, 277-301.Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.