Biogeographic traits of dimethyl sulfide and dimethylsulfoniopropionate cycling in polar oceans

Dimethyl sulfide (DMS) is the dominant volatile organic sulfur in global oceans. The predominant source of oceanic DMS is the cleavage of dimethylsulfoniopropionate (DMSP), which can be produced by marine bacteria and phytoplankton. Polar oceans, which represent about one fifth of Earth’s surface, contribute significantly to the global oceanic DMS sea-air flux. However, a global overview of DMS and DMSP cycling in polar oceans is still lacking and the key genes and the microbial assemblages involved in DMSP/DMS transformation remain to be fully unveiled. Here, we systematically investigated the biogeographic traits of 16 key microbial enzymes involved in DMS/DMSP cycling in 60 metagenomic samples from polar waters, together with 174 metagenome and 151 metatranscriptomes from non-polar Tara Ocean dataset. Our analyses suggest that intense DMS/DMSP cycling occurs in the polar oceans. DMSP demethylase (DmdA), DMSP lyases (DddD, DddP, and DddK), and trimethylamine monooxygenase (Tmm, which oxidizes DMS to dimethylsulfoxide) were the most prevalent bacterial genes involved in global DMS/DMSP cycling. Alphaproteobacteria (Pelagibacterales) and Gammaproteobacteria appear to play prominent roles in DMS/DMSP cycling in polar oceans. The phenomenon that multiple DMS/DMSP cycling genes co-occurred in the same bacterial genome was also observed in metagenome assembled genomes (MAGs) from polar oceans. The microbial assemblages from the polar oceans were significantly correlated with water depth rather than geographic distance, suggesting the differences of habitats between surface and deep waters rather than dispersal limitation are the key factors shaping microbial assemblages involved in DMS/DMSP cycling in polar oceans. Overall, this study provides a global overview of the biogeographic traits of known bacterial genes involved in DMS/DMSP cycling from the Arctic and Antarctic oceans, laying a solid foundation for further studies of DMS/DMSP cycling in polar ocean microbiome at the enzymatic, metabolic, and processual levels. 6bJ8nkA7sq-T64bgHw5GYL Video Abstract Video Abstract


Introduction
The volatile organosulfur compound dimethyl sulfide (DMS) is the main source of marine sulfate aerosols [1], a key player in the global sulfur cycle [2], and an important nutrient for many organisms (e.g., marine algae [3], coral reefs [4], and heterotrophic bacteria [5]). Although DMS can be produced and removed by a variety of abiotic processes, biological transformations, particularly bacterial production and consumption, exert great influence on the oceanic DMS budget [6].
The predominant source of oceanic DMS is bacterial cleavage of dimethylsulfoniopropionate (DMSP). DMS can also be produced by direct cleavage of intracellular DMSP in DMSP-producing phytoplankton [7]. DMSP cleavage is mediated via several known DMSP lyases, including an algal DMSP lyase (Alma1) [7] and 7 bacterial DMSP lyases (DddD, DddL, DddY, DddQ, DddK, DddW, and DddP) ( Table 1, Fig. 1) [6]. Dimethylsulfoxide (DMSO) is another precursor of DMS, which is ubiquitous in surface ocean waters, the sea-ice zone and sediments [25][26][27]. DMSO can be reduced to DMS in marine algae although the enzymes involved remain to be identified [3]. In bacteria, DMSO reduction to DMS is carried out by the DMSO reductase, DMSOR ( Fig. 1) [28]. Moreover, DMS production can also be mediated by the microbial transmethylation of methanethiol (MeSH) via a methyltransferase MddA (Fig. 1) [24]. Similar to DMS, MeSH is also a volatile organic sulfur compound [29]. The transformation of MeSH to DMS plays a role in DMS production in both marine and terrestrial environments [24,30].
Bacterial oxidation is the primary process for DMS removal in the marine environment [31]. Microbial oxidation of DMS to DMSO represents a major sink of DMS in surface seawater [32]. Three enzymes capable of DMS oxidation have been identified, the multicomponent monooxygenase DsoABCDEF [21], the DMS dehydrogenase DdhABC [22], and the flavin-containing trimethylamine (TMA) monooxygenase Tmm [20]. DMS can also be converted to MeSH by the two-component DMS monooxygenase DmoAB (Fig. 1) [23].
The Arctic and Antarctic are two of the most geographically separated bioregions on Earth with extreme environmental conditions. High concentrations of DMS/ DMSP have been detected in both the Arctic Ocean and the Southern Ocean (Table 2). Indeed, the world's highest concentration of DMS in marine surface water was recorded in the Southern Ocean, which contributes significantly to the global oceanic DMS sea-air flux [60,61]. Previous studies on Arctic and Antarctic DMS/ DMSP cycling mainly focused on quantifying the spatial and temporal concentrations as well as the turnover rates of these compounds [44,55]. Investigations on the abundance and diversity of potential genes involved in DMS/DMSP cycling in polar oceans are limited to a few selected genes involved in DMSP degradation, e.g., DmdA, DddD, DddL, and DddP, via metagenomics [62,63], qPCR [37], or gene clone library analyses [64,65]. With the global warming threat, the polar regions are experiencing rapid changes including sea ice melting [66,67] that is known to correlate with the reduced production of DMS/DMSP [61,68], and this, in turn, may feedback to the global climate. Thus, interpreting the biogeographic traits of DMS/DMSP cycling in Arctic and Antarctic oceans is an urgent task. We postulate that the biogeographic traits of DMS/DMSP cycling in polar oceans may be similar and are less affected by dispersal limitation since similar microbial community structure was observed in these regions [69]. Moreover, considering high concentrations and fast turnover rates of DMS/DMSP have been recorded in polar oceans [26,27], we hypothesize that genes involved in DMS/DMSP cycling are common in polar ocean microbiome. In this study, we set out to systematically uncover the distribution and abundance of 16 functional microbial enzymes involved in DMS/DMSP cycling (Table 1) in the Arctic and Antarctic oceans via metagenomic and metatranscriptomic analyses in order to gain a global overview of microbial transformation of DMS/DMSP in polar oceans.
The functionally ratified protein sequences (Table 3), namely MmtN, DsyB, DddD, DddK, DddP, DddQ, DddW, DddL, DddY, DmdA, DMSOR, Tmm, DsoB (a key catalytic subunit of monooxygenase DsoABCDEF), DdhA (the catalytic subunit of DMS dehydrogenase DdhABC), MddA, and DmoA (the catalytic subunit of DMS monooxygenase DmoAB), were obtained from the National Center for Biotechnology Information (NCBI) database (https://www.ncbi.nlm.nih.gov/) or the IMG/M database [71]. Homologues of DsoB and DmoA in metagenomes/metatranscriptomes were obtained using BLASTP, since both of which only had one biochemically characterized protein (Table 3). For the other 14 proteins involved in DMS/DMSP cycling, hidden Markov models (HMM) were created for each enzyme using protein sequences that are biochemically or structurally characterized and their homologues from metagenomes/ metatranscriptomes were obtained using hmmsearch (http://hmmer.org). The cutoff values used were selected based on established stringency cutoff values from previous reports ( Table 3). The sequences retrieved from our bioinformatics pipeline were further scrutinized for the presence of key residues involved in substrate binding or catalysis and/or validated through protein purification and further biochemical characterization (see below). The amino acid sequences of 10 conserved bacterial marker genes [75] were retrieved from the NCBI database, and the average abundance of these marker genes was used to normalize the abundance of the genes involved in DMS/DMSP cycling in metagenomic and metatranscriptomic datasets as described previously [10].
Finally, to validate the function of predicted hits of the top 5 most abundant enzymes from our datasets, we randomly selected several environmental sequences from each group, chemically synthesized these genes, and overexpressed them in recombinant Escherichia coli for functional characterization of their enzyme activities. These included DddD (2 sequences), DddP (2 sequences), DddK (2 sequences), DmdA (1 sequence), and Tmm (2 sequences) ( Table S6). The nucleotide sequences of these 9 hits were synthesized by BGI (Beijing, China), cloned, and overexpressed using the pET22b plasmid in Escherichia coli BL21 (DE3). These proteins were purification as described previously [84] and their activities were measured following the protocols from previous reports (Table S6) [12,32,41,72,80]. The newly identified DddX was not analysed in this study [85]. DddX homologs returned from Tara Oceans datasets are usually short and do not always contain the full open reading frame, making it difficult for gene synthesis and overexpression in E. coli for functional validation.

Taxonomic profiling
The amino acid sequences of predicted DMS/DMSP cycling-related genes from these metagenomes/metatranscriptomes were extracted using scripts compiled in Python code and aligned against the non-redundant protein sequences (nr) database using BLASTP [86]. The best hit of each query sequence was retrieved, and its taxon was recorded. Taxonomic classification of the assembled MAGs was performed with GTDB-Tk v0.3.2 (the script classify wf was used) [87] using the Genome Taxonomy Database (GTDB) [88].

Data analysis and visualization via bioinformatics tools
The geographical distribution of sampling locations was constructed by Ocean Data View [89]. DMS/DMSP-related protein homologs retrieved from these marine metagenomes/metatranscriptomes were analysed and visualized using the R software package [90] with the following descriptions. Relative abundance and phylogenetic diversities of DMS/DMSP cycling-related genes in polar metagenomic samples were visualized using the 'gplots' and the 'ggplot2' package [91], respectively. The Sankey diagram of the taxonomic profiling of DMS/DMSP cycling-related genes was built using the 'ggalluvial' package [92]. For principal coordinates analysis (PCoA), gross relative abundance in each metagenomic sample was normalized to 1, and Bray-Curtis distances were generated using the 'vegan' packages [93], based on the percentages of DMS/DMSP-related genes. Redundancy analysis (RDA) was performed based on the relative abundance of DMS/DMSP-related genes using the 'vegan' package. Geographical distance was generated using the 'geosphere' package (https://cran.rproject.org/web/packages/geosphere/index.html).
The relationship between Bray-Curtis dissimilarity of microbial communities [94] involved in DMSP/DMS cycling and geographic distance or water depth were analysed using the Mantel test. Alpha-diversity analysis was performed on polar microbiota involved in DMS/DMSP cycling. Shannon and Simpson index was calculated  using the 'vegan' package and plotted via Origin 2018 (https://www.originlab.com/). The average abundance of DMS/DMSP-related genes in metagenomic and metatranscriptomic samples from polar and non-polar oceans was used for Pearson correlation analysis. Pearson correlation coefficients and P values were calculated using 'ggcorrplot' packages [91]. Data processing was performed via scripts compiled in Python code. All graphs were combined via Adobe Illustrator CS5.

Results
Curation of the environmental sequences obtained from polar and non-polar oceans and abundance of genes involved in DMS/DMSP cycling Wherever feasible, we built hidden Markov models (HMM) for each protein involved in DMSP/DMS cycling using ratified sequences obtained from literature (Table 1). These HMM models were then used to search the polar metagenomes and metagenomes/metatranscriptomes from the Tara Ocean datasets (Table 3). Homologs of all currently known bacterial enzymes in DMS/DMSP cycling (Table 1) were found in the Arctic and Antarctic seawater samples (Fig. 3a) although majority of the samples were dominated by five putative enzymes, i.e., DddD, DddP, DddK, DmdA, and Tmm. Most of these putative enzymes involved in DMSP/DMS cycling exhibited wide geographical distributions, several of which (e.g., DddD, DddP, DmdA, Tmm) were detected in all 60 polar ocean samples (Fig. 3a, Table S1).
To evaluate the validity of our approach, we used three complementary methods to curate these sequences. First, we analysed predicted hits for the occurrence of conserved amino acid resides involved in substrate coordination and catalysis guided by biochemical data and/ or available protein structures ( Figure S1). Our analyses suggest that the HMM model can successfully retrieve environmental sequences that largely retained the conserved sites necessary for performing corresponding enzyme activity ( Figure S1). This is supported by further phylogenetic analyses performed for the top five most abundant genes in our datasets (i.e., DddD, DddP, DddK, DmdA, and Tmm), showing that the majority of the predicted hits are affiliated with ratified enzymes ( Figure  S2). To validate the function of these predicted proteins, we then randomly selected 9 environmental sequences from the aforementioned five protein groups and tested their corresponding enzyme activities using purified proteins from recombinant E. coli. Indeed, these proteins retrieved from environmental samples were functional (Table S6). Taken together, our approach appears capable of retrieving bona fide sequences involved in DMS/ DMSP cycling from these polar and no-polar marine omics datasets. In contrast to proteins involved in DMSP catabolism, bacterial DMSP biosynthesis pathway (e.g., dsyB, mmtN) did not appear to be prevalent in these polar samples (Table S1). In contrast, the DmdA-mediated DMSP demethylation pathway was more prevalent, consistent with previous reports of high abundance of DmdA from other oceans [37,62,95]. The DMSP cleavage pathway was also numerically abundant in polar oceans, and DddD, DddP, and DddK were more frequently observed than DddW/DddQ/DddL/DddY. Moreover, the potential genes involved in the transformation between DMS and DMSO were more abundant than those between DMS and MeSH (Fig. 3a). To compare the geographic distribution of DMS/DMSP cycling between the polar and non-polar oceans, the 174 non-polar metagenome samples and the 151 metatranscriptome samples from the Tara Oceans project were analysed. Among the 16 proteins analysed, DmdA, DddD, and DddP were also the most abundant genes involved in DMS/DMSP cycling in non-polar metagenomic samples (Fig. 3b). DMS/DMSP cycling in non-polar oceans appears to be primarily driven by the DMSP demethylation pathway (DmdA), and DddD and DddP mediated DMSP cleavage pathways (Fig. 3b). In addition, the relative abundance of potential transcripts involved in DMS/DMSP cycling in non-polar and polar metatranscriptomic samples were significantly correlated with the relative abundance of potential genes in non-polar (Pearson correlation coefficient = 0.84, P value < 0.0001) and polar metagenomic samples (Pearson correlation coefficient = 0.94, P value < 0.0001), respectively (Fig. 3).

Geographic distribution traits of DMS/DMSP cycling in polar and non-polar oceans
In the metagenomic samples from the Arctic Ocean, the average relative abundance of DMS/DMSP cyclingrelated genes in surface waters was higher than that in deep waters. However, the opposite appears to hold true in the Southern Ocean metagenomic samples (Fig. 4a,  Table S1), which may be explained by the so-called 'high nutrient, low chlorophyll' paradox likely caused by iron limitation in the surface layer of the Southern Ocean [96,97]. In addition, it is noticeable that a high relative abundance of DMS/DMSP cycling-related genes, especially DMSP lyases, was found in deep seawaters over 3000 m (Fig. 4a, Table S1), implying an important role of DMS/DMSP cycling in deep ocean sulfur cycle.
To determine the distribution characteristics of DMS/ DMSP-related genes in polar and non-polar oceans, principal coordinates analysis (PCoA) and redundancy analysis (RDA) were performed. These metagenomic samples were broadly grouped into three independent coordinates: polar surface waters, Tara surface waters, and deep waters (Fig. 4b). Polar and non-polar surface waters were less similar from their gene abundance. In contrast, deep waters in polar and non-polar oceans were more similar and displayed different distribution patterns compared with the surface waters (Fig. 4c, d). Hence, the distributions of DMS/DMSP-related genes were clustered primarily based on water depth rather than geographic distance.
Further RDA analysis demonstrated that the divergence of the ordinations is mostly driven by the differences of relative abundance of certain genes in DMS/ DMSP cycling in surface and deep waters (Table S7). DddK was relatively more prevalent in polar surface waters, while DddD and DddP were more common in polar deep waters (Fig. 4a, e). In non-polar oceans, DmdA and DddK were the principal elements that influenced the distribution traits of surface DMS/DMSP cycling, whereas DddD and DdhA were more influential in deep waters (Fig. 4f). The high relative abundance (Fig. 4a) and wide distribution (Fig. 4e, f)  waters were consistent with the fact that it is primarily originated from the SAR11 clade (Pelagibacterales) which is numerically dominant in the surface ocean [43,95], and the broad dispersion of DddD in deep waters suggests its importance in DMS/DMSP cycling in deep waters.

Phylogenetic diversity of DMS/DMSP cycling-related genes in polar oceans
To reveal the taxonomic diversity of DMS/DMSP cycling-related proteins in polar oceans, 17,189 protein sequences from polar oceans obtained through our pipeline were aligned against the NCBI-nr database, and the taxon of each best hit with the highest accuracy to species level was extracted. Thirty phyla (26 phyla from Bacteria domain, 2 phyla from Eukaryota domain and 2 phyla from Archaea domain) spanning over 38 classes, 72 orders, and 107 families were involved in polar DMS/ DMSP cycling (Table S8). Among the phyla affiliated to Bacteria, Proteobacteria accounted for 84% of the total sequences, of which the dominant classes were Alphaproteobacteria (58%) and Gammaproteobacteria (23%) (Fig. 5a, b). Sequences of DddY, DsoB, DdhA, and DmoA were dominated by Gammaproteobacteria whereas the other 12 proteins were mainly affiliated with Alphaproteobacteria (Fig. 5b). In Alphaproteobacteria (9917 sequences), the Pelagibacterales (5016 sequences) were the most abundant (Fig. 5c), in which members of DmdA, DddK, and Tmm made great contributions. Indeed, Alphaproteobacteria participated in all 7 DMS/ DMSP cycling pathways (Fig. 5b), in which Pelagibacterales were involved in 5 pathways (i.e., DsyB, DddD/ DddK/DddP/DddQ, DmdA, DMSOR, and Tmm) indicating their role as generalists in DMS/DMSP cycling. Regardless of the abundance of the potential genes, DddD, DddP, and MddA exhibited high phylogenetic diversities (Fig. 5d). In contrast, MmtN (100% from Sphingomonadales), DddW (100% from Rhodobacterales), DddY (100% from Alteromonadales), and DddK (99% from Pelagibacterales) were highly conserved at the order level (Table S8). Similarly, the biogeographic patterns of DMS/DMSP cycling in polar oceans were and Spearman's correlation P value (P) were indicated mainly driven by water depth (Fig. 5e) rather than geographical distance. In addition, the dissimilarity of community composition of DMS/DMSP-related genes among polar seawater samples was in line with a depth-decay relationship (Fig. 5f) instead of a distance-decay relationship ( Figure S3). Thus, environmental conditions were likely more important than dispersal limitation in determining community composition of DMS/DMSP-related genes.

DMS/DMSP cycling traits in MAGs obtained from polar oceans
In the majority of the metagenomic samples from both polar and non-polar oceans, the cumulative relative abundance of DMS/DMSP-related genes exceeded 1 (Fig. 3a, b), suggesting that some bacteria may harbour more than one key gene in one or more DMS/DMSP metabolic pathways. We thus carried out co-occurrence analyses of key genes involved in DMS/DMSP metabolic pathways using MAGs assembled from these polar ocean metagenomes. Two hundred and fourteen microbial MAGs (> 80% completeness and < 2% potential contamination) belonging to 23 classes (Table S3) were recovered from these 60 polar metagenomes [69]. One hundred and forty-three MAGs affiliated with 15 classes including 70 families (Table S3) were found to contain at least one gene involved in DMS/DMSP cycling (Fig.  6a). Of these 143 MAGs, 63 MAGs had more than one key gene in the DMS/DMSP metabolic pathways. Overall, at the gene level, these MAGs had 13 different genes (as indicated by the nodes) and 28 co-occurrence combinations (as indicated by the edges, Fig. 6b). At the pathway level, the genes in these MAGs contributed to 7 different DMS/ DMSP pathways with 12 co-occurrence combinations (Fig.  6c). According to the biological network analysis, the DMSP demethylation pathway (DmdA) and DMSP cleavage pathway (DddD) maintained the most frequent coexistence relationship (Fig. 6b, c), which also formed a close clustering relationship with genes responsible for the transformation between DMS and DMSO (Fig. 6b, c).
To uncover the co-occurrence of these genes in DMS/ DMSP metabolism, we carried out a comprehensive cooccurrence network analysis of all microbial genomes in  (Table S9). At the gene level, these combinations yielded 50 one-to-one gene configuration modes (Fig. 6d), with DddP being the most frequent enzyme present in these genome-sequenced microbial strains, while DddL being the most connected gene coexisting with other genes involved in DMS/ DMSP metabolism. At the pathway level, 14 different pathway co-occurrence patterns were observed (Fig. 6e). Interestingly, strong co-existence clustering among various DMSP-degradation pathways were observed in both MAGs from polar oceans and microbial genomes from the IMG/M, suggesting marine microbes likely employ multiple routes for DMSP catabolism. However, DMSP cleavage pathway and DMSP biosynthesis pathway showed stronger connection in microbial genomes from the IMG/M than MAGs from polar oceans metagenomes.

Discussion
Here, we investigated bacteria mediated DMS/DMSP cycling in 60 seawater metagenomes and 214 MAGs obtained from polar oceans and compared them with metagenomes and metatranscriptomes from the Tara Ocean datasets. The relative abundance and phylogenetic analyses of these potential genes involved in DMS/ DMSP cycling in polar oceans suggested that there appears to be an intense and integrated DMS/DMSP cycle in polar oceans (Fig. 7). DmdA, DddD, DddP DddK, and Tmm appear to be the dominant genes involved in DMS/DMSP cycling, and Alpha-and Gammaproteobacteria made the largest contributions. Globally, the geographic distribution of DMS/DMSP cycling was significantly influenced by water depth, which may be Fig. 7 The conceptual diagram of bacterial DMS/DMSP metabolism in polar and non-polar oceans based on the analysis of the relative abundance of the potential genes involved in DMS/DMSP cycles. The thickness of the edge represents the relative abundance of the potential genes in each pathway. The arrowheads indicate the flow directions of organic sulfur compounds. Potential genes contributing more than 20% of the total relative abundance in each pathway are shown due to the differences in microbial assemblages caused by environmental selections. Furthermore, the coexistence of DMS/DMSP-related proteins in marine bacterial genomes was not a rare trait in polar oceans. Met is the sulfocompound for the initiation of DMSP biosynthesis [34]. Given the presence of a low abundance of bacterial DMSP biosynthesis genes, DMSP in polar and non-polar oceans may largely be produced by phytoplankton in surface waters [27,48,58,98,99], which can then be transported to the deep ocean [100] through sinking particles.
Based on our analysis of the relative abundance of potential genes, a large proportion of DMSP may act as intermediates, while most of the sulfur from Met may ultimately be channelled into the production of DMS and especially MeSH. Considerable MeSH may thus accumulate in the polar oceans, which certainly warrants further investigation by measuring its in situ concentration in these polar environments. Our hypothesis is indeed supported by the high abundance and active transcription of DmdA in situ in metatranscriptomic samples (Fig. 3). Thus, the produced MeSH may provide a substantial budget for other physiological processes, such as MeSH oxidation to hydrogen sulfide by the MeSH oxidase (MTO) enzyme [74]. MTO was found to be abundant and widely distributed in both metagenomic and especially in metatranscriptomic samples in this study (Table S10).
Similarly, the relative abundance of Tmm and DMSOR in polar oceans suggested that the production DMS and DMSO were likely unbalanced, which may result in DMSO accumulation. Indeed, high concentrations of DMSO have been detected in both polar oceans waters and sea ice [25,27,47,101], where they may act as cryoprotectants, osmoregulants, or cellular anti-oxidants in bacteria to cope with the extreme environments of the polar regions [102]. Besides, Tmm is also responsible for TMA oxidation to trimethylamine N-oxide (TMAO) [20] and the Tmm-mediated DMS oxidation to DMSO is a methylamine-dependent process [32], which suggests the presence of an inter-connected nitrogen-sulfur cycle through Tmm-mediated DMS oxidation.
Overall, the relative abundance of genes involved in DMS/DMSP cycling in polar oceans appears to be higher than that in non-polar oceans (Fig. 6). Interestingly, this corroborates with the fact that higher concentrations of DMS/DMSP were recorded at poles ( Table 2) and turnover of DMS/DMSP at poles also appeared faster [26,27] according to previous studies. Our results suggested that the dissimilarity of biogeographic traits of DMS/DMSP cycling was barely affected by dispersal limitation [103]. Instead, the similarities of environment conditions (i.e., illumination, temperature and salinity) at the same water layers may play a leading role [104]. The biogeographic traits tended to be more similarity in polar oceans which is consistent with bipolar distribution of marine bacteria [105,106]. It is intriguing that biogeographic pattern of genes involved in DMS/DMSP cycling appears more similar in deep waters than surface waters. This may be due to the long-term stability and connectivity of deep waters [105]. However, there is still divergence between polar and non-polar surface waters, where the microbial communities suffered from shortterm changing environmental conditions (e.g., changes in illumination and weather), consistent with the ecological theory that states 'Everything is everywhere but the environment selects' [107]. Future work on standing concentrations and turnover rates of these organic sulfurs and their response to environmental changes may shed new light on our understanding of their cycling in a changing climate.

Conclusions
Overall, this study provides a global overview of the biogeographic traits of known bacterial genes involved in DMS/DMSP cycling from the Arctic and Antarctic oceans, laying a solid foundation for further studies of DMS/DMSP cycling in polar ocean microbiome at the enzymatic, metabolic, and processual levels.
Additional file 1: Figure S1. Analysis of conserved amino acid residues involved in substrate binding and catalysis of DddK (a), DddQ (b), DddY (c), Tmm (d), DddP (e), DmdA (f) and DMSOR (g) retrieved from polar metagenomic samples. Figure S2. Maximum likelihood trees of the predicted hits of the top five most abundant genes (DddP, DddK, DddK, DmdA, Tmm) involved in DMSP/DMS cycling which were retrieved from the polar metagenomes, Tara metagenomes/metatranscriptomes datasets. Figure S3. Correlation between dissimilarity of DMS/DMSP related bacterial community and water depth in polar oceans.
Additional file 2: Table S1. Raw abundances of DMS/DMSP related genes in Arctic and Antarctic seawater samples. Table S2. A list of enzymes selected as outgroups for phylogenetic analyses. Table S3. Raw abundances and taxonomic composition of DMS/DMSP related genes in MAGs. Table S4. Raw abundances of DMS/DMSP related genes in Tara Ocean samples. Table S5. Raw abundances of DMS/DMSP related genes in metatranscriptomes. Table S6. The predicted hits with enzymatic activity. Table S7. Biplot scores for constraining variables. Table S8. Taxonomy composition of DMS/DMSP related genes in polar ocean samples. Table S9. Occurrence of DMS/DMSP related genes in genomes from IMG/M database. Table S10. Raw abundances of MTO in metagenomic and metatranscriptomic samples.