Skip to main content

Genomic and metatranscriptomic analyses of carbon remineralization in an Antarctic polynya

A Correction to this article was published on 11 March 2019

This article has been updated



Polynyas in the Southern Ocean are regions of intense primary production, mainly by Phaeocystis antarctica. Carbon fixed by phytoplankton in the water column is transferred to higher trophic levels, and finally, to the deep ocean. However, in the Amundsen Sea, most of this organic carbon does not reach the sediment but is degraded in the water column due to high bacterial heterotrophic activity.


We reconstructed 12 key bacterial genomes from different phases of bloom and analyzed the expression of genes involved in organic carbon remineralization. A high correlation of gene expression between the peak and decline phases was observed in an individual genome bin-based pairwise comparison of gene expression. Polaribacter belonging to Bacteroidetes was found to be dominant in the peak phase, and its transcriptional activity was high (48.9% of the total mRNA reads). Two dominant Polaribacter bins had the potential to utilize major polymers in P. antarctica, chrysolaminarin and xylan, with a distinct set of glycosyl hydrolases. In the decline phase, Gammaproteobacteria (Ant4D3, SUP05, and SAR92), with the potential to utilize low molecular weight-dissolved organic matter (LMW-DOM) including compatible solutes, was increased. The versatility of Gammaproteobacteria may contribute to their abundance in organic carbon-rich polynya waters, while the SAR11 clade was found to be predominant in the sea ice-covered oligotrophic ocean. SAR92 clade showed transcriptional activity for utilization of both polysaccharides and LMW-DOM; this may account for their abundance both in the peak and decline phases. Ant4D3 clade was dominant in all phases of the polynya bloom, implicating the crucial roles of this clade in LMW-DOM remineralization in the Antarctic polynyas.


Genomic reconstruction and in situ gene expression analyses revealed the unique metabolic potential of dominant bacteria of the Antarctic polynya at a finer taxonomic level. The information can be used to predict temporal community succession linked to the availability of substrates derived from the P. antarctica bloom. Global warming has resulted in compositional changes in phytoplankton from P. antarctica to diatoms, and thus, repeated parallel studies in various polynyas are required to predict global warming-related changes in carbon remineralization.


Large polynyas developed in the Antarctic coast during the austral summer are ecological hotspots due to extremely productive phytoplankton bloom and intense biogeochemical cycling. In the spring, melting glaciers and sea ice stratify the water column and increase levels of light and nutrients for stimulating the growth of phytoplankton, especially Phaeocystis antarctica [1, 2]. Phaeocystis is one of the few microalgae that benefits strongly from eutrophication, resulting in almost uni-algal plankton blooms [3, 4]. This alga has a polymorphic life cycle (free-living cells and colonies composed of thousands of cells embedded in a polysaccharidic matrix). Organic matter is released from phytoplankton upon lysis by zooplankton grazing, virus infection, or senescence [5]. In fact, the highest chromophoric dissolved organic matter (DOM) concentrations are frequently observed near the peak of the phytoplankton bloom within polynyas of the Amundsen Sea (the ASPIRE cruise in 2010–2011) [6].

Carbon fixed by phytoplankton in the water column is transferred to higher trophic levels, and finally, is exported from the system to the deep ocean [7]. However, intriguingly, in the Amundsen Sea dominated by P. antarctica, most of the organic carbon from primary production does not reach the sediment; instead, it is actively recycled and mineralized in the water column via microbial loop [8,9,10,11]. Indeed, higher heterotrophic bacterial production, accounting for ca. 20% of the organic carbon produced by primary production, has been observed in the surface mixed layer of the polynya [9]. These reports indicate that the Amundsen Sea polynya (ASP) is highly inefficient at transporting carbon to the deep sea via biological pump.

Increases in the pCO2 concentration and water column stratification [12] by global warming may have shifted the phytoplankton community structure from P. antarctica to diatoms in the Southern Ocean [12,13,14]. Diatoms possess relatively faster sinking rate [15, 16]. This makes export flux of organic carbon formed by diatom blooms two times faster than that by Phaeocystis blooms [16], which dramatically increases the contribution of diatoms to total export below the photic zone [17].

Bacterial assemblages may be responsible for the enhanced assimilation and remineralization of organic matter from phytoplankton blooms in polynyas. Associations of bacteria with phytoplankton blooms have been reported in polar oceans based on conventional clone library analyses [18]. Using next-generation sequencing approaches with enhanced sequencing depth [19,20,21], distinct prokaryotic assemblages have been observed at the peak of the P. antarctica-dominated bloom in ASP [21, 22]. However, logistical constraints and safety concerns have limited the investigation of the temporal succession of microbial-bloom communities in Antarctic waters—particularly in the decline period (late February) following the peak in phytoplankton activity.

Furthermore, although bacterial community composition in various ocean provinces has been described, ecophysiological and biochemical properties of key bacteria are still hard to predict owing to their low culturability in artificial media. Since free-living bacterioplankton plays a pivotal role in the mineralization of algae-derived organic matter, free-living bacterioplankton was frequently used to explain bacterioplankton succession dynamics during the bloom [23]. Genomic reconstruction techniques from metagenomic sequences can be used to characterize uncultivated lineages of microorganisms. Physiological properties and biochemical pathways can be predicted using genomes reconstructed by de novo assembly and binning from random shotgun genomic libraries that are obtained directly from environmental samples [24,25,26]. Further, environmental mRNA sequence reads can be mapped to metabolic pathway genes of reconstructed genome bins to evaluate in situ activities [27]. In the present study, nucleic acids harvested from water samples gathered during three cruises were used to reconstruct genomes of key heterotrophic bacteria from a P. antarctica-dominant ASP. We also analyzed the expression of genes in the reconstructed genomes using metatranscriptomics. A comparative analysis of samples from different phases of the bloom (peak and decline) and from sea ice-covered ocean provides fundamental insights into the ecology of abundant bacteria associated with phytoplankton blooms in these Antarctic polynyas.


Sampling and geochemical analysis

During three cruises, four water samples from the Amundsen Sea (two each from the peak and decline phases) were gathered from the center of a polynya. The average bloom termination date is February 23 (± 5.38 days) [12]. Satellite-based surface chlorophyll data was obtained from the GlobColour webpage (, which has been constructed from multi ocean color sensors. In addition, a control ocean water sample was gathered from below sea ice by ice core drilling (Additional file 1: Table S1). A rosette water sampler equipped with ten Niskin bottles (10 L) and a conductivity-temperature-depth unit (sensors for Chl-a fluorescence, pressure, temperature, and salinity) was deployed. The concentrations of Chl-a and inorganic nutrients (nitrate + nitrite, ammonium, phosphate, and silicate) were measured aboard, as described by Kim et al. [1] Briefly, the concentration of Chl-a extracted in 90% acetone was measured using a Trilogy fluorometer (Turner Designs, San Jose, CA, USA). The inorganic nutrients were analyzed using a QuAAtro Auto Analyzer (SEAL Analytical, Southampton, UK). Bacterial 16S rRNA gene copies were evaluated by real-time quantitative PCR (qPCR) as described by Kim et al. [21] For qPCR, the Bac518F and Bac786R primer set was used. To evaluate the phytoplankton composition, sampled water was immediately fixed using glutaraldehyde. Filter-harvested phytoplankton cells were stained with a staining solution, and the composition was manually checked using a microscope. The detailed methods are described in Additional file 2: Supplementary methods according to Lee et al. [28]. The depth of the samples was set to the maximum chlorophyll layer.

Free-living bacterioplankton cells were harvested by sequential filtration of 10 L of water through a 1.2-μm pore-sized filter and 0.22-μm pore-sized filter (Supor polyethersulfone, Pall Life Sciences, Ann Arbor, MI, USA). Seawater samples were collected for approximately 1–2 h around midday during each sampling period. Within 30 min of collection, samples were filtered and maintained at under 10 °C. The filters were immediately stored at − 80 °C until further processing in the laboratory.

Nucleic acid extraction and pyrosequencing

The total nucleic acids were extracted from cells collected on filters with a 0.22-μm pore size using a previously described protocol [29]. Briefly, harvested cells in the filter were grazed with quartz and liquid nitrogen. Grazed cells were lysed with CTAB buffer, and nucleic acids were separated using chloroform. Ethanol precipitation was used to purify and concentrate total nucleic acids. DNA and RNA were separated using the AllPrep DNA/RNA Mini Kit (Qiagen, Valencia, CA, USA) following the manufacturer’s protocols.

For the community analysis, the V1 to V3 regions of the 16S rRNA gene were amplified with the barcoded primer set V1-9F (5′-GAGTTTGATCMTGGCTCAG-3′) and V3-541R (5′-WTTACCGCGGCTGCTGG-3′) [30]. PCR was conducted using the following conditions: 94 °C for 5 min, 30 cycles of denaturation at 94 °C for 30 s, annealing at 55 °C for 45 s, elongation at 72 °C for 60 s, and 72 °C for 5 min for the final extension. The amplified PCR products were purified using the PCR Purification Kit (Cosmogentech, Seoul, Republic of Korea). PCR products for each sample were evenly mixed (0.4 μg per sample), quantified using the NanoDrop Spectrophotometer (Wilmington, DE, USA), and then sequenced by ChunLab Inc. (Seoul, Republic of Korea) using a Roche/454 GS FLX Titanium sequencer. Pyrosequencing data were processed as follows. Chimeric sequences were removed using UCHIME (v. 4.2) [31]. OTUs at the 97% similarity level were obtained using uclust [31]. A taxonomic classification was assigned to each read using Greengenes as a reference database [32]. These steps were performed using the script in QIIME (ver. 1.9.1). Unassigned reads that showed no match to sequences in the Greengenes database were manually reassigned against NCBI Microbial 16S rRNA gene database using BLASTn (cutoff, e value 1 × E−5). Diversity and species richness indices were calculated using the script with 5456 reads subsampled from each sample. Neighbor-joining trees were constructed using representative sequences of each OTU selected by QIIME. These sequences were compared against the NCBI nucleotide database and the EzBioCloud database to select the nearest neighbors. Selected sequences were aligned and edited using BioEdit, and a tree was reconstructed using MEGA 7.0 [33] with the Kimura 2-parameter model with 1000 bootstrap replicates. Additionally, previous pyrosequencing data obtained during the 2010/2011 cruise [21] were reanalyzed using QIIME to compare bacterial communities.

Statistical analysis of environmental factors and the microbial community

To identify the relationships between major bacterial groups and the environmental parameters, a non-metric multidimensional scaling (NMDS) analysis was performed using R with the vegan package (v. 2.4).

Metagenome sequencing, assembly, and annotation

Three genomic DNAs obtained above (marked in Additional file 1: Table S1; where PK indicates the peak phase of the bloom, DC indicates the declining phase, and SI indicates the ocean sample under sea ice) were used to construct Illumina sequencing libraries with target insert sizes of 300 and 5000 bp using the TruSeq PCR Free Kit and Nextera Mate Pair Kit, respectively. Fragments of libraries were used for paired-end and mate-pair sequencing using the Illumina Hiseq2000 according to the manufacturer’s instructions at Macrogen (Seoul, Republic of Korea). Raw reads were trimmed using Sickle with default settings ( Each paired-end read set was assembled using the IDBA-UD (v. 1.1.1) assembler since it provided the best results. The quality of assembly was determined by using N50. Default settings in IDBA_UD were used since those are normally suitable for most metagenome datasets ( [34].Scaffolding was performed by SSPACE (v. 2.1) using preassembled contigs from each sample with mate-pair reads [35]. For taxonomic classification of metagenome and metatranscriptome reads, Centrifuge (v. 1.0.4) was used against NT database (supported by Centrifuge) [36].

For scaffolds of ≥ 1 kb, putative genes were predicted using MetaGeneAnnotator [37]. Functions of the protein sequences were predicted using BLASTp similarity searches against the NCBI NR database, KEGG, COG, and TIGR (e value cutoff; 1 × E−5). The rRNAs and tRNAs were identified using RNAmmer [38] and tRNAscan-SE [39], respectively. Domain information was obtained using the Pfam database (, default parameters). TonB-dependent transporters (TBDTs) were classified as described by Tang et al. [40]. For the classification of ABC transporters and peptidases, putative genes were compared against the TCDB [41] and MEROPS databases [42] using BLASTp (e value cutoff; 1 × E−5), respectively. Signal peptides were predicted for the determination of extracellular proteins using SignalP version 4.1 [43].

Binning and phylogenetic analysis

The binning was performed with SI metagenomic scaffolds using the differential coverage plotting method in combination with the tetranucleotide frequency, as described by Albertsen et al. [44] based on R script ( Bowtie2 (v. 2.1.0) and SAMtools (v. 1.9) were used to calculate scaffold coverages [45, 46]. The coverage value was calculated according to Lander and Waterman [47] as LN/G, where L is the read length, N is the number of mapped reads, and G is the scaffold length (excluding Ns). After binning, unmatched scaffolds to the bin were manually removed according to a taxonomic analysis of scaffolds. The taxonomy of scaffold was determined based on an assignment of the majority of its genes to a specific taxon using BLASTp against the NR database. To estimate the genome completeness of reconstructed bins, CheckM (v. 1.0.5) was used [48]. Using criteria (completeness and contamination) evaluated using CheckM, final bins were selected for further analysis. For the taxonomic classification of genomes, rRNA (rpoB; in case of GM5, recA was used because this bin lacked the rpoB gene) was used. Single marker genes were analyzed using BLASTp against the NR database. Hits for 16S rRNA genes were extracted from the total reads using BLASTn (alignment length ≥ 100 and e value ≤ 0.001) for comparisons of pyrosequencing data [49].

Metatranscriptomic analysis

The TruSeq RNA Kit was used to generate the cDNA template library from extracted RNA, and transcript reads were obtained by Illumina paired-end Hiseq2000 sequencing according to the manufacturer’s instructions at Macrogen. Raw transcript reads were preprocessed using the FASTX-Toolkit (v. 0.0.14) for quality filtering (minimum quality score, 20; minimum percent of bases that must have a quality score exceeding 20, 90; To remove rRNA reads, reads with matches in the SILVA database (SSU and LSU ver. 127) [50] and 5S rRNA database [51] were removed using Bowtie2 and SAMtools. Reads mapped to eukaryotic and viral genomes were removed using Centrifuge (v. 1.0.4). For the transcriptomic analysis, predicted bacterial scaffolds among assembled scaffolds exceeding 1 kb obtained from SI were also used as a custom nucleotide database. Non-rRNA cDNA reads from three samples were mapped against the predicted gene of the custom nucleotide database using Bowtie2 (under default parameters), and assigned reads were counted using SAMtools. Unmapped reads were aligned against UniRef90 using DIAMOND blastx search (query cover 90, e value 1, according to HUMAnN2) [52, 53]. Finally, mapped reads to metagenome and UniRef90 were used for mRNA normalization. Expression levels of individual genes were obtained using TPM value (transcripts per kilobase million).

Correlation between DNA-TPM and mRNA-TPM

The normality of the population distribution was evaluated using the Shapiro–Wilk test for samples of mRNA-TPM. As the data were not normally distributed, the Spearman rank correlation was determined using R (vegan package, v. 2.4) to determine correlations between PK-mRNA-TPM:DC-mRNA-TPM for each sample.


Oceanographic properties

Samples were obtained from phytoplankton blooms of ASP during three cruises (2010–2011, 2011–2012, and 2013–2014) in austral summer. During the 2010–2011 and 2013–2014 cruises, samples from the peak phase of the bloom were obtained. As indicated in Additional file 3: Figure S1, the expedition in 2011–2012 was scheduled to collect two samples in the decline phase of the phytoplankton bloom (Feb. 14 and 29 in 2012). A sample of sea ice-covered ocean water was included for comparison. P. antarctica was the predominant primary producer at the polynya stations, whereas diatoms predominated at the sea ice station (Table 1).

Table 1 Description of the location and oceanographic properties of sampling sites

Bacterial community composition and dynamics

Based on an analysis of 16S rRNA gene sequences, alpha diversity indices in the decline phase were greater than those during the peak phase of the bloom and in the sea ice-covered ocean (Additional file 1: Table S1). Bacteroidetes and Proteobacteria were the most abundant phyla in all samples (Additional file 1: Table S1). Reads of 16S rRNA gene sequences of Polaribacter, the gammaproteobacterium Ant4D3 (Oceanospirillaceae), and Pelagibacter were abundant at ASP. Polaribacter sequences were particularly dominant (37.4–50.5% of total sequences) in the peak phase of the bloom. Despite differences between bloom phases, the major OTUs assigned to Polaribacter, SAR92, Ant4D3, and SAR11 were nearly identical (Additional file 3: Figure S2). The taxonomic distributions are described in detail in Additional file 1: Table S1.

A NMDS analysis of bacterial communities showed that the communities at the peak phase clustered with each other but were distinct from those at the decline phase or in the sea ice-covered ocean, although they were collected in different years (Additional file 3: Figure S3). The bacterial communities in the two different decline-phase samples clustered strongly together, despite differences in various factors, such as P. antarctica abundance, total algal carbon, and total bacterial abundance. The bacterial communities in the sea ice-covered oceans were variable but were distinct from those in polynya samples. The major environmental parameters influencing bacterial community structure were Chl-a and nitrite + nitrate concentration and depth, and the samples of peak phase were positively related to the Chl-a concentration.

Genome reconstruction

To determine the metabolic potential and activities of key bacteria in the different bloom phases, samples at the peak phase (2013–2014; PK) and the decline phase (first sample in 2011–2012; DC) as well as a sample from the sea ice-covered ocean (SI) were selected for genomic reconstruction and transcriptomic analyses. Using the coverage plots of scaffolds assembled from raw sequence reads, genome binning was performed (Additional file 3: Figure S4). Clusters of scaffolds were selected with a threshold of > 100× average coverage in at least one dimension combined with the tetranucleotide frequency (Additional file 3: Figure S4). For genome reconstruction, 12 bins were used. A CheckM analysis indicated that the 12 reconstructed genomes ranged from partially to nearly complete (between 53.9 and 93.6%, except for BC5 (18%)). Using typical phylogenetic marker genes (Additional file 4: Table S2), the phylogenetic positions and contamination were evaluated for the 12 genomes, as shown in Table 2 (see Additional file 1: Table S1). Overall, representative bacterial genomes of dominant clades in the analysis of 16S rRNA gene could be obtained.

Table 2 Features of the twelve reconstructed genomes (genome bins) identified by an analysis of metagenomic sequences derived from DNA extracted from Antarctic waters

Rank abundances of reconstructed genomes

The coverage of metagenome-assembled genomes can be used to estimate genome abundance at each phase of the bloom (Fig. 1). As expected from the community composition analysis (Additional file 1: Table S1), DNA read coverages of genomes of Polaribacter (BC1_Pol and BC5_Pol) were highest at the peak bloom (ca. 2400× and 800×, respectively), indicating that BC1_Pol and BC5_Pol are the key ecotypes of Polaribacter for understanding carbon remineralization in the peak phase (Fig. 1a). The coverage values for BC1_Pol and BC5_Pol diminished during the transition to the decline phases of the bloom, while the coverages of other Bacteroidetes, such as BC2, BC3, and BC4, increased. In Gammaproteobacteria, the abundance of GM1_Ant and GM4_SAR92 decreased from the peak to the decline phase, while the abundance of GM2_Ant and GM3 increased in the decline phase. Eventually, most of the genomes belonging to Bacteroidetes, Gammaproteobacteria, and Alphaproteobacteria had comparable coverages from 100× to 270× in the decline phase (except GM5 and GM6_SUP05) (Fig. 1b). The highest coverage for AL1_Pel was observed in the sea ice-covered ocean sample (750×, Fig. 1c), with much lower coverages during the peak and decline phases (116× and 163×, respectively). To examine possible reasons for the difference in abundance between pyrosequencing and metagenomic reads-based coverage, read mapping was performed using the genomes of Pelagibacter (Pelagibacter ubique HTCC 1062, Pelagibacter sp. HTCC7211, and Pelagibacter sp. IMCC9063) as a database. However, the total mapping read count was lower than those of AL1_Pel (Additional file 1: Table S3). Further, as shown in Additional file 1: Table S4, rRNA gene reads of Alphaproteobacteria (including Pelagibacter) were lower than those in pyrosequencing results. Thus, we assume that the PCR-biased community analysis overestimated Pelagibacter in the Southern Ocean since the same DNA was used for both pyrosequencing and metagenome sequencing [54].

Fig. 1
figure 1

Ranked abundance of selected bins. The coverage values of DNA reads (ac) were determined by the average number of reads that align to scaffolds. DNA reads were used from each sample in PK (a), DC (b), and SI (c). A, BC1_Pol; B, GM1_Ant; C, BC5_Pol; D, GM4_SAR92; E, GM5; F, GM2_Ant; G, AL1_Pel; H, GM6_SUP05; I, BC2; J, BC3; K, BC4; L, GM3

mRNA transcript abundance

mRNA reads from PK, DC, and SI were classified as shown in Additional file 1: Table S5. Among non-rRNA (mRNA) reads, 66.2%, 42.1%, and 63.1% were matched to the total metagenomic scaffolds and uniref90 database for PK, DC, and SI, respectively (Additional file 1: Table S6). In the peak phase, approximately 25.1% and 6.4% of mRNA transcripts were assigned to genes of BC1_Pol and GM1_Ant, respectively, suggesting their high abundances. In SI, 3.4% of total bacterial mRNA reads matched to genes of AL1_Pel.

Pairwise comparisons of the expression levels of all genes (mRNA-TPM:mRNA-TPM) in PK and DC showed a low overall correlation (p = 0.16). However, in an individual genome bin-based pairwise comparison of gene expression between PK and DC, a high correlation of gene expression was observed, with regression slopes ranging from 0.44 to 0.90 (Additional file 3: Figure S5; Additional file 1: Table S7). The strong correlation was further supported by the low average fold changes in transcripts in most of the bins between PK and DC (Additional file 1: Table S8).

Traits overrepresented in each phase

To investigate the specialized metabolic potential for biogeochemical cycling, gene transcripts abundant during each phase of the bloom were analyzed using reconstructed genome-based approaches. Pairwise comparisons of the expression (mRNA-TPM: mRNA-TPM) of genes involved in biogeochemical processes indicated that gene transcripts overrepresented in PK and DC belong to the taxa with the highest abundance in PK and DC in the metagenome, respectively (Additional file 3: Figure S6a). The expression of distinct repertoires of genes for transporters, the glycoside hydrolase (GH) family, and the assimilation of compatible solutes (Additional file 3: Figure S6b–e) were overrepresented at each phase and were further investigated.


Transcripts assigned to putative transporter genes in selected bins were highly represented in our metatranscriptome (2.3%, 1.0%, and 1.8% of total mRNA obtained from PK, DC, and SI, respectively; Fig. 2a). The phylogenetic position of the metagenome-assembled genome corresponded to distinct transporter types (Additional file 1: Table S9). TBDT-like transcripts in BC1_Pol and BC5_Pol bins were prominent (Fig. 2a). Transcripts of putative TBDT genes involved in the uptake of various digested biopolymers (cluster 720 and 3090) were abundant in Polaribacter bins (BC1_Pol and BC5_Pol) (Fig. 2b). Putative genes encoding SusD, a key protein required for glycan binding and uptake by TBDT-mediated transporters [55], were enriched in Polaribacter bins (n = 10 and 5 in BC1_Pol and BC5_Pol, respectively (Table 2; Additional file 1: Table S9) and were highly expressed (Fig. 2a). Genes encoding transporters for vitamin B12 (cluster 973) and thiamin (cluster 180) in BC1_Pol were highly expressed at the peak phase (Fig. 2b). GM1_Ant and GM2_Ant may be potential sources of vitamin B12 (see Additional file 1: Table S10).

Fig. 2
figure 2

Relative gene expression levels based on mRNA-TPM of transporters. a TonB-dependent transporters (TBDT), SusD-family proteins (SusD), ABC (ATP-dependent binding cassette), tripartite ATP-independent periplasmic (TRAP) transporters, and tripartite tricarboxylate transporter (TTT) in selected bins at PK, DC, and SI. Functionally classified TBDT and ABC transporters in b PK (top), c DC (mid), and d SI (bottom). Transcript values of less than 200 TPM value in all bins are not shown

Although GM4_SAR92 belongs to Gammaproteobacteria, putative genes for TBDTs were enriched (Additional file 1: Table S9) and transcripts assigned to the putative genes were abundant. In particular, transcripts assigned to TBDT cluster 427 for the uptake of arabinose, a major component of the saccharide pools of Phaeocystis cells [15], were observed at the peak phase (Fig. 2b). In addition, putative genes for TBDTs of cluster 973 and 410 in GM4_SAR92 involved in the transport of vitamin B12 and various cofactors (especially iron), respectively, were similarly expressed in all three samples.

Putative genes encoding ABC transporters specific for the uptake of amino acids, branched amino acids, peptides, and monosaccharides were abundant in GM1_Ant and GM2_Ant (n = 73 and n = 97, respectively), and transcripts assigned to these putative genes were abundant in the peak phase (Table 2 and Fig. 2b). Transcripts of putative genes for the ABC transporter for glycerol uptake were also prominent in GM2_Ant (Fig. 2b). Further, putative TRAP genes (in particular, for mannitol uptake) were enriched (n = 12 and n = 16, respectively) in GM1_Ant and GM2_Ant, with high recruitment of transcripts in the peak phase (Fig. 2). Glycerol and mannitol are used as osmolytes [56] and might be widely available substrates in polynya blooms. Putative genes encoding mono- and disaccharide, peptide, and glycine betaine (GB) ABC transporters were abundant in GAM6_SUP05 and highly expressed in the decline phase (Fig. 2c). High abundances of transcripts for ABC and TRAP transporters in AL1_Pel for low molecular weight (LMW)-DOM, such as amino acids, taurine, mono- and disaccharides, were observed in the sea ice-covered ocean (Fig. 2d).

Utilization of polysaccharides

Putative genes in the GH family were abundant (n = 27) in BC1_Pol (Table 2), i.e., there were more than four copies of GH16, GH20, and GH92 for the degradation of laminarin, chitin, and mannose-containing polysaccharides, respectively. Many of these putative genes were co-localized with TBDTs and SusD genes in five putative polysaccharide utilization loci (PUL). In contrast to BC1_Pol, BC5_Pol contained six of ten putative sulfatase genes residing in PUL associated with the potential for sulfated polysaccharide assimilation.

Transcripts from putative genes encoding GHs (GH2, GH16, GH17, and GH30) in a PUL for laminarin degradation were abundant in BC1_Pol in the peak phase (Additional file 3: Figure S6c; Fig. 3). A xylan utilization-related PUL containing various putative genes for xylan degradation and assimilation was observed in BC5_Pol (Fig. 4). Among known Polaribacter genomes, the genetic potential for xylan degradation (GH10, endo-1,4-beta-xylanase) was observed only in BC5_Pol. Interestingly, PUL was not observed in BC3, and BC4 bins and the abundances of transcripts assigned to GHs in the BC3 and BC4 bins were very low0 (Additional file 3: Figure S6c). Although PUL was not observed, transcripts from putative genes encoding GHs, such as glucan β-1,4-glucosidase (GH3) and laminarinase (GH16), were detected in GM4_SAR92 (Additional file 3: Figure S7) in addition to the high representation of TBDT genes (Fig. 2).

Fig. 3
figure 3

Representative polysaccharide utilization loci (PUL) related to laminarin utilization in Polaribacter bins and reference Polaribacter genomes. a, b Expression levels of a gene in the PUL of BC1_Pol and BC5_Pol, respectively. Gene expression levels were determined by TPM value. For comparison, c shows similar PULs of Polaribacter spp. including the laminarin-induced PULs in the genomes of Polaribacter sp. Hel1_33_49 and Hel1_85 obtained from a diatom bloom [82]. TBDT, TonB-dependent transporter; SusD, SusD-family protein; CBM4; carbohydrate binding module; GH16, glycoside hydrolase 16 (laminarinase); GH2, glycoside hydrolase 2; GH3, glycoside hydrolase 3; GH17, glycoside hydrolase 17; GH30, glycoside hydrolase 30; Symp, sugar:H+ symporter

Fig. 4
figure 4

A PUL related to xylan utilization in BC5_Pol with expression levels of genes. A cluster for xylose assimilation in Polaribacter sp. Hel1_85 is shown for comparison. Normalization for the estimation of the relative abundance was conducted as described in the legend of Fig. 3. TonB, TonB-dependent transporter; SusD, SusD-family protein; sul, sulfatase; GH10, glycoside hydrolase 10 (endo-1,4-beta-xylanase); GH43, glycoside hydrolase 43 (xylan 1,4 xylosidase); xylP and xylE, xylose:H+ symporter; tran, putative transporter; xylA, xylose isomerase; xylB. xylulose kinase; GH30, glycoside hydrolase 30 (beta-xylosidase); GH29, glycoside hydrolase 29 (alpha-l-fucosidase); GH39, glycoside hydrolase 39 (xylan 1,4 xylosidase)

Assimilation of dimethylpropionate

Putative genes for dimethylpropionate(DMSP) demethylase (dmdA) were widely present and expressed in the major bacteria in the PK, DC, and SI samples, i.e., GM1_Ant and GM4_SAR92 (Additional file 3: Figure S8a). While dmdA of GM1_Ant was highly expressed at PK, that of GM4_SAR92 was highly expressed in all three samples. Putative genes related to the assimilation of 3-methylmercaptopropionate (MMPA) were co-localized with DMSP demethylase in GM1_Ant, GM2_Ant, and GM3 (Additional file 3: Figure S8b). Assimilation of glycine betaine and other compatible solutes is described in the Additional file 2: Supplementary results.


Phytoplankton blooms in Antarctic polynyas are accompanied by various bacterial taxa, as revealed by 16S rRNA gene analyses [21]. In order to investigate bacterial plankton succession dynamics during the bloom, we used samples for two phases, namely the peak and decline of bloom collected over different years. Due to the unique nature of blooms in ASP (i.e., a short period of opening in the sea ice, low temperature, the abundance of P. antarctica-derived organic matter), inter-annual variation in biodiversity may be limited, and thus, bacterioplankton dynamics likely exhibit recurrent patterns. Similarly, repeated patterns have been observed in the North Sea among specific bacterial clades of four consecutive blooms, despite some inter-annual variation in physicochemical conditions [23]. Moreover, the dominance of Polaribacter at the peak of the bloom has been observed evident in previous studies of ASP [21, 57]. The obligate psychrophilic nature of Polaribacter spp. among members of Bacteroidetes isolated from Southern Ocean including ASP [58] may contribute to their adaptation to blooms of ASP [59]. The high ANI value and conserved synteny between fosmid clone Ant4D3 [60] and GM1_Ant, which was dominant in all phases of the polynya bloom (see Additional file 3: Figure S9), support the recurring pattern of the most well-adapted clades in the Southern Ocean.

For adaptation to the polynya blooms, each dominant bacterial clade may harbor distinct metabolic properties. Our genome reconstruction and metatranscriptomic analysis provide relevant information on the major taxa responsible for DOM utilization in the ASP. Polaribacter and SAR92 clades had the potential to utilize HMW-DOC, while the Ant4D3, SAR92, and SUP05 clades had the potential to utilize LMW-DOC. Further fine-scale taxonomic (ecotype) differences could also be observed.

In contrast to diatom-dominated spring bloom in the North Sea, where the expression of GHs most likely allowed diverse members of Bacteroidetes to decompose algal biopolymers [61], Polaribacter was the single predominant clade associated with the degradation of organic matter derived from P. antarctica at the peak bloom in the ASP (Table 1). The direct release of LMW-DOM by growing P. antarctica cells is low [62,63,64], and most DOM from intact P. antarctica cells is composed of polysaccharides. This corresponds with the predominance of novel Polaribacter ecotypes, BC1_Pol and BC5_Pol.

Chrysolaminarin, a β-1,3-glucan, laminarinase (GH16)-specific substrate, is considered one of the most abundant polysaccharides in Phaeocystis blooms [15]. BC1_Pol might be specialized to degrade chrysolaminarin with high expression of a laminarinase (GH16) gene in PUL. Arabinose and xylose are predominant in the carbohydrate composition of Phaeocystis [15, 65], which is a crucial distinction from diatoms [66]. Thus, pentose-containing polysaccharides might be widely available substrates during P. antarctica blooms, and BC5_Pol with the genetic potential for xylan degradation (GH10, endo-1,4-beta-xylanase) may be specialized for the utilization of these polysaccharides. Despite the high average nucleotide identity (ANI 91.8%) between BC1_Pol and BC5_Pol, our results indicate differentiation in the utilization of polysaccharides (e.g., chrysolaminarin and xylan with GH16 and GH10, respectively; Fig. 3 and Fig. 4), which are significant polymers in P. antarctica [15], using a distinct set of GHs. Similar to observations by Teeling et al. [23], frequencies of CAZymes of BC1_Pol and BC5_Pol in metagenomes and metatranscriptomes were significantly decreased as blooms subsided in DC. Detection of transcripts from putative genes encoding GHs in addition to the high representation of TBDT genes in GM4_SAR92 indicates a potential role of GM4_SAR92 in the degradation and assimilation of algae-derived polysaccharides, as previously suggested in a diatom bloom [27], which is a key distinction from other members of Gammaproteobacteria.

Phaeocystis accumulates large amounts of dimethylpropionate (DMSP) (~ 150 mM in cells) as a compatible solute, which can be released from decaying P. antarctica cells [67, 68]. One of the common traits of the abundant Ant4D3 and SAR92 clades in the Amundsen Sea was the metabolic potential of C1 compounds associated with the assimilation of these compatible solutes. DMSP can be converted to a highly reactive volatile sulfur species, methanethiol (MeSH), by DMSP demethylase (DmdA) (see Additional file 3: Figure S8c). MeSH is likely to remain in the ocean surface and is readily used by marine bacteria as a carbon and sulfur source [69]. DMSP lyase (DddD) and CoA transferase (DddD and DddL, − W and − Y) catalyze key biochemical steps in the conversion of DMSP to dimethyl sulfide (DMS), an important climate change gas [70]. Remarkably, representative genes encoding DMSP lyase and CoA transferase were not observed in the total assembled scaffolds. Since significant concentrations of DMS were observed at these study sites [71], DMSP lyase of Phaeocystis [72] or an unknown pathway might be involved in DMS production in the polynya.

Unique LMW-DOM-assimilating bacterial assemblages in the decline phase in the Amundsen Sea are dictated by gene expression in the SUP05, SAR92, and Ant4D3 clades of Gammaproteobacteria (Additional file 3: Figure S6d), while the RCA (Roseobacter clade-affiliated) cluster of Alphaproteobacteria assimilates LMW-DOM in spring algal blooms in other oceans [73,74,75]. A comparative genomic analysis indicated that the metabolic properties of the abundant clades of Gammaproteobacteria largely overlap with those of the RCA cluster, e.g., amino acid and C1 compound (e.g., DMSP) assimilation and cobalamin synthesis (Additional file 5: Table S11). Additionally, the differentiation of the clades of Gammaproteobacteria from the RCA cluster can be observed in the glyoxylate shunt, proteorhodopsin synthesis, CO oxidation, and urea utilization (Additional file 1: Table S10). Isocitrate lyase of GM6_SUP05 and GM1_Ant was highly expressed (ranked 229th and 137th in the expression in PK and DC, respectively (Additional file 1: Table S12). At polynyas with iron limitation, cells with the glyoxylate shunt can use fewer hemeproteins and make fewer electron carriers to decrease oxidative stress [76]. Genes for CODH were only observed in RCA clades for which iron is required as an essential cofactor. Insufficient ATP or proton motive force can be supplemented by proteorhodopsin, which is an abundant gene transcript in the SUP05, SAR92, and Ant4D3 clades in both PK and DC (Additional file 3: Figure S6e). These genomic features may contribute to the dominance of gammaproteobacterial clades in Phaeocystis-dominant Antarctic blooms in which iron is depleted [77]. The possible depletion of iron was supported by the high expression of genes encoding TBDT and ABC transporters involved in iron transport in most of the genomes (Fig. 2).

The polynya is a biogeochemical hotspot where high carbon biomass reached 10 mg C 1−1is sequestered via biological pump [78]. Surprisingly, an exceptionally low amount of fixed carbon is exported to the ASP [8]. The low recalcitrance of the DOM derived from the slowly sinking Phaeocystis is likely to be associated with rapid mineralization of the organic matter in the water column [79, 80]. Laboratory studies have revealed that carbohydrates derived from Phaeocystis were readily degraded by heterotrophic bacteria [81]. Together, P. antarctica-dominated bloom may contribute relatively uniform and readily degradable organic matter, which can be efficiently degraded by a few specialized bacterial clades, contributing to high rates of remineralization of fixed carbon.


In this study, genomic reconstruction and in situ gene expression analyses were used to characterize the metabolic contributions of dominant bacteria native to the ASP to understand carbon remineralization processes. This analysis allowed us to evaluate the major findings of previous studies of the Southern Ocean at a finer taxonomic level. The metabolic potential of dominant bacterial clades of Bacteroidetes and Gammaproteobacteria can be used to predict temporal community succession linked to the availability of substrates derived from phases of the P. antarctica bloom. Global warming has resulted in compositional changes in phytoplankton from Phaeocystis to diatoms in Antarctic polynyas [12,13,14]. This change might be accompanied by bacterioplankton community changes, which may affect carbon remineralization and thus carbon sequestration. Repeated parallel studies of the carbon remineralization potential of bacterioplankton in diatom-dominated polynyas are required to predict global warming-related changes in biogeochemical cycles in polynyas.

Change history

  • 11 March 2019

    Following publication of the original article [1], the authors identified wrong citations in the maintext.


  1. Kim BK, Joo H, Song HJ, Yang EJ, Lee SH, Hahm D, Rhee TS, Lee SH. Large seasonal variation in phytoplankton production in the Amundsen Sea. Polar Biol. 2015;38(3):319–31.

    Article  Google Scholar 

  2. Lee SH, Kim BK, Yun MS, Joo H, Yang EJ, Kim YN, Shin HC, Lee S. Spatial distribution of phytoplankton productivity in the Amundsen Sea, Antarctica. Polar Biol. 2012;35(11):1721–33.

    Article  Google Scholar 

  3. Stefels J, Gieskes WW, Dijkhuizen L. Intriguing functionality of the production and conversion of DMSP in Phaeocystis sp, Biological and environmental chemistry of DMSP and related sulfonium compounds: Springer; 1996. p. 305–15.

  4. Riegman R, Noordeloos AA, Cadée GC. Phaeocystis blooms and eutrophication of the continental coastal zones of the North Sea. Mar Biol. 1992;112(3):479–84.

    Article  Google Scholar 

  5. Schoemann V, Becquevort S, Stefels J, Rousseau V, Lancelot C. Phaeocystis blooms in the global ocean and their controlling mechanisms: a review. J Sea Res. 2005;53(1):43–66.

    Article  CAS  Google Scholar 

  6. Yager PL, Sherrell L, Stammerjohn SE, Alderkamp A-C, Schofield O, Abrahamsen EP, Arrigo KR, Bertilsson S, Garay D, Guerrero R. ASPIRE: the Amundsen Sea Polynya international research expedition. Oceanography. 2012;25(3):40–53.

    Article  Google Scholar 

  7. Henson SA, Sanders R, Madsen E. Global patterns in efficiency of particulate organic carbon export and transfer to the deep ocean. Glob Biogeochem Cy. 2012;26(1):GB1028.

    Article  CAS  Google Scholar 

  8. Ducklow HW, Wilson SE, Post AF, Stammerjohn SE, Erickson M, Lee S, Lowry KE, Sherrell RM, Yager PL. Particle flux on the continental shelf in the Amundsen Sea Polynya and Western Antarctic Peninsula. Elem Sci Anthrepol. 2015;3(1):000046.

    Article  Google Scholar 

  9. Hyun JH, Kim SH, Yang EJ, Choi A, Lee SH. Biomass, production, and control of heterotrophic bacterioplankton during a late phytoplankton bloom in the Amundsen Sea Polynya, Antarctica. Deep Sea Res. 2015;123:102–12.

    Article  Google Scholar 

  10. Rinta-Kanto JM, Sun S, Sharma S, Kiene RP, Moran MA. Bacterial community transcription patterns during a marine phytoplankton bloom. Environ Microbiol. 2012;14(1):228–39.

    Article  CAS  PubMed  Google Scholar 

  11. Lee S, Hwang J, Ducklow HW, Hahm D, Lee SH, Kim D, Hyun JH, Park J, Ha HK, Kim TW. Evidence of minimal carbon sequestration in the productive Amundsen Sea polynya. Geophys Res Lett. 2017;44(15):7892–9.

    Article  CAS  Google Scholar 

  12. Arrigo KR, Lowry KE, van Dijken GL. Annual changes in sea ice and phytoplankton in polynyas of the Amundsen Sea, Antarctica. Deep Sea Res. 2012;71:5–15.

    Article  Google Scholar 

  13. Tortell PD, Payne CD, Li Y, Trimborn S, Rost B, Smith WO, Riesselman C, Dunbar RB, Sedwick P. DiTullio GR: CO2 sensitivity of Southern Ocean phytoplankton. Geophys Res Lett. 2008:35(4).

  14. Arrigo KR, Van Dijken GL. Phytoplankton dynamics within 37 Antarctic coastal polynya systems. J Geophys Res. 2003;108(C8):3271.

    Article  Google Scholar 

  15. Alderkamp A-C, Buma AG, van Rijssel M. The carbohydrates of Phaeocystis and their degradation in the microbial food web. In: Phaeocystis, major link in the biogeochemical cycling of climate-relevant elements: Springer; 2007. p. 99–118.

  16. DeJong HB, Dunbar RB, Koweek DA, Mucciarone DA, Bercovici SK, Hansell DA. Net community production and carbon export during the late summer in the Ross Sea, Antarctica. Global Biogeochem Cycles. 2017;31(3):473–91.

    Article  CAS  Google Scholar 

  17. Reigstad M, Wassmann P. Does Phaeocystis spp. contribute significantly to vertical export of organic carbon? Biogeochemistry. 2007;83(1–3):217–34.

    Article  Google Scholar 

  18. Claire Horner-Devine M, Leibold MA, Smith VH, Bohannan BJ. Bacterial diversity patterns along a gradient of primary productivity. Ecol Lett. 2003;6(7):613–22.

    Article  Google Scholar 

  19. Galand PE, Casamayor EO, Kirchman DL, Potvin M, Lovejoy C. Unique archaeal assemblages in the Arctic Ocean unveiled by massively parallel tag sequencing. ISME J. 2009;3(7):860–9.

    Article  CAS  PubMed  Google Scholar 

  20. Ghiglione JF, Murray AE. Pronounced summer to winter differences and higher wintertime richness in coastal Antarctic marine bacterioplankton. Environ Microbiol. 2012;14(3):617–29.

    Article  CAS  PubMed  Google Scholar 

  21. Kim JG, Park SJ, Quan ZX, Jung MY, Cha IT, Kim SJ, Kim KH, Yang EJ, Kim YN, Lee SH, et al. Unveiling abundance and distribution of planktonic bacteria and archaea in a polynya in Amundsen Sea, Antarctica. Environ Microbiol. 2014;16(6):1566–78.

    Article  PubMed  Google Scholar 

  22. Delmont TO, Eren AM, Vineis JH, Post AF. Genome reconstructions indicate the partitioning of ecological functions inside a phytoplankton bloom in the Amundsen Sea, Antarctica. Front Microbiol. 2015;6:1090.

    PubMed  PubMed Central  Google Scholar 

  23. Teeling H, Fuchs BM, Bennke CM, Kruger K, Chafee M, Kappelmann L, Reintjes G, Waldmann J, Quast C, Glockner FO, et al. Recurring patterns in bacterioplankton dynamics during coastal spring algae blooms. Elife. 2016;5:e11888.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Castelle CJ, Hug LA, Wrighton KC, Thomas BC, Williams KH, Wu D, Tringe SG, Singer SW, Eisen JA, Banfield JF. Extraordinary phylogenetic diversity and metabolic versatility in aquifer sediment. Nat Commun. 2013;4:2120.

    Article  PubMed  Google Scholar 

  25. Seitz KW, Lazar CS, Hinrichs KU, Teske AP, Baker BJ. Genomic reconstruction of a novel, deeply branched sediment archaeal phylum with pathways for acetogenesis and sulfur reduction. ISME J. 2016;10:1696–705.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Wrighton KC, Thomas BC, Sharon I, Miller CS, Castelle CJ, VerBerkmoes NC, Wilkins MJ, Hettich RL, Lipton MS, Williams KH, et al. Fermentation, hydrogen, and sulfur metabolism in multiple uncultivated bacterial phyla. Science. 2012;337(6102):1661–5.

    Article  CAS  PubMed  Google Scholar 

  27. Klindworth A, Mann AJ, Huang S, Wichels A, Quast C, Waldmann J, Teeling H, Glockner FO. Diversity and activity of marine bacterioplankton during a diatom bloom in the North Sea assessed by total RNA and pyrotag sequencing. Mar Genomics. 2014;18:185–92.

    Article  PubMed  Google Scholar 

  28. Lee Y, Yang EJ, Park J, Jung J, Kim TW, Lee S. Physical-biological coupling in the Amundsen Sea, Antarctica: influence of physical factors on phytoplankton community structure and biomass. Deep Sea Res I. 2016;117:51–60.

    Article  Google Scholar 

  29. Hurt RA, Qiu X, Wu L, Roh Y, Palumbo AV, Tiedje JM, Zhou J. Simultaneous recovery of RNA and DNA from soils and sediments. Appl Environ Microbiol. 2001;67(10):4495–503.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Hur M, Kim Y, Song HR, Kim JM, Choi YI, Yi H. Effect of genetically modified poplars on soil microbial communities during the phytoremediation of waste mine tailings. Appl Environ Microbiol. 2011;77(21):7611–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Edgar RC, Haas BJ, Clemente JC, Quince C, Knight R. UCHIME improves sensitivity and speed of chimera detection. Bioinformatics. 2011;27(16):2194–200.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. DeSantis TZ, Hugenholtz P, Larsen N, Rojas M, Brodie EL, Keller K, Huber T, Dalevi D, Hu P, Andersen GL. Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl Environ Microbiol. 2006;72(7):5069–72.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Kumar S, Stecher G, Tamura K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016;33(7):1870–4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Peng Y, Leung HC, Yiu SM, Chin FY. IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth. Bioinformatics. 2012;28(11):1420–8.

    Article  CAS  PubMed  Google Scholar 

  35. Boetzer M, Henkel CV, Jansen HJ, Butler D, Pirovano W. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics. 2011;27(4):578–9.

    Article  CAS  PubMed  Google Scholar 

  36. Kim D, Song L, Breitwieser FP, Salzberg SL. Centrifuge: rapid and sensitive classification of metagenomic sequences. Genome Res. 2016.

  37. Noguchi H, Taniguchi T, Itoh T. MetaGeneAnnotator: detecting species-specific patterns of ribosomal binding site for precise gene prediction in anonymous prokaryotic and phage genomes. DNA Res. 2008;15(6):387–96.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Lagesen K, Hallin P, Rodland EA, Staerfeldt HH, Rognes T, Ussery DW. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007;35(9):3100–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25(5):955–64.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Tang K, Jiao N, Liu K, Zhang Y, Li S. Distribution and functions of TonB-dependent transporters in marine bacteria and environments: implications for dissolved organic matter utilization. PLoS One. 2012;7(7):e41204.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Saier MH Jr, Reddy VS, Tamang DG, Vastermark A. The transporter classification database. Nucleic Acids Res. 2014;42(Database issue):D251–8.

    Article  CAS  PubMed  Google Scholar 

  42. Rawlings ND, Barrett AJ, Finn R. Twenty years of the MEROPS database of proteolytic enzymes, their substrates and inhibitors. Nucleic Acids Res. 2016;44(D1):D343–50.

    Article  CAS  PubMed  Google Scholar 

  43. Petersen TN, Brunak S, von Heijne G, Nielsen H. SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat Methods. 2011;8(10):785–6.

    Article  CAS  PubMed  Google Scholar 

  44. Albertsen M, Hugenholtz P, Skarshewski A, Nielsen KL, Tyson GW, Nielsen PH. Genome sequences of rare, uncultured bacteria obtained by differential coverage binning of multiple metagenomes. Nat Biotechnol. 2013;31(6):533–8.

    Article  CAS  PubMed  Google Scholar 

  45. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9(4):357–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  46. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R. Genome Project Data Processing S: the sequence alignment/map format and SAMtools. Bioinformatics. 2009;25(16):2078–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  47. Lander ES, Waterman MS. Genomic mapping by fingerprinting random clones: a mathematical analysis. Genomics. 1988;2(3):231–9.

    Article  CAS  PubMed  Google Scholar 

  48. Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25(7):1043–55.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  49. Kim SJ, Park SJ, Cha IT, Min D, Kim JS, Chung WH, Chae JC, Jeon CO, Rhee SK. Metabolic versatility of toluene-degrading, iron-reducing bacteria in tidal flat sediment, characterized by stable isotope probing-based metagenomic analysis. Environ Microbiol. 2014;16(1):189–204.

    Article  CAS  PubMed  Google Scholar 

  50. Yilmaz P, Parfrey LW, Yarza P, Gerken J, Pruesse E, Quast C, Schweer T, Peplies J, Ludwig W, Glockner FO. The SILVA and “All-species Living Tree Project (LTP)” taxonomic frameworks. Nucleic Acids Res. 2014;42(Database issue):D643–8.

    Article  CAS  PubMed  Google Scholar 

  51. Szymanski M, Barciszewska MZ, Erdmann VA, Barciszewski J. 5S Ribosomal RNA Database. Nucleic Acids Res. 2002;30(1):176–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  52. Buchfink B, Xie C, Huson DH. Fast and sensitive protein alignment using DIAMOND. Nat Methods. 2014;12(1):59.

    Article  CAS  PubMed  Google Scholar 

  53. Franzosa EA, McIver LJ, Rahnavard G, Thompson LR, Schirmer M, Weingart G, Lipson KS, Knight R, Caporaso JG, Segata N. Species-level functional profiling of metagenomes and metatranscriptomes. Nat Methods. 2018;1.

  54. Klindworth A, Pruesse E, Schweer T, Peplies J, Quast C, Horn M, Glockner FO. Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies. Nucleic Acids Res. 2013;41(1):e1.

    Article  CAS  PubMed  Google Scholar 

  55. Martens EC, Koropatkin NM, Smith TJ, Gordon JI. Complex glycan catabolism by the human gut microbiota: the Bacteroidetes Sus-like paradigm. J Biol Chem. 2009;284(37):24673–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  56. Yancey PH. Organic osmolytes as compatible, metabolic and counteracting cytoprotectants in high osmolarity and other stresses. J Exp Biol. 2005;208(15):2819–30.

    Article  CAS  PubMed  Google Scholar 

  57. Delmont TO, Hammar KM, Ducklow HW, Yager PL, Post AF. Phaeocystis antarctica blooms strongly influence bacterial community structures in the Amundsen Sea polynya. Front Microbiol. 2014;5:646.

    Article  PubMed  PubMed Central  Google Scholar 

  58. Gosink JJ, Woese CR, Staley JT. Polaribacter gen. nov., with three new species, P. irgensii sp. nov., P. franzmannii sp. nov. and P. filamentus sp. nov., gas vacuolate polar marine bacteria of the Cytophaga-Flavobacterium-Bacteroides group and reclassification of ‘Flectobacillus glomerat’ as Polaribacter glomeratus comb. nov. Int J Syst Bacteriol. 1998;48:223–35.

    Article  PubMed  Google Scholar 

  59. Choi S-B, Kim J-G, Jung M-Y, Kim S-J, Min U-G, Si O-J, Park S-J, Hwang CY, Park J, Lee S. Cultivation and biochemical characterization of heterotrophic bacteria associated with phytoplankton bloom in the Amundsen Sea polynya, Antarctica. Deep Sea Res II. 2016;123:126–34.

    Article  CAS  Google Scholar 

  60. Grzymski JJ, Carter BJ, DeLong EF, Feldman RA, Ghadiri A, Murray AE. Comparative genomics of DNA fragments from six Antarctic marine planktonic bacteria. Appl Environ Microbiol. 2006;72(2):1532–41.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  61. Teeling H, Fuchs BM, Becher D, Klockow C, Gardebrecht A, Bennke CM, Kassabgy M, Huang S, Mann AJ, Waldmann J, et al. Substrate-controlled succession of marine bacterioplankton populations induced by a phytoplankton bloom. Science. 2012;336(6081):608–11.

    Article  CAS  PubMed  Google Scholar 

  62. Van Boekel W, Hansen F, Riegman R, Bak R. Lysis-induced decline of a Phaeocystis spring bloom and coupling with the microbial foodweb. Mar Ecol Prog Ser. 1992:269–76.

  63. Brussaard C, Kuipers B, Veldhuis M. A mesocosm study of Phaeocystis globosa population dynamics: I. Regulatory role of viruses in bloom control. Harmful Algae. 2005;4(5):859–74.

    Article  Google Scholar 

  64. Brussaard C, Riegman R, Noordeloos A, Cadée G, Witte H, Kop A, Nieuwland G, Van Duyl F, Bak R. Effects of grazing, sedimentation and phytoplankton cell lysis on the structure of a coastal pelagic food web. Mar Ecol Prog Ser. 1995:259–71.

  65. Mann AJ, Hahnke RL, Huang S, Werner J, Xing P, Barbeyron T, Huettel B, Stuber K, Reinhardt R, Harder J, et al. The genome of the alga-associated marine flavobacterium Formosa agariphila KMM 3901T reveals a broad potential for degradation of algal polysaccharides. Appl Environ Microbiol. 2013;79(21):6813–22.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  66. Gugi B, Le Costaouec T, Burel C, Lerouge P, Helbert W, Bardor M. Diatom-specific oligosaccharide and polysaccharide structures help to unravel biosynthetic capabilities in Diatoms. Mar Drugs. 2015;13(9):5993–6018.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  67. Keller MD, Bellows WK, Guillard RR. Dimethyl sulfide production in marine phytoplankton. In: Saltzman E, Copper W, editors. Biogenic sulfur in the environment, vol. 393. Washington, D.C: American Chemical society; 1989. p. 167–82.

    Chapter  Google Scholar 

  68. Stefels J, van Leeuwe MA. Effects of iron and light stress on the biochemical composition of Antarctic Phaeocystis sp.(Prymnesiophyceae). I. Intracellular DMSP concentrations. J Phycol. 1998;34(3):486–95.

    Article  CAS  Google Scholar 

  69. Buchan A, LeCleir GR, Gulvik CA, Gonzalez JM. Master recyclers: features and functions of bacteria associated with phytoplankton blooms. Nat Rev Microbiol. 2014;12(10):686–98.

    Article  CAS  PubMed  Google Scholar 

  70. Curson AR, Todd JD, Sullivan MJ, Johnston AW. Catabolism of dimethylsulphoniopropionate: microorganisms, enzymes and genes. Nat Rev Microbiol. 2011;9(12):849–59.

    Article  CAS  PubMed  Google Scholar 

  71. Tortell PD, Long MC, Payne CD, Alderkamp A-C, Dutrieux P, Arrigo KR. Spatial distribution of pCO2, ΔO2/Ar and dimethylsulfide (DMS) in polynya waters and the sea ice zone of the Amundsen Sea, Antarctica. Deep Sea Res. 2012;71:77–93.

    Article  CAS  Google Scholar 

  72. Stefels J, Dijkhuizen L. Characteristics of DMSP-lyase in Phaeocystis sp. (Prymnesiophyceae). Mar Ecol Prog Ser. 1996;131(1):307–13.

    Article  CAS  Google Scholar 

  73. Janse I, Zwart G, van der Maarel MJ, Gottschal JC. Composition of the bacterial community degrading Phaeocystis mucopolysaccharides in enrichment cultures. Aquat Microb Ecol. 2000;22(2):119–33.

    Article  Google Scholar 

  74. Voget S, Wemheuer B, Brinkhoff T, Vollmers J, Dietrich S, Giebel HA, Beardsley C, Sardemann C, Bakenhus I, Billerbeck S, et al. Adaptation of an abundant Roseobacter RCA organism to pelagic systems revealed by genomic and transcriptomic analyses. ISME J. 2015;9(2):371–84.

    Article  CAS  PubMed  Google Scholar 

  75. Buchan A, Gonzalez JM, Moran MA. Overview of the marine Roseobacter lineage. Appl Environ Microbiol. 2005;71(10):5665–77.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  76. Ahn S, Jung J, Jang IA, Madsen EL, Park W. Role of glyoxylate shunt in oxidative stress response. J Biol Chem. 2016;291(22):11928–38.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  77. Bender SJ, Moran DM, McIlvin MR, Zheng H, McCrow JP, Badger J, DiTullio GR, Allen AE, Saito MA. Iron triggers colony formation in Phaeocystis antarctica: connecting molecular mechanisms with iron biogeochemistry. In: Biogeosciences Discuss; 2018. in review.

    Google Scholar 

  78. Schofield O, Miles T, Alderkamp A-C, Lee S, Haskins C, Rogalsky E, Sipler R, Sherrell RM, Yager PL. In situ phytoplankton distributions in the Amundsen Sea Polynya measured by autonomous gliders. Elem Sci Anth. 2015;3:000073.

    Article  Google Scholar 

  79. Wassmann P. Significance of sedimentation for the termination of Phaeocystis blooms. J Mar Syst. 1994;5(1):81–100.

    Article  Google Scholar 

  80. Rousseau V, Becquevort S, Parent J-Y, Gasparini S, Daro M-H, Tackx M, Lancelot C. Trophic efficiency of the planktonic food web in a coastal ecosystem dominated by Phaeocystis colonies. J Sea Res. 2000;43(3–4):357–72.

    Article  Google Scholar 

  81. Janse I, van Rijssel M, Ottema A, Gottschal JC. Microbial breakdown of Phaeocystis mucopolysaccharides. Limnol Oceanogr. 1999;44(6):1447–57.

    Article  Google Scholar 

  82. Xing P, Hahnke RL, Unfried F, Markert S, Huang S, Barbeyron T, Harder J, Becher D, Schweder T, Glockner FO, et al. Niches of two polysaccharide-degrading Polaribacter isolates from the North Sea during a spring diatom bloom. ISME J. 2015;9(6):1410–22.

    Article  CAS  PubMed  Google Scholar 

Download references


We thank the captain and crew members of the Korean research icebreaker Araon, for their outstanding assistance during the cruises.


This work was supported by National Research Foundation of Korea grants (NRF-2015M3D3A1A01064881, NRF- 2015R1A4A1041869 and NRF-2018R1A2B6008861) funded by the Ministry of Science and ICT and a grant (PE19060) from the Korea Polar Research Institute. S.-J.K was supported by a grant from National Research Foundation of Korea grants (NRF-2016R1C1B1010946).

Availability of data and materials

Pyrosequencing data of 16S rRNA gene have been deposited in the NCBI Sequence Read Archive (SRA) under accession numbers SRX1887715-SRX1887719. Selected genome bins are available at IMG under accession number 3300009842. All of the cDNA reads are deposited in the NCBI SRA under accession numbers SRX1887523, SRX1887527, and SRX1887528.

Author information

Authors and Affiliations



S-JK, J-GK, and S-KR designed and performed the research. S-JK, J-GK, S-JP, J-HG, M-J, W-HC, YH, and S-KR analyzed the data. J-GK, S-HL, E-JY, JP, and JJ undertook the field work and processing of samples. J-CC, FR, and J-HH discussed the results and commented on the manuscript. S-JK, J-GK, ELM, and S-KR wrote the paper. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Sung-Keun Rhee.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Table S1. Diversity and abundance of bacterial 16S rRNA gene sequences obtained by the pyrosequencing of PCR amplicons in this study. Taxa with frequencies of < 1% were omitted from all samples. N/D, not detected. Table S3. DNA reads mapped to AL1_Pel and three Pelagibacter genomes. Table S4. Abundance (phylum level) of 16S rRNA gene among DNA reads. tr, < 0.1%; N/D, not detected. Table S5. Raw read classification based on NT database using Centrifuge. Table S6. Summary of metagenome and metatranscriptome data. Table S7. Spearman correlation coefficients for genes in the PK-mRNA-TPM and DC-mRNA-TPM datasets. Table S8. Bloom phase-specific gene expression and average fold changes in expression in 12 genome bins. Table S9. Summary of genes encoding representative transporters (TBDT, ABC, and TRAP) and SusD from 12 genomes. Table S10. Comparison of selected genes and pathways in GM1_Ant, GM2_Ant, GM4_SAR92, GM6_SUP05, and Roseobacter clades. Table S12. Genes in the pathway for vitamin B12 biosynthesis in GM1_Ant, GM2_Ant, and GM4_SAR92. (DOCX 73 kb)

Additional file 2:

Supplementary Results and Methods. (DOCX 939 kb)

Additional file 3:

Figure S1. A) Satellite-based surface chlorophyll-a concentrations (mg/m3) and photosynthetically available radiance (PAR) in the polynya of the Amundsen Sea during the sampling cruises. B) Detailed satellite-based surface chlorophyll-a concentrations (mg/m3) and sampling day of this study. GPS position polynya center station, 73.25–73.75S, 114.25–113.75 W. The data constructed from multi ocean color sensors were obtained from the Globcolour webpage ( Figure S2. Neighbor-joining tree of the most abundant OTUs in a) Polaribacter, b) Oceanospirillaceae, c) SAR92, and d) SAR11 from PK, DC, and SI. The representative sequences of each OTU selected by QIIME are highlighted in bold and coded as follows: OTU N (=OTU number), site (PK, DC, or SI), and percentage of the total read number. Bootstrap values of ≥ 50% are shown. Figure S3. NMDS plot showing relationships between bacterial compositions and environmental variables. Bray–Curtis stress 0.117. *indicates p < 0.05 for the environmental parameter. Samples from station 1 to station 26 from Kim et al. [19] were reanalyzed. Samples from the polynya center in this study are represented as filled squares; green indicates the peak phase of the bloom, and red indicates the declining phase of the bloom. Filled triangles represent samples obtained under sea ice. Samples from the previous study are represented as empty squares for the polynya center (station 13), empty diamonds for the polynya margin (station 8), empty pentagons for the polynya ice shelf (station 11), empty circles for the open ocean (station 1), and empty triangles for water under sea ice (station 26). Figure S4. Differential coverage plot of scaffolds for DNA reads obtained from a) PK, b) DC, and c, d) SI. Samples used for coverage calculation are marked on the X-axis (decline phase of the bloom) and Y-axis (peak phase of the bloom). Summary of a principal component analysis of the tetranucleotide frequencies of scaffolds selected from differential coverage plot: e) BC1_Pol, f) BC2, g) BC3 and BC4, h) BC5_Pol, i) GM1_Ant, j) GM2_Ant, k) GM3, l) GM4_SAR92, m) GM5, n) GM6_SUP05, and o) AL1_Pel. Colored circles indicate taxonomic information for each scaffold obtained using phylogenetic marker genes of the scaffold: Bacteroidetes (red), Alphaproteobacteria (blue), Gammaproteobacteria (green), Betaproteobacteria (purple), unclassified Proteobacteria (orange), Cyanobacteria (yellow), Firmicutes (brown), and others (pink). Putative target bins are indicated using arrows. Figure S5. Correlation of gene expression between PK and DC (PK-mRNA-TPM and DC-mRNA-TPM) for selected bins. Figure S6. COG-based transcriptome analysis of 12 genomes using a scatter plot of mRNA-TPM. Each point represents the relative abundance of a transcript assigned to a COG category. (A) total COGs; (B) transporter-related COGs; (C) GH-related COGs; (D) DMSP-, GB-, and glyoxylate shunt-related COGs; and (E) proteorhodopsin-related COGs. Figure S7. Gene expression levels of representative glycoside hydrolase genes from GM4_SAR92. GH3, glycoside hydrolase 3; GH16, glycoside hydrolase 16; GH42, glycoside hydrolase 42. Figure S8. Metabolism of dimethylsulfoniopropionate (DMSP), gene expression levels, and gene clusters across the reconstructed genomes. a) Relative abundance of dmdA transcripts from each bin, b) organization of the clusters of genes involved in DMSP utilization, and c) pathway for the utilization of DMSP. 1, DMSP demethylase (dmdA); 2, O-acetylhomoserine aminocarboxypropyltransferase; 3, alpha/beta hydrolase fold protein; 4, 3-hydroxyacyl-CoA dehydrogenase (dmdC); 5, acyl-CoA synthetase (dmdB); MMPA, methylmercaptopropionate; MMPA-CoA, 3-methylmercaptopropionyl-CoA; MTA-CoA, methylthioacryloyl-CoA. Figure S9. Synteny of GM1_Ant, GM2_Ant, and fosmic clone Ant4D3. Colors are assigned based on the COG classification. (DOCX 8210 kb)

Additional file 4:

Table S2. List of single marker genes from 12 bins and their phylogenetic positions determined using the NCBI NR database (Excel). (XLSX 33 kb)

Additional file 5:

Table S5. Information for the 500 most abundant genes in 12 genomes based on metatranscriptome data (a, PK; b, DC; and c, SI). Genes related to ribosome and mitochondrial biogenesis classified by KEGG and hypothetical proteins are marked in gray. (Excel) (XLSX 109 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kim, SJ., Kim, JG., Lee, SH. et al. Genomic and metatranscriptomic analyses of carbon remineralization in an Antarctic polynya. Microbiome 7, 29 (2019).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: