Remarkably coherent population structure for a dominant Antarctic Chlorobium species
Microbiome volume 9, Article number: 231 (2021)
In Antarctica, summer sunlight enables phototrophic microorganisms to drive primary production, thereby “feeding” ecosystems to enable their persistence through the long, dark winter months. In Ace Lake, a stratified marine-derived system in the Vestfold Hills of East Antarctica, a Chlorobium species of green sulphur bacteria (GSB) is the dominant phototroph, although its seasonal abundance changes more than 100-fold. Here, we analysed 413 Gb of Antarctic metagenome data including 59 Chlorobium metagenome-assembled genomes (MAGs) from Ace Lake and nearby stratified marine basins to determine how genome variation and population structure across a 7-year period impacted ecosystem function.
A single species, Candidatus Chlorobium antarcticum (most similar to Chlorobium phaeovibrioides DSM265) prevails in all three aquatic systems and harbours very little genomic variation (≥ 99% average nucleotide identity). A notable feature of variation that did exist related to the genomic capacity to biosynthesize cobalamin. The abundance of phylotypes with this capacity changed seasonally ~ 2-fold, consistent with the population balancing the value of a bolstered photosynthetic capacity in summer against an energetic cost in winter. The very high GSB concentration (> 108 cells ml−1 in Ace Lake) and seasonal cycle of cell lysis likely make Ca. Chlorobium antarcticum a major provider of cobalamin to the food web. Analysis of Ca. Chlorobium antarcticum viruses revealed the species to be infected by generalist (rather than specialist) viruses with a broad host range (e.g., infecting Gammaproteobacteria) that were present in diverse Antarctic lakes. The marked seasonal decrease in Ca. Chlorobium antarcticum abundance may restrict specialist viruses from establishing effective lifecycles, whereas generalist viruses may augment their proliferation using other hosts.
The factors shaping Antarctic microbial communities are gradually being defined. In addition to the cold, the annual variation in sunlight hours dictates which phototrophic species can grow and the extent to which they contribute to ecosystem processes. The Chlorobium population studied was inferred to provide cobalamin, in addition to carbon, nitrogen, hydrogen, and sulphur cycling, as critical ecosystem services. The specific Antarctic environmental factors and major ecosystem benefits afforded by this GSB likely explain why such a coherent population structure has developed in this Chlorobium species.
The Chlorobiaceae family, including the Chlorobium genus, are green sulphur bacteria (GSB) that fix CO2 anaerobically using the reverse tricarboxylic acid cycle by performing anoxygenic photosynthesis using sulphide or other reduced sulphur compounds as electron donors [1,2,3]. GSB can perform primary production under conditions of low photosynthetically active radiation as they have very sensitive and efficient light-harvesting antennae in their photosynthetic apparatus [4,5,6]. Members of the Chlorobium genus have global representation, making important contributions to thermally diverse ecosystems, typically residing at the oxic-anoxic interface of the water column in stratified aquatic systems, and within benthic mats [7,8,9,10,11,12,13,14,15,16,17,18,19]. Their growth requirements, physiology, and ecology have been well studied [3, 5, 7,8,9, 11, 16, 18, 20,21,22,23], including the use of comparative genomics [24, 25] and metagenomics [13, 14, 19, 26] to study their roles in environmental communities.
In Antarctica, summer sunlight can persist for 24 hours and deliver intense photosynthetically active radiation to drive primary production by phototrophic communities of phytoplankton; levels as high as 1225 μE m−2 S−1 have been recorded [27, 28]. While photosynthetic algae are known to play key phototrophic roles in the Southern Ocean, as do cyanobacteria in continental aquatic systems, comparatively little is known about Antarctic GSB [7, 9, 19, 27]. The most well-characterized Antarctic GSB are Chlorobium from Ace Lake [13, 14, 19]. Ace Lake is one of many meromictic (stratified) lakes within East Antarctica, Vestfold Hills , a region that harbours Chlorobiaceae [8, 9] (Fig. 1). Using microscopy, growth and isolation approaches, Chlorobiaceae were identified in a number of lakes and fjords, including Ellis Fjord and Taynaya Bay [8, 9] (Fig. 1). Ellis Fjord and Taynaya Bay contain marine basins where shallow sills restrict water flow from the Southern Ocean thereby permitting stratification of the water column and the development of stable oxic-anoxic interfaces [29, 31, 32]. While Ace Lake is one of the most extensively studied systems in Antarctica in terms of microbiology [19, 30, 34], Ellis Fjord [8, 35, 36] and Taynaya Bay [8, 37] have had little study and no metagenomic assessments.
In Ace Lake, the Chlorobium abundance exhibits marked seasonal variation, with highest abundance in summer, numbers falling during winter, and lowest abundance in early spring (> 100-fold lower than summer) before a rebound back into summer . The seasonal fluctuation was attributed primarily to changes in light hours rather than to the possible controlling effects of viral predation . Despite the availability of very large metagenome datasets and associated metagenome-assembled genomes (MAGs) for Chlorobium from Ace Lake, genomic variation and population structure have not been examined.
Insight into Antarctic haloarchaea genomic variation has been gained from analyses of single nucleotide polymorphisms (SNPs), and low coverage regions (LCRs) generated from mapping metagenome reads to reference genomes or MAGs [38,39,40,41]. LCRs arise from phylotypes that do not possess the sequences or have sufficiently diverged sequences that do not recruit. Phylotypes that contain genes in LCRs possess a unique genomic capacity compared to phylotypes that lack the genes. Highly divergent genes within LCRs can also confer distinct functional traits by conferring altered protein functions such as specificity for substrates or substrate preference, altered specificity for viral attachment to cell surface proteins, and so forth. Examining the function of genes from variable regions can determine whether phylotypes represent ecotypes that may occupy distinct ecological niches within an ecosystem.
In this study, the MAGs of Chlorobium from Ace Lake, Ellis Fjord, and Taynaya Bay were compared to each other and to non-Antarctic Chlorobium species in order to determine the following: (i) which Chlorobium species characterize the individual Antarctic systems; (ii) whether the species are endemic to Antarctica; (iii) what genomic traits characterize phylotypes within and between the Antarctic systems, including seasonal populations in Ace Lake. Chlorobium phylotypes and Chlorobium viruses were examined to determine: (i) the biogeographic distribution of Chlorobium viruses in the Vestfold Hills; (ii) the types of viral defence systems possessed by the Chlorobium; (iii) the characteristics of virus-host dynamics in each system. As a result, we greatly expanded knowledge of Antarctic Chlorobiaceae and learned how the unique Antarctic environment controls the evolution of these primary producers.
Results and discussion
Overview of metagenomes and Chlorobium MAGs
Ace Lake, Ellis Fjord, and Taynaya Bay are herein referred to as AL, EF, and TB, respectively. Biomass was collected by filtration through a 20-μm pre-filter onto large format filters (3, 0.8, and 0.1 μm) for AL and EF, and into Sterivex cartridges (0.22 μm) for TB (see the “Methods” section). The filtered reads from 18 AL (~ 99 Gb), three EF (~48 Gb) and one TB (~ 12 Gb) oxic-anoxic interface metagenomes were used for fragment recruitment (FR) analyses (Additional file 1: Table S1); for these analyses, the AL and EF metagenome reads from the three filter fractions representing a single sample (date and depth) were pooled to form merged metagenomes (see the “Methods” section). The assembled contigs from individual AL (~ 6 Gb), EF (~ 7 Gb) and TB (~ 700 Mb) metagenomes (Additional file 1: Table S1) were used to determine the Chlorobium OTU abundance distribution in the three Vestfold Hills systems, and for viral analyses.
A total of 59 high or medium quality MAGs were analysed, of which 31 AL, five EF, and two TB high-quality MAGs had ≥ 99% genome completeness (Additional file 1: Table S2; Additional file 2: Dataset S1). The MAGs represented 67,265 genes on 1124 Chlorobium contigs, and both 16S rRNA gene and FmoA (Fenna-Matthews-Olson protein; bacteriochlorophyll A) protein sequences were used as phylogenetic markers . For FR analyses the AL_ref MAG (Dec 2014, 19 m depth, 0.1 μm-filter) contained 27 contigs and 1,797 genes and was 99% complete (1,812,610 bp), and the EF_ref MAG (Oct 2014, 45-m depth, 3-μm filter) contained 32 contigs and 1807 genes and was 99% complete (1,836,564 bp) (Additional file 1: Tables S2 and S3; Additional file 2: Dataset S1).
Chlorobium species present in EF and TB
Chlorobium OTUs were most abundant in EF (45 m) and TB (11 m) at depths where oxic-anoxic interfaces have previously been recorded [8, 29, 31], with a relative abundance (EF, ≤ 49%; TB, 6%) comparable to the range of abundances observed in AL (< 1–84%; Fig. 2) . In TB where Chlorobium had lower relative abundance than EF or AL, the Simpson’s index of diversity was higher (1 − λ′ > 0.9 compared to ≤ 0.7).
All 16S rRNA genes from AL, EF, and TB Chlorobium MAGs had identical sequences (1505 bp), as did all FmoA protein sequences (366 aa) (Additional file 1: Fig. S1). The pair-wise, average nucleotide identify (ANI) of all Chlorobium MAGs was ≥ 99.9% over ≥ 92% alignment fraction. FR of AL, EF, and TB metagenome reads to the Chlorobium 16S rRNA gene (EF_ref MAG) revealed a number of SNPs with variant frequency ≥ 0.01 (i.e., at least 1% of the aligned reads contained the SNP) (Additional file 3: Dataset S2). All of these SNPs, except one from the AL Dec 2014 merged metagenome, two from the EF merged metagenome, and four from the TB metagenome, had very low read depth (on average < 5) and could represent sequencing errors (Additional file 3: Dataset S2). In contrast, the read depth of the Chlorobium 16S rRNA gene sequence (lacking SNPs) was > 80 in all AL (except Oct 2014, read depth 31), EF and TB metagenomes, and > 11,000 in some metagenomes (Additional file 3: Dataset S2). These data indicate that the same species of Chlorobium was present in all three Vestfold Hills systems, representing at least 97% of AL, 97% of EF, and 98% of TB Chlorobium population, and was the only detectable Chlorobium species in AL throughout a seasonal cycle (also see below in “Ca. Chlorobium antarcticum population variation between AL, EF, and TB”).
IMG (Integrated Microbial Genomes) taxonomy denoted all MAGs as most closely related to Chlorobium phaeovibrioides DSM 265 (herein referred to as Cpv-DSM265). The 16S rRNA gene identity (99%; 17 nt mismatches; Additional file 1: Fig. S1a), FmoA protein identity (98%; six aa mutations; Additional file 1: Fig. S1b), ANI (85% over 80–86% alignment fraction), and average amino acid identity (AAI; 89%) distinguish the Antarctic species from Cpv-DSM265, and these differences are reflected in 16S rRNA gene and FmoA protein trees (Fig. 3) (also see below in “Comparison of Ca. Chlorobium antarcticum to Cpv-DSM265 and global representation”). In view of the genomic and phylogenetic differences we name the Antarctic species, Candidatus Chlorobium antarcticum sp. nov. (from ant.arc'ti.cum. L. neut. adj. antarcticum southern, Antarctic) (type MAG AL_ref MAG = 3300023061_2; 99% complete; 0.55% contamination) (Additional file 1: Table S2; Additional file 2: Dataset S1).
Ca. Chlorobium antarcticum population variation within AL
Aligning AL metagenome filtered-reads to the AL_ref MAG to identify SNPs determined that no fixed mutations (variant frequency ≥ 0.9) were present. However, seven LCRs were identified (Fig. 4; Additional file 1: Table S4). The LCRs encoded cell wall modification, cell defence, transport, DNA repair, protein modification, metabolism, mobile element, and hypothetical genes (Additional file 1: Tables S4, S5, and S6). Metabolic genes included: (i) a cluster of nine genes representing the N-type rotary ATPase (N-ATPase) operon (atpD, atpC, atpQ, atpR, atpB, atpE, atpF, atpA, atpG), which codes for ATPase subunits involved in ATP-dependent efflux of Na+ or H+ ions; (ii) a cluster of eight single-copy genes involved in the anaerobic pathway for cobalamin biosynthesis (cbiD, cbiJ, cbiL, cbiK, cysG, and bifunctional cbiFG, cbiET, cbiHC), plus a single copy gene involved in cobinamide salvaging (cbiZ); (iii) a gene cluster containing one cobaltochelatase (cobN) and three magnesium chelatase (bchD, bchH, bchI) genes; (iv) TonB-dependent and ABC transporter proteins involved in the import of iron, cobalt, and cobalamin across the outer membrane and inner membrane, respectively; (v) a gene cluster for export of proteases (Additional file 1: Tables S5 and S6).
A seasonal pattern was observed, with the proportion of the Ca. Chlorobium antarcticum population that possessed the genes within LCRs tending to be higher in summer than in winter or spring (Additional file 1: Tables S4, S5, and S6), most notably for genes associated with cobalamin synthesis and transport (also see below in “Population structure of cobalamin biosynthesis and transport genes”).
The range of transport genes present in the LCRs of Ca. Chlorobium antarcticum MAGs is indicative of the population supporting a diversity of transport abilities (Additional file 1: Table S4 and S5). For example, protease export systems with similarity to Pseudomonas aeruginosa AprDEF were present in at least 28% of the Ca. Chlorobium antarcticum population, and abundance did not vary with season (Group 7 in Additional file 1: Table S5). For GSB, iron is an essential trace element required for the photosynthetic reaction centre . The concentration of iron in AL increases with depth, being ~ 1 μM at the oxic-anoxic interface [30, 43]. TonB-dependent transporter and ABC transporter genes enable the uptake of both inorganic iron and organic forms of iron (siderophores, hemoproteins) . All Ca. Chlorobium antarcticum MAGs contained two sets of ferrous iron transporter genes (feoABC and feoAB), and three TonB-dependent transporter genes potentially involved in iron complex import across the outer membrane. However, the ABC transporters associated with the uptake of iron complexes were only identified in LCRs (Groups 1 and 2 in Additional file 1: Table S5), indicating an augmented capacity for these phylotypes to source exogenous iron (at least 56% of the Ca. Chlorobium antarcticum population).
An N-ATPase operon (atpDCQRBEFAG) was present in at least 61% of the Ca. Chlorobium antarcticum population, with abundance varying only marginally by season (Group 8 in Additional file 1: Table S5); in addition, F0F1 ATP synthase genes were present throughout the Ca. Chlorobium antarcticum population. N-ATPases utilize ATP to actively transport Na+ or H+ ions out of the bacterial cell [45,46,47]. The Ca. Chlorobium antarcticum ATPase subunit c amino acid sequence included the two glutamate residues in both of its C- and N-terminal helices that are diagnostic of Na+-binding [45,46,47], indicating it functions in Na+ export. N-ATPase genes have been identified in some Chlorobi, including Chlorobaculum parvum, Chlorobaculum tepidum (partial locus only), Pelodictyon luteolum, and Prosthecochloris aestuarii [48, 49].
Ca. Chlorobium antarcticum population variation between AL, EF, and TB
Similar to the analysis of SNPs within the AL population, no fixed SNPs were observed for EF metagenome reads against the EF_ref MAG. However, from 1807 genes in the EF_ref MAG, SNPs were identified in 68 genes only from AL, two only from TB, and 19 genes from both AL and TB (Fig. 5; Additional file 1: Table S7). Most SNPs occurred in genes involved in intracellular functions, with a smaller proportion in cell wall modification, substrate transport, and membrane protein genes. SNPs were present in regions of the EF_ref MAG that had even FR coverage, except for those in a hypothetical gene (contig E1, Additional file 1: Table S7), a precorrin-3B methylase/precorrin isomerase gene (contig E15, Additional file 1: Table S7), and gene for a receptor for the TonB-dependent uptake of iron-containing proteins (contig E17, Additional file 1: Table S7). This indicated that the AL and TB SNPs tended to occur within all Ca. Chlorobium antarcticum subpopulations, and were therefore characteristic of each system.
A total of 12 LCRs were identified from FR of AL, EF and TB metagenome reads to the EF_ref MAG (Fig. 5; Additional file 1: Table S4). Notably, five AL LCRs identified against the AL_ref MAG were also LCRs from FR of AL, EF, and TB reads to the EF_ref MAG (Additional file 1: Table S4) indicating that the main (detectable) Ca. Chlorobium antarcticum phylotypes existed in all three Vestfold Hills systems. The LCRs encoded cell wall modification, cell defence, transport, DNA repair, protein modification, Na+ or H+ ion efflux, anaerobic cobalamin biosynthesis, cobinamide salvaging, and cobalt/magnesium chelatase genes, similar to the gene functions of the AL_ref MAG LCRs. LCRs specific to the EF_ref MAG included cell wall modification, general function, and hypothetical genes.
To assess gene order of phylotypes, the contigs of AL, EF, and TB MAGs were aligned to AL_ref MAG (Additional file 3: Dataset S2). Most of the AL_ref MAG contigs that did not align to the contigs of the other MAGs were from AL_ref MAG LCRs, consistent with gene order varying in Ca. Chlorobium antarcticum phylotypes.
While the main phylotypes were shared amongst systems, some LCRs (e.g., contigs E29–E32) had very low read depth (≤ 2%) in all three systems (Additional file 1: Table S4) indicating that the genetic capacity represented by these contigs was rare within the overall Ca. Chlorobium antarcticum population. The relative coverage of some LCRs also varied considerably between systems indicative of different population structures for these specific genes (Fig. 6; Additional file 1: Table S4). For example, the 11-kb contig E1 represented 3% of the EF Ca. Chlorobium antarcticum population but 69% of the TB Ca. Chlorobium antarcticum population. Based on relative coverage, phylotypes represented by LCRs contributed more to the TB Ca. Chlorobium antarcticum population than to the AL or EF populations (Fig. 6; Additional file 1: Table S4). However, EF_ref MAG SNPs were more prevalent for AL than TB, indicating that SNP-based variation was more similar between EF and TB Ca. Chlorobium antarcticum populations than either were to the AL population. The apparent differences in contribution of LCRs and SNPs to the Ca. Chlorobium antarcticum population from each system may reflect the cellular mechanisms involved in generating variation (e.g., DNA repair) and/or environmental effects (e.g., selective forces), and determining the causes will require further investigation (also see Additional file 1: Supplementary text).
To determine if phylotypes from AL, EF, or TB existed with greater sequence divergence than the FR matching criteria permitted (≥ 95% identity), G + C content of metagenome contigs was plotted against read depth and the taxonomy of contig clusters assigned (Additional file 1: Fig. S2); this approach was previously used to identify phylotypes of Antarctic haloarchaea with significantly different genomes to known species . The contigs in the main cluster were from Ca. Chlorobium antarcticum (Additional file 1: Fig. S2). Aside from a number of contigs from some smaller clusters (see the “Methods” section), none of the OTUs of small clusters represented Ca. Chlorobium antarcticum, indicating that phylotypes with more divergence than the cutoffs used for assigning LCRs were not detectable in the metagenome data.
Collectively, the high ANI/AAI between MAGs (see above in “Chlorobium species present in EF and TB”), the small extent of variation represented by SNPs and LCRs, and the taxonomic findings of the analysis of GC/read-depth clusters, illustrate that the Ca. Chlorobium antarcticum population has remarkably little genomic variation.
Comparison of Ca. Chlorobium antarcticum to Cpv-DSM265 and global representation
The AL, EF, and TB contigs had overall low nucleotide identity (< 90%) when aligned to the Cpv-DSM265 genome, with many gaps and differences in gene content (Fig. 7). As described previously, Ca. Chlorobium antarcticum is green rather than brown in colour (unlike Cpv-DSM265); as well as possessing the biosynthetic pathway for chlorobactene (found in green-coloured GSB), Ca. Chlorobium antarcticum lacks the capacities to synthesize bacteriochlorophyll e and isorenieratene, both found in Cpv-DSM265 and other brown-coloured GSB .
Many of the Cpv-DSM265 genes that caused the alignment gaps were associated with transposases and hypothetical genes (Additional file 3: Dataset S2). However, some were genes involved in thiosulphate oxidation (sox gene cluster containing soxA, soxB, soxX, soxY, soxZ), assimilatory sulphate reduction (cysC, cysD, cysN), and pilus assembly, none of which were present in the Ca. Chlorobium antarcticum MAGs. GSB do not tend to have a genomic capacity to perform assimilatory sulphate reduction , and it has been speculated that Cpv-DSM265 acquired the sox gene cluster on a mobile element from another member of the Chlorobiaceae family that originated in Proteobacteria . Ca. Chlorobium antarcticum is therefore predicted to not be able to assimilate sulphate or to oxidise thiosulphate.
A number of Ca. Chlorobium antarcticum contigs did not align to the Cpv-DSM265 genome (Additional file 3: Dataset S2). These contigs contained anaerobic cobalamin biosynthesis, cobalt transport, cobalamin transport, cobalt/magnesium chelatase, and N-ATPase genes, all of which were absent from the Cpv-DSM265 genome. While cobalamin transport and magnesium chelatase genes were present in all Ca. Chlorobium antarcticum MAGs, all of the contigs that did not align with the Cpv-DSM265 genome represented LCRs of the AL_ref MAG and EF_ref MAG (Additional file 1: Tables S4, S5, and S6). It is therefore possible that Cpv-DSM265 represents a phylotype that lacks these genetic loci, or that the loci represent functions that are of particular importance to the Antarctic Ca. Chlorobium antarcticum population (also see below in “Population structure of cobalamin biosynthesis and transport genes”).
The Ca. Chlorobium antarcticum MAGs encoded multiple glycosyltransferase genes involved in cell wall biosynthesis that were not identified in the Cpv-DSM265 genome; the glycosyltransferases were represented throughout the Ca. Chlorobium antarcticum population, with only a few in LCRs (Additional file 1: Table S4), and are therefore characteristic of this Antarctic species. The glycosyltransferases may fulfil roles in cold adaptation through their function in biosynthesis and modification of cell walls [13, 52]. RNA helicases present in LCRs may also fulfil roles in cold adaptation through a potential functional capacity to unravel RNA secondary structures and influence rates of protein synthesis [53, 54]. The CRISPR-Cas defence systems  varied between the two Chlorobium species with Ca. Chlorobium antarcticum containing subtype I-E and Cpv-DSM265 containing subtype I-C (also see below in “Ca. Chlorobium antarcticum-virus interactions”). These genomic differences underscore specific metabolic and defence capabilities of the two Chlorobium species.
The global representation of Ca. Chlorobium antarcticum was assessed by matching the Ca. Chlorobium antarcticum 16S rRNA gene to all 16S rRNA genes from public metagenomes and genomes and the Ca. Chlorobium antarcticum FmoA protein sequence to all proteins from genomes (including MAGs and single-cell genomes) in IMG. All metagenome and genome matches were ≤ 99% 16S rRNA gene identity, and with the exception of Cpv-DSM265 (98% identity), all FmoA sequences had < 98% identity (Additional file 4: Dataset S3). The inability to identify Ca. Chlorobium antarcticum outside of Antarctica was in marked contrast to its representation in data from the three Vestfold Hills systems.
Population structure of cobalamin biosynthesis and transport genes
Cobalamin and cobamide analogues are cofactors that function in a variety of metabolic processes, and although most bacteria contain cobamide-dependent enzymes, most are incapable of synthesizing the cofactors and need to source if from the environment [56, 57]. Cobalamin is an organometallic compound containing a central corrin ring with chelated cobalt. The biologically active form of cobalamin, adenosylcobalamin, can be synthesized by an aerobic or anaerobic pathway, with part of the pathway shared by both (Additional file 1: Fig. S3).
All the genes in the anaerobic pathway for cobalamin biosynthesis have been reported for Chlorobaculum tepidum . However, a comparative genomics assessment of 11,000 bacterial species did not identify all cobamide biosynthesis genes in the 10 Chlorobi that were examined, including Cpv-DSM265, and categorized them as cobinamide salvagers . We determined that Ca. Chlorobium antarcticum encodes the anaerobic pathway, with the genes exclusive to the anaerobic pathway (green-coloured branch between precorrin-2 and cob(II)yrinate a,c-diamide in Fig. 8) located in a LCR. At least 29% of the AL Ca. Chlorobium antarcticum population from all time periods, and 8% and 72% of the EF and TB Ca. Chlorobium antarcticum populations, respectively, possessed the genes, although coverage was about 2-fold higher in AL in summer compared to winter (Additional file 1: Tables S4 and S6).
The anaerobic synthesis of 5,6-dimethylbenzimidazole (DMB), the lower axial ligand of adenosylcobalamin, involves enzymes from the bzaABCDE operon acting on 5-amino-1-(5-phospho-β-D-ribosyl)imidazole as substrate . While the Ca. Chlorobium antarcticum MAGs did not possess bzaABCDE or cobC it did encode the DMB activation and utilization genes (cobT, cobS). This indicates that similar to some other bacteria [68, 69], Ca. Chlorobium antarcticum may have a capacity to remodel exogenous DMB to produce cobalamin. The gene cobC can perform the final step in adenosylcobalamin synthesis, but Ca. Chlorobium antarcticum MAGs lacked this gene and may instead utilize alternative genes, cblZ or cblXY, which have been proposed to function in Actinobacteria and some Alphaproteobacteria, respectively .
The Ca. Chlorobium antarcticum LCRs also contained a colocalized cluster of genes annotated as cobaltochelatase subunit CobN and magnesium chelatase subunits BchH, BchI and BchD (Additional file 1: Table S6). CobN forms a complex with cobaltochelatase subunits CobS and CobT (which were not identified in the MAGs) and catalyses cobalt insertion during aerobic cobalamin biosynthesis [70, 71], and BchH, BchI and BchD can function in magnesium insertion during bacteriochlorophyll biosynthesis . However, sequence similarity exists between cobaltochelatase NST and magnesium chelatase HID [73, 74] and it has been speculated that BchI and BchD may function as CobS and CobT to form a functional cobaltochelatase complex . In Ca. Chlorobium antarcticum, these cobalt/magnesium chelatase genes were colocalized with potential cobalamin transport genes (LCR5 in Additional file 1: Table S4; Groups 4 and 5 in Additional file 1: Table S5) and therefore may function in cobalamin biosynthesis. In support of this inference, it was speculated that the colocalization of cobalt/magnesium chelatases beside a TonB-dependent receptor protein for cobalamin in Chlorobaculum tepidum may pertain to cobalt being inserted into exogenously acquired cobalamin . Moreover, additional magnesium chelatase genes, including three coding for BchH and one each for BchI and BchD, were present throughout the Ca. Chlorobium antarcticum population which likely function in bacteriochlorophyll synthesis rather than cobalamin production. Most GSB contain three homologues of BchH, denoted BchH, BchS, and BchT , which have been reported to be active magnesium chelatases that exhibit differences in their enzymatic properties .
Cobalamin biosynthesis genes can be colocalized with the cobalt transporter genes cbiMNQO [61, 62], and this was the case in Ca. Chlorobium antarcticum (LCR5 in Additional file 1: Table S4). Cobalt is relatively concentrated in AL, with ~6 nM at the oxic-anoxic interface which is ~ 300-times the concentration in sea water [30, 43]. The cbiMNQO gene cluster was present in a LCR (Group 6 in Additional file 1: Table S5) with the genes present in at least 41% of the Ca. Chlorobium antarcticum population from all time periods, although an approximately 1.5-fold higher coverage occurred in summer compared to winter; the minimum abundance (~ 30%) and seasonal change (~ 2-fold higher in summer) are similar to the phylotypes containing the cobalamin biosynthesis genes.
The Ca. Chlorobium antarcticum MAGs contained cobA, cobP/cobU, and cbiZ, representing all the genes known in bacteria and archaea to be involved in salvaging cobinamide [63,64,65,66]. cbiZ can also function in salvaging pseudocobalamin, and cbiZ was the only gene located in a LCR (Fig. 8; Additional file 1: Table S6). These data indicate that the whole lake population of Ca. Chlorobium antarcticum was likely adept at converting cobinamide into intermediates of cobalamin biosynthesis, and a subpopulation (at least 8% from all time periods) had the capacity to also salvage pseudocobalamin. The coverage of cbiZ was about 2-fold higher in summer, matching the seasonal abundance pattern of cobalt transporter and cobalamin biosynthesis genes (Additional file 1: Tables S5 and S6).
In Ca. Chlorobium antarcticum MAGs, the cbiZ and cobalamin transporter genes were colocalized (LCR5 in Additional file 1: Table S4), as is the case in many bacteria, including Chlorobium . It has been speculated that Rhodobacter sphaeroides may use cobalamin transporters to scavenge pseudocobalamin produced by cyanobacteria and convert it to cobalamin precursors using CbiZ [65, 66, 77,78,79,80]. AL supports a high abundance of Synechococcus that blooms in summer close to the oxic-anoxic interface [19, 81], indicating that it may be the source of pseudocobalamin that is imported and converted to cobalamin precursors by cbiZ.
The uptake of cobalamin itself requires TonB-dependent transport (BtuB) through the outer membrane and ABC transporters (e.g., BtuCDF) or energy-coupling factor (CbrT) through the inner membrane [82,83,84]. Ca. Chlorobium antarcticum contained two putative btuB TonB-dependent transporter genes, plus a set of ABC transporter genes (btuC, permease; btuD, ATP-binding; btuF, substrate-binding) throughout the population. Additional putative btuB and btuCDF genes were also present in LCRs (Groups 3, 4, and 5 in Additional file 1: Table S5) in at least 7% of the Ca. Chlorobium antarcticum population across all time periods, although the abundance was 2–3-fold higher in summer compared to winter (Groups 3, 4, and 5 in Additional file 1: Table S5).
The biosynthesis and transport of cobalamin has been shown to be regulated by cobalamin-binding riboswitches that are present in the 5′-untranslated region of genes, including btuB (cobalamin transporter), metE (5-methyltetrahydropteroyltriglutamate homocysteine methyltransferase), and nrdD (ribonucleoside-triphosphate reductase) [85,86,87,88,89,90,91,92,93]. A total of six cobalamin riboswitch sequences were identified in LCRs of Ca. Chlorobium antarcticum, one each upstream of btuB and btuF (both cobalamin transporters), metE, nrdD, and at the end of two contigs (Fig. 4b; Additional file 1: Table S6). Three additional cobalamin riboswitch sequences were identified throughout the Ca. Chlorobium antarcticum population, one each upstream of two btuB genes, and a hypothetical protein-coding gene. In Chlorobi, the genes with cobalamin riboswitch sequences are mainly translationally regulated; regulation has been shown to involve inhibition of translation initiation, where cobalamin (in the form of adenosylcobalamin) binds to the riboswitch RNA sequence of the regulated mRNA, leading to a perturbed mRNA structure that inhibits ribosome binding and subsequent translation [88, 89, 91].
Overall, the phylotype data for cobalamin-related biosynthesis, salvaging, and transport indicate that all of the Ca. Chlorobium antarcticum population is capable of importing cobalamin (Additional file 1: Tables S4, S5, and S6), although the proportion of the population with additional cobalamin transport genes varies with the system: EF, 7%; AL, 7% increasing to 25% in summer; TB, 78% (Additional file 1: Tables S4 and S5). Certain phylotypes are also capable of importing and salvaging cobinamide and pseudocobalamin, with this capacity also increasing in summer in AL.
Ca. Chlorobium antarcticum-virus interactions
The subtype I-E CRISPR-Cas system in Ca. Chlorobium antarcticum contained the core cas genes casA (or cse1) and casB (or cse2) with genes arranged cas3, casA, casB, casE, casC, casD, cas1, cas2, followed by a CRISPR spacer array, indicating the system could be functional. Analysis of NCBI gene annotation data showed CRISPR-Cas systems to be common in GSB, the subtypes to vary, and some species to contain multiple subtypes (Additional file 1: Table S8). No genes associated with BREX (bacteriophage exclusion) or DISARM (defence island system associated with restriction-modification) systems were identified. However, type I R-M (restriction-modification) methyltransferase and endonuclease and two type IV R-M endonuclease genes were identified (Additional file 1: Table S9), with the type I R-M genes present in a LCR (Additional file 1: Tables S4). Additionally, five genes associated with toxin-antitoxin (T-A) systems (parD, parE, relF, brnA, abiEi) were identified in Ca. Chlorobium antarcticum (Additional file 1: Table S9), with brnA in a LCR (Additional file 1: Table S4). The most likely system to contribute to the control of viral propagation is the AbiE type IV T-A system, an ABI (abortive infection) system that causes cell dormancy and prevents viral dissemination , but it is unclear if this system was functional as the antitoxin gene (abiEi) was identified but not the toxin gene (abiEii).
Potential Ca. Chlorobium antarcticum viruses were identified by aligning the Ca. Chlorobium antarcticum CRISPR-Cas spacers to an Antarctic virus catalogue, and a spacer database was used to identify additional potential hosts of the viruses (see the “Methods” section) . A total of 79 CRISPR spacers from EF Ca. Chlorobium antarcticum MAGs (Additional file 1: Table S10) mapped to potential viruses. Eight viral contigs had 97% identity to spacer Spc230 (Additional file 1: Table S11). The viral contigs were from AL metagenomes and belonged to viral cluster cl_248, a previously identified potential AL Chlorobium virus . No EF Ca. Chlorobium antarcticum spacers were mapped to EF viral contigs, which likely reflects the smaller size of the EF metagenome dataset compared to AL which resulted in 6,104 EF viral contigs compared to 30,897 AL viral contigs in the Antarctic virus catalogue.
As the TB metagenomes were not available when the Antarctic virus catalogue and IMG/VR spacer database were constructed , a slightly different approach was used to identify viral contigs matching to spacers in TB Ca. Chlorobium antarcticum MAGs (see the “Methods” section). A total of 58 TB Ca. Chlorobium antarcticum spacers were aligned against the Antarctic virus catalogue, resulting in nine spacers (Spc236, Spc238, Spc241, Spc243–Spc245, Spc249, Spc251, Spc252; Additional file 1: Table S10) matching to 23 viral contigs with ≥ 97% identity. Eighteen of the viral contigs were from AL metagenomes and belonged to viral cluster cl_1024 (14) and viral singletons sg_10581 (1), sg_14551 (1), sg_14796 (1), and sg_14959 (1); cl_1024 was previously identified as a potential AL Chlorobium virus . The remaining five viral contigs were from hypersaline Antarctic systems, Deep Lake and Rauer 13 Lake , and belonged to cl_9176 (1), sg_1370 (1), sg_1648 (1), sg_1649 (1), and sg_1677 (1). Similar to EF, no TB Ca. Chlorobium antarcticum spacers mapped to the 995 available TB viral contigs, likely reflecting the size of the metagenome dataset. It is noteworthy that the AL Ca. Chlorobium antarcticum spacers themselves had ≥ 97% identity matches to viral contigs from AL as well as Deep Lake, Club Lake, Organic Lake, and some Rauer Island lakes (Rauer 2, 3, 5, 6, 11, and 13 lakes) (Fig. 9; Additional file 1: Table S11).
The viral contigs representing potential EF and TB Ca. Chlorobium antarcticum viruses were matched (100% identity) to host spacers, identifying potential hosts to be primarily Gammaproteobacteria and Chlorobi (including Chlorobium OTUs from the Vestfold Hills), plus Actinobacteria, Bacteroidetes, Firmicutes, Betaproteobacteria, Deltaproteobacteria, and Verrucomicrobia (Additional file 1: Table S12). These host assignments were similar to previous findings for AL Chlorobium viruses  and point to Ca. Chlorobium antarcticum viruses from all three systems belonging to similar viral clusters (e.g., cl_1024 and cl_248). This host analysis indicates that the viruses likely prey on several different bacterial genera as a wide variety of hosts, and may therefore be considered generalist rather than specialist viruses [95,96,97].
The predicted Ca. Chlorobium antarcticum viruses also appeared to be widely distributed with spacer matches to viral contigs from hypersaline systems enriched in haloarchaea (Deep Lake, Club Lake, Rauer 3, 6, and 13 lakes) and diverse bacterial taxa (Organic Lake, Rauer 2, 5, and 11 lakes) (Fig. 9; Additional file 1: Table S11). Chlorobium has not been reported in these lake systems, and the microbial communities in Deep Lake [38, 41] and Organic Lake [98, 99] in particular, have been intensively studied. In contrast, the other potential hosts, notably Gammaproteobacteria, are prevalent in Organic Lake [98, 99] and have been identified in some of the other lakes [38, 41], further reinforcing that the potential Ca. Chlorobium antarcticum viruses have characteristics of generalist viruses infecting a broad host range [95,96,97].
We have shown that a single species of Chlorobium was detected in AL, EF, and TB that has distinct genomic traits to its closest relative Cpv-DSM265 (Additional file 1: Table S13) and is not identifiable in available metagenome data from elsewhere in the world. As such, we conclude that Ca. Chlorobium antarcticum is to the best of our knowledge, endemic to the stratified lakes and fjords of the Vestfold Hills of East Antarctica.
Variation present as SNPs and LCRs defined population variation of Ca. Chlorobium antarcticum, indicating the presence of phylotypes and ecotypes, with the population structure differing marginally amongst the three systems. Limited genomic variation of Ca. Chlorobium antarcticum in AL across a 7-year period illustrates that the population is currently stable. Seasonal changes in population structure were inferred to arise as a natural response to sunlight hours and growth of active populations. Population variation contributing to survivability was inferred for genes associated with cold adaptation, metabolism, and viral defence. In particular, cobalamin synthesis and transport stood out as a genomic facet of Ca. Chlorobium antarcticum that was subject to seasonal variation in population structure and was likely a trait relevant to effective ecosystem functioning.
Cobalamin deficiency can impair bacteriochlorophyll content and chlorosome formation, with cobalamin supplementation restoring bacteriochlorophyll content [100, 101]. The higher abundance in summer (cf. 2-fold higher than winter) of Ca. Chlorobium antarcticum phylotypes that possess a genomic capacity for cobalamin biosynthesis, cobinamide and pseudocobalamin salvaging, cobalt transport, and/or cobalamin transport, fits with the importance of cobalamin for supporting phototrophic processes and may help cells recuperate after a long, dark winter to regain the very high abundance they achieve in summer. Conversely, the involvement of ~ 30 genes and energetic cost associated with cobalamin biosynthesis  fits with the ecosystem supporting a reduced capacity in winter when sunlight is limited or absent. While bacteria rely on cobalamin for growth, most bacteria in microbial communities lack the biosynthetic capacity [56, 57]. Ca. Chlorobium antarcticum is the most abundant species in AL and is key to ecosystem function, being probably the single most important member of the food web . Its requirement for cobalamin for effective phototrophic growth likely generates positive selection within the Ca. Chlorobium antarcticum population for a biosynthetic capacity. As a result of its niche competitiveness, the species generates a very high level of biomass mid-water in the lake (>108 cells ml−1) . Therefore, in addition to its role in carbon, nitrogen, hydrogen, and sulphur cycling [13, 14, 19], Ca. Chlorobium antarcticum is also likely to be the main provider of exogenous cobalamin to the lake food web; this provision would be facilitated by the seasonal lysis and release of cellular contents of > 99% of the summer population of cells.
Partially based on Chlorobium-virus interactions in AL, it was proposed that some Antarctic viruses may persist by achieving less harmful interactions with their hosts than counterparts from warmer environments . However, Chlorobium-virus interactions are not well understood because very few GSB viruses have been described [17, 102]. Through this study and a previous study , a total of 59 viral contigs and 12 viral clusters or singletons were mapped to Ca. Chlorobium antarcticum CRISPR-spacers, resulting in the discovery of 12 potential Chlorobium viruses. These viruses are predicted to be generalists. It has been speculated that viruses can evolve into specialist viruses when they are exposed to a homogenous host population (e.g., composed of a single species) that does not change with time, whereas generalist viruses can evolve from viruses exposed to a heterogenous host population (e.g., composed of multiple species) that fluctuates with time . The adaptation of a specialist virus to effective replication in a single host may result in a cost to fitness when replicating in other potential hosts, whereas a generalist virus is not expected to suffer a fitness cost as it is adapted to replicate in different hosts . While Ca. Chlorobium antarcticum represents a remarkably dominant species with relatively subtle population variation and may therefore be expected to harbour specialist viruses, its seasonal abundance in AL changes by at least 100-fold . If as proposed, sunlight hours control seasonal abundance of AL Chlorobium , the marked change in host abundance may select against the establishment of specialist viruses, while still leaving Ca. Chlorobium antarcticum as a host for generalist viruses that have a capacity to propagate in other bacterial hosts. In this regard, a reliance on sunlight and seasonal die-off during winter and early spring may significantly benefit the long-term persistence of Ca. Chlorobium antarcticum in Antarctic aquatic systems.
The Antarctic continent is geographically isolated and Antarctic environmental conditions distinguish it from most other regions of the globe [27, 103, 104]. The remoteness and environmental conditions create major logistical challenges for performing scientific research, yet without adequate research, policy makers will be compromised when making decisions about Antarctica’s future . Metagenomic approaches have greatly enhanced the understanding of indigenous Antarctic microorganisms [27, 103]. For example, Antarctic soil bacteria were discovered that scavenge and oxidize atmospheric H2, which in association with CO and/or CO2, enables chemosynthetic growth . In the Vestfold Hills and Rauer Islands, three different genera have been found to dominate the haloarchaea population of hypersaline lakes, making photoheterotrophy the main microbial process occuring in these lakes [38, 40, 41, 106]. The species appear to be endemic to Antarctica, with one member, Halohasta litchfieldiae (tADL), constituting ≤ 45% of each lake’s microbial community [38, 41]. Relatively little genomic variation exists within and between the populations from the hypersaline systems, but both environment and distance effects have been inferred to contribute to biogeographical patterning of variation . A major phylotype of Hht. litchfieldiae with relatively low ANI (~ 0.8) has also been discovered [38, 40]. Based on our current research, we make the claim that Ca. Chlorobium antarcticum represents the Antarctic species with the least amount of known population-level, genomic variation. The capacity to state this is predicated on having a very large Ca. Chlorobium antarcticum metagenome dataset (~ 159 Gb) that provided a MAG read depth of up to ~ 11,000. The coherence of the population is particularly striking in view of it being retained across a 7-year time span, across the populations from three distinct water bodies, and throughout the population of a seasonal cycle, during which relative cellular abundance changed > 100-fold. Future efforts need to evaluate how distinct Antarctic species and communities are by canvassing the environmental and biogeographic diversity of Antarctica’s ecosystems and obtaining sufficient metagenomic depth to assemble MAGs and perform population-level studies. Achieving this will help to establish the extent of Antarctic microbial endemism, the uniqueness of contributions that Antarctic microbes make to global biogeochemical cycles, and the risks associated with anthropogenic impact, including climate change, on the Antarctic biome [27, 104, 107].
Sample collection, DNA sequencing, MAG generation, and abundance calculations
The sampling, sequencing, assembly, and annotation of AL metagenomes were described previously [19, 108]. Biomass from EF Basin 2 (Fig. 1) was collected from 5-, 18-, 45-, and 60-m depths by filtration through a 20-μm prefilter onto large (293 mm diameter) format filters (3, 0.8 and 0.1 μm) and DNA extracted as previously described [13, 38, 41]. The sequencing, assembly, and annotation of EF metagenomes were performed by the Joint Genome Institute as previously described , generating 12 EF metagenomes (three filter fractions from four depths) (Additional file 1: Table S1). The biomass from TB Basin 1 (Fig. 1) was collected from 5- and 11-m depths by filtration through a 20-μm prefilter into Sterivex cartridges (0.22 μm filter) and the DNA extracted and sequenced as previously described  (Additional file 1: Table S1). The QC filtered and error-corrected reads (BFC v181)  from the AL, EF, and TB metagenomes were assembled using metaSPAdes [110, 111] and annotated through IMG (Additional file 1: Table S1). The IMG pipeline generated Ca. Chlorobium antarcticum MAGs, of which we used 50 AL, seven EF, and two TB MAGs (one MAG per metagenome) that were medium to high quality and > 50% genome completeness; the MAGs with their respective metagenomes are available in IMG (see IMG Bin IDs in Additional file 1: Table S2; Additional file 2: Dataset S1). For MIMAG (minimum information about MAGs)  data preparation, MAG quality data and metadata were obtained from IMG, except MAG N50 and L50 contig statistics which were generated using Quast v5.0.2  (Additional file 2: Dataset S1). Chlorobium OTU abundances from AL were calculated previously . Contig taxonomy assignments, Chlorobium OTU bin refinement, abundance calculations, and alpha diversity (Simpson’s index of diversity) from EF and TB metagenomes were determined as previously described .
Ca. Chlorobium antarcticum genomic variation
The metagenome reads from the oxic-anoxic interface of AL, EF, and TB were used for FR analyses of Ca. Chlorobium antarcticum (Additional file 1: Table S14). The AL metagenomes used were all Illumina data and represented two sampling periods (2008 and 2013–2014), including different seasons: summer (Dec 2014), winter (Jul and Aug 2014), and spring (Nov 2008, Nov 2013, Oct 2014). AL metagenomes from 2006 were not included due to possible bias caused by differences in dataset size (2006, ≤ 500 million bases; 2008 and 2013/2014, ≥ 3 billion bases) and sequencing technology used (2006, Sanger and 454; 2008 and 2013/2014, Illumina). However, it is noteworthy that Chlorobium abundance in AL in 2006 was previously shown to be comparable to 2008 and 2013/2014 , so inferences from this study are likely to apply to the 2006 population.
The AL and EF reads from the three filters from a specific time period and depth were pooled and converted to multi-FASTA format using an in-house script, thereby facilitating comparative analyses between AL and EF metagenomes (biomass in the size range 0.1–20 μm) with TB metagenomes (0.22–20-μm biomass size range) (Additional file 1: Table S14). For the analysis of genomic variation within the AL Ca. Chlorobium antarcticum population, the MAG from Dec 2014, 19-m depth, 0.1-μm filter was used (AL_ref MAG). For analyses between AL, EF, and TB, the EF Ca. Chlorobium antarcticum MAG from 45-m depth, 3-μm filter was used (EF_ref MAG). The two MAGs were selected because they had the highest total base pair count and > 99% genome completeness. To determine the Ca. Chlorobium antarcticum MAG contig arrangement that best represents a draft genome, the AL_ref MAG and EF_ref MAG contigs were organised in Mauve v2.4.0  using Cpv-DSM265 as the reference genome with default parameters. Contigs were subsequently manually reordered by comparing nucleotide sequences from AL, EF, and TB using the blastn module of BLAST+ v2.9.0  and considering only ≥ 500-bp alignment length matches of 100% identity. Arising from this, MAG contigs were grouped into scaffolds (Additional file 1: Table S3).
The metagenome reads were aligned to AL_ref MAG or EF_ref MAF using BBMap v38.51  with 95% minimum alignment identity (minid = 0.95), generating SAM files. The BAM and BAI alignment and index files were created from SAM files using Samtools v1.10  and were used for SNP analysis in IGV . Only the SNPs with variant frequency ≥ 0.9 (i.e., at least 90% of the reads aligned at the position containing the SNP) were considered fixed mutations, similar to a previously described method . The total number of aligned reads and the base coverages of AL_ref MAG and EF_ref MAG were calculated using the “flagstat” and “depth” functions of Samtools, respectively. To identify LCRs, the base coverages of AL_ref MAG and EF_ref MAG in metagenomes from AL, EF, and TB were plotted on circos plots using R v4.0.2. The LCRs that spanned multiple adjacent contigs on a scaffold were considered a single LCR (Additional file 1: Table S4); for example, LCR5 spanned contigs A13–A17 from AL_ref MAG and contigs E14–E15 from EF_ref MAG. The IMG auto-annotated genes identified in LCRs were manually annotated by aligning the protein sequences to reference proteins from the UniProtKB/Swiss-Prot database using the ExPASy BLAST+ online service , and those with poor alignment or no hits were realigned to reference proteins in the UniProtKB database or RefSeq protein database using the NCBI blastp suite .
For comparison of gene order between AL_ref MAG and other AL, EF, and TB high-quality MAGs of ≥ 99% genome completeness, the MAG contigs were aligned using the blastn module of BLAST+ v2.9.0. The alignments were manually parsed to assess the gene order of MAGs compared to that of AL_ref MAG, and MAG contigs that did not align, had lower identity matches (< 80%) or short length matches (< 1 kb) were identified (Additional file 3: Dataset S2).
GC content vs read depth analysis
Based on an approach previously reported for analysing haloarchaea , metagenome contigs of length ≥ 1 kb, and 30–70% GC content, and Ca. Chlorobium antarcticum MAG contigs from AL, EF, and TB, were plotted in a GC content-read depth 2D space using Python v3.6.4. The metagenome contig clusters placed close to the Ca. Chlorobium antarcticum MAG contig cluster that had a GC content ranging from 35–65% and read depth up to 7500 and length ≥ 10 kb, were selected for taxonomic analysis. The contigs were aligned to the Ca. Chlorobium antarcticum MAGs and Cpv-DSM265 genome. The alignment files were manually parsed to identify cluster contigs with low identity and high query alignment fraction (≥ 5 kb), and their taxonomies were determined using the IMG Phylodist file-based contig taxonomies, as described previously . Some small clusters of metagenome contigs were from Ca. Chlorobium antarcticum (Additional file 1: Fig. S2c), with 100% identity to Ca. Chlorobium antarcticum MAG contigs. These metagenome contigs likely belonged to two incomplete Ca. Chlorobium antarcticum MAGs (60% and 66% bin completeness) generated from 0.8–3- and 0.1–0.8-μm filter Nov 2008 spring metagenomes, respectively, from AL oxic-anoxic interface.
Ca. Chlorobium antarcticum phylotype abundance
The Ca. Chlorobium antarcticum population containing a “region of interest” (specific LCR, gene, or gene cluster) was determined from the relative coverages of the corresponding region, calculated using the formula:
where Region is the region of interest and MAG is AL_ref MAG or EF_ref MAG. The numerator indicates the mean read depth of the region of interest in a metagenome and the denominator refers to the mean read depth of the MAG in the metagenome.
The mean read depths were calculated using the formula:
where Region is the region of interest and MAG is AL_ref MAG or EF_ref MAG. The numerator indicates the sum of the read depths of the bases in a region of interest or MAG, calculated in each metagenome. The denominator indicates the total number of bases in the region of interest or MAG.
The approximate percentage of the Ca. Chlorobium antarcticum population containing a region of interest, in a season (summer, winter, spring) or a system (AL, EF, TB) were determined by averaging the percentages calculated in metagenomes from a season or a system, respectively. To assess the significance of the differences in summer and winter coverages of LCR genes of AL_ref MAG, the DESeq2 R package  was used with gene read depths from all time periods. The result for summer and winter comparison was generated using the “contrast” option of DESeq2 result function. DESeq2 method uses Wald test to calculate the P-value for significance analysis and uses Benjamini-Hochberg adjustment to calculate adjusted P-value for assessing significance considering a specific false discovery rate (i.e., the fraction of false positives amongst the significant values). Here, P-values < 0.05 were considered significant at the 95% significance level. BH-adjusted P-values < 0.05 were regarded as significant, considering a 5% fraction of false positives as acceptable (Additional file 1: Tables S5 and S6).
Comparative analysis of Ca. Chlorobium antarcticum and Cpv-DSM265
A total of 31 AL, five EF and two TB Ca. Chlorobium antarcticum MAGs with ≥ 99% genome completeness were aligned to the Cpv-DSM265 genome (RefSeq ID: NC_009337.1) using the blastn module of BLAST+ v2.9.0 and Samtools v1.10, generating SAM, BAM, and BAI files. The alignments were analysed using IGV to assess the types of variations (indels or SNPs) in MAG sequences. The auto-annotated genes on MAG contigs or Cpv-DSM265 genome that showed no alignment were assessed. To identify cobalamin riboswitch sequences in Ca. Chlorobium antarcticum, four cobalamin riboswitch genes from the Cpv-DSM265 genome were aligned to AL_ref MAG contigs using the NCBI blastn suite . The Ca. Chlorobium antarcticum cobalamin riboswitch sequences were verified, and additional cobalamin riboswitch sequences were identified, using the Rfam database [122, 123] (Additional file 1: Table S6). The overall functional potential of Cpv-DSM265 and Ca. Chlorobium antarcticum MAGs from AL (AL_ref MAG), EF (EF_ref MAG), and TB (MAG from TB 11-m depth metagenome) were compared using COG number data generated by IMG. The COG numbers were categorized using COG reference data from IMG (database accessed on 21 December 2020). Genes with COG number assignments belonging to more than one COG category were assigned to multiple categories (Additional file 1: Fig. S4).
ANI, AAI, and phylogenetic analyses
The pair-wise ANI of Ca. Chlorobium antarcticum MAGs, as well as ANI against the Cpv-DSM265 genome were calculated using pyani . The AAI of MAGs was calculated using the AAI-profiler online service , which compared the input protein sequences with the proteins of species in the UniProt database . The phylogenetic analysis of Ca. Chlorobium antarcticum was performed using the 16S rRNA gene and FmoA protein sequences from AL, EF, and TB MAGs, as well as various members of the Chlorobiaceae family (Additional file 1: Table S15). The 16S rRNA genes were aligned using the ClustalW algorithm and FmoA proteins were aligned using the Neighbour Joining cluster method of the MUSCLE algorithm in MEGA X v10.1.7 . The alignments were used for generating maximum likelihood trees in MEGA using default parameters and 1000 bootstrap values.
The proportion of the Chlorobium population that was represented by Ca. Chlorobium antarcticum in the AL, EF, and TB oxic-anoxic interface metagenomes was estimated by aligning AL, EF and TB metagenome reads to the Ca. Chlorobium antarcticum 16S rRNA gene from EF_ref MAG using BBMap v38.51 and Samtools (see above in “Ca. Chlorobium antarcticum genomic variation”). The default minid was used for alignment with BBMap. SNPs with variant frequency ≥ 0.01 (i.e., at least 1% of the reads aligned at the position containing the SNP) were considered during analysis in IGV (Additional file 3: Dataset S2).
Assessment of the endemism of Ca. Chlorobium antarcticum to the Vestfold Hills was performed by comparing Ca. Chlorobium antarcticum marker (16S rRNA gene and FmoA protein) sequences to available metagenome and genome data in IMG. The Ca. Chlorobium antarcticum 16S rRNA gene was aligned to the IMG databases of 16S rRNA genes from public-assembled metagenomes (accessed on 14 Mar 2021) and public isolates (accessed on 30 Mar 2021) using the IMG RNA BLAST (blastn) online service with e-value 10−5. The Ca. Chlorobium antarcticum FmoA protein sequence was aligned to the IMG isolate protein database (including proteins from isolate genomes, MAGs, and single-amplified genomes; accessed on 14 Mar 2021) using the IMG RNA BLAST (blastp) online service with e-value 10−5.
Ca. Chlorobium antarcticum defence genes and associated viruses
The AL, EF, and TB Ca. Chlorobium antarcticum MAG genes were manually parsed to identify those associated with defence, such as R-M, DISARM, BREX, and T-A (specifically ABI mechanism) systems. The putative defence genes were manually annotated (see above in “Ca. Chlorobium antarcticum genomic variation”).
The potential viruses associated with EF and TB Ca. Chlorobium antarcticum were determined using the CRISPR spacers and repeats in metagenome IMG CRISPR annotation files, as well as the data in an Antarctic virus catalogue and IMG/VR spacer database, as described previously . The Antarctic virus catalogue contained a list of viral contigs identified in a range of Antarctic metagenomes, along with their viral cluster or singleton designations, and the IMG/VR spacer database contained a list of spacer sequences and their matches to host contigs ; the construction of these two databases was described previously . The databases did not include TB metagenome data as these metagenomes were not available at the time the databases were created. To identify TB viral contigs, all TB assembled contigs were aligned to the Antarctic virus catalogue using the blastn module of BLAST+ v2.9.0, with e-value 10−3 and ≥ 97% alignment identity. A total of 995 TB contigs with ≥ 1000-bp alignment length and 100% identity across the whole length of either the query contig or the reference viral contig were considered to be TB viral contigs; this approach to identifying TB viral contigs from matches to the Antarctic virus catalogue is not as rigorous as might be achieved using the virus identification pipeline .
The Ca. Chlorobium antarcticum CRISPR spacers in EF and TB metagenomes were identified from the Ca. Chlorobium antarcticum MAGs and Chlorobium OTU refined bins (Additional file 1: Table S10). CRISPR arrays tended to be present at the ends of contigs, possibly indicative of assembly constraints caused by sequence repeats. To potentially capture a greater number of spacers, TB MAGs derived from assembly of non-error corrected reads (IMG Genome IDs: 3300038786, 3300039186) were also analysed. The viral contigs potentially associated with EF and TB Ca. Chlorobium antarcticum were determined by aligning the Chlorobium spacer sequences to viral contigs in the Antarctic virus catalogue and to TB viral contigs using the ‘megablast’ option of BLAST+ v2.9.0, with e-value 10−3 and ≥ 97% alignment identity. The data in the Antarctic virus catalogue were used to assign viral cluster or singleton designations to the potential Ca. Chlorobium antarcticum viral contigs. This approach to assessing virus-host relationships was described previously .
Pfennig N, Trüper HG. Higher taxa of the phototrophic bacteria. Int J Syst Bacteriol. 1971;21:17–8.
Sakurai H, Ogawa T, Shiga M, Inoue K. Inorganic sulfur oxidizing system in green sulfur bacteria. Photosynth Res. 2010;104:163–76.
Tang KH, Blankenship RE. Both forward and reverse TCA cycles operate in green sulfur bacteria. J Biol Chem. 2010;285:35848–54.
Eisen JA, Nelson KE, Paulsen IT, Heidelberg JF, Wu M, Dodson RJ, et al. The complete genome sequence of Chlorobium tepidum TLS, a photosynthetic, anaerobic, green-sulfur bacterium. Proc Natl Acad Sci U S A. 2002;99:9509–14.
Blankenship RE, Matsuura K. Antenna complexes from green photosynthetic bacteria. In: Green BR, Parson WW, editors. Light-harvesting antennas in photosynthesis. Advances in Photosynthesis and Respiration, vol. 13; 2003. p. 195–217.
Chen JH, Wu H, Xu C, Liu XC, Huang Z, Chang S, et al. Architecture of the photosynthetic complex from a green sulfur bacterium. Science. 2020;370:eabb6350.
Herbert RA, Tanner AC. The isolation and some characteristics of photosynthetic bacteria (Chromatiaceae and Chlorobiaceae) from Antarctic marine sediments. J Appl Microbiol. 1977;43:437–45.
Burke CM, Burton HR. Photosynthetic bacteria in meromictic lakes and stratified fjords of the Vestfold Hills, Antarctica. Hydrobiologia. 1988;65:13–23.
Burke CM, Burton HR. The ecology of photosynthetic bacteria in Burton Lake, Vestfold Hills, Antarctica. Hydrobiologia. 1988;165:1–11.
Van Gemerden H, Mas J. Ecology of phototrophic sulfur bacteria. In: Blankenship RE, Madigan MT, Bauer CE, editors. Anoxygenic photosynthetic bacteria. The Netherlands: Kluwer Academic Publishers; 1995. p. 49–85.
Beatty JT, Overmann J, Lince MT, Manske AK, Lang AS, Blankenship RE, et al. An obligately photosynthetic bacterial anaerobe from a deep-sea hydrothermal vent. Proc Natl Acad Sci U S A. 2005;102:9306–10.
Roeselers G, Norris TB, Castenholz RW, Rysgaard S, Glud RN, Kühl M, et al. Diversity of phototrophic bacteria in microbial mats from Arctic hot springs (Greenland). Environ Microbiol. 2007;9:26–38.
Ng C, DeMaere MZ, Williams TJ, Lauro FM, Raftery M, Gibson JAE, et al. Metaproteogenomic analysis of a dominant green sulfur bacterium from Ace Lake, Antarctica. ISME J. 2010;4:1002–19.
Lauro FM, DeMaere MZ, Yau S, Brown MV, Ng C, Wilkins D, et al. An integrative study of a meromictic lake ecosystem in Antarctica. ISME J. 2011;5:879–95.
Comeau AM, Harding T, Galand PE, Vincent WF, Lovejoy C. Vertical distribution of microbial communities in a perennially stratified Arctic lake with saline, anoxic bottom waters. Sci Rep. 2012;2:604.
Imhoff JF. Biology of green sulfur bacteria. In: eLS. John Wiley & Sons, Ltd. Chichester. 2014. DOI: https://doi.org/10.1002/9780470015902.a0000458.pub2.
Llorens Marès T, Liu Z, Allen LZ, Rusch DB, Craig MT, Dupont CL, et al. Speciation and ecological success in dimly lit waters: horizontal gene transfer in a green sulfur bacteria bloom unveiled by metagenomic assembly. ISME J. 2017;11:201–11.
Grouzdev DS, Lunina ON, Gaisin VA, Krutkina MS, Baslerov RV, Savvichev AS, et al. Genome sequences of green- and brown-colored strains of Chlorobium phaeovibrioides with gas vesicles. Microbiol Resour Announc. 2019;8:e00711–9.
Panwar P, Allen MA, Williams TJ, Hancock AM, Brazendale S, Bevington J, et al. Influence of the polar light cycle on seasonal dynamics of an Antarctic lake microbial community. Microbiome. 2020;8:116.
Caumette P. Distribution and characterization of phototrophic bacteria isolated from the water of Bietri Bay (Ebrie Lagoon, Ivory Coast). Can J Microbiol. 1984;30:273–84.
Miracle MR, Vicente E. Phytoplankton and photosynthetic sulphur bacteria production in the meromictic coastal lagoon of Cullera (Valencia, Spain). Verhandlungen des Internationalen Verein Limnologie. 1985;22:2214–20.
Chapin B, Denoyelles F, Gaham DW, Smith VH. A deep maximum of green sulphur bacteria (‘Chlorochromatium aggregatum’) in a strongly stratified reservoir. Freshw Biol. 2004;49:1337–54.
Coolen MJL, Muyzer G, Schouten S, Volkman JK, Damsté JSS. Sulfur and methane cycling during the Holocene in Ace Lake (Antarctica) revealed by lipid and DNA stratigraphy. In: Neretin L, editor. Past and present water column anoxia. NATO Science Series: IV: Earth and Environmental Sciences, vol. 64. Dordrecht: Springer; 2006. p. 41–65.
Frigaard NU, Bryant DA. Seeing green bacteria in a new light: genomics-enabled studies of the photosynthetic apparatus in green sulfur bacteria and filamentous anoxygenic phototrophic bacteria. Arch Microbiol. 2004;182:265–76.
Bryant DA, Liu Z, Li T, Zhao F, Costas AMG, Klatt CG, et al. Comparative and functional genomics of anoxygenic green bacteria from the taxa Chlorobi, Chloroflexi, and Acidobacteria. In: Burnap RL, Vermaas WFJ, editors. Functional genomics and evolution of photosynthetic systems. Advances in Photosynthesis and Respiration, vol. 33. New York: Springer; 2012. p. 47–102.
Liu Z, Klatt CG, Ludwig M, Rusch DB, Jensen SI, Kühl M, et al. ‘Candidatus Thermochlorobacter aerophilum’: an aerobic chlorophotoheterotrophic member of the phylum Chlorobi defined by metagenomics and metatranscriptomics. ISME J. 2012;6:1869–82.
Cavicchioli R. Microbial ecology of Antarctic aquatic systems. Nat Rev Microbiol. 2015;13:691–706.
Burch MD. Annual cycle of phytoplankton in Ace Lake, an ice covered, saline meromictic lake. Hydrobiol. 1988;165:59–75.
Gibson JAE. The meromictic lakes and stratified marine basins of the Vestfold Hills, East Antarctica. Antarct Sci. 1999;11:175–92.
Rankin LM, Gibson JAE, Franzmann PD, Burton HR. The chemical stratification and microbial communities of Ace Lake: a review of the characteristics of a marine derived meromictic lake. Polarforschung. 1999;66:33–52.
Gallagher JB, Burton HR. Seasonal mixing of Ellis Fjord, Vestfold Hills, East Antarctica. Estuar Coast Shelf Sci. 1988;27:363–80.
Gallagher JB, Burton HR, Calf GE. Meromixis in an Antarctic fjord: a precursor to meromictic lakes on an isostatically rising coastline. Hydrobiologia. 1989;172:235–54.
USGS Antarctic Research Atlas. https://lima.usgs.gov/antarctic_research_atlas/ (2007). .
Laybourn-Parry J, Bell EM. Ace Lake: three decades of research on a meromictic, Antarctic lake. Polar Biol. 2014;37:1685–99.
Grey J, Laybourn-Parry J, Leakey RJG, McMinn A. Temporal patterns of protozooplankton abundance and their food in Ellis Fjord, Princess Elizabeth Land, Eastern Antarctica. Estuar Coast Shelf Sci. 1997;45:17–25.
Beaumont KL. Planktonic interactions and particulate flux in Ellis Fjord, East Antarctica. PhD thesis. Hobart: University of Tasmania; 2003.
McMinn A, Bleakley N, Steinburner K, Roberts D, Trenerry L. Effect of permanent sea ice cover and different nutrient regimes on the phytoplankton succession of fjords of the Vestfold Hills Oasis, Eastern Antarctica. J Plankton Res. 2000;22:287–303.
DeMaere MZ, Williams TJ, Allen MA, Brown MV, Gibson JAE, Rich J, et al. High level of intergenera gene exchange shapes the evolution of haloarchaea in an isolated Antarctic lake. Proc Natl Acad Sci U S A. 2013;110:16939–44.
Tschitschko B, Williams TJ, Allen MA, Páez-Espino D, Kyrpides N, Zhong L, et al. Antarctic archaea–virus interactions: metaproteome-led analysis of invasion, evasion and adaptation. ISME J. 2015;9:2094–107.
Tschitschko B, Williams TJ, Allen MA, Zhong L, Raftery MJ, Cavicchioli R. Ecophysiological distinctions of Haloarchaea from a hypersaline Antarctic lake as determined by metaproteomics. Appl Environ Microbiol. 2016;82:3165–73.
Tschitschko B, Erdmann S, DeMaere MZ, Roux S, Panwar P, Allen MA, et al. Genomic variation and biogeography of Antarctic haloarchaea. Microbiome. 2018;6:113.
Imhoff JF. Phylogenetic taxonomy of the family Chlorobiaceae on the basis of 16S rRNA and fmo (Fenna–Matthews–Olson protein) gene sequences. Int J Syst Evol Microbiol. 2003;53:941–51.
Masuda N, Nakaya S, Burton HR, Torii T. Trace element distribution in some saline lakes of the Vestfold Hills, Antarctica. Hydrobiologia. 1988;165:103–14.
Hogle SL, Thrash JC, Dupont CL, Barbeaua KA. Trace metal acquisition by marine heterotrophic bacterioplankton with contrasting trophic strategies. Appl Environ Microbiol. 2016;82:1613–24.
Von Ballmoos C, Cook GM, Dimroth P. Unique rotary ATP synthase and its biological diversity. Annu Rev Biophys. 2008;37:43–64.
Dibrova DV, Galperin MY, Mulkidjanian AY. Characterization of the N-ATPase, a distinct, laterally transferred Na+-translocating form of the bacterial F-type membrane ATPase. Bioinformatics. 2010;26:1473–6.
Schulz S, Wilkes M, Mills DJ, Kühlbrandt W, Meier T. Molecular architecture of the N-type ATPase rotor ring from Burkholderia pseudomallei. EMBO Rep. 2017;18:526–35.
Koumandou VL, Kossida S. Evolution of the F0F1 ATP synthase complex in light of the patchy distribution of different bioenergetic pathways across prokaryotes. PLoS Comput Biol. 2014;10:e1003821.
Niu Y, Moghimyfiroozabad S, Safaie S, Yang Y, Jonas EA, Alavian KN. Phylogenetic profiling of mitochondrial proteins and integration analysis of bacterial transcription units suggest evolution of F1F0 ATP synthase from multiple modules. J Mol Evol. 2017;85:219–33.
Frigaard NU, Bryant DA. Genomic insights into the sulfur metabolism of phototrophic green sulfur bacteria. In: Hell R, Dahl C, Knaff D, Leustek T, editors. Sulfur metabolism in phototrophic organisms. Advances in Photosynthesis and Respiration, vol. 27. Dordrecht: Springer; 2008. p. 337–55.
Gregersen LH, Bryant DA, Frigaard NU. Mechanisms and evolution of oxidative sulfur metabolism in green sulfur bacteria. Front Microbiol. 2011;2:116.
Allen MA, Lauro FM, Williams TJ, Burg D, Siddiqui KS, De Francisci D, et al. The genome sequence of the psychrophilic archaeon, Methanococcoides burtonii: the role of genome evolution in cold adaptation. ISME J. 2009;3:1012–35.
Lim J, Thomas T, Cavicchioli R. Low temperature regulated DEAD-box RNA helicase from the Antarctic archaeon, Methanococcoides burtonii. J Mol Biol. 2000;297:553–67.
Williams TJ, Lauro FM, Ertan H, Burg DW, Poljak A, Raftery MJ, et al. Defining the response of a microorganism to temperatures that span its complete growth temperature range (-2°C to 28°C) using multiplex quantitative proteomics. Environ Microbiol. 2011;13:2186–203.
Makarova KS, Wolf YI, Iranzo J, Shmakov JA, Alkhnbashi OS, Brouns SJJ, et al. Evolutionary classification of CRISPR–Cas systems: a burst of class 2 and derived variants. Nat Rev Microbiol. 2020;18:67–83.
Romine MF, Rodionov DA, Maezato Y, Anderson LN, Nandhikonda P, Rodionova IA, et al. Elucidation of roles for vitamin B12 in regulation of folate, ubiquinone, and methionine metabolism. Proc Natl Acad Sci U S A. 2017;114:E1205–14.
Shelton AN, Seth EC, Mok KC, Han AW, Jackson SN, Haft DR, et al. Uneven distribution of cobamide biosynthesis and dependence in bacteria predicted by comparative genomics. ISME J. 2019;13:789–804.
BioCyc. https://biocyc.org/ (2011). MetaCyc Pathway: adenosylcobalamin biosynthesis accessed between Dec 2020 and Jan 2021.
Karp PD, Billington R, Caspi R, Fulcher CA, Latendresse M, Kothari A, et al. The BioCyc collection of microbial genomes and metabolic pathways. Brief Bioinform. 2017. https://doi.org/10.1093/bib/bbx085.
Hazra AB, Han AW, Mehta AP, Mok KC, Osadchiy V, Begley TP, et al. Anaerobic biosynthesis of the lower ligand of vitamin B12. Proc Natl Acad Sci U S A. 2015;112:10792–7.
Rodionov DA, Vitreschak AG, Mironov AA, Gelfand MS. Comparative genomics of the vitamin B12 metabolism and regulation in prokaryotes. J Biol Chem. 2003;278:41148–59.
Rodionov DA, Hebbeln P, Gelfand MS, Eitinger T. Comparative and functional genomic analysis of prokaryotic nickel and cobalt uptake transporters: evidence for a novel group of ATP-binding cassette transporters. J Bacteriol. 2006;188:317–27.
Woodson JD, Zayas CL, Escalante-Semerena JC. A new pathway for salvaging the coenzyme B12 precursor cobinamide in archaea requires cobinamide-phosphate synthase (CbiB) enzyme activity. J Bacteriol. 2003;185:7193–201.
Woodson JD, Escalante-Semerena JC. CbiZ, an amidohydrolase enzyme required for salvaging the coenzyme B12 precursor cobinamide in archaea. Proc Natl Acad Sci U S A. 2004;101:3591–6.
Gray MJ, Tavares NK, Escalante-Semerena JC. The genome of Rhodobacter Sphaeroides strain 2.4.1 encodes functional cobinamide salvaging systems of archaeal and bacterial origins. Mol Microbiol. 2008;70:824–36.
Gray MJ, Escalante-Semerena JC. The cobinamide amidohydrolase (cobyric acid-forming) CbiZ enzyme: a critical activity of the cobamide remodeling system of Rhodobacter sphaeroides. Mol Microbiol. 2009;74:1198–210.
Taga ME, Larsen NA, Howard-Jones AR, Walsh CT, Walker GC. BluB cannibalizes flavin to form the lower ligand of vitamin B12. Nature. 2007;446:449–53.
Anderson PJ, Lango J, Carkeet C, Britten A, Kräutler B, Hammock BD, et al. One pathway can incorporate either adenine or dimethylbenzimidazole as an α-axial ligand of B12 cofactors in Salmonella enterica. J Bacteriol. 2008;190:1160–71.
Helliwell KA, Lawrence AD, Holzer A, Kudahl UJ, Sasso S, Kräutler B, et al. Cyanobacteria and eukaryotic algae use different chemical variants of Vitamin B12. Curr Biol. 2016;26:999–1008.
Crouzet J, Cameron B, Cauchois L, Rigault S, Blanche F, Guilhot C, et al. Genetic and sequence analyses of a Pseudomonas denitrificans DNA fragment containing two cob genes. J Bacteriol. 1991;173:6058–65.
Debussche L, Couder M, Thibaut D, Cameron B, Crouzet J, Blanche F. Assay, purification, and characterization of cobaltochelatase, a unique complex enzyme catalyzing cobalt insertion in hydrogenobyrinic acid a,c-diamide during coenzyme B12 biosynthesis in Pseudomonas denitrificans. J Bacteriol. 1992;174:7445–51.
Bollivar DW, Suzuki JY, Beatty JT, Dobrowolski JM, Bauer CE. Directed mutational analysis of bacteriochlorophyll A biosynthesis in Rhodobacter capsulatus. J Mol Biol. 1994;237:622–40.
Petersen BL, Jensen PE, Gibson LC, Stummann BM, Hunter CN, Henningsen KW. Reconstitution of an active magnesium chelatase enzyme complex from the bchI, -D, and -H gene products of the green sulfur bacterium Chlorobium vibrioforme in Escherichia coli. J Bacteriol. 1998;180:699–704.
Willows RD, Al-Karadaghi S, Hansson M, Fodje MN, Hansson A, Olsen JG, et al. Interplay between an AAA module and an integrin I domain may regulate the function of magnesium chelatase. J Mol Biol. 2001;311:111–22.
Chew AGM, Frigaard NU, Bryant DA. Mutational analysis of three bchH paralogs in (bacterio-)chlorophyll biosynthesis in Chlorobaculum tepidum. Photosynth Res. 2009;101:21–34.
Johnson ET, Dannert CS. Characterization of three homologs of the large subunit of the magnesium chelatase from Chlorobaculum tepidum and interaction with the magnesium protoporphyrin IX methyltransferase. J Biol Chem. 2008;283:27776–84.
Watanabe F, Katsura H, Takenaka S, Fujita T, Abe K, Tamura Y, et al. Pseudovitamin b12 is the predominant cobamide of an algal health food, spirulina tablets. J Agric Food Chem. 1999;47:4736–41.
Miyamoto E, Tanioka Y, Nakao T, Barla F, Inui H, Fujita T, et al. Purification and characterization of a corrinoid-compound in an edible cyanobacterium Aphanizomenon flos-aquae as a nutritional supplementary food. J Agric Food Chem. 2006;54:9604–7.
Watanabe F, Miyamoto E, Fujita T, Tanioka Y, Nakano Y. Characterization of a corrinoid compound in the edible (blue-green) alga, Suizenji-nori. Biosci Biotechnol Biochem. 2006;70:3066–8.
Watanabe F, Tanioka Y, Miyamoto E, Fujita T, Takenaka H, Nakano Y. Purification and characterization of corrinoid-compounds from the dried powder of an edible cyanobacterium, Nostoc commune (Ishikurage). J Nutr Sci Vitaminol (Tokyo). 2007;53:183–6.
Powell LM, Bowman JP, Skerratt JH, Franzamnn PD, Burton HR. Ecology of a novel Synechococcus clade occurring in dense populations in saline Antarctic lakes. Mar Ecol Prog Ser. 2005;291:65–80.
Cadieux N, Bradbeer C, Reeger-Schneider E, Köster W, Mohanty AK, Wiener MC, et al. Identification of the periplasmic cobalamin-binding protein BtuF of Escherichia coli. J Bacteriol. 2002;184:706–17.
Santos JA, Rempel S, Mous STM, Pereira CT, Ter Beek J, De Gier JW, et al. Functional and structural characterization of an ECF-type ABC transporter for vitamin B12. eLife. 2018;7:e35828.
Pieńko T, Trylska J. Extracellular loops of BtuB facilitate transport of vitamin B12 through the outer membrane of E. coli. PLoS Comput Biol. 2020;16:e1008024.
Urbanowski ML, Stauffer LT, Plamann LS, Stauffer GV. A new methionine locus, metR, that encodes a trans-Acting protein required for activation of metE and metH in Escherichia coli and Salmonella typhimurium. J Bacteriol. 1987;169:1391–7.
Franklund CV, Kadner RJ. Multiple transcribed elements control expression of the Escherichia coli btuB gene. J Bacteriol. 1997;179:4039–42.
Nou X, Kadner RJ. Adenosylcobalamin inhibits ribosome binding to btuB RNA. Proc Natl Acad Sci U S A. 2000;97:7190–5.
Nahvi A, Sudarsan N, Ebert MS, Zou X, Brown KL, Breaker RR. Genetic control by a metabolite binding mRNA. Chem Biol. 2002;9:1043–9.
Vitreschak AG, Rodionov DA, Mironov AA, Gelfand MS. Regulation of the vitamin B12 metabolism and transport in bacteria by a conserved RNA structural element. RNA. 2003;9:1084–97.
Borovok I, Gorovitz B, Schreiber R, Aharonowitz Y, Cohen G. Coenzyme B12 controls transcription of the Streptomyces Class Ia ribonucleotide reductase nrdABS operon via a riboswitch mechanism. J Bacteriol. 2006;188:2512–20.
Barrick JE, Breaker RR. The distributions, mechanisms, and structures of metabolite-binding riboswitches. Genome Biol. 2007;8:R239.
Warner DF, Savvi S, Mizrahi V, Dawes SS. A riboswitch regulates expression of the coenzyme B12-independent methionine synthase in Mycobacterium tuberculosis: Implications for differential methionine synthase function in strains H37Rv and CDC1551. J Bacteriol. 2007;189:3655–9.
Li J, Ge Y, Zadeh M, Curtiss R III, Mohamadzadeh M. Regulating vitamin B12 biosynthesis via the cbiMCbl riboswitch in Propionibacterium strain UF1. Proc Natl Acad Sci U S A. 2020;117:602–9.
Dy RL, Przybilski R, Semeijn K, Salmond GPC, Fineran PC. A widespread bacteriophage abortive infection system functions through a Type IV toxin–antitoxin mechanism. Nucleic Acids Res. 2014;42:4590–605.
Elena SF, Agudelo-Romero P, Lalić J. The evolution of viruses in multi-host fitness landscapes. Open Virol J. 2009;3:1–6.
Zborowskya S, Lindell D. Resistance in marine cyanobacteria differs against specialist and generalist cyanophages. Proc Natl Acad Sci U S A. 2019;116:16899–908.
Zhao L, Duffy S. Gauging genetic diversity of generalists: a test of genetic and ecological generalism with RNA virus experimental evolution. Virus Evol. 2019;5:vez019.
Bowman JP, McCammon SA, Rea SM, McMeekin TA. The microbial composition of three limnologically disparate hypersaline Antarctic lakes. FEMS Microbiol Lett. 2000;183:81–8.
Yau S, Lauro FM, Williams TJ, DeMaere MZ, Brown MV, Rich J, et al. Metagenomic insights into strategies of carbon conservation and unusual sulfur biogeochemistry in a hypersaline Antarctic lake. ISME J. 2013;7:1944–61.
Sato K, Ishida K, Kuno T, Mizuno A, Shimizu S. Regulation of vitamin B12 and bacteriochlorophyll biosynthesis in a facultative methylotroph, Protaminobacter ruber. J Nutr Sci Vitaminol (Tokyo). 1981;27:439–47.
Fuhrmann S, Overmann J, Pfennig N, Fischer U. Influence of vitamin B12 and light on the formation of chlorosomes in green- and brown-colored Chlorobium species. Arch Microbiol. 1993;160:193–8.
Berg M, Goudeau D, Olmsted C, McMahon KD, Thweatt J, Bryant D, et al. Host population diversity as a driver of viral infection cycle in wild populations of green sulfur bacteria with long standing virus-host interactions. ISME J. 2021. https://doi.org/10.1038/s41396-020-00870-1.
Cary SC, McDonald IR, Barrett JE, Cowan DA. On the rocks: the microbiology of Antarctic Dry Valley soils. Nat Rev Microbiol. 2010;8:129–38.
Rintoul SR, Chown SL, DeConto RM, England MH, Fricker HA, Masson-Delmotte V, et al. Choosing the future of Antarctica. Nature. 2018;558:233–41.
Ji M, Greening C, Vanwonterghem I, Carere CR, Bay SK, Steen JA, et al. Atmospheric trace gases support primary production in Antarctic desert surface soil. Nature. 2017;552:400–3.
Williams TJ, Allen MA, DeMaere MZ, Kyrpides NC, Tringe SG, Woyke T, et al. Microbial ecology of an Antarctic hypersaline lake: genomic assessment of ecophysiology among dominant haloarchaea. ISME J. 2014;8:1645–58.
Cavicchioli R, Ripple WJ, Timmis KN, Azam F, Bakken LR, Baylis M, et al. Scientists’ warning to humanity: microorganisms and climate change. Nat Rev Microbiol. 2019;17:569–86.
Williams TJ, Allen MA, Ivanova N, Huntemann M, Haque S, Hancock AM, et al. Genome analysis of a verrucomicrobial endosymbiont with a tiny genome discovered in an Antarctic lake. Front Microbiol. 2021;12:674758.
Li H. BFC: correcting Illumina sequencing errors. Bioinformatics. 2015;31:2885–7.
Nurk S, Bankevich A, Antipov D, Gurevich AA, Korobeynikov A, Lapidus A, et al. Assembling single-cell genomes and mini-metagenomes from chimeric MDA products. J Comput Biol. 2013;20:714–37.
Nurk S, Meleshko D, Korobeynikov A, Pevzner PA. MetaSPAdes: a new versatile metagenomic assembler. Genome Res. 2017;27:824–34.
Bowers RM, Kyrpides NC, Stepanauskas R, Smith MH, Doud D, Reddy TBK, et al. Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea. Nat Biotechnol. 2017;35:725–31.
Gurevich A, Saveliev V, Vyahhi N, Tesler G. QUAST: quality assessment tool for genome assemblies. Bioinformatics. 2013;29:1072–5.
Darling ACE, Mau B, Blattner FR, Perna NT. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 2004;14:1394–403.
Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, et al. BLAST+: architecture and applications. BMC Bioinformatics. 2009;10:421.
JGI BBMap Guide. https://jgi.doe.gov/data-and-tools/bbtools/bb-tools-user-guide/bbmap-guide/ (2014). Accessed in Oct 2020.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The sequence alignment/map (SAM) format and SAMtools. Bioinformatics. 2009;25:2078–9.
Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, et al. Integrative Genomics Viewer. Nat Biotechnol. 2011;29:24–6.
ExPASy BLAST. https://web.expasy.org/blast/ (1993). UniProtKB/Swiss-Prot database accessed between Oct 2020 and May 2021.
NCBI BLAST. https://blast.ncbi.nlm.nih.gov/Blast.cgi (1994). UniProtKB and RefSeq databases accessed between Oct 2020 and May 2021.
Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550.
Kalvari I, Nawrocki EP, Argasinska J, Olvera NQ, Finn RD, Bateman A, et al. Non-coding RNA analysis using the Rfam database. Curr Protoc Bioinformatics. 2018;62:e51.
Kalvari I, Nawrocki EP, Palacios NQ, Argasinska J, Lamkiewicz K, Marz M, et al. Rfam 14: expanded coverage of metagenomic, viral and microRNA families. Nucleic Acids Res. 2021;49:D192–200.
Pritchard L, Glover RH, Humphris S, Elphinstone JG, Toth IK. Genomics and taxonomy in diagnostics for food security: soft-rotting enterobacterial plant pathogens. Anal Methods. 2016;8:12–24.
AAI-profiler: fast proteome-wide search reveals taxonomic outliers. http://ekhidna2.biocenter.helsinki.fi/AAI/ (2018). Accessed between September and November 2020.
Medlar AJ, Toronen P, Holm L. AAI-profiler: fast proteome-wide exploratory analysis reveals taxonomic identity, misclassification and contamination. Nucleic Acids Res. 2018;46:W479–85.
Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: molecular evolutionary genetics analysis across computing platforms. Mol Biol Evol. 2018;35:1547–9.
Paez-Espino D, Roux S, Chen IA, Palaniappan K, Ratner A, Chu K, et al. IMG/VR v.2.0: an integrated data management and analysis system for cultivated and environmental viral genomes. Nucleic Acids Res. 2019;47:D678–86.
Paez-Espino D, Eloe-Fadrosh EA, Pavlopoulos GA, Thomas AD, Huntemann M, Mikhailova N, et al. Uncovering Earth’s virome. Nature. 2016;536:425–30.
We thank Simon Roux for the discussion about virus-host interactions, and Emiley Eloe-Fadrosh and others at JGI for providing long-term support for the Antarctic metagenomics project. Computational analyses at UNSW Sydney were performed on the computational cluster Katana, supported by the Faculty of Science. We thank the expeditioners and the Helicopter Resources crew at Davis Station during the 2006, 2008, and 2013–2015 expeditions for their assistance in collecting samples; the Australian Antarctic Division for technical and logistical support during the expedition; and the Landsat Image Mosaic of Antarctica (LIMA) project for making satellite images available. We acknowledge the considerable value that the reviewers brought to this study during the review process.
This work was supported by the Australian Research Council (DP150100244) and the Australian Antarctic Science programme (project 4031). The work conducted by the US Department of Energy Joint Genome Institute, a DOE Office of Science User Facility, is supported by the Office of Science of the US Department of Energy under contract no. DE-AC02-05CH11231.
Ethics approval and consent to participate
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Data on Ca. Chlorobium antarcticum population structure, genomic variation, and viral analysis. Supplementary text, figures, and tables.
MIMAG data for Ca. Chlorobium antarcticum. Supplementary dataset.
Data from Ca. Chlorobium antarcticum 16S rRNA gene variation analysis and comparative analysis of AL, EF and TB MAGs and Cpv-DSM265 genome. Supplementary dataset.
Data for Ca. Chlorobium antarcticum marker gene and protein analyses. Supplementary dataset.
About this article
Cite this article
Panwar, P., Allen, M.A., Williams, T.J. et al. Remarkably coherent population structure for a dominant Antarctic Chlorobium species. Microbiome 9, 231 (2021). https://doi.org/10.1186/s40168-021-01173-z
- Antarctic microbiology
- Green sulphur bacteria
- Vitamin B12
- Metagenome-assembled genomes
- Population structure
- Host-virus interactions
- Generalist virus
- Meromictic lake
- Microbial food web