Quantifying fluorescent glycan uptake to elucidate strain-level variability in foraging behaviors of rumen bacteria

Gut microbiomes, such as the microbial community that colonizes the rumen, have vast catabolic potential and play a vital role in host health and nutrition. By expanding our understanding of metabolic pathways in these ecosystems, we will garner foundational information for manipulating microbiome structure and function to influence host physiology. Currently, our knowledge of metabolic pathways relies heavily on inferences derived from metagenomics or culturing bacteria in vitro. However, novel approaches targeting specific cell physiologies can illuminate the functional potential encoded within microbial (meta)genomes to provide accurate assessments of metabolic abilities. Using fluorescently labeled polysaccharides, we visualized carbohydrate metabolism performed by single bacterial cells in a complex rumen sample, enabling a rapid assessment of their metabolic phenotype. Specifically, we identified bovine-adapted strains of Bacteroides thetaiotaomicron that metabolized yeast mannan in the rumen microbiome ex vivo and discerned the mechanistic differences between two distinct carbohydrate foraging behaviors, referred to as “medium grower” and “high grower.” Using comparative whole-genome sequencing, RNA-seq, and carbohydrate-active enzyme fingerprinting, we could elucidate the strain-level variability in carbohydrate utilization systems of the two foraging behaviors to help predict individual strategies of nutrient acquisition. Here, we present a multi-faceted study using complimentary next-generation physiology and “omics” approaches to characterize microbial adaptation to a prebiotic in the rumen ecosystem. 49JS9nbEEXk7bV4NZX-91M Video abstract Video abstract


Background
Ruminants have evolved foregut digestive systems specialized in the bioconversion of recalcitrant, complex carbohydrates into energy. These catabolic processes rely on a core bacterial community composed predominantly of the genera Prevotella, Butyrivibrio, Fibrobacter, and Ruminococcus, families Lachnospiraceae and Ruminococcaceae, and orders Bacteroidales and Clostridiales [1,2]. The rumen microbiome is estimated to contain 69,000 carbohydrate-active enzyme (CAZyme) genes [3] that encode extensive catalytic activities. Despite this vast genetic repertoire and catalytic potential, the microbial conversion of plant fiber to host-accessible metabolites in the rumen is suboptimal and could be improved [4,5]. For example, supplementation with direct fed microorganisms, such as Bacteroides spp., or prebiotic carbohydrates that modify the rumen microbiome to enhance feed conversion may help address the emerging challenges associated with sustainable production of food animals [6].
In diverse animal symbioses and environmental ecosystems, Bacteroides spp. and other members of Bacteroidetes are thought to play central roles in glycan digestion because they encode highly specialized carbohydrate metabolic systems called polysaccharide utilization loci (PULs) [7,8]. The first described PUL was the starch utilization system of Bacteroides thetaiotaomicron (B. theta: BtVPI-5482) [9], and since its description, PULs that metabolize glycans with unique chemistries have been found in diverse ecosystems [8,[10][11][12]. PULs are distinguished by the presence of a TonB-dependent transporter coupled to a surface glycanbinding protein, known as the SusC/D-like complex, and other associated proteins that modify or bind the target glycan. These gene products function together in an orchestrated cascade to transport oligosaccharides into the periplasm where monosaccharides are released from polymeric substrates and used for primary metabolism. PULs can operate through a "distributive" mechanism, which releases products [13] to the microbial community or a "selfish" mechanism [14], which limits product loss by confining substrate depolymerization within the cell [8,15]. Recently, PUL-prediction [16] and whole-PUL characterization [14,17] have become common approaches for the discovery of new CAZyme families and catalytic activities at the species [18] and strain levels [19,20]. The most common enzymes encoded within PULs are glycoside hydrolases (GHs), which cleave glycosidic bonds by acidbase catalysis [21]. GHs are divided into sequence-related families that display conserved folds, mechanisms, and catalytic residues. However, these features are not necessarily representative of function as many different GH families are polyspecific [22].
In addition to microorganisms that improve the efficiency of digestion, prebiotic glycans, such as yeast αmannan (YM) and its derivative oligosaccharides (i.e., αmannanoligosaccharides), are known to provide beneficial physiological outcomes to animals, such as cattle and pigs [23][24][25]. Prebiotics can enhance feed digestion and cattle health by becoming selective nutrients for symbiotic gut bacteria, such as Bacteroides spp. The digestion of YM requires a collection of CAZymes targeting distinct linkages using different modes of activity, including α-mannanases and α-mannosidases [15]. CAZymes that possess these activities are commonly found in family GH38, GH76, GH92, GH99, and GH125 [14,26,27], and correspondingly, these enzymes are present in PULs that target YM (i.e., MAN-PULs).
YM-specific CAZymes and PULs are widely distributed in Bacteroidetes; however, individual species differ in their abilities to consume α-mannans depending on the structural complexity of the substrate [14]. For example, Bacteroides xylanisolvens NLAE-zl isolated from pigs reared on a diet infused with distillers' grains could only metabolize debranched YM [14]. The pathway responsible for YM catabolism in these strains (i.e., MAN-PUL1) was encoded on a transposable element, suggesting that aspects of YM metabolism can be exchanged between strains [14]. This finding is consistent with reports of specialized metabolic abilities being transferred to intestinal Bacteroides spp. from species that occupy ecologically distinct habitats [19,20,28], facilitating their persistence within highly competitive ecosystems and adaption to spatially and culturally diversified diets.
Although major advances have been made in understanding the diversity of metabolic potential in symbiotic bacteria and the mechanisms of prebiotic utilization, establishing stable engineered microbiomes in complex ecosystems, such as the rumen, will require more detailed knowledge of the competitive and complementary processes that drive metabolic phenotypes at the strain level. To achieve this, "next-generation physiology"based [29] approaches that identify metabolic potentials of individual bacteria, thereby providing critical insights of cellular functions and assigning cellular phenotypes, must be developed. One such approach is fluorescently labeled polysaccharides (FLA-PS). FLA-PS were initially developed to demonstrate selfish uptake of marine polysaccharides in marine Bacteroidetes [30] and have also been applied to the gut bacterium BtVPI-5482 to confirm that YM metabolism also occurs through a selfish mechanism [31]. Fluorescent glucose analogs have been recently used to study glucose uptake by rumen bacteria [32]; however, use of fluorescent polysaccharides in the rumen has been limited until now.
Here, for the first time, we apply FLA-PS as a nextgeneration physiology approach to directly visualize YM metabolism by single cells in a complex rumen community and subsequently classify populations of cells using fluorescence in situ hybridization (FISH). We combine this analysis with a multi-tiered study of the evolution and function of YM metabolism in bovine-adapted B. theta strains (Bt Bov ), which adopt one of two dichotomous growth phenotypes, referred to as "High Grower" (HG) or "Medium Grower" (MG), based on the optical density of cultures after 24 h. Despite displaying distinct growth profiles, the genetic, transcriptomic, or biochemical factors that contributed to the differential growth phenotypes of these strains remained to be defined. Using genomics, transcriptomics, and CAZyme fingerprinting, multiple MAN-PUL architectures were identified in this study that are consistent with reports for human-associated BtVPI-5482 [14] and key differences in the YM utilization systems between MGs and HGs were revealed. To define the mechanisms that contribute to these growth phenotypes, we present a new quantitative application of FLA-PS, which we believe has farreaching implications for elucidating differences in substrate utilization of individual cells within complex microbial communities.

Results
Ex vivo visualization of YM-metabolizing taxa within the rumen community To assess the capability of rumen microbiota to metabolize YM, extracted rumen samples were incubated with FLA-YM and visualized on feed particles and in solution (Fig. 1a, b). The total cell density in 100 μm pre-filtered rumen fluid, as determined by enumerating DAPI-stained cells, was 2.98 × 10 8 ± 6.02 × 10 7 cells ml −1 (Fig. 1c). In these complex communities, on average 6.1% ± 0.5% of cells showed uptake of FLA-YM (0% after 15 min, 6% after 3 h, 7% after 1 day, 6% after 3 days). Fluorescence in situ hybridization (FISH) using the CF968 probe [33] specific for the phylum Bacteroidetes showed that 2.9 ± 0.5% of the cells showing FLA-YM uptake were members of the Bacteroidetes. In total, Bacteroidetes made up 34.8% ± 6.8% of the rumen bacterial community and only a fraction of these (~3%) showed uptake of FLA-YM (Fig. 1c). The microbial community composition of these rumen samples was determined by 16S rRNA metagenomics sequencing. The community was dominated by Bacteroidetes, specifically the genus Prevotella 1, which demonstrated that YM metabolism has penetrated distantly related members of the phylum (Fig. 1d).
Isolation and growth profiling of YM-utilizing Bt Bov strains Targeted isolation approaches were performed to selectively isolate bovine-adapted-bacteria that utilize Saccharomyces cerevisiae YM from enriched rumen and fecal communities. Single colonies were observed within 24 h, with new colonies forming up to 96 h. In total, 50 bacterial isolates were collected and each mannandegrading (MD) isolate was assigned a reference number (e.g., isolate #8 = MD8). The majority of these isolates were identified by 16S rRNA gene sequencing as strains of B. theta using the NCBI BLASTN database [34], and referred to as Bt Bov (Fig. 2a, Supplementary Table 1).
YM metabolism was confirmed for each MD strain by growth in liquid cultures using S. cerevisiae YM as the sole carbon source (Fig. 2b, Supplementary Fig. 1). Interestingly, based on their growth on S. cerevisiae YM, the Bt Bov isolates and BtVPI-5482 control strain were divided into two populations (Supplementary Table 2): "Medium Growers" (MGs; plateaued growth at OD 6000 .4 after 24 h) and "High Growers" (HGs; plateaued growth at OD 600~0 .7 after 24 h). Notably, this growth phenotype is substrate specific and does not extend to other substrates, such as mannose in which all strains have a similar growth curve (data not shown), and YM from Schizosaccharomyces pombe (S. pombe; Supplementary Fig. 1b). In addition, the 16S rRNA gene topology of the Bt Bov isolates did not reveal a discernable relationship with the growth phenotype (Fig. 2a).
Visualization and quantification of differential FLA-YM uptake by Bt Bov isolates To determine if the rate of glycan uptake varied between the two growth types, representative strains, one from each growth population and BtVPI-5482 as a control, were incubated with FLA-YM. BtVPI-5482, MD33 MG , and MD40 HG cells became fluorescent, whereas cells incubated with unlabeled YM did not (Fig. 2c). Phenotypic differences in FLA-YM uptake over time were determined by splitting cell populations by flow cytometric gating into FLA-positive and FLA-negative cells ( Supplementary Fig.  2a). The total fluorescence intensity (rate of uptake) was significantly different between the representative strains (MD40 HG vs. MD33 MG t(10) = 4.4, p value = 0.001; MD40 HG vs. BtVPI-5482 t(10) = 3.9, p value = 0.003; BtVPI-5482 vs. MD33 MG t(10) = 4.0, p value = 0.002) (Fig.  2d). The change in mean fluorescence of the three strains showed a similar temporal pattern, increasing from 0 to 120 min, peaking at 120 min, and declining from 120 to 1440 min (Fig. 2e). Although all three strains showed uptake of FLA-YM after 60 min, MD33 MG had the lowest fluorescence intensity at each time point. MD40 HG displayed the highest fluorescence intensity, 2.7-fold higher than MD33 MG . BtVPI-5482 displayed a fluorescence intensity between the two bovine strains for each time point, with a peak value 1.9-fold higher than MD33 MG . Additionally, MD40 HG cells showed a more rapid uptake (22% at 5 min), BtVPI-5482 cells showed an intermediate rate of uptake (9% at 5 min), and MD33 MG had the slowest uptake rate (2% at 5 min) (Fig. 2e, Supplementary Fig. 2b).
To test if the phenotypic differences were inherited between generations, we measured the differences in FLA-YM uptake with and without prior exposure to YM. The cultures continued to display the same phenotypic uptake patterns (MD40 HG and BtVPI-5482 higher uptake, MD33 MG low uptake, Fig. 3a-c). However, previous exposure to YM resulted in a heightened cellular response as indicated by more rapid rates of FLA-YM uptake relative to cultures previously grown on mannose-MM ( Fig.  3b-d). All cultures grown on mannose-MM reached a lower mean fluorescence and the temporal change in mean fluorescence was slower, with quantifiable uptake occurring only after 4 to 8 h.

Characterization of genotypes by PUL delineation
Whole-genome sequencing and de novo assembly were used to identify genes involved in YM metabolism. SPAdes [35] assembly output and average nucleotide identity based on BLAST+ (ANIb) [36] are shown in Supplementary Table 2. The ANIb results supported the 16S rRNA gene sequence data, confirming that each isolate was a strain of B. theta. Furthermore, comparative genomics revealed that these strains have acquired unique CAZome repositories ( Fig. 4a) and PUL updates ( Supplementary Fig.  3); features that may assist with their colonization of the bovine gut and represents opportunities for developing bovine-adapted probiotics (Supplementary Discussion).
Reconstruction of the three YM-specific PULs and alignment with BtVPI-5482 MAN-PULs determined that there was a high level of synteny among all strains in these pathways ( Supplementary Fig. 4a). MAN-PUL2 and MAN-PUL3 were absolutely conserved, whereas MAN-PUL1, a PUL tailored for the consumption of mannan from S. pombe [14] ( Supplementary Fig. 1b), was only present in BtVPI-5482, MD33 MG , and MD35 MG . The presence of MAN-PUL1 in two MGs indicated this pathway was not responsible for the HG phenotype. The HMNG-PUL, which is specific for digestion of high mannose N-glycans and not activated by YM in BtVPI-5482 [14], was also conserved in each of the Bt Bov genomes.

CAZome fingerprinting
To determine if there was amino acid sequence divergence within MAN-PULs, and potentially the function of homologous enzymes, polyspecific CAZyme families GH92 and GH76 were analyzed by SACCHARIS [37]. Enzyme sequences from GH92 and GH76 were embedded into phylogenetic trees comprised of all characterized enzyme sequences from each family (Supplementary Fig. 4b, c). Notably, every sequence within MAN-PUL1, MAN-PUL2, and MAN-PUL3 displayed the highest level of amino acid sequence conservation with its syntenic homolog. This suggested that each PUL is under strong selective pressure to function as an intact catabolic system. To determine if CAZyme sequences were conserved in other potential α-mannandegrading PULs, a genome-wide approach (i.e., CAZome fingerprinting) was used [37]. Each isolate encoded between twenty-four and twenty-six GH92s and eight or nine GH76s (Fig. 4b, Supplementary Fig. 4b,c). Only MD17 HG and MD51 HG displayed identical conservation for GH76, whereas every GH92 tree was unique.
Topological differences were observed for other αmannan active enzyme families (e.g., GH38, GH99, and GH125), suggesting that despite the high level of functional conservation within the MAN-PULs, metabolic specialization in α-mannan consumption between these strains may be encoded within orphan PULs [38]. Therefore, the contributions of two exogenous GH76s to the foraging behavior of HGs and MGs were investigated. BtGH76-MD40 is a surface-exposed GH76 inserted into PUL55 of HGs (Fig. 4b) and is active on intact S. cerevisiae and S. pombe YM [39]. BT_3782 is a periplasmic endo-α-mannanase that generates small oligosaccharide products [14]. Addition of recombinant BtGH76-MD and BT_3782 to pure cultures of MD33 MG did not augment the MG growth phenotype ( Supplementary Fig. 5), suggesting that acquisition of BtGH76-MD40 or augmented endo-mannanase activity were not responsible for the HG phenotype.

Differences in YM import between phenotypes
The differential transport kinetics of FLA-YM ( Fig. 2ce) and absence of genetic differences in PUL structure between phenotypes ( Supplementary Fig. 4a) suggested that glycan transport processes may be responsible for the MG and HG growth phenotypes. Alignment of the SusC-like amino acid sequences from MAN-PULs 1, 2,  CAZyme fingerprinting of YM metabolism by Bt Bov isolates. a GH enzyme families encoded within the genomes of MD40 HG and MD33 MG that differ in total number of sequences. BtVPI-5482 sequences are provided as a reference for each GH family. b Phylogenetic trees of characterized GH92s and GH76s, and Bt Bov sequences generated with SACCHARIS [37]. Circles represent Bt Bov sequences and white circles highlight sequences from PULs with mannan activity: 1, 2, or 3 = MAN-PULs 1, 2, or 3, respectively; H = HMNG-PUL; 55 = BtMD40 PUL55. Activities assigned to characterized enzymes within each clade for GH92 and GH76 are depicted using the provided legend. Outer ring represents characterized specificities. Numbers in parenthesis indicate the total number of enzymes within each strain exclusively into clades associated with either the MG or HG phenotype. This result is in contrast with the 16S rRNA ( Fig. 2a) and whole-genome (Supplementary  Table 2) alignments, which showed no correlation with growth profiles. Interestingly, this pattern does not exist for MAN-PUL1 or MAN-PUL3 as the SusC-like proteins in these pathways are highly conserved (Fig. 5a), suggesting that syntenic conservation may not always reflect sequence-function relationships.
To study how transporters affect YM uptake, different combinations of susC/D-like genes were excised from the BtVPI-5482 MAN-PULs. Three mutant strains were produced: a MAN-PUL2 susC-like and susD-like gene knock-out strain (ΔMP2susCD), a MAN-PUL1 and 3 susC-like and susD-like deletion mutant (ΔMP1/ 3susCD), and a strain with all three sets of susC-like and susD-like genes deleted (ΔMP1/2/3susCD). Because MAN-PUL1 is absent in every HG except BtVPI-5482, we can conclude it has no effect on YM transport efficiency and that the ΔMP1/3susCD mutant essentially operates as a Bt Bov MAN-PUL3 susCD knock-out strain. The mutants, along with BtVPI-5482, were grown on YM-MM to assess how the loss of transport complexes impacted growth on YM (Fig. 5d). Surprisingly, the mutants retained an identical growth profile to the wildtype, with the exception of the triple knock-out mutant (ΔMP1/2/3susCD), which displayed no growth. Furthermore, when the mutants were incubated with FLA-YM, to study the impact on uptake rates, they displayed identical rates to the wild-type, with only the triple deletion mutant having a complete loss of FLA-YM import (Fig.  5e, f). These results demonstrated that the SusC-like/ SusD-like proteins from MAN-PUL2 and MAN-PUL3 in BtVPI-5482 are functionally redundant. Although the absence of genetic tools prevented the investigation of the interplay between the MD33 MG transporters, the sequence divergence existing between MAN-PUL2 SusC/ D/E-like proteins from MD33 MG and MD40 HG (Supplementary Table 4) suggested that the dichotomous MG and HG growth phenotypes may result from differential transport through these complexes.
Comparative analysis of gene expression between Bt Bov growth phenotypes RNA-seq was performed on BtVPI-5482, MD33 MG , and MD40 HG cultured on either mannose or YM to explore differential patterns in expression of the enzymes and transporters in the MAN-PULs and identify any distally To confirm that gene expression was representative of protein production, a C-Myc tag was fused to the Cterminal of the MAN-PUL2 SusD-like protein (BT_ 3789) in the chromosome of BtVPI-5482. Extracellular display of BT_3789 was demonstrated using antibodies directed at C-Myc when this bacterium was cultured on YM but not mannose ( Supplementary Fig. 7).
The TPM values for every homologous gene transcript from MD33 MG , MD40 HG , and BtVPI-5482 were analyzed. Surprisingly, the sus-like genes (BT_3788 and BT_ 3789) and the surface enzyme transcripts (BT_3792, BT_ 2623, and BT_3858) of MD33 MG consistently displayed significantly higher expression levels than the HG strains (Fig. 6b, Supplementary Fig. 6b). These values ranged between 6.2-log 2 and 7.7-log 2 , suggesting that the expression level of gene products involved in outer membrane processing and intracellular transport is negatively correlated with growth proficiency on YM. The only example of an enzyme that is expressed at a significantly higher level in the MD40 HG strain was BT_3780 (12.5fold higher than MD33 MG ), which encodes a GH130 that is active on β-1,2-mannosides [40].

Differences in YM hydrolysis and import
The enzymatic processing of YM (amount of YM products, extent of YM utilization, and total free mannose present in the post-growth supernatants) by each culture was analyzed using a combination of methods. Thin layer chromatography (TLC) (Fig. 7a) revealed that there was no detectable free mannooligosaccharides or mannose in the supernatant of the YM-MM negative control. Consistent with this observation, the BtMAN-PUL1/2/3 deletion mutant (ΔMAN-PUL1/2/3) did not grow on YM and was unable to release products into the medium. BtVPI-5482 and each of the HGs generated a similar product profile, with a noticeable loss of YM signal and faint detection of oligosaccharides and mannose. In contrast, the post-growth media of MGs contained more mannose and had residual YM (Fig. 7a). Gas chromatography-mass spectrometry determined that the quantity of total mannosides (YM and oligosaccharides) in the supernatant was 1.48 and 1.40-fold higher for MD33 MG (1.36 ± 0.29) than BtVPI-5482 (0.92 ± 0.04) and MD40 HG (0.97 ± 0.05), respectively (Fig. 7b). Furthermore, post-growth BtVPI-5482 and MD40 HG cultures, but not MD33 MG , showed (p < 0.05) lower total mannose concentration in the media relative to the YM-MM negative control (Fig. 7b). This suggests that, consistent with their higher growth densities (Fig. 2b) and thin layer chromatography, BtVPI-5482 and MD40 HG consume more YM.

Discussion
The gut microbiome plays an integral role in digestion and nutrient acquisition. Improved understanding of the functional potential encoded within members of the microbiota is still required to define metabolic abilities and microbial-prebiotic interactions. Next-generation physiology approaches represent promising strategies to rapidly assign cellular phenotypes and can consolidate genomic predictions [29]. By combining phenotypic and sequencing approaches, we have conducted a highresolution study of differential YM utilization by isolated bovine-associated bacterial strains. In liquid culture, the isolates displayed one of two growth patterns: MG or HG; trends that were independent of taxonomic relationships (Fig. 2a, b, Supplementary Table 2). This showed that closely related Bacteroides spp. have evolved different foraging strategies for the same substrate. FLA-PS were successfully used to visualize (Fig. 2c) and quantify the accumulation (Fig. 2d) and uptake rate (Fig. 2e) of YM products in bacterial cells, confirming that HGs use a selfish mode of metabolism on this substrate, as previously reported for BtVPI-5482 [14,31]. In contrast, the MG strains consumed less YM and released mannose into the medium (Fig. 7a, b), suggesting that MGs are less adept at YM catabolism and display some properties consistent with distributive metabolism (Figs. 2b and 7c).
Comparative genomics revealed genotypes with high synteny across genomes and MAN-PUL pathways, with few exceptions. Perhaps the most interesting genetic anomaly is the sequence variability of the MAN-PUL2 SusC/D-like proteins, which elegantly branch into two clades coinciding with the HG and MG growth phenotype (Fig. 5b, c), as well as differential rates and total levels of FLA-PS uptake (Fig. 2c-e, Supplementary Fig.  2b). Previously, it was shown that the amino acid homology of a SusD-like protein involved in utilization of two different fructans was low between two strains of B. theta, despite their taxonomic similarity [41], highlighting that syntenic genes within PULs can evolve independently. Here we report that SusC/D-like amino acid sequences from the major PUL involved in metabolism of YM correlate with differential utilization of a common substrate (Fig. 5b, c). MD40 HG likely has a more efficient transport process (Fig. 7), as suggested by the following results: there is no perceived difference in the structures of surface enzymes encoded within the MAN-PULs ( Supplementary Fig. 4), the outer surface endo-αmannanases are expressed at lower levels in HGs (Fig.  6b), the addition of exogenous endo-GH76s to MG growth cultures did not augment MG growth (Supplementary Fig. 5), and the higher growth and faster disappearance of large YM products in MD40 HG cultures (Fig. 7). Whether this is the direct result of higher transporter efficiency in MD40 HG or indirect result from impoverished transport leading to product inhibition of surface enzymes in MD33 MG is unclear. Intriguingly, deletion of MAN-PUL1/3 susC/D or the MAN-PUL2 susC/D did not impede growth of BtVPI-5482 on YM or uptake of YM (Fig. 5d-f), suggesting SusC/D-like pairs in MAN-PUL2 and 3 are functionally redundant in HGs. Based upon sequence identity (Supplementary Table 4), MGs possess one compromised SusC/D/E complex (MAN-PUL2) and one high-performing SusC/D complex (MAN-PUL3), which are regulated differently between the strains. In MD33 MG , the MAN-PUL3 susClike gene (bt3854 homolog) is expressed at a level similar to the MAN-PUL2 susC-like gene (bt3788 homolog), and at a level 2.9-fold higher than its homologous gene in MD40 HG (Fig. 6b). Higher expression of outer surface proteins in MD33 MG is a consistent pattern (Fig. 6b). Despite the higher expression levels of the MAN-PUL3 SusC/D-like complex in MD33 MG , and the ability of the MAN-PUL3 SusC/D-like complex to compensate for deletion of the MAN-PUL2 transporter in BtVPI-5482 (Fig. 5d, e), the MAN-PUL3 SusC/D-like complex in MD33 MG is unable to rescue the MG growth phenotype of the representative strain. Thus, the SusC/D-like complexes in MAN-PUL2 and MAN-PUL3 appear to compete for substrates and the inefficiencies of transport ascribed to the MAN-PUL2 complex are related to its ability to transport, but not recruit, YM substrates. Further biochemical and structural studies of the MAN-PUL2 SusC/D-like proteins are warranted to tease apart these results.
The "Nutrient Niche Hypothesis" [42] suggests that metabolic abilities are determined by the creation and filling of ecological nutrient niches. In theory, these relationships could be in response to the introduction of a new dietary glycan (i.e., prebiotic), resulting in the selection for or adaptation of a bacterium with the metabolic capacity to consume it. In this study, the MG and HG phenotypes represent a variation on this theme, as two closely related populations (> 98% identity) adapted to the colonization of a common host (Supplementary Fig.  3) display different (Fig. 2b, Supplementary Fig. 1), yet reproducible (Fig. 3a) and inducible (Fig. 3b-d) foraging behaviors on the same substrate. These findings raise several unsolved questions related to the existence and persistence of MGs, and potentially other glycan foragers that are less adept at substrate utilization, in the rumen. If HGs have a superior capacity for YM metabolism, why are MGs not eliminated by competitive exclusion? And if MGs have restricted abilities to digest YM and/or transport YM products (Fig. 7), why are these PULs not selected against and excised from the genome? The existence of multiple metabolic phenotypes suggests that ecological selection factors may be responsible. Firstly, Bacteroides spp. are generalists with the capacity to utilize a wide variety of substrates available in the diet of their hosts and glycan responses are prioritized in Bacteroides spp. [43,44]. MGs may possess a different substrate hierarchy than HGs and, correspondingly, display more prowess for consuming chemically distinct glycans. Alternative substrate priorities would reduce the competitive burden on MGs when provided with complex diets. In this regard, the acquisition of new CAZymes or PULs that endow a microorganism with an ability to consume new substrates has been hypothesized to occur by horizontal gene transfer and is linked to spatial and dietary habits [20,28]. YM from S. cerevisiae (Supplementary Fig. 1a) and S. pombe (Supplementary Fig. 1b) were the substrates used in this study and showcased that HGs are not consistently superior when it comes to glycan utilization. Further investigation into the ability of MGs and HGs to utilize other substrates is warranted to identify additional variability in substrate utilization and preference. Secondly, feeding strategies, such as distributive metabolism, may foster beneficial syntrophic relationships at multiple levels within a community [45,46]. The generation of public goods [13] by MGs provides nutrients to species that are incapable of digesting YM. This event would increase the richness of the community and, potentially, result in the generation of additional secondary metabolites that benefit the lifestyle of MGs. Furthermore, it has been shown that both the concentrations and complexity of available substrate cause differential selection of distributive or selfish foraging strategies [47][48][49].
Comparison of the MD40 HG and MD33 MG CAZomes confirmed that there are many GH families, encoding different enzyme specificities that vary in number (Fig.  4a). Closer inspection of GH3 and GH16, two polyspecific GH families active on β-linkages, revealed CAZyme updates within a PUL in MD33 MG (Fig. 8). This suggests that the acquisition of a putative β-glucan metabolic pathway, and potentially others, may provide a colonization advantage for MD33 MG despite its weakened potential to metabolize YM. Recently, β-glucan utilization pathways were shown to have independently evolving genes that result in the expansion of protein specificity and glycan targets [50]. Thus, clustered mutations or differential acquisition of genes in PULs could unlock previously inaccessible nutrient niches. Conversely, however, there is the risk of impeding nutrient acquisition, as exhibited by the restricted β-glucan utilization of polyspecific proteins in Bacteroides fluxus [50] and, potentially, the inefficiencies of YM uptake governed by transporter specificity or efficiency. Further investigation of total CAZome function and transporter selectivity and efficiency encoded within genomes at the strain level will reveal how microorganisms living in partnership or competition within complex ecosystems tune their metabolic responses to complex dietary landscapes. Coupling "omics" methods and functional methods, such as FLA-PS, will help usher in a new frontier for the assignment of metabolic traits to bacterial populations within microecological food webs

Direct visualization of YM metabolism in rumen communities and cell identification by FISH
Rumen samples were collected from two cannulated cows fed a diet rich in barley grain. The rumen samples were filtered through cheesecloth under CO 2 gas. Subsamples were taken, flash frozen, and stored at − 80°C until genomic extractions could be completed. The rest of the sample was transferred into an anaerobic chamber (atmosphere: 85% N 2 , 10% CO 2 , 5% H 2 , at 37°C) and filtered through a 100-μm pore size nylon net filter (Millipore, USA). The filtered samples from each cow were then aliquoted into three tubes. One tube was immediately fixed with 1% formaldehyde (FA) for 1 h at room temperature as the 0 h control. The other tubes were incubated with 20 μL FLA-YM for a final concentration of 3.1 nM and fixed with FA after 1 day and 3 days. Immediately after fixation, all samples were filtered through a 47 mm (0.2-μm pore size) polycarbonate filter (Millipore), using a 0.45-μm cellulose acetate support filter (Millipore) and a gentle vacuum of < 200 mbar. After drying, the filters were stored at − 20°C.
DNA from the frozen rumen samples was extracted using the Qiagen DNeasy PowerSoil Kit, and samples were sent to McGill GenomeQuebec for Illumina MiSeq PE250 16S rRNA metagenomics sequencing. The 16S rRNA sequences were merged and quality trimmed using the BBTools [52] software and subsequently classified using the standard settings of the SILVAngs pipeline using the SSU rRNA seed of the SILVA database release 132 [53]. All analysis and plotting of the microbial diversity data were done using RStudio version 3.6.3 using the Vegan package [54,55] Isolation of bovine-adapted mannan degraders Bovine rumen and fecal samples were collected for in vitro batch culture experiments. Ruminal and fecal inoculants from cattle were enriched anaerobically (atmosphere: 85% N 2 , 10% CO 2 , 5% H 2 ) at 37°C with one of the following substrates: Bio-Mos® (1% w/v), corn distillers' grains (1% w/v), or YM (1% w/v). Bacteria were isolated from the enriched batch cultures by streaking onto nutrient-restricted media supplemented with 0.5% YM to select for YM-degraders (supplementary methods). In total, 50 YM-degrading bacterial isolates were characterized for their propensity to metabolize YM. Nine of these isolates were selected for detailed analysis in this study.
Post-growth cultures were harvested and centrifuged. Supernatants were taken and 6 μL was ran on a silica sheet in 2:1:1 (butanol to d 2 H2O to acetic acid) running buffer. The plate was dried at ambient temperature and stained with orcinol (diluted to 1% in a solution of 70:3 ethanol to sulfuric acid). Once the plate was dry, it was activated in an oven at 120°C and imaged using a gel doc XR image system (Bio-Rad).

Genome sequencing, assembly, and annotation of Bt Bov strains
The 16S rRNA of 50 bovine bacterial isolates was sequenced to determine taxonomic classification using the universal primers 27F and 1492R (supplementary methods). Based on growth profiles (OD 600 > 0.4) and 16S rRNA sequences, nine isolates were chosen for whole-genome sequencing using Illumina MiSeq PE150 bp. Genomes were assembled using SPAdes de novo assembly [35]. The K-mer value in SPAdes was chosen from (21,33,55, and 77 defaults for 150 bp reads). Quality reporting of the assemblies was done using Quast [56]. SPAdes assembly N50s, largest contigs, and number of contigs are shown in Supplementary Table 2. Genomes were uploaded to the NCBI submission portal and annotated using the NCBI Prokaryotic Annotation Pipeline. Isolate contigs were blasted against the reference genome BtVPI-5482 for MAN-PUL1/2/3 and the HMNG-PUL using NCBI BLAST (2.7.1) [34]. SPAdes contig assemblies were aligned with the JSpeciesWS reference BtVPI-5482 genomes to calculate average nucleotide identity based on BLAST+ (ANIb) [36].

Production of BtVPI-5482 MAN-PUL mutants
Flanking regions (~750 bp) of the susC/D-like genes from each MAN-PUL were PCR amplified, stitched together, and ligated into pExchange-tdk (pEx-tdk). The plasmids were transformed into E. coli strain S17-1λpir, which were donor cells used to conjugate the plasmids into the BtVPI-5482 ΔPUL75Δtdk recipient strain to delete the sus-like gene pairs [57]. Mutants with the MAN-PUL1 Sus genes deleted (ΔMP1susCD) were then conjugated with E. coli cells containing a plasmid with the flanking regions for the MAN-PUL3 Sus genes to create a dual mutant (ΔMP1/3susCD). The dual mutant was then conjugated with E. coli cells that contained a plasmid with the MAN-PUL2 Sus flanks to produce a triple mutant (ΔMP1/2/3susCD). Plasmids and mutants were sequenced at each step of this process.
The three knock-out strains, along with BtVPI-5482 wild-type, were grown on 0.5% YM-MM as described above. In addition, the strains were incubated with FLA-YM and sampled at 0 h, 1 h, 1 day, and 3 days. These samples were fixed and stored at 4°C until analyzed by flow cytometry and epifluorescence microscopy (see below).

PUL delineation and comparative CAZomics
Isolate contigs were processed through EMBOSS GetORF [58] to determine open reading frames; these data were run through the dbCAN [59] HMMscan to identify CAZyme sequences. CAZyme sequences were then analyzed by SACCHARIS [37] (Sequence Analysis and Clustering of CarboHydrate Active enzymes for Rapid Informed prediction of Specificity). User CAZyme sequences were trimmed to their catalytic domain with dbCAN [59], aligned with MUSCLE [60], and fitted to a phylogenetic tree using ProtTest3 [61] to find the appropriate amino acid replacement model. RAxML [62] or FastTree [63] was used to generate the final tree. GHs from families 38, 76, 92, 99, and 125 identified in the genomes of the MD isolates were analyzed by SACCHARI S. Phylogenetic trees were developed using FastTree, and Newick file outputs were viewed and plotted using ITOL (doi.org/10.1093/nar/gkz239).

RNA-seq: assembly, quantitation, and comparative analysis
RNA from BtVPI-5482, MD33 MG , and MD40 HG grown in 0.5% mannose or YM (see supplementary methods) was extracted and purified using a GeneJET RNA Purification kit (Thermo Scientific) within 1 week of storage at − 80°C. RNA was sent to Génome Québec for Illumina HiSeq 4000 PE100bp sequencing. Using Geneious v11.1.2 [64], each set of reads was mapped to their previously assembled genomic sequence or, in the case of BtVPI-5482, to the genomic sequence from the NCBI database (NC_004663). Expression levels were calculated as transcript expression (transcript per kb per million; TPM) for each growth treatment. Ambiguously mapped reads were counted as partial matches. The Geneious DESeq2 [65] plugin was used to compare the expression levels between the two treatments, producing log 2 expression ratios and p values.
Generalized linear mixed models in SAS PROC GLIM MIX (SAS 9.4, SAS Institute, Cary, NC, USA) were used to estimate statistically significant (p < 0.05) differences of TPM means (least squares-means) for the MAN-PUL1/2/3 susC-like genes of each bacterial strain. Based on the Bayesian information criterion (BIC) of the generalized linear mixed models, the response was modeled using the log-normal distribution. The expression of gene transcripts was the dependent variable in models with two independent fixed factors: bacterial strain (i.e., BtVPI-5482, MD33 MG , or MD40 HG ) and media treatment (i.e., YM or mannose). Mixed models of variance heterogeneity were selected based on the BIC. For the studied transcripts, the variance of expression was heterogeneous for the experimental treatments, bacteria, or their interaction. The statistical significance of the interaction between the TPM values of MAN-PUL genes for each bacterial strain and the media treatment was determined using an F test. Bonferroni's method was used for multiple comparisons (Supplementary Table 3).

Production of SusD-like protein C-myc fusion B. theta strain
The C-myc epitope (EQKLISEEDL) was fused to the Cterminal domain of the MAN-PUL2 SusD-like protein (BT_3789) with a linker sequence (STSTST) between the SusD-like nucleotide sequence and the C-myc sequence BtVPI-5482 Δtdk Δpul75 (control), and BtVPI-5482 Δtdk Δpul75 SusD-like C-myc fusion mutant was inoculated in TYG and cultured as described above. The cells were centrifuged and resuspended in 1 mL 2X MM. One hundred microliters of the resuspension was inoculated into 0.5% YM-MM and incubated for 4 h at 37°C. The cells were then centrifuged and washed three times in phosphate-buffered saline (PBS) pH 7.4 (PBS; 137 mM NaCl, 2.7 mM KCl, 10 mM Na 2 HPO 4 ), before resuspension in 2 mL 2X MM. Two hundred twenty-five microliters of the resuspended cells was added to 1.5 mL 0.2% FLA-YM or YM-MM and incubated for 3 h at 37°C. One hundred microliters of each culture was collected and fixed in 1% formaldehyde for 1 h at room temperature. The samples were then incubated with 1: 2500 rabbit IgG anti-C-myc polyclonal antibody (Ther-moFisher #PA1-981) for 1 h at room temperature. The samples were then washed four times in PBS and resuspended in 1:2500 goat anti-rabbit DyLight 650 nm secondary antibody (ThermoFisher #84546) for 1 h at room temperature. The samples were then washed and stored in PBS until further analysis.  Table 4). The 16S rRNA gene and MAN-PUL2 SusC-like and SusD-like amino acid phylogenetic trees were generated using the maximum likelihood method and Tamura-Nei model [66]. Evolutionary analyses were performed by MEGA X [67]. Trees with the highest log likelihood are shown in Fig. 4b and c.

Generation of FLA-YM conjugates
A previously defined protocol [30,68] was used to generate fluorescently labeled YM (FLA-YM), with slight variations (supplementary methods).

Visualization of FLA-YM uptake by strains of Bt Bov
Wild-type BtVPI-5482, BtΔMAN-PUL1/2/3, and rumen isolates MD33 MG and MD40 HG were inoculated in TYG and grown as described above. Cells were harvested at OD 600~1 .0 and centrifuged (4700×g) for 5 min, the supernatant was removed, and pellets resuspended in 2 mL 2X MM for the first two washes. After the third centrifugation, pellets were resuspended in 2 mL MM with 0.5% YM (BtVPI-5482, MD33 MG , and MD40 HG ) or 0.5% glucose + YM (BtΔMAN-PUL1/2/3) as the sole carbon source (not conjugated to FLA). After~18-h incubation, cultures were centrifuged and washed three times in PBS, with the final resuspension in 2 mL 2X MM. Three hundred microliters of the resuspended pellet was aliquoted into 0.2% unlabeled YM or FLA-YM. Twenty microliters of the 2X MM resuspension was used as the 0h time point, as the cells were not exposed to FLA-YM. Forty microliters aliquots of each condition were taken at time points: 5 min, 1 h, and 24 h. The cells were centrifuged (10 min; 2300×g), and the pellet was fixed in 1% formaldehyde (FA; Sigma; F8775) in PBS, at 4°C for 18-24 h. The fixed cells were centrifuged (10 min; 2300×g) and washed in 1X PBS. The samples were centrifuged and stored at 4°C in the dark until visualized by SR-SIM (supplementary methods).
Quantification of the rate of FLA-YM uptake by Bt Bov isolates BtVPI-5482, MD33 MG , and MD40 HG were grown in TYG and prepared as described above. After 24 h of incubation, cultures were placed into 2 mL 0.5% YM. Cells were harvested in exponential phase (OD 600 0.6-1.0), centrifuged (10 min; 2300×g), and resuspended in 2 mL 2X MM. Three hundred microliters of this suspension was added to 1 mL 2 X MM. Then 20 μL of each culture was aliquoted into 1 mL 1% FA and used as the T0 time point. Into the remaining 280 μL, 0.2% FLA-YM and 150 ng/mL fluoresceinamine (FLA) or YM was added and subsamples of 40 μL were taken at 5, 10, 15, 20, 30, and 60 min and 2, 4, 8, and 24 h. The subsamples were centrifuged (10 min, 2300×g), and the cell pellets were fixed in 1% FA in 1X PBS, at 4°C for 18-24 h. The fixed cells were centrifuged (10 min; 2300×g) and resuspended in 1 ml 1X PBS and stored at 4°C in the dark.
Cell fluorescence due to FLA or FLA-YM uptake was quantified in all samples using an Accuri C6 flow cytometer (BD Accuri Cytometers). The 8-peak and 6-peak validation bead suspensions (Spherotech, IL, USA) were used as internal references. All samples were measured under laser excitation at 488 nm from a blue-green laser, and the green fluorescence was collected in the FL1 channel (530 ± 30 nm). Using the medium as a background, an electric threshold of 17,000 FSC-H was set to reduce the background noise. All measurements were done at a slow flow rate and a total of 10,000 (FLA-YM) or 5000 (YM and FLA) events per sample were acquired. Bacteria were detected from the signature plot of SSC-H vs green fluorescence (FL1-H). The FCM output was analyzed using FlowJo v10-4-2 (Tree Star, USA). The FCM files were imported into FlowJo, and both the total population (all events) and main population (automated gating through event density) were determined. For each population (total and main), sample statistics (counts, mean fluorescence, and the standard deviation) were determined from the raw FL1-H data. The results were exported and analyzed using Welch's t tests in R studio using the packages Vegan and Rioja [55,69] to determine statistical difference between the control (YM and FLA) and FLA-YM incubation within each strain and between the FLA-YM incubation of each strain.

Quantification of mannose in minimal medium using GC-MS
Cell culture medium after incubation in 1% YM-MM was collected after 24 h and centrifuged (4700×g for 15 mins), and the supernatant was passed through a syringe filter (0.2 μm cellulose acetate membrane, VWR). The filtrate was kept frozen for 48 h at − 20°C and then thawed and centrifuged (3000×g, 30 min) at room temperature. Concentration of mannose in the resulting supernatant was tested based on our previous report [70], with some modifications to cope with the relatively large amount of starting carbohydrate material and the presence of minimum medium. One milliliter of the supernatant was evaporated to dryness under a gentle flow of nitrogen. The residue was suspended and magnetically stirred in 3.5 mL of 6 M TFA at 100°C for 6 h with headspace filled with nitrogen, followed by addition of internal standard myo-inositol (0.4 mg dissolved in 0.5 mL of water) and evaporation to dryness. Monosaccharides were reduced by magnetic stirring overnight in 10 mg of NaBD4 (99% D, Alfa Aesar) dissolved in 2 mL of 1 M ammonium oxide solution, followed by quenching excess reductant with acetic acid and evaporating to dryness. Boric acid was removed by evaporation to dryness five times in 3 mL of 10% (v/v) acetic acid in methanol followed by five times in 3 mL of absolute methanol. The residue was suspended in 4 mL of acetic anhydride, followed by magnetic stirring at 100°C for 2 h with headspace filled with nitrogen, cooling to room temperature, and evaporation to dryness. The derivatives were purified by partitioning with water and dichloromethane, recovered by collecting and evaporating to dryness the organic phase after three changes of water, and re-dissolved and diluted in ethyl acetate for analysis on an Agilent 7890A-5977B GC-MS system (Agilent Technologies, Inc., CA, USA). Sample solution (1 μL) was splitless-injected to the system, and optimal analyte separation was achieved on a medium polarity SP2380 column (30 m × 0.25 mm × 0.20 μm, Sigma-Aldrich) with a constant helium flow of 0.8 mL/min and with oven temperature programmed to start at 55°C (hold 1 min) followed by increasing at 30°C/min to 120°C then at 12°C/min to 255°C (hold 20 min). Two separate experiments were conducted for each sample. Mannose concentration was calculated based on calibration curve established from a series of mannose standard solution containing internal standard.

Measurement of YM hydrolysis
Samples from BtVPI-5482, MD33 MG , and MD40 HG cultures in 0.2% FLA-YM (as above) and a no-cell negative control were filtered through 0.2-μm cellulose acetate membrane syringe filter (VWR). The filtrates were flash frozen and stored at − 80°C until analysis. Samples were analyzed as described in Arnosti 2003 [68]; in brief, samples were injected onto two columns of Sephadex G50 and G75 gel linked in series, with the column effluent passing through a Hitachi fluorescence detector set to excitation and emission wavelengths of 490 and 530 nm, respectively. The columns were standardized using FITC dextran standards (150 kDa, 70 kDa, 40kDA, 10 kDa, 4 kDa, FITC-glucose, and free fluorophore; Sigma), so the fraction of polysaccharide eluting in each molecular weight class at each time point could be calculated.

Additional file 1. Supplementary figures
Additional file 2. Supplementary materials LK assisted with the rumen extractions and performed the rumen incubations with FLA-YM, selective bacterial growth profiling and TLC analysis, RNA sequencing, production of FLA-YM, FGC-bacterial incubations, construction of the SusD-C-myc mutant and evaluation of production, and extraction of S. pombe YM; prepared figures; and wrote manuscript. GR conducted the epifluorescence microscopy and SR-SIM analysis, FISH analysis, cell enumeration of rumen samples, flow cytometry sorting and analysis, and statistical analysis and assisted with figure and manuscript writing and preparation. JPT performed comparative genome and protein sequence analysis and assisted with figure preparation. DRJ assisted with comparative genome and protein analysis. JHH helped conceive of the study, interpreted the data, and assisted in the preparation of the manuscript. ADS performed isolations of rumen bacteria and 16S rRNA sequencing. TDS conducted the statistical analysis of RNA-seq data. CAr assisted with the generation of FLA-PS, analysis FLA-PS products, and preparation of the manuscript. LJ performed the rumen collections and assisted with the bacterial isolations. TWA assisted with the rumen extractions and preparation of the manuscript. CAm assisted with the RNA-seq analysis and figure generation. DT assisted with the CAZyme fingerprinting and maintenance of the SACCHARIS pipeline. RA helped conceive of the study and assisted with the SR-SIM microscopy and preparation of the manuscript. TAM assisted with the animal study, rumen extractions, and preparation of the manuscript. DPY assisted with the data analysis and preparation of the manuscript. DWA helped conceive of the study, secured the funding, designed the study, performed the data analysis, and assisted with figure and manuscript preparation. The authors read and approved the final manuscript.