- Open Access
Predicting recurrence of Clostridium difficile infection following encapsulated fecal microbiota transplantation
Microbiomevolume 6, Article number: 166 (2018)
Fecal microbiota transplantation (FMT) is an effective treatment for recurrent Clostridium difficile infection (rCDI). The use of freeze-dried, encapsulated donor material for FMT (cap-FMT) allows for an easy route of administration and remains clinically effective in the majority of rCDI patients. We hypothesized that specific shifts in the microbiota in response to cap-FMT could predict clinical outcome. We further evaluated the degree of donor microbiota engraftment to determine the extent that donor transfer contributed to recovery.
In total, 89 patients were treated with 100 separate cap-FMTs, with a success rate (no rCDI 60 days post cap-FMT) of 80%. Among responders, the lower alpha diversity (ANOVA P < 0.05) observed among patient’s pre-FMT samples was restored following cap-FMT. At 1 week post-FMT, community composition varied by clinical outcome (ANOSIM P < 0.001), with similar abundances among families (Lachnospiraceae, Ruminococcaceae, and Bacteroidaceae) in responder and donor samples. Families that showed differential abundances by outcome (response vs. recurrence) from samples collected 7 days following cap-FMT were used to construct a regression tree-based model to predict recurrence. Results showed a training accuracy of 100% to predict recurrence and the model was 97% accurate against a test data set of samples collected 8–20 days following cap-FMT. Evaluation of the extent of engraftment using the Bayesian algorithm SourceTracker revealed that approximately 50% of the post-FMT communities of responders were attributable to donor microbiota, while an additional 20–30% of the communities were similar to a composite healthy microbiota consisting of all donor samples.
Regression tree-based analyses of microbial communities identified taxa significantly related to clinical response after 7 days, which can be targeted to improve microbial therapeutics. Furthermore, reinstatement of a healthy assemblage following cap-FMT was only partially attributable to explicit donor engraftment and continued to develop towards an overall healthy assemblage, independent of donor.
Clostridium difficile infection (CDI) remains a common cause of hospital and community-acquired infection . One of the most difficult clinical challenges associated with CDI is recurrent infection (rCDI) despite multiple courses of antibiotics [2, 3]. Fecal microbiota transplantation (FMT) has emerged recently as a highly effective treatment of rCDI that is now endorsed by professional societies and incorporated into standard treatment guidelines as an option following failure of antibiotic treatment [4, 5]. FMT involves transfer of fecal microorganisms from healthy donors to patients to correct antibiotic-induced dysbiosis, which is the primary causal risk factor for CDI in most patients.
Unlike antibiotics, FMT represents a restorative therapeutic approach that results in donor-like normalization of fecal microbial community structure and functionality [6, 7]. However, despite its impressive overall efficacy in breaking the cycle of CDI recurrence, FMT fails to cure rCDI in a fraction of patients. The reasons for FMT failure are not well understood; potential variables include specific patient factors [8, 9], potential resistance of individual strains of C. difficile bacteria to FMT activity , or failure to activate protective mechanisms or achieve full potency, possibly due to inadequate engraftment of key donor microorganisms . In order to understand why FMT may fail, it is critical to know the mechanisms by which FMT is able to break the cycle of CDI recurrence. Achieving this understanding is also important for development of reliable next-generation anti-CDI therapeutics.
Identification of specific microbial taxa that are essential for resolution of rCDI would allow for a targeted microbiota restoration approach and decrease the FMT failure rate. Specific approaches have included attempts to correlate microbiome analyses with CDI recurrence risk, as well as treatments of rCDI with defined consortia of microorganisms or preparations of fecal microbiota of reduced complexity [11,12,13,14]. However, these investigations have not yielded consistent results. Here, we expand upon our previous studies analyzing microbiome recovery in rCDI patients treated with an encapsulated preparation of freeze-dried microbiota (cap-FMT) . Clinically, cap-FMT has been very successful [15, 16]. However, in contrast to colonoscopic administration of fecal microbiota, which results in prompt engraftment of the entire donor bacterial communities within 24–48 h , treatment with this oral FMT preparation is associated with more gradual, punctuated kinetics of microbiome normalization over a period of approximately a month . The increased period of recovery presents a potential opportunity to capture the essential steps in microbiome repair and identify critical microbial taxa in controlling CDI.
Clinical cohort and fecal samples
The current analysis includes samples from an expanded cohort of 89 consecutively treated rCDI patients who participated in the study. The cohort includes the 49 patients described previously  and reflects 100 separate cap-FMTs from eight different donor lots. A donor lot is defined as the preserved fecal microbiota purified from a single stool sample. The demographics and clinical characteristics of this patient cohort are shown in Additional file 1: Table S1. None of the patients had underlying inflammatory bowel disease (IBD), which is associated with lower success rates of FMT in treating rCDI . The success rate for cap-FMT treatment was 80%. Among the non-responders, the median interval between cap-FMT and diagnosis of CDI relapse was 13 days (range 4–42 days). The general clinical course among patients following cap-FMT was similar to that following colonoscopic FMT experience in our center (> 400 colonoscopic FMTs for rCDI), including the interval between the procedure and diagnosis of CDI relapse in patients without IBD who failed treatment. Patients who had CDI recurrence were subsequently retreated by cap-FMT, received colonoscopic FMT, or remained on a long-term suppressive treatment with a daily dose of vancomycin. During the course of the study, the donor lot, total capsule dosage, and timing of delivery varied based on clinical experience, but none of these factors were shown to be significantly related to patient outcomes (Additional file 1: Table S2). Due to variation in study protocols during the course of this study and logistics of stool collections, analyses were done using samples that were grouped by broad time points encompassing days (days 2–6), weeks (days 7–20), months (days 21–60), and longer-term samples (> 60 days) following cap-FMT (Additional file 1: Figure S1).
FMT is associated with characteristic shifts in bacterial community alpha and beta diversity
As expected, alpha diversity, community richness, and evenness measured by the Shannon index were lowest in pre-FMT samples compared to those collected from donors and patients post-FMT (post hoc P ≤ 0.014; Fig. 1). In addition, post-FMT samples from non-responders had lower alpha diversity relative to samples from donors (P ≤ 0.034), but did not differ significantly from responder post-FMT samples, at any time point. Donor communities were primarily comprised of members of the Lachnospiraceae, Ruminococcaceae, Bacteroidaceae, and Porphyromonadaceae, while pre-FMT communities had greater numbers of Enterobacteriaceae (Fig. 2). The microbial communities in responders began to taxonomically resemble those of the donors at the “weeks” time point post-FMT (Fig. 2), with increases in the donor-associated families described above, and a significant reduction in abundances of members of the Enterobacteriaceae.
Changes in bacterial community composition were evaluated by using Bray-Curtis dissimilarities, which account for differences in the abundances of operational taxonomic units (OTUs) among sample groups. Over time, progressive, temporal changes were observed in bacterial communities in post-FMT samples from patients who responded to cap-FMT, but not those who experienced recurrence. No significant differences were noted between the fecal bacterial communities in cap-FMT responders and non-responders at the “days” time point (analysis of similarity (ANOSIM) R = 0.10, P = 0.052). However, the communities from cap-FMT responders and non-responders diverged after the “weeks” time points (R = 0.11, P = 0.020; Bonferroni-corrected α = 0.002). Specifically, bacterial communities of cap-FMT responders differed significantly at all later time points (“weeks” through “long-term”) relative to the communities at the “days” time point (R = 0.14–0.23, P < 0.001). Moreover, fecal bacterial composition of responder samples did not change significantly following the “weeks” time point (R = − 0.03 to − 0.01, P ≥ 0.627). In contrast, no significant differences between the “days” and “weeks” time points (R = 0.05, P = 0.142) were noted in the bacterial communities in feces from the cap-FMT non-responder group.
To evaluate which metabolic functional traits may be associated with response to cFMT, functional inferences were performed and correlated with days-post-FMT among samples from patients who responded. The abundances of most inferred functional traits were negatively correlated with the number of days following cFMT (Additional file 1: Table S3), including genes associated with nitrogen and sulfur metabolism (ρ = − 0.124 and − 0.158, P = 0.038 and 0.008, respectively). However, functional genes within the tier 2 category of glycan biosynthesis and metabolism showed weak, but significant positive correlations (ρ = 0.125–0.191, P ≤ 0.037). Notably, genes inferred to function in primary and secondary bile acid biosynthesis were also positively correlated with duration following cFMT among responders (ρ = 0.215 and 0.217, respectively; P < 0.0001). Among genes involved in secondary bile acid, choloylglycine hydrolase (K01442), 7α-hydroxysteroid dehydrogenase (hdhA; KO00076), and 3-dehydro-bile acid Δ4,6-reductase (baiN; KO07007) were inferred. Members of the Lachnospiraceae were the predominant contributors of these genes among all samples (56.1, 93.9, and 66.5%, respectively). Members of the Bacteroidaceae, Porphyromonadaceae, Verrucomicrobiaceae, and Ruminococcaceae were also predominant contributors to abundances of choloylglycine hydrolase (9.1, 7.6, 6.9, and 6.6%, respectively), and members of Enterobacteriaceae and Ruminococcaceae were predominant contributors to abundances of baiN (10.5 and 8.1%). Members of Coriobacteriaceae and Synergistaceae were among the only other families inferred to contribute to abundances of hdhA (4.3 and 1.1%).
Identification of markers indicative of clinical outcome
Ordination of Bray-Curtis dissimilarity matrices among samples by principal coordinate analysis (PCoA, Fig. 3) showed separation of samples associated with time and clinical outcome along the x-axis. This is in agreement with ANOSIM analyses (above). The Lachnospiraceae, Ruminococcaceae, Bacteroidaceae, Porphyromonadaceae, and Enterobacteriaceae comprised the predominant families whose abundances were significantly correlated with position along this axis (Fig. 3). Similarly, and more specifically, the predominant OTUs that showed significantly different abundances due to time point and were related to clinical outcome by linear discriminant analysis of effect size (LEfSe), were also primarily classified to genera within the Lachnospiraceae, Ruminococcaceae, and Bacteroidaceae, among others (Table 1).
Since members of the Lachnospiraceae, Ruminococcaceae, Bacteroidaceae, Porphyromonadaceae, and Enterobacteriaceae were related to time points and clinical outcome by various statistical tests, we hypothesized that abundances of these families could be used to predict the likelihood of recurrence. Given a statistical similarity of the microbial community composition in responders at the “weeks” time point, relative to donors, these data were selected to develop a chi-squared automatic interaction detection (CHAID)-based regression tree model, using a machine learning approach. The model was trained on all cap-FMT patient data (responders and non-responders) collected from day 7 (n = 64) and tested on subsequent patient data from days 8–20 post-FMT (n = 67, Table 2). Ten samples from the training data were withheld for validation prior to testing. Thus, the training and test datasets were independent from each other. When more than one sample representing a unique cap-FMT delivery was present in the days 8–20 dataset, only the earliest time point post-FMT was included.
The model generated had an overall training accuracy of 100% (correct classification of all samples) using the 7-day data and a test accuracy of 97.0% against the days 8–20 data set (Table 2). The model specificity was 96.5% to identify a recurrence within weeks of cap-FMT, with a sensitivity of 100% against the test data. Attempts to improve accuracy against the test data were unsuccessful by using all family-level data and by using the same parameters for tree construction, producing a maximum test accuracy of 94.0%.
Evaluation of engraftment in relation to clinical outcome
To determine how donor microbiota engraftment influenced clinical outcome, the extent of engraftment was assessed using SourceTracker software, which employs a Bayesian algorithm to determine the percent similarity in OTU composition between the donor (the source) and patient (the sink) communities . Engraftment among patient samples was assessed by (1) using communities from all donor lots as a “composite” source (evaluates the similarity of patient sample communities to a generalized healthy fecal microbiota); (2) designating each “specific” donor lot as a unique source (testing the ability of SourceTracker to discriminate among donors); and (3) analyzing “individual” donor lots and associated patient samples separately (determines the similarity to the specific donor lot representing empirical engraftment). Among all patient samples, the extent of engraftment based on these analysis methods were in the order composite > individual > specific, with each method significantly different from the others (Tukey’s post hoc P < 0.0001). Furthermore, since all the same OTUs were not consistently transferred to all patients, regardless of the donor lot or the method of analysis (data not shown), taxonomic assignments were performed at the family level to assess patterns of engraftment (Additional file 1: Table S4).
When microbial communities from all donor lots were pooled as a single composite source (Fig. 4a), approximately one quarter (27.6 ± 2.8%) of pre-FMT communities included OTUs shared with donor samples. This percentage similarity, however, was significantly less than all other post-FMT sample groups (post hoc P < 0.001). In contrast, among the post-FMT samples, the extent of engraftment was significantly greater at the “weeks” and later time points among patients who responded, relative to those that had recurrence within days of cap-FMT (P ≤ 0.036). The greatest similarity to all donors was observed among responders at approximately the 1 month post-FMT time point. Furthermore, the percent of donor engraftment was significantly and positively correlated with the number of days post-FMT (Spearman’s ρ = 0.536, P < 0.0001). The OTUs that were associated with engraftment (Fig. 4b) were predominantly classified within the families Lachnospiraceae, followed by Bacteroidaceae and Ruminococcaceae.
Assessment of specific donor engraftment among the entire dataset similarly revealed a small attribution (6.6 ± 1.4%) of the communities in pre-FMT samples to donor communities. This was significantly less than that seen in post-FMT engraftment samples among patients who responded to cap-FMT (P ≤ 0.011). Differences in engraftment between the pre-FMT samples and those from patients who had a recurrence, however, did not differ significantly (P ≥ 0.138). While the percent of engraftment was still significantly correlated with days post-FMT (ρ = 0.296, P < 0.0001), the strength of this relationship was much weaker compared to that obtained from analyses of the composite. Furthermore, when samples were grouped by time point, there were no significant increases in engraftment at later times (P ≥ 0.336). Among patient samples from responders, collected at the “weeks” time point, microbial communities showed a greater proportion of similarity to the specific donor lot the patient received (Fig. 4c, Additional file 1: Figure S2), rather than to the communities from other donor lots. However, the empirical donor lot used for cap-FMT was not assigned a significantly greater percentage of the community than lots not delivered to the patient (post hoc P > 0.05).
Intermediate engraftment percentages were found when an individual donor lot was compared to samples from only those patients who received that lot (Fig. 4a, Table 3). Similar to the prior analyses, this analysis indicated the percent of engraftment in pre-FMT samples was significantly less than that of all post-FMT samples (P ≤ 0.044). In addition, the percent of engraftment was significantly correlated with the number of days post-FMT (ρ = 0.473, P < 0.0001). Moreover, samples from patients who responded to cap-FMT also had significantly greater percentages of engraftment in the “weeks” time point, relative to those from patients who had recurrence within days of receiving cap-FMT (P < 0.042). While the OTUs that engrafted were classified to similar families as those observed for the composite donor (Fig. 4b), donor-specific differences in the abundance of these taxa were observed (Additional file 1: Table S3). Notably, for two donors (numbers 44 and 62), approximately 8–14% of the donor-associated community was comprised of Verrucomicrobia, which was not contributed from other donors, although transfer of this family did not significantly affect efficacy of these donor lots (Additional file 1: Table S2). Moreover, many taxa were not unique to a single donor lot and 23.5 to 84.6% of sequence reads in a single donor lot belonged to taxa found in one or more other donor lot (Table 4).
In this study, we were able to predict, with great accuracy, an eventual recurrence of CDI following cap-FMT at 7 days post-FMT using an unbiased, statistical model incorporating the abundances of members of the families Lachnospiraceae, Ruminococcaceae, Bacteroidaceae, Porphyromonadaceae, and Enterobacteriaceae. The abundances of Bacteroides spp. have been previously suggested to prevent recurrence , and the results of our current work support this supposition. However, while we previously suggested that dysbiotic signatures, e.g., sustained, elevated abundances of Enterobacteriaceae , may be useful in predicting recurrences, our analysis of recurrence events following cap-FMT here did not reflect this. While sustained levels of Enterobacteriaceae were noted following some recurrences, others were characterized by near elimination of this family, but incomplete restoration of diversity within the Firmicutes, Bacteroidetes, or both. This result suggests that the microbial community dynamics surrounding recurrence may be specific to the individual patient [8, 9].
The mechanisms by which FMT resolves rCDI are broadly associated with reinstatement of intestinal microbial diversity, as well as restoration of the functional and beneficial effects of the microbiota on host physiology and gut chemistry . Previous studies have noted significantly reduced alpha diversity that is restored following FMT [17, 22,23,24], suggesting that competition for nutrients between the reinstated flora and C. difficile may play a role in suppressing the infection. Furthermore, a recent longitudinal study found that intestinal microbial diversity was associated with both recurrence of CDI as well as the severity of the disease, with greater diversity associated with decreased severity and reduced likelihood of recurrence . In a mouse model, a single species, C. scindens, was inhibitory to C. difficile , and the suppressive mechanism was found to be associated with secondary bile acid biosynthesis. Similarly, our inferred functional data indicated a positive correlation between time following cFMT and genes associated with secondary bile acid synthesis. Moreover, the restoration of secondary bile acid metabolism was noted in patients treated with FMT . Several studies have also demonstrated the inhibitory effects of some bile acids, including chenodeoxycholic, lithocholic, and ursodeoxycholic acids, on germination of C. difficile spores [26,27,28]. Interestingly, co-administration of bacterial species from Lachnospiraceae and Porphyromonadaceae families enhanced the protective potency of C. scindens against CDI . Thus, the reinstatement of both microbial diversity and specific functional capabilities related to host physiology and microbe-microbe interactions are vital to the efficacy of FMT and restoration of gut health. However, accurate characterization of functions associated with response to FMT will require further experimental characterization.
We employed a regression tree-based machine-learning algorithm in order to assess factors that may be associated with recurrence [29, 30]. Due to the complexity of the microbial dataset, we selected independent variables for this model based on prior statistical analyses to identify highly discriminant taxa, which provided high sensitivity and specificity to detect recurrence at later time points. Thus, this method, using data-specific and statistically derived taxa as independent variables, may allow similar models to be applied to diverse patient populations. While the search for predictive models of recurrence is an active area of research , results using our exhaustive CHAID-regression tree model provide much greater accuracy than a random forest-based method used to predict or determine signatures of dysbiosis in microbiome data (maximum classification accuracy of 85.4%) . Furthermore, using unbiased variable selection resulted in high predictive accuracy independent of donor-specific taxa that likely engrafted successfully from some donor lots, such as Verrucomicrobia. This suggests that donor engraftment by itself is not solely predictive of the success of FMT, similar to our previous findings using colonoscopic FMT .
Full normalization of bacterial community structure following FMT with orally administered, encapsulated, freeze-dried microbiota appears to be significantly delayed relative to that seen with colonoscopic FMT, where donor-like microbiome restoration can be seen as early as 24 h following application . This delay may, in part, be due to variability in location of capsule release of microbiota in different patients given the substantial range in gastric pH and intestinal transit times. Further optimization of microbiota delivery with encapsulated preparations should solve this problem. Currently, our ability to predict failure from a sample obtained at 7 days after FMT, which generally precedes recurrence of CDI symptoms, is already potentially useful clinically. Patients with rCDI syndrome are often discouraged by multiple failures of standard antibiotic therapy and may choose indefinite treatment with vancomycin, despite the expense and risks of furthering antibiotic resistance. A demonstration of incomplete engraftment following cap-FMT suggests a rationale for performing repeat cap-FMTs. Notably, some patients fail even colonoscopic FMT. Unfortunately, we do not have a large systematic collection of stool samples over time following colonoscopic FMT. It is possible that engraftment of some bacteria that support resistance to C. difficile is intrinsically difficult due to some yet unknown host-specific factors. Therefore, continued investigations in patients who fail FMT may help identify some of the key bacteria and potential functions that can improve the consistency of next-generation FMT products.
Finally, the SourceTracker analysis done here revealed that complete donor transfer did not occur in patients, similar to previous reports [10, 18], although significantly greater engraftment was observed among responders. We found that the similarity of microbial community structure in patients relative to the donor lot used for FMT reached a maximum around 40–50%. Through the first month of FMT, however, the patient similarity to a broader pool of healthy donors was approximately 20–30% greater. Similarly, within-donor analyses showed, maximally, only 40 to 50% of the microbial community (to as low as 20%) was shared between healthy individuals. This result highlights the intrapersonal variability in the composition of gut microbiota [17, 33] and is suggestive that formulation of the cap-FMT consortium should focus on use of keystone bacterial species, likely those within the families identified, that promote the healthy reorganization of the intestinal microbiota, independent of donor- or patient-specific factors.
Fecal microbiota transplantation using freeze-dried, encapsulated, microbiota is an effective treatment for rCDI that results in restoration of microbial diversity and reinstatement of healthy microbiota, similar to that observed following colonoscopic and nasoduodenal approaches with frozen or fresh fecal microbiota . While the restoration of bacterial diversity is currently slower than that observed using more traditional approaches , shifts in the microbiota reflective of clinical response can be observed within 7 days of cap-FMT. Using a regression tree-based approach, taxa that are predictive of clinical response can be identified and targeted to improve microbial therapeutics. Furthermore, we found that fecal microbiota transfer from donors accounted for approximately half of the patient community among patients who responded to cap-FMT, but that the microbiota then continued to shift to achieve a stable configuration that was more similar to a universal healthy intestinal assemblage. These results demonstrate the efficacy of an unbiased statistical model to determine which taxa are associated with patient response and inform efforts to optimize the cap-FMT preparation.
Preparation of encapsulated microbiota
Encapsulated fecal microbiota was prepared using eight different fecal samples from six donors [numbers 06, 20, 41 (three lots), 42, 44, and 62] who enrolled in the University of Minnesota donor program, as described previously . Any single course of capsules was constituted from only one lot (a single donation) of donor material. Freeze-dried preparations were encapsulated as described previously . Briefly, fecal material was homogenized by blending under N2 gas, sieved to remove large particles, amended with 5% trehalose, and freeze-dried. Capsules were stored at − 80 °C until provided to patients. An evolving study protocol was used that was adapted based on clinical experience . The principle factors that were varied, besides donor lot, included (1) total dosage of microorganisms, ranging from 2.1 × 1011 to 2.0 × 1012 cells, and (2) number of capsules and days over which capsules were taken, ranging from two capsules in a single day to 27 capsules over 3 days of administration (Additional file 1: Table S2).
Patients and sample collection
Patient inclusion and exclusion criteria, as well as a subset of patient demographics, were described previously . Patients were enrolled if they had at least two prior recurrences of C. difficile infection, failed to respond to antibiotic therapies, and were C.-difficile-toxin-B-positive by PCR, at least 3 months prior to treatment . This work extends our previous work with the inclusion of a total of 89 patients who received a total of 100 cap-FMTs. The patients were taking oral vancomycin until 2 days prior to cap-FMT treatment. Patients received no colon purgative prior to cap-FMT, as described previously . The FMT capsules were delivered to the patients by a research coordinator. Cap-FMT (2–5 capsules, depending on the preparation lot, as a single treatment dose) was administered on an empty stomach with only clear liquids allowed afterwards for 2 h. The patients remained in close contact with the coordinator and clinical staff throughout follow-up. Clinical failure of cap-FMT was determined as return of diarrheal symptoms and a positive test for C. difficile toxin (toxin B PCR) through a 2-month clinical follow-up . Patients who had a recurrence of infection had the option of receiving a follow-up cap-FMT or colonoscopic FMT. The cohort described in this paper includes only the patients who were able to collect the fecal samples for this study. An additional 15 patients treated with cap-FMT did not participate in the stool analysis study because of logistic difficulties or inability to consent. Nevertheless, the clinical outcomes from these patients are included in the Additional file 1.
Patient samples were collected in single-use toilet hats and transferred by the patients to 30 ml polystyrene fecal specimen containers (Globe Scientific, Inc., Paramus, NJ, USA). Samples were stored in the patients’ freezers prior to transport to the laboratory on dry ice, where they were stored at − 20 to − 80 °C prior to DNA extraction.
DNA extraction and sequencing
DNA was extracted from 250 to 500 mg of thawed donor and patient fecal samples using the DNeasy® PowerSoil® Kit (QIAGEN, Hilden, Germany), according to the manufacturer’s instructions. Fecal material from donor 42 was not available for DNA extraction and sequencing. The V5-V6 hypervariable regions of the 16S rRNA gene were amplified using the BSF784/1064R primer set . Amplification was performed by the University of Minnesota Genomics Center (UMGC, Minneapolis, MN, USA) using an initial amplification with primers including Nextera adapter sequences (Illumina, Inc., San Diego, CA, USA) for 25 cycles. An additional 10 cycles of amplification was performed to add dual index tags to forward and reverse reads . Samples were sized-selected and pooled in equal amounts, as previously described , followed by paired-end sequencing at a read length of 300 nucleotides (nt) on the Illumina MiSeq or 250 nt on the HiSeq2500. Negative (sterile water) controls were carried through amplification and sequencing and did not produce amplicons.
Sequence processing and analyses were performed using mothur software ver. 1.35.1 , and the batch commands used are available in Additional file 2. Forward and reverse reads were trimmed to 150 nt to eliminate low-quality 3′-regions and paired-end joined using fastq-join software . Joined reads were quality trimmed at a base score of 35 over a window of 50 nt. In addition, samples with ambiguous bases, homopolymers > 8 nt, and > 2 mismatches from primer sequences were removed. High-quality sequences were aligned against the SILVA database ver. 132 . A 2% pre-cluster was used to remove any remaining likely sequence errors , and chimeric sequences were identified and removed using UCHIME software ver. 4.2.40 . Operational taxonomic units were assigned at 97% similarity using complete-linkage clustering and taxonomic classification was performed against the version 16 data release from the Ribosomal Database Project . For comparisons among samples, the number of sequences per sample was rarefied to 11,000 sequences by random subsampling .
To determine potential structure-function relationships and to examine changes in abundances of traits, functional inferences were made using the PICRUSt (Phylogenetic Investigation of Communities by Reconstruction of Unobserved States) software ver. 1.1.3 . Analyses were done using the Numpy (1.13.3), biom-format (2.1.6), and PyCogent (1.5.3) dependencies. Rarefied sequence data were aligned against the GreenGenes database ver. 13.5  and normalized by copy number. Taxa that contributed to abundances of secondary bile acid genes were determined using the metgenome_contributions.py script. The mean nearest sequence taxon index (NSTI) value among all samples was 0.046 ± 0.022.
SourceTracker ver. 0.9.8 was used to assess engraftment, as the percent of a patient (sink) community that could be attributed to a donor (source) community using a Bayesian inference approach . For all SourceTracker analyses, default parameters were maintained including those for rarefaction (to 1000 reads) and α1, α2, and β Dirichlet hyperparameters (0.001, 0.01, and 0.01, respectively). To assess how differences in donor pools might affect SourceTracker results, engraftment was calculated by (1) specifying all donor lots as a composite source, with all patient samples as a sink (termed “composite donor”); (2) specifying each donor lot as a unique source, with all patient samples as a sink (termed “specific donor”); and (3) separating samples based on donor lot, where a single donor lot was the source and only samples from patients who received that lot were sinks (termed “individual donor”).
The Shannon index of alpha diversity was calculated by using mothur software. Analysis of similarity (ANOSIM), to assess differences in community composition , and ordination by principal coordinate analysis (PCoA)  were done by using Bray-Curtis dissimilarity matrices . To determine OTUs significantly associated with PCoA axis position, the corr.axes function in mothur was used. Linear discriminant analysis of effect size (LEfSe)  was used to evaluate OTUs indicative of clinical outcome at specific time points.
A regression tree approach was used to determine whether it was possible to build a predictive model for the detection of recurrence. In this approach, the exhaustive chi-squared automatic interaction detection (CHAID) procedure  was utilized, with the Pearson measure set at a maximum tree depth of five levels. Ten samples from the training data were withheld for internal model validation. The CHAID-regression tree model was built and tested using XLSTAT software ver. 17.06 (Addinsoft, Belmont, MA), with all other default settings maintained. Input data consisted of the relative abundances (as percent) of the families among all 7-day-post-FMT time points. Following training, the model was tested against all patient samples not used in the training set) from days 8 to 20. For follow-up testing and potential tuning, all tree-building parameters were maintained. The tree structure obtained to predict recurrence is shown in Additional file 1: Table S5.
Chi-squared tests, analysis of variance (ANOVA), followed by Tukey’s post hoc test, and Spearman rank correlations were also performed using XLSTAT. All statistics were evaluated at α = 0.05 with Bonferroni correction for multiple comparisons, where applicable.
Analysis of similarity
Encapsulated fecal microbiota transplantation
Clostridium difficile infection
Fecal microbiota transplantation
Linear discriminant analysis of effect size
Operational taxonomic unit
Principal coordinate analysis
Recurrent Clostridium difficile infection
Lessa FC, Mu Y, Bamberg WM, Beldavs ZG, Dumyati GK, Dunn JR, et al. Burden of Clostridium difficile infection in the United States. N Engl J Med. 2015;372:825–34.
Kelly CP, Lamont JT. Clostridium difficile — more difficult than ever. N Engl J Med. 2008;359:1932–40.
Ma GK, Brensinger CM, Wu Q, Lewis JD. Increasing incidence of multiply recurrent Clostridium difficile infection in the United States. Ann Intern Med. 2017;167:152.
Surawicz CM, Brandt LJ, Binion DG, Ananthakrishnan AN, Curry SR, Gilligan PH, et al. Guidelines for diagnosis, treatment, and prevention of Clostridium difficile infections. Am J Gastroenterol. 2013;108:478–98.
McDonald LC, Gerding DN, Johnson S, Bakken JS, Carroll KC, Coffin SE, et al. Clinical practice guidelines for Clostridium difficile infection in adults and children: 2017 update by the Infectious Diseases Society of America (IDSA) and Society for Healthcare Epidemiology of America (SHEA). Clin Infect Dis. 2018;66:987–94.
van Nood E, Vrieze A, Nieuwdorp M, Fuentes S, Zoetendal EG, de Vos WM, et al. Duodenal infusion of donor feces for recurrent Clostridium difficile. N Engl J Med. 2013;368:407–15.
Weingarden AR, Chen C, Bobr A, Yao D, Lu Y, Nelson VM, et al. Microbiota transplantation restores normal fecal bile acid composition in recurrent Clostridium difficile infection. Am J Physiol Gastrointest Liver Physiol. 2014;306:G310–9.
Li SS, Zhu A, Benes V, Costea PI, Hercog R, Hildebrand F, et al. Durable coexistence of donor and recipient strains after fecal microbiota transplantation. Science. 2016;352:586–9.
Smillie CS, Sauk J, Gevers D, Friedman J, Sung J, Youngster I, et al. Strain tracking reveals the determinants of bacterial engraftment in the human gut following fecal microbiota transplantation. Cell Host Microbe. 2018;23:229–240.e5.
Staley C, Kelly CR, Brandt LJ, Khoruts A, Sadowsky MJ. Complete microbiota engraftment is not essential for recovery from recurrent Clostridium difficile infection following fecal microbiota transplantation. MBio. 2016;7:e01965–16.
Seekatz AM, Rao K, Santhosh K, Young VB. Dynamics of the fecal microbiome in patients with recurrent and nonrecurrent Clostridium difficile infection. Genome Med. 2016;8:47.
Khanna S, Pardi DS, Kelly CR, Kraft CS, Dhere T, Henn MR, et al. A novel microbiome therapeutic increases gut microbial diversity and prevents recurrent Clostridium difficile infection. J Infect Dis. 2016;214:173.
Tvede M, Rask-Madsen J. Bacteriotherapy for chronic relapsing Clostridium difficile diarrhoea in six patients. Lancet. 1989;1:1156–60.
Petrof EO, Gloor GB, Vanner SJ, Weese SJ, Carter D, Daigneault MC, et al. Stool substitute transplant therapy for the eradication of Clostridium difficile infection: “RePOOPulating” the gut. Microbiome. 2013;1:3.
Staley C, Hamilton MJ, Vaughn BP, Graiziger CT, Newman KM, Kabage AJ, et al. Successful resolution of recurrent Clostridium difficile infection using freeze-dried, encapsulated microbiota; pragmatic cohort study. Am J Gastroenterol. 2017;112:940–7.
Kao D, Roach B, Silva M, Beck P, Rioux K, Kaplan GG, et al. Effect of oral capsule- vs colonoscopy-delivered fecal microbiota transplantation on recurrent Clostridium difficile infection. JAMA. 2017;318:1985–93.
Weingarden A, González A, Vázquez-Baeza Y, Weiss S, Humphry G, Berg-Lyons D, et al. Dynamic changes in short- and long-term bacterial composition following fecal microbiota transplantation for recurrent Clostridium difficile infection. Microbiome. 2015;3:10.
Staley C, Vaughn BP, Graiziger CTCT, Singroy S, Hamilton MJMJ, Yao D, et al. Community dynamics drive punctuated engraftment of the fecal microbiome following transplantation using freeze-dried, encapsulated fecal microbiota. Gut Microbes. 2017;8:276–88.
Khoruts A, Rank KM, Newman KM, Viskocil K, Vaughn BP, Hamilton MJ, et al. Inflammatory bowel disease affects the outcome of fecal microbiota transplantation for recurrent Clostridium difficile infection. Clin Gastroenterol Hepatol. 2016;14:1433–8.
Knights D, Kuczynski J, Charlson ES, Zaneveld J, Mozer MC, Collman RG, et al. Bayesian community-wide culture-independent microbial source tracking. Nat Methods. 2011;8:761–U107.
Khoruts A, Sadowsky MJ. Understanding the mechanisms of faecal microbiota transplantation. Nat Rev Gastroenterol Hepatol. 2016;13:508–16.
Hamilton MJ, Weingarden AR, Unno T, Khoruts A, Sadowsky MJ. High-throughput DNA sequence analysis reveals stable engraftment of gut microbiota following transplantation of previously frozen fecal bacteria. Gut Microbes. 2013;4:125–35.
Seekatz AM, Aas J, Gessert CE. Recovery of the gut microbiome following fecal microbiota. MBio. 2014;5:1–9.
Kelly C, Khoruts A, Staley C, Sadowsky M, Abd M, Alani M, et al. Effect of fecal microbiota transplantation on recurrence in multiply recurrent Clostridium difficile infection: a randomized trial. Ann Intern Med. 2016;165:609–16.
Buffie CG, Bucci V, Stein RR, McKenney PT, Ling L, Gobourne A, et al. Precision microbiome reconstitution restores bile acid mediated resistance to Clostridium difficile. Nature. 2015;517:205–8.
Giel JL, Sorg JA, Sonenshein AL, Zhu J. Metabolism of bile salts in mice influences spore germination in Clostridium difficile. PLoS One. 2010;5:e8740.
Theriot CM, Bowman AA, Young VB. Antibiotic-induced alterations of the gut microbiota alter secondary bile acid production and allow for Clostridium difficile spore germination and outgrowth in the large intestine. mSphere. 2016;1:e00045–15.
Weingarden AR, Chen C, Zhang N, Graiziger CT, Dosa PI, Steer CJ, et al. Ursodeoxycholic acid inhibits Clostridium difficile spore germination and vegetative growth, and prevents the recurrence of ileal pouchitis associated with the infection. J Clin Gastroenterol. 2016;50:624–30.
Biggs D, De Ville B, Suen E. A method of choosing multiway partitions for classification and decision trees. J Appl Stat. 1991;18:49–62.
Lemon SC, Roy J, Clark MA, Friedmann PD, Rakowski W. Classification and regression tree analysis in public health: methodological review and comparison with logistic regression. Ann Behav Med. 2003;26:172–81.
Fischer M, Kao D, Mehta SR, Martin T, Dimitry J, Keshteli AH, et al. Predictors of early failure after fecal microbiota transplantation for the therapy of Clostridium difficile infection: a multicenter study. Am J Gastroenterol. 2016;111:1024–31.
Pascal V, Pozuelo M, Borruel N, Casellas F, Campos D, Santiago A, et al. A microbial signature for Crohn’s disease. Gut. 2017;66:813.
Turnbaugh PJ, Ley RE, Hamady M, Fraser-Liggett CM, Knight R, Gordon JI. The human microbiome project: exploring the microbial part of ourselves in a changing world. Nature. 2007;449:804–10.
Hamilton MJ, Weingarden AR, Sadowsky MJ, Khoruts A. Standardized frozen preparation for transplantation of fecal microbiota for recurrent Clostridium difficile infection. Am J Gastroenterol. 2012;107:761–7.
Claesson MJ, Wang QO, O’Sullivan O, Greene-Diniz R, Cole JR, Ross RP, et al. Comparison of two next-generation sequencing technologies for resolving highly complex microbiota composition using tandem variable 16S rRNA gene regions. Nucleic Acids Res. 2010;38:gkq873.
Gohl DM, Vangay P, Garbe J, MacLean A, Hauge A, Becker A, et al. Systematic improvement of amplicon marker gene methods for increased accuracy in microbiome studies. Nat Biotechnol. 2016;34:942–9.
Schloss PD, Westcott SL, Ryabin T, Hall JR, Hartmann M, Hollister EB, et al. Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl Environ Microbiol. 2009;75:7537–41.
Aronesty E. Comparison of sequencing utility programs. Open Bioinforma J. 2013;7:1–8.
Pruesse E, Quast C, Knittel K, Fuchs BM, Ludwig WG, Peplies J, et al. SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB. Nucleic Acids Res. 2007;35:7188–96.
Huse SM, Welch DM, Morrison HG, Sogin ML. Ironing out the wrinkles in the rare biosphere through improved OTU clustering. Environ Microbiol. 2010;12:1889–98.
Edgar RC, Haas BJ, Clemente JC, Quince C, Knight R. UCHIME improves sensitivity and speed of chimera detection. Bioinformatics. 2011;27:2194–200.
Cole JR, Wang Q, Cardenas E, Fish J, Chai B, Farris RJ, et al. The ribosomal database project: improved alignments and new tools for rRNA analysis. Nucleic Acids Res. 2009;37:D141–5.
Gihring TM, Green SJ, Schadt CW. Massively parallel rRNA gene sequencing exacerbates the potential for biased community diversity comparisons due to variable library sizes. Environ Microbiol. 2012;14:285–90.
Langille MGI, Zaneveld J, Caporaso JG, McDonald D, Knights D, Reyes JA, et al. Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences. Nat Biotechnol. 2013;31:814–21.
DeSantis TZ, Hugenholtz P, Larsen N, Rojas M, Brodie EL, Keller K, et al. Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl Environ Microbiol. 2006;72:5069–72.
Clarke KR. Non-parametric multivariate analyses of changes in community structure. Aust J Ecol. 1993;18:117–43.
Anderson MJ, Willis TJ. Canonical analysis of principal coordinates: a useful method of constrained ordination for ecology. Ecology. 2003;84:511–25.
Bray JR, Curtis JT. An ordination of the upland forest communities of southern Wisconsin. Ecol Monogr. 1957;27:325–49.
Segata N, Huttenhower C. Toward an efficient method of identifying core genes for evolutionary and functional microbial phylogenies. PLoS One. 2011;6:e24704.
Sequence processing and analysis were performed using the resources of the Minnesota Supercomputing Institute.
This research was made possible by support by grants from the NIH 1R21-AI114722–01 (AK, MJS), CIPAC Limited (to AK, MJS), Achieving Cures Together (to AK), and the Hubbard Foundation (to AK).
Availability of data and material
Sequence data are deposited in the Sequence Read Archive at the National Center for Biotechnology Information under BioProject accession number SRP070464.
Ethics approval and consent to participate
Approval for this study was given by the University of Minnesota Institutional Review Board (Protocol Number: 0901M56962). All human subjects provided informed consent for participation in the study and collection and analysis of data. All human subjects gave their permission for their information to be published.
Consent for publication
AK, MJH, and MJS hold the following patent which relates to the content of this manuscript: Compositions and methods for transplantation of colon microbiota, US Patent 2014/0147417 A1, filed 9 March 2012, accepted 29 May 2014. The other authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
- Clostridium difficile
- Encapsulated microbiota
- Fecal microbiota transplantation
- Machine learning
- Prediction model
- Microbial community structure