Skip to main content

Predicting recurrence of Clostridium difficile infection following encapsulated fecal microbiota transplantation



Fecal microbiota transplantation (FMT) is an effective treatment for recurrent Clostridium difficile infection (rCDI). The use of freeze-dried, encapsulated donor material for FMT (cap-FMT) allows for an easy route of administration and remains clinically effective in the majority of rCDI patients. We hypothesized that specific shifts in the microbiota in response to cap-FMT could predict clinical outcome. We further evaluated the degree of donor microbiota engraftment to determine the extent that donor transfer contributed to recovery.


In total, 89 patients were treated with 100 separate cap-FMTs, with a success rate (no rCDI 60 days post cap-FMT) of 80%. Among responders, the lower alpha diversity (ANOVA P < 0.05) observed among patient’s pre-FMT samples was restored following cap-FMT. At 1 week post-FMT, community composition varied by clinical outcome (ANOSIM P < 0.001), with similar abundances among families (Lachnospiraceae, Ruminococcaceae, and Bacteroidaceae) in responder and donor samples. Families that showed differential abundances by outcome (response vs. recurrence) from samples collected 7 days following cap-FMT were used to construct a regression tree-based model to predict recurrence. Results showed a training accuracy of 100% to predict recurrence and the model was 97% accurate against a test data set of samples collected 8–20 days following cap-FMT. Evaluation of the extent of engraftment using the Bayesian algorithm SourceTracker revealed that approximately 50% of the post-FMT communities of responders were attributable to donor microbiota, while an additional 20–30% of the communities were similar to a composite healthy microbiota consisting of all donor samples.


Regression tree-based analyses of microbial communities identified taxa significantly related to clinical response after 7 days, which can be targeted to improve microbial therapeutics. Furthermore, reinstatement of a healthy assemblage following cap-FMT was only partially attributable to explicit donor engraftment and continued to develop towards an overall healthy assemblage, independent of donor.


Clostridium difficile infection (CDI) remains a common cause of hospital and community-acquired infection [1]. One of the most difficult clinical challenges associated with CDI is recurrent infection (rCDI) despite multiple courses of antibiotics [2, 3]. Fecal microbiota transplantation (FMT) has emerged recently as a highly effective treatment of rCDI that is now endorsed by professional societies and incorporated into standard treatment guidelines as an option following failure of antibiotic treatment [4, 5]. FMT involves transfer of fecal microorganisms from healthy donors to patients to correct antibiotic-induced dysbiosis, which is the primary causal risk factor for CDI in most patients.

Unlike antibiotics, FMT represents a restorative therapeutic approach that results in donor-like normalization of fecal microbial community structure and functionality [6, 7]. However, despite its impressive overall efficacy in breaking the cycle of CDI recurrence, FMT fails to cure rCDI in a fraction of patients. The reasons for FMT failure are not well understood; potential variables include specific patient factors [8, 9], potential resistance of individual strains of C. difficile bacteria to FMT activity [9], or failure to activate protective mechanisms or achieve full potency, possibly due to inadequate engraftment of key donor microorganisms [10]. In order to understand why FMT may fail, it is critical to know the mechanisms by which FMT is able to break the cycle of CDI recurrence. Achieving this understanding is also important for development of reliable next-generation anti-CDI therapeutics.

Identification of specific microbial taxa that are essential for resolution of rCDI would allow for a targeted microbiota restoration approach and decrease the FMT failure rate. Specific approaches have included attempts to correlate microbiome analyses with CDI recurrence risk, as well as treatments of rCDI with defined consortia of microorganisms or preparations of fecal microbiota of reduced complexity [11,12,13,14]. However, these investigations have not yielded consistent results. Here, we expand upon our previous studies analyzing microbiome recovery in rCDI patients treated with an encapsulated preparation of freeze-dried microbiota (cap-FMT) [15]. Clinically, cap-FMT has been very successful [15, 16]. However, in contrast to colonoscopic administration of fecal microbiota, which results in prompt engraftment of the entire donor bacterial communities within 24–48 h [17], treatment with this oral FMT preparation is associated with more gradual, punctuated kinetics of microbiome normalization over a period of approximately a month [18]. The increased period of recovery presents a potential opportunity to capture the essential steps in microbiome repair and identify critical microbial taxa in controlling CDI.


Clinical cohort and fecal samples

The current analysis includes samples from an expanded cohort of 89 consecutively treated rCDI patients who participated in the study. The cohort includes the 49 patients described previously [15] and reflects 100 separate cap-FMTs from eight different donor lots. A donor lot is defined as the preserved fecal microbiota purified from a single stool sample. The demographics and clinical characteristics of this patient cohort are shown in Additional file 1: Table S1. None of the patients had underlying inflammatory bowel disease (IBD), which is associated with lower success rates of FMT in treating rCDI [19]. The success rate for cap-FMT treatment was 80%. Among the non-responders, the median interval between cap-FMT and diagnosis of CDI relapse was 13 days (range 4–42 days). The general clinical course among patients following cap-FMT was similar to that following colonoscopic FMT experience in our center (> 400 colonoscopic FMTs for rCDI), including the interval between the procedure and diagnosis of CDI relapse in patients without IBD who failed treatment. Patients who had CDI recurrence were subsequently retreated by cap-FMT, received colonoscopic FMT, or remained on a long-term suppressive treatment with a daily dose of vancomycin. During the course of the study, the donor lot, total capsule dosage, and timing of delivery varied based on clinical experience, but none of these factors were shown to be significantly related to patient outcomes (Additional file 1: Table S2). Due to variation in study protocols during the course of this study and logistics of stool collections, analyses were done using samples that were grouped by broad time points encompassing days (days 2–6), weeks (days 7–20), months (days 21–60), and longer-term samples (> 60 days) following cap-FMT (Additional file 1: Figure S1).

FMT is associated with characteristic shifts in bacterial community alpha and beta diversity

As expected, alpha diversity, community richness, and evenness measured by the Shannon index were lowest in pre-FMT samples compared to those collected from donors and patients post-FMT (post hoc P ≤ 0.014; Fig. 1). In addition, post-FMT samples from non-responders had lower alpha diversity relative to samples from donors (P ≤ 0.034), but did not differ significantly from responder post-FMT samples, at any time point. Donor communities were primarily comprised of members of the Lachnospiraceae, Ruminococcaceae, Bacteroidaceae, and Porphyromonadaceae, while pre-FMT communities had greater numbers of Enterobacteriaceae (Fig. 2). The microbial communities in responders began to taxonomically resemble those of the donors at the “weeks” time point post-FMT (Fig. 2), with increases in the donor-associated families described above, and a significant reduction in abundances of members of the Enterobacteriaceae.

Fig. 1

Shannon indices of donor and patient communities. The number of samples (n) is shown in parentheses. Letters indicate significant differences as assessed by Tukey’s post hoc test (P < 0.05), where sample groups sharing the same letter did not differ significantly

Fig. 2

Distribution of abundant phyla among A) donor and pre-FMT samples, B) post-FMT samples from responders, and C) post-FMT samples from patients who had recurrence. Numbers in parentheses reflect the numbers of samples in each group. Letters on family abundances reflect significant differences by Tukey’s post hoc test (P < 0.05) across all groups (all panels) for each family, separately. No significant differences were observed for abundances of Porphyromonadaceae. Less abundant families represent those present at mean abundance ≤ 5% of sequence reads among all samples

Changes in bacterial community composition were evaluated by using Bray-Curtis dissimilarities, which account for differences in the abundances of operational taxonomic units (OTUs) among sample groups. Over time, progressive, temporal changes were observed in bacterial communities in post-FMT samples from patients who responded to cap-FMT, but not those who experienced recurrence. No significant differences were noted between the fecal bacterial communities in cap-FMT responders and non-responders at the “days” time point (analysis of similarity (ANOSIM) R = 0.10, P = 0.052). However, the communities from cap-FMT responders and non-responders diverged after the “weeks” time points (R = 0.11, P = 0.020; Bonferroni-corrected α = 0.002). Specifically, bacterial communities of cap-FMT responders differed significantly at all later time points (“weeks” through “long-term”) relative to the communities at the “days” time point (R = 0.14–0.23, P < 0.001). Moreover, fecal bacterial composition of responder samples did not change significantly following the “weeks” time point (R = − 0.03 to − 0.01, P ≥ 0.627). In contrast, no significant differences between the “days” and “weeks” time points (R = 0.05, P = 0.142) were noted in the bacterial communities in feces from the cap-FMT non-responder group.

To evaluate which metabolic functional traits may be associated with response to cFMT, functional inferences were performed and correlated with days-post-FMT among samples from patients who responded. The abundances of most inferred functional traits were negatively correlated with the number of days following cFMT (Additional file 1: Table S3), including genes associated with nitrogen and sulfur metabolism (ρ = − 0.124 and − 0.158, P = 0.038 and 0.008, respectively). However, functional genes within the tier 2 category of glycan biosynthesis and metabolism showed weak, but significant positive correlations (ρ = 0.125–0.191, P ≤ 0.037). Notably, genes inferred to function in primary and secondary bile acid biosynthesis were also positively correlated with duration following cFMT among responders (ρ = 0.215 and 0.217, respectively; P < 0.0001). Among genes involved in secondary bile acid, choloylglycine hydrolase (K01442), 7α-hydroxysteroid dehydrogenase (hdhA; KO00076), and 3-dehydro-bile acid Δ4,6-reductase (baiN; KO07007) were inferred. Members of the Lachnospiraceae were the predominant contributors of these genes among all samples (56.1, 93.9, and 66.5%, respectively). Members of the Bacteroidaceae, Porphyromonadaceae, Verrucomicrobiaceae, and Ruminococcaceae were also predominant contributors to abundances of choloylglycine hydrolase (9.1, 7.6, 6.9, and 6.6%, respectively), and members of Enterobacteriaceae and Ruminococcaceae were predominant contributors to abundances of baiN (10.5 and 8.1%). Members of Coriobacteriaceae and Synergistaceae were among the only other families inferred to contribute to abundances of hdhA (4.3 and 1.1%).

Identification of markers indicative of clinical outcome

Ordination of Bray-Curtis dissimilarity matrices among samples by principal coordinate analysis (PCoA, Fig. 3) showed separation of samples associated with time and clinical outcome along the x-axis. This is in agreement with ANOSIM analyses (above). The Lachnospiraceae, Ruminococcaceae, Bacteroidaceae, Porphyromonadaceae, and Enterobacteriaceae comprised the predominant families whose abundances were significantly correlated with position along this axis (Fig. 3). Similarly, and more specifically, the predominant OTUs that showed significantly different abundances due to time point and were related to clinical outcome by linear discriminant analysis of effect size (LEfSe), were also primarily classified to genera within the Lachnospiraceae, Ruminococcaceae, and Bacteroidaceae, among others (Table 1).

Fig. 3

Ordination of Bray-Curtis dissimilarity matrices by PCoA (r2 = 0.23) and the five most abundant families. Panels reflect the same analysis but are colored by time point for clarity: a) pre-FMT, b) days (2–6 days post-FMT), c) weeks (7–20 days), and d) months (21–60 days) and long-term (> 60 days). A total of 471 axes were used to explain all variation with the remaining axes explain < 2.3% of the variation individually. Family abundances were significantly related to x-axis position by Spearman correlation (P < 0.05). Legend: black circle—donor, ex mark—pre-FMT, green circle—responder/months, orange circle—recurrence, blue circle—long-term, gray circle—sample not associated with time point

Table 1 Genus-level classification and relative abundances of OTUs found to be significantly indicative (LDA ≥ 2.0, P < 0.05) of time point and clinical outcome by LEfSe analysis. Only the 15 most abundant genera are shown

Since members of the Lachnospiraceae, Ruminococcaceae, Bacteroidaceae, Porphyromonadaceae, and Enterobacteriaceae were related to time points and clinical outcome by various statistical tests, we hypothesized that abundances of these families could be used to predict the likelihood of recurrence. Given a statistical similarity of the microbial community composition in responders at the “weeks” time point, relative to donors, these data were selected to develop a chi-squared automatic interaction detection (CHAID)-based regression tree model, using a machine learning approach. The model was trained on all cap-FMT patient data (responders and non-responders) collected from day 7 (n = 64) and tested on subsequent patient data from days 8–20 post-FMT (n = 67, Table 2). Ten samples from the training data were withheld for validation prior to testing. Thus, the training and test datasets were independent from each other. When more than one sample representing a unique cap-FMT delivery was present in the days 8–20 dataset, only the earliest time point post-FMT was included.

Table 2 Confusion matrices showing the accuracy of CHAID-regression tree model to (A) training data (post-FMT day 7 composition), (B) validation data (using 10 samples withheld from the training data) and (C) test data (post-FMT day 8–20 composition)

The model generated had an overall training accuracy of 100% (correct classification of all samples) using the 7-day data and a test accuracy of 97.0% against the days 8–20 data set (Table 2). The model specificity was 96.5% to identify a recurrence within weeks of cap-FMT, with a sensitivity of 100% against the test data. Attempts to improve accuracy against the test data were unsuccessful by using all family-level data and by using the same parameters for tree construction, producing a maximum test accuracy of 94.0%.

Evaluation of engraftment in relation to clinical outcome

To determine how donor microbiota engraftment influenced clinical outcome, the extent of engraftment was assessed using SourceTracker software, which employs a Bayesian algorithm to determine the percent similarity in OTU composition between the donor (the source) and patient (the sink) communities [20]. Engraftment among patient samples was assessed by (1) using communities from all donor lots as a “composite” source (evaluates the similarity of patient sample communities to a generalized healthy fecal microbiota); (2) designating each “specific” donor lot as a unique source (testing the ability of SourceTracker to discriminate among donors); and (3) analyzing “individual” donor lots and associated patient samples separately (determines the similarity to the specific donor lot representing empirical engraftment). Among all patient samples, the extent of engraftment based on these analysis methods were in the order composite > individual > specific, with each method significantly different from the others (Tukey’s post hoc P < 0.0001). Furthermore, since all the same OTUs were not consistently transferred to all patients, regardless of the donor lot or the method of analysis (data not shown), taxonomic assignments were performed at the family level to assess patterns of engraftment (Additional file 1: Table S4).

When microbial communities from all donor lots were pooled as a single composite source (Fig. 4a), approximately one quarter (27.6 ± 2.8%) of pre-FMT communities included OTUs shared with donor samples. This percentage similarity, however, was significantly less than all other post-FMT sample groups (post hoc P < 0.001). In contrast, among the post-FMT samples, the extent of engraftment was significantly greater at the “weeks” and later time points among patients who responded, relative to those that had recurrence within days of cap-FMT (P ≤ 0.036). The greatest similarity to all donors was observed among responders at approximately the 1 month post-FMT time point. Furthermore, the percent of donor engraftment was significantly and positively correlated with the number of days post-FMT (Spearman’s ρ = 0.536, P < 0.0001). The OTUs that were associated with engraftment (Fig. 4b) were predominantly classified within the families Lachnospiraceae, followed by Bacteroidaceae and Ruminococcaceae.

Fig. 4

SourceTracker analysis of engraftment in patient samples. A) average engraftment in patient samples with analysis performed using all donors pooled (“composite”, blue) or only samples associated with a single donor lot (“individual”, green). Letters indicated significant differences by Tukey’s post hoc test (P < 0.05) for the composite analysis only, where sample groups sharing the same letter did not differ significantly. B) Classification and abundance of OTUs determined to engraft by SourceTracker in the composite analysis. C) Similarity of responder fecal communities to unique donor lots at the ‘weeks’ time point (separate analysis than that shown in A). Numbers in parentheses reflect the number of samples. Fecal material from donor 42 was not available for sequencing. In all panels, error bars reflect SEM

Assessment of specific donor engraftment among the entire dataset similarly revealed a small attribution (6.6 ± 1.4%) of the communities in pre-FMT samples to donor communities. This was significantly less than that seen in post-FMT engraftment samples among patients who responded to cap-FMT (P ≤ 0.011). Differences in engraftment between the pre-FMT samples and those from patients who had a recurrence, however, did not differ significantly (P ≥ 0.138). While the percent of engraftment was still significantly correlated with days post-FMT (ρ = 0.296, P < 0.0001), the strength of this relationship was much weaker compared to that obtained from analyses of the composite. Furthermore, when samples were grouped by time point, there were no significant increases in engraftment at later times (P ≥ 0.336). Among patient samples from responders, collected at the “weeks” time point, microbial communities showed a greater proportion of similarity to the specific donor lot the patient received (Fig. 4c, Additional file 1: Figure S2), rather than to the communities from other donor lots. However, the empirical donor lot used for cap-FMT was not assigned a significantly greater percentage of the community than lots not delivered to the patient (post hoc P > 0.05).

Intermediate engraftment percentages were found when an individual donor lot was compared to samples from only those patients who received that lot (Fig. 4a, Table 3). Similar to the prior analyses, this analysis indicated the percent of engraftment in pre-FMT samples was significantly less than that of all post-FMT samples (P ≤ 0.044). In addition, the percent of engraftment was significantly correlated with the number of days post-FMT (ρ = 0.473, P < 0.0001). Moreover, samples from patients who responded to cap-FMT also had significantly greater percentages of engraftment in the “weeks” time point, relative to those from patients who had recurrence within days of receiving cap-FMT (P < 0.042). While the OTUs that engrafted were classified to similar families as those observed for the composite donor (Fig. 4b), donor-specific differences in the abundance of these taxa were observed (Additional file 1: Table S3). Notably, for two donors (numbers 44 and 62), approximately 8–14% of the donor-associated community was comprised of Verrucomicrobia, which was not contributed from other donors, although transfer of this family did not significantly affect efficacy of these donor lots (Additional file 1: Table S2). Moreover, many taxa were not unique to a single donor lot and 23.5 to 84.6% of sequence reads in a single donor lot belonged to taxa found in one or more other donor lot (Table 4).

Table 3 Similarity (%) to individual donor lots. Similarity was assessed only on patients who were treated using the individual donor lot. Fecal material from donor 42 was not available for sequencing
Table 4 Similarity (%) among donor lots determined by SourceTracker. Sequence data were not available for donor 42


In this study, we were able to predict, with great accuracy, an eventual recurrence of CDI following cap-FMT at 7 days post-FMT using an unbiased, statistical model incorporating the abundances of members of the families Lachnospiraceae, Ruminococcaceae, Bacteroidaceae, Porphyromonadaceae, and Enterobacteriaceae. The abundances of Bacteroides spp. have been previously suggested to prevent recurrence [13], and the results of our current work support this supposition. However, while we previously suggested that dysbiotic signatures, e.g., sustained, elevated abundances of Enterobacteriaceae [18], may be useful in predicting recurrences, our analysis of recurrence events following cap-FMT here did not reflect this. While sustained levels of Enterobacteriaceae were noted following some recurrences, others were characterized by near elimination of this family, but incomplete restoration of diversity within the Firmicutes, Bacteroidetes, or both. This result suggests that the microbial community dynamics surrounding recurrence may be specific to the individual patient [8, 9].

The mechanisms by which FMT resolves rCDI are broadly associated with reinstatement of intestinal microbial diversity, as well as restoration of the functional and beneficial effects of the microbiota on host physiology and gut chemistry [21]. Previous studies have noted significantly reduced alpha diversity that is restored following FMT [17, 22,23,24], suggesting that competition for nutrients between the reinstated flora and C. difficile may play a role in suppressing the infection. Furthermore, a recent longitudinal study found that intestinal microbial diversity was associated with both recurrence of CDI as well as the severity of the disease, with greater diversity associated with decreased severity and reduced likelihood of recurrence [11]. In a mouse model, a single species, C. scindens, was inhibitory to C. difficile [25], and the suppressive mechanism was found to be associated with secondary bile acid biosynthesis. Similarly, our inferred functional data indicated a positive correlation between time following cFMT and genes associated with secondary bile acid synthesis. Moreover, the restoration of secondary bile acid metabolism was noted in patients treated with FMT [7]. Several studies have also demonstrated the inhibitory effects of some bile acids, including chenodeoxycholic, lithocholic, and ursodeoxycholic acids, on germination of C. difficile spores [26,27,28]. Interestingly, co-administration of bacterial species from Lachnospiraceae and Porphyromonadaceae families enhanced the protective potency of C. scindens against CDI [25]. Thus, the reinstatement of both microbial diversity and specific functional capabilities related to host physiology and microbe-microbe interactions are vital to the efficacy of FMT and restoration of gut health. However, accurate characterization of functions associated with response to FMT will require further experimental characterization.

We employed a regression tree-based machine-learning algorithm in order to assess factors that may be associated with recurrence [29, 30]. Due to the complexity of the microbial dataset, we selected independent variables for this model based on prior statistical analyses to identify highly discriminant taxa, which provided high sensitivity and specificity to detect recurrence at later time points. Thus, this method, using data-specific and statistically derived taxa as independent variables, may allow similar models to be applied to diverse patient populations. While the search for predictive models of recurrence is an active area of research [31], results using our exhaustive CHAID-regression tree model provide much greater accuracy than a random forest-based method used to predict or determine signatures of dysbiosis in microbiome data (maximum classification accuracy of 85.4%) [32]. Furthermore, using unbiased variable selection resulted in high predictive accuracy independent of donor-specific taxa that likely engrafted successfully from some donor lots, such as Verrucomicrobia. This suggests that donor engraftment by itself is not solely predictive of the success of FMT, similar to our previous findings using colonoscopic FMT [10].

Full normalization of bacterial community structure following FMT with orally administered, encapsulated, freeze-dried microbiota appears to be significantly delayed relative to that seen with colonoscopic FMT, where donor-like microbiome restoration can be seen as early as 24 h following application [17]. This delay may, in part, be due to variability in location of capsule release of microbiota in different patients given the substantial range in gastric pH and intestinal transit times. Further optimization of microbiota delivery with encapsulated preparations should solve this problem. Currently, our ability to predict failure from a sample obtained at 7 days after FMT, which generally precedes recurrence of CDI symptoms, is already potentially useful clinically. Patients with rCDI syndrome are often discouraged by multiple failures of standard antibiotic therapy and may choose indefinite treatment with vancomycin, despite the expense and risks of furthering antibiotic resistance. A demonstration of incomplete engraftment following cap-FMT suggests a rationale for performing repeat cap-FMTs. Notably, some patients fail even colonoscopic FMT. Unfortunately, we do not have a large systematic collection of stool samples over time following colonoscopic FMT. It is possible that engraftment of some bacteria that support resistance to C. difficile is intrinsically difficult due to some yet unknown host-specific factors. Therefore, continued investigations in patients who fail FMT may help identify some of the key bacteria and potential functions that can improve the consistency of next-generation FMT products.

Finally, the SourceTracker analysis done here revealed that complete donor transfer did not occur in patients, similar to previous reports [10, 18], although significantly greater engraftment was observed among responders. We found that the similarity of microbial community structure in patients relative to the donor lot used for FMT reached a maximum around 40–50%. Through the first month of FMT, however, the patient similarity to a broader pool of healthy donors was approximately 20–30% greater. Similarly, within-donor analyses showed, maximally, only 40 to 50% of the microbial community (to as low as 20%) was shared between healthy individuals. This result highlights the intrapersonal variability in the composition of gut microbiota [17, 33] and is suggestive that formulation of the cap-FMT consortium should focus on use of keystone bacterial species, likely those within the families identified, that promote the healthy reorganization of the intestinal microbiota, independent of donor- or patient-specific factors.


Fecal microbiota transplantation using freeze-dried, encapsulated, microbiota is an effective treatment for rCDI that results in restoration of microbial diversity and reinstatement of healthy microbiota, similar to that observed following colonoscopic and nasoduodenal approaches with frozen or fresh fecal microbiota [16]. While the restoration of bacterial diversity is currently slower than that observed using more traditional approaches [18], shifts in the microbiota reflective of clinical response can be observed within 7 days of cap-FMT. Using a regression tree-based approach, taxa that are predictive of clinical response can be identified and targeted to improve microbial therapeutics. Furthermore, we found that fecal microbiota transfer from donors accounted for approximately half of the patient community among patients who responded to cap-FMT, but that the microbiota then continued to shift to achieve a stable configuration that was more similar to a universal healthy intestinal assemblage. These results demonstrate the efficacy of an unbiased statistical model to determine which taxa are associated with patient response and inform efforts to optimize the cap-FMT preparation.


Preparation of encapsulated microbiota

Encapsulated fecal microbiota was prepared using eight different fecal samples from six donors [numbers 06, 20, 41 (three lots), 42, 44, and 62] who enrolled in the University of Minnesota donor program, as described previously [34]. Any single course of capsules was constituted from only one lot (a single donation) of donor material. Freeze-dried preparations were encapsulated as described previously [15]. Briefly, fecal material was homogenized by blending under N2 gas, sieved to remove large particles, amended with 5% trehalose, and freeze-dried. Capsules were stored at − 80 °C until provided to patients. An evolving study protocol was used that was adapted based on clinical experience [15]. The principle factors that were varied, besides donor lot, included (1) total dosage of microorganisms, ranging from 2.1 × 1011 to 2.0 × 1012 cells, and (2) number of capsules and days over which capsules were taken, ranging from two capsules in a single day to 27 capsules over 3 days of administration (Additional file 1: Table S2).

Patients and sample collection

Patient inclusion and exclusion criteria, as well as a subset of patient demographics, were described previously [15]. Patients were enrolled if they had at least two prior recurrences of C. difficile infection, failed to respond to antibiotic therapies, and were C.-difficile-toxin-B-positive by PCR, at least 3 months prior to treatment [15]. This work extends our previous work with the inclusion of a total of 89 patients who received a total of 100 cap-FMTs. The patients were taking oral vancomycin until 2 days prior to cap-FMT treatment. Patients received no colon purgative prior to cap-FMT, as described previously [15]. The FMT capsules were delivered to the patients by a research coordinator. Cap-FMT (2–5 capsules, depending on the preparation lot, as a single treatment dose) was administered on an empty stomach with only clear liquids allowed afterwards for 2 h. The patients remained in close contact with the coordinator and clinical staff throughout follow-up. Clinical failure of cap-FMT was determined as return of diarrheal symptoms and a positive test for C. difficile toxin (toxin B PCR) through a 2-month clinical follow-up [15]. Patients who had a recurrence of infection had the option of receiving a follow-up cap-FMT or colonoscopic FMT. The cohort described in this paper includes only the patients who were able to collect the fecal samples for this study. An additional 15 patients treated with cap-FMT did not participate in the stool analysis study because of logistic difficulties or inability to consent. Nevertheless, the clinical outcomes from these patients are included in the Additional file 1.

Patient samples were collected in single-use toilet hats and transferred by the patients to 30 ml polystyrene fecal specimen containers (Globe Scientific, Inc., Paramus, NJ, USA). Samples were stored in the patients’ freezers prior to transport to the laboratory on dry ice, where they were stored at − 20 to − 80 °C prior to DNA extraction.

DNA extraction and sequencing

DNA was extracted from 250 to 500 mg of thawed donor and patient fecal samples using the DNeasy® PowerSoil® Kit (QIAGEN, Hilden, Germany), according to the manufacturer’s instructions. Fecal material from donor 42 was not available for DNA extraction and sequencing. The V5-V6 hypervariable regions of the 16S rRNA gene were amplified using the BSF784/1064R primer set [35]. Amplification was performed by the University of Minnesota Genomics Center (UMGC, Minneapolis, MN, USA) using an initial amplification with primers including Nextera adapter sequences (Illumina, Inc., San Diego, CA, USA) for 25 cycles. An additional 10 cycles of amplification was performed to add dual index tags to forward and reverse reads [36]. Samples were sized-selected and pooled in equal amounts, as previously described [36], followed by paired-end sequencing at a read length of 300 nucleotides (nt) on the Illumina MiSeq or 250 nt on the HiSeq2500. Negative (sterile water) controls were carried through amplification and sequencing and did not produce amplicons.


Sequence processing and analyses were performed using mothur software ver. 1.35.1 [37], and the batch commands used are available in Additional file 2. Forward and reverse reads were trimmed to 150 nt to eliminate low-quality 3′-regions and paired-end joined using fastq-join software [38]. Joined reads were quality trimmed at a base score of 35 over a window of 50 nt. In addition, samples with ambiguous bases, homopolymers > 8 nt, and > 2 mismatches from primer sequences were removed. High-quality sequences were aligned against the SILVA database ver. 132 [39]. A 2% pre-cluster was used to remove any remaining likely sequence errors [40], and chimeric sequences were identified and removed using UCHIME software ver. 4.2.40 [41]. Operational taxonomic units were assigned at 97% similarity using complete-linkage clustering and taxonomic classification was performed against the version 16 data release from the Ribosomal Database Project [42]. For comparisons among samples, the number of sequences per sample was rarefied to 11,000 sequences by random subsampling [43].

To determine potential structure-function relationships and to examine changes in abundances of traits, functional inferences were made using the PICRUSt (Phylogenetic Investigation of Communities by Reconstruction of Unobserved States) software ver. 1.1.3 [44]. Analyses were done using the Numpy (1.13.3), biom-format (2.1.6), and PyCogent (1.5.3) dependencies. Rarefied sequence data were aligned against the GreenGenes database ver. 13.5 [45] and normalized by copy number. Taxa that contributed to abundances of secondary bile acid genes were determined using the script. The mean nearest sequence taxon index (NSTI) value among all samples was 0.046 ± 0.022.

SourceTracker ver. 0.9.8 was used to assess engraftment, as the percent of a patient (sink) community that could be attributed to a donor (source) community using a Bayesian inference approach [20]. For all SourceTracker analyses, default parameters were maintained including those for rarefaction (to 1000 reads) and α1, α2, and β Dirichlet hyperparameters (0.001, 0.01, and 0.01, respectively). To assess how differences in donor pools might affect SourceTracker results, engraftment was calculated by (1) specifying all donor lots as a composite source, with all patient samples as a sink (termed “composite donor”); (2) specifying each donor lot as a unique source, with all patient samples as a sink (termed “specific donor”); and (3) separating samples based on donor lot, where a single donor lot was the source and only samples from patients who received that lot were sinks (termed “individual donor”).

Statistical analyses

The Shannon index of alpha diversity was calculated by using mothur software. Analysis of similarity (ANOSIM), to assess differences in community composition [46], and ordination by principal coordinate analysis (PCoA) [47] were done by using Bray-Curtis dissimilarity matrices [48]. To determine OTUs significantly associated with PCoA axis position, the corr.axes function in mothur was used. Linear discriminant analysis of effect size (LEfSe) [49] was used to evaluate OTUs indicative of clinical outcome at specific time points.

A regression tree approach was used to determine whether it was possible to build a predictive model for the detection of recurrence. In this approach, the exhaustive chi-squared automatic interaction detection (CHAID) procedure [29] was utilized, with the Pearson measure set at a maximum tree depth of five levels. Ten samples from the training data were withheld for internal model validation. The CHAID-regression tree model was built and tested using XLSTAT software ver. 17.06 (Addinsoft, Belmont, MA), with all other default settings maintained. Input data consisted of the relative abundances (as percent) of the families among all 7-day-post-FMT time points. Following training, the model was tested against all patient samples not used in the training set) from days 8 to 20. For follow-up testing and potential tuning, all tree-building parameters were maintained. The tree structure obtained to predict recurrence is shown in Additional file 1: Table S5.

Chi-squared tests, analysis of variance (ANOVA), followed by Tukey’s post hoc test, and Spearman rank correlations were also performed using XLSTAT. All statistics were evaluated at α = 0.05 with Bonferroni correction for multiple comparisons, where applicable.



Analysis of similarity


Encapsulated fecal microbiota transplantation


Clostridium difficile infection


Fecal microbiota transplantation


Linear discriminant analysis of effect size


Operational taxonomic unit


Principal coordinate analysis


Recurrent Clostridium difficile infection


  1. 1.

    Lessa FC, Mu Y, Bamberg WM, Beldavs ZG, Dumyati GK, Dunn JR, et al. Burden of Clostridium difficile infection in the United States. N Engl J Med. 2015;372:825–34.

    Article  CAS  Google Scholar 

  2. 2.

    Kelly CP, Lamont JT. Clostridium difficile — more difficult than ever. N Engl J Med. 2008;359:1932–40.

    Article  CAS  Google Scholar 

  3. 3.

    Ma GK, Brensinger CM, Wu Q, Lewis JD. Increasing incidence of multiply recurrent Clostridium difficile infection in the United States. Ann Intern Med. 2017;167:152.

    Article  Google Scholar 

  4. 4.

    Surawicz CM, Brandt LJ, Binion DG, Ananthakrishnan AN, Curry SR, Gilligan PH, et al. Guidelines for diagnosis, treatment, and prevention of Clostridium difficile infections. Am J Gastroenterol. 2013;108:478–98.

    Article  CAS  Google Scholar 

  5. 5.

    McDonald LC, Gerding DN, Johnson S, Bakken JS, Carroll KC, Coffin SE, et al. Clinical practice guidelines for Clostridium difficile infection in adults and children: 2017 update by the Infectious Diseases Society of America (IDSA) and Society for Healthcare Epidemiology of America (SHEA). Clin Infect Dis. 2018;66:987–94.

    Article  Google Scholar 

  6. 6.

    van Nood E, Vrieze A, Nieuwdorp M, Fuentes S, Zoetendal EG, de Vos WM, et al. Duodenal infusion of donor feces for recurrent Clostridium difficile. N Engl J Med. 2013;368:407–15.

    Article  CAS  Google Scholar 

  7. 7.

    Weingarden AR, Chen C, Bobr A, Yao D, Lu Y, Nelson VM, et al. Microbiota transplantation restores normal fecal bile acid composition in recurrent Clostridium difficile infection. Am J Physiol Gastrointest Liver Physiol. 2014;306:G310–9.

    Article  CAS  Google Scholar 

  8. 8.

    Li SS, Zhu A, Benes V, Costea PI, Hercog R, Hildebrand F, et al. Durable coexistence of donor and recipient strains after fecal microbiota transplantation. Science. 2016;352:586–9.

    Article  CAS  Google Scholar 

  9. 9.

    Smillie CS, Sauk J, Gevers D, Friedman J, Sung J, Youngster I, et al. Strain tracking reveals the determinants of bacterial engraftment in the human gut following fecal microbiota transplantation. Cell Host Microbe. 2018;23:229–240.e5.

    Article  CAS  Google Scholar 

  10. 10.

    Staley C, Kelly CR, Brandt LJ, Khoruts A, Sadowsky MJ. Complete microbiota engraftment is not essential for recovery from recurrent Clostridium difficile infection following fecal microbiota transplantation. MBio. 2016;7:e01965–16.

    Article  Google Scholar 

  11. 11.

    Seekatz AM, Rao K, Santhosh K, Young VB. Dynamics of the fecal microbiome in patients with recurrent and nonrecurrent Clostridium difficile infection. Genome Med. 2016;8:47.

    Article  CAS  Google Scholar 

  12. 12.

    Khanna S, Pardi DS, Kelly CR, Kraft CS, Dhere T, Henn MR, et al. A novel microbiome therapeutic increases gut microbial diversity and prevents recurrent Clostridium difficile infection. J Infect Dis. 2016;214:173.

    Article  Google Scholar 

  13. 13.

    Tvede M, Rask-Madsen J. Bacteriotherapy for chronic relapsing Clostridium difficile diarrhoea in six patients. Lancet. 1989;1:1156–60.

    Article  CAS  Google Scholar 

  14. 14.

    Petrof EO, Gloor GB, Vanner SJ, Weese SJ, Carter D, Daigneault MC, et al. Stool substitute transplant therapy for the eradication of Clostridium difficile infection: “RePOOPulating” the gut. Microbiome. 2013;1:3.

    Article  Google Scholar 

  15. 15.

    Staley C, Hamilton MJ, Vaughn BP, Graiziger CT, Newman KM, Kabage AJ, et al. Successful resolution of recurrent Clostridium difficile infection using freeze-dried, encapsulated microbiota; pragmatic cohort study. Am J Gastroenterol. 2017;112:940–7.

    Article  Google Scholar 

  16. 16.

    Kao D, Roach B, Silva M, Beck P, Rioux K, Kaplan GG, et al. Effect of oral capsule- vs colonoscopy-delivered fecal microbiota transplantation on recurrent Clostridium difficile infection. JAMA. 2017;318:1985–93.

    Article  CAS  Google Scholar 

  17. 17.

    Weingarden A, González A, Vázquez-Baeza Y, Weiss S, Humphry G, Berg-Lyons D, et al. Dynamic changes in short- and long-term bacterial composition following fecal microbiota transplantation for recurrent Clostridium difficile infection. Microbiome. 2015;3:10.

    Article  Google Scholar 

  18. 18.

    Staley C, Vaughn BP, Graiziger CTCT, Singroy S, Hamilton MJMJ, Yao D, et al. Community dynamics drive punctuated engraftment of the fecal microbiome following transplantation using freeze-dried, encapsulated fecal microbiota. Gut Microbes. 2017;8:276–88.

    Article  Google Scholar 

  19. 19.

    Khoruts A, Rank KM, Newman KM, Viskocil K, Vaughn BP, Hamilton MJ, et al. Inflammatory bowel disease affects the outcome of fecal microbiota transplantation for recurrent Clostridium difficile infection. Clin Gastroenterol Hepatol. 2016;14:1433–8.

    Article  Google Scholar 

  20. 20.

    Knights D, Kuczynski J, Charlson ES, Zaneveld J, Mozer MC, Collman RG, et al. Bayesian community-wide culture-independent microbial source tracking. Nat Methods. 2011;8:761–U107.

    Article  CAS  Google Scholar 

  21. 21.

    Khoruts A, Sadowsky MJ. Understanding the mechanisms of faecal microbiota transplantation. Nat Rev Gastroenterol Hepatol. 2016;13:508–16.

    Article  Google Scholar 

  22. 22.

    Hamilton MJ, Weingarden AR, Unno T, Khoruts A, Sadowsky MJ. High-throughput DNA sequence analysis reveals stable engraftment of gut microbiota following transplantation of previously frozen fecal bacteria. Gut Microbes. 2013;4:125–35.

    Article  Google Scholar 

  23. 23.

    Seekatz AM, Aas J, Gessert CE. Recovery of the gut microbiome following fecal microbiota. MBio. 2014;5:1–9.

    Article  CAS  Google Scholar 

  24. 24.

    Kelly C, Khoruts A, Staley C, Sadowsky M, Abd M, Alani M, et al. Effect of fecal microbiota transplantation on recurrence in multiply recurrent Clostridium difficile infection: a randomized trial. Ann Intern Med. 2016;165:609–16.

    Article  Google Scholar 

  25. 25.

    Buffie CG, Bucci V, Stein RR, McKenney PT, Ling L, Gobourne A, et al. Precision microbiome reconstitution restores bile acid mediated resistance to Clostridium difficile. Nature. 2015;517:205–8.

    Article  CAS  Google Scholar 

  26. 26.

    Giel JL, Sorg JA, Sonenshein AL, Zhu J. Metabolism of bile salts in mice influences spore germination in Clostridium difficile. PLoS One. 2010;5:e8740.

    Article  CAS  Google Scholar 

  27. 27.

    Theriot CM, Bowman AA, Young VB. Antibiotic-induced alterations of the gut microbiota alter secondary bile acid production and allow for Clostridium difficile spore germination and outgrowth in the large intestine. mSphere. 2016;1:e00045–15.

  28. 28.

    Weingarden AR, Chen C, Zhang N, Graiziger CT, Dosa PI, Steer CJ, et al. Ursodeoxycholic acid inhibits Clostridium difficile spore germination and vegetative growth, and prevents the recurrence of ileal pouchitis associated with the infection. J Clin Gastroenterol. 2016;50:624–30.

    Article  CAS  Google Scholar 

  29. 29.

    Biggs D, De Ville B, Suen E. A method of choosing multiway partitions for classification and decision trees. J Appl Stat. 1991;18:49–62.

    Article  Google Scholar 

  30. 30.

    Lemon SC, Roy J, Clark MA, Friedmann PD, Rakowski W. Classification and regression tree analysis in public health: methodological review and comparison with logistic regression. Ann Behav Med. 2003;26:172–81.

    Article  Google Scholar 

  31. 31.

    Fischer M, Kao D, Mehta SR, Martin T, Dimitry J, Keshteli AH, et al. Predictors of early failure after fecal microbiota transplantation for the therapy of Clostridium difficile infection: a multicenter study. Am J Gastroenterol. 2016;111:1024–31.

    Article  Google Scholar 

  32. 32.

    Pascal V, Pozuelo M, Borruel N, Casellas F, Campos D, Santiago A, et al. A microbial signature for Crohn’s disease. Gut. 2017;66:813.

    Article  CAS  Google Scholar 

  33. 33.

    Turnbaugh PJ, Ley RE, Hamady M, Fraser-Liggett CM, Knight R, Gordon JI. The human microbiome project: exploring the microbial part of ourselves in a changing world. Nature. 2007;449:804–10.

    Article  CAS  Google Scholar 

  34. 34.

    Hamilton MJ, Weingarden AR, Sadowsky MJ, Khoruts A. Standardized frozen preparation for transplantation of fecal microbiota for recurrent Clostridium difficile infection. Am J Gastroenterol. 2012;107:761–7.

    Article  Google Scholar 

  35. 35.

    Claesson MJ, Wang QO, O’Sullivan O, Greene-Diniz R, Cole JR, Ross RP, et al. Comparison of two next-generation sequencing technologies for resolving highly complex microbiota composition using tandem variable 16S rRNA gene regions. Nucleic Acids Res. 2010;38:gkq873.

    Article  CAS  Google Scholar 

  36. 36.

    Gohl DM, Vangay P, Garbe J, MacLean A, Hauge A, Becker A, et al. Systematic improvement of amplicon marker gene methods for increased accuracy in microbiome studies. Nat Biotechnol. 2016;34:942–9.

    Article  CAS  Google Scholar 

  37. 37.

    Schloss PD, Westcott SL, Ryabin T, Hall JR, Hartmann M, Hollister EB, et al. Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl Environ Microbiol. 2009;75:7537–41.

    Article  CAS  Google Scholar 

  38. 38.

    Aronesty E. Comparison of sequencing utility programs. Open Bioinforma J. 2013;7:1–8.

    Article  Google Scholar 

  39. 39.

    Pruesse E, Quast C, Knittel K, Fuchs BM, Ludwig WG, Peplies J, et al. SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB. Nucleic Acids Res. 2007;35:7188–96.

    Article  CAS  Google Scholar 

  40. 40.

    Huse SM, Welch DM, Morrison HG, Sogin ML. Ironing out the wrinkles in the rare biosphere through improved OTU clustering. Environ Microbiol. 2010;12:1889–98.

    Article  CAS  Google Scholar 

  41. 41.

    Edgar RC, Haas BJ, Clemente JC, Quince C, Knight R. UCHIME improves sensitivity and speed of chimera detection. Bioinformatics. 2011;27:2194–200.

    Article  CAS  Google Scholar 

  42. 42.

    Cole JR, Wang Q, Cardenas E, Fish J, Chai B, Farris RJ, et al. The ribosomal database project: improved alignments and new tools for rRNA analysis. Nucleic Acids Res. 2009;37:D141–5.

    Article  CAS  Google Scholar 

  43. 43.

    Gihring TM, Green SJ, Schadt CW. Massively parallel rRNA gene sequencing exacerbates the potential for biased community diversity comparisons due to variable library sizes. Environ Microbiol. 2012;14:285–90.

    Article  CAS  Google Scholar 

  44. 44.

    Langille MGI, Zaneveld J, Caporaso JG, McDonald D, Knights D, Reyes JA, et al. Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences. Nat Biotechnol. 2013;31:814–21.

    Article  CAS  Google Scholar 

  45. 45.

    DeSantis TZ, Hugenholtz P, Larsen N, Rojas M, Brodie EL, Keller K, et al. Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl Environ Microbiol. 2006;72:5069–72.

    Article  CAS  Google Scholar 

  46. 46.

    Clarke KR. Non-parametric multivariate analyses of changes in community structure. Aust J Ecol. 1993;18:117–43.

    Article  Google Scholar 

  47. 47.

    Anderson MJ, Willis TJ. Canonical analysis of principal coordinates: a useful method of constrained ordination for ecology. Ecology. 2003;84:511–25.

    Article  Google Scholar 

  48. 48.

    Bray JR, Curtis JT. An ordination of the upland forest communities of southern Wisconsin. Ecol Monogr. 1957;27:325–49.

    Article  Google Scholar 

  49. 49.

    Segata N, Huttenhower C. Toward an efficient method of identifying core genes for evolutionary and functional microbial phylogenies. PLoS One. 2011;6:e24704.

    Article  CAS  Google Scholar 

Download references


Sequence processing and analysis were performed using the resources of the Minnesota Supercomputing Institute.


This research was made possible by support by grants from the NIH 1R21-AI114722–01 (AK, MJS), CIPAC Limited (to AK, MJS), Achieving Cures Together (to AK), and the Hubbard Foundation (to AK).

Availability of data and material

Sequence data are deposited in the Sequence Read Archive at the National Center for Biotechnology Information under BioProject accession number SRP070464.

Author information




CS and TK performed data acquisition and analysis and drafted the manuscript. CTG coordinated with patients, transported patient samples, and maintained patient records. BPV and AK provided clinical supervision for patients. MJH prepared capsules and contributed to writing the manuscript. TR and KS collated and verified metadata. AK and MJS directed research and contributed to writing the manuscript. All authors have read and approved the final manuscript.

Corresponding author

Correspondence to Michael J. Sadowsky.

Ethics declarations

Ethics approval and consent to participate

Approval for this study was given by the University of Minnesota Institutional Review Board (Protocol Number: 0901M56962). All human subjects provided informed consent for participation in the study and collection and analysis of data. All human subjects gave their permission for their information to be published.

Consent for publication

Not applicable.

Competing interests

AK, MJH, and MJS hold the following patent which relates to the content of this manuscript: Compositions and methods for transplantation of colon microbiota, US Patent 2014/0147417 A1, filed 9 March 2012, accepted 29 May 2014. The other authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Supplemental figures and tables. (DOCX 292 kb)

Additional file 2:

Batch file for sequencing processing. (TXT 12 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Staley, C., Kaiser, T., Vaughn, B.P. et al. Predicting recurrence of Clostridium difficile infection following encapsulated fecal microbiota transplantation. Microbiome 6, 166 (2018).

Download citation


  • Clostridium difficile
  • Encapsulated microbiota
  • Fecal microbiota transplantation
  • Machine learning
  • Prediction model
  • Microbial community structure