Skip to main content

Longitudinal development of the gut microbiome and metabolome in preterm neonates with late onset sepsis and healthy controls



Late onset sepsis (LOS) in preterm infants is associated with considerable morbidity and mortality. While studies have implicated gut bacteria in the aetiology of the disease, functional analysis and mechanistic insights are generally lacking. We performed temporal bacterial (n = 613) and metabolomic (n = 63) profiling on extensively sampled stool from 7 infants with LOS and 28 matched healthy (no LOS or NEC) controls.


The bacteria isolated in diagnostic blood culture usually corresponded to the dominant bacterial genera in the gut microbiome. Longitudinal changes were monitored based on preterm gut community types (PGCTs), where control infants had an increased number of PGCTs compared to LOS infants (P = 0.011). PGCT 6, characterised by Bifidobacteria dominance, was only present in control infants. Metabolite profiles differed between LOS and control infants at diagnosis and 7 days later, but not 7 days prior to diagnosis. Bifidobacteria was positively correlated with control metabolites, including raffinose, sucrose, and acetic acid.


Using multi-omic analysis, we show that the gut microbiome is involved in the pathogenesis of LOS. While the causative agent of LOS varies, it is usually abundant in the gut. Bifidobacteria dominance was associated with control infants, and the presence of this organism may directly protect, or act as a marker for protection, against gut epithelial translocation. While the metabolomic data is preliminary, the findings support that gut development and protection in preterm infants is associated with increased in prebiotic oligosaccharides (e.g. raffinose) and the growth of beneficial bacteria (e.g. Bifidobacterium).


Late onset sepsis (LOS; defined as sepsis after 72 h of life) remains a serious and common complication of prematurity, with rates of 20–40% for infants <32 weeks gestation reported in some studies. LOS in preterm infants impacts negatively on survival (with mortality rates of up to 10%) and on developmental outcomes [1]. Mechanisms of LOS pathogenesis are poorly understood, but bacterial colonisation and low gestational age are key risk factors [2]. Bacterial profiling studies have shown that LOS infants have an altered microbiome and lower bacterial diversity [3,4,5,6,7,8], and the bacterial strain isolated in diagnostic blood culture is frequently present in the gut [9]. Central to LOS pathogenesis are bacterial-host interactions modulating gut and systemic immune responses, tight junction integrity, and host metabolic function [10]. The most common organisms causing LOS in preterm infants include coagulase-negative Staphylococcus, Escherichia, Klebsiella, and Enterococcus [11].

Recent advances in ultra-performance liquid chromatography-mass spectrometry (UPLC-MS) untargeted metabolomics facilitate further understanding of these complex relationships involving host and bacteria, and the complex interactions of immune and metabolic function in relation to health and disease states [12]. While largely pilot in nature, existing metabolomic studies in preterm infants have demonstrated important findings. In necrotizing enterocolitis (NEC), the most prevalent serious preterm disease after LOS, the metabolite profiles are different at diagnosis compared to controls in serum [13,14,15], urine [16], and stool [17]. Stool volatile organic compound and serum UPLC-MS has also demonstrated differences between LOS infants and matched controls at or immediately prior to disease [13, 15, 18]. Stool metabolite profiles are also significantly associated with age [19] and serum metabolite profiles between preterm and term neonates also differ [14].

We aimed to explore relationships between gut microbiome and metabolome to determine key insights into LOS development, impact, and recovery. This is the first study to employ UPLC-MS untargeted metabolomics of stool to determine host and bacterial functioning within the gut of infants diagnosed with LOS.


Infant and sample information

We recruited a large cohort of preterm infants, sampling stool daily where possible (n = >300 infants/>3000 samples) and capturing key health-related outcomes using precise definitions [17, 20, 21]. Using strict classification for LOS (positive blood culture with a >5 days antibiotics) and only including infants with robust temporal sampling before and after disease diagnosis, we present comprehensive longitudinal gut microbiome data on 613 stool samples from LOS infants (n = 7) and well-matched non-diseased (no NEC or LOS) controls (n = 28). A subset of 63 stool samples from LOS infants (n = 4) and matched controls (n = 10) also underwent UPLC-MS.

Infant demographics are shown in Table 1 and further detail provided in Additional file 1: Table S1. The average number of samples for LOS and control infants was comparable (15 vs. 18, respectively). Infant demographics were comparable between LOS and matched controls, although as expected, antibiotic use was increased in LOS (antibiotic information provided in Additional file 2: Table S2). Diagnostic blood culture identified two cases of Staphylococcus aureus, two cases of Staphylococcus epidermidis, one case of Enterococcus faecalis, one case of Streptococcus agalactiae, and one case of Escherichia coli (Additional file 1: Table S1).

Table 1 Summary of infant samples and demographic per group

The abundant bacterial genus in the gut microbiome preceding diagnosis corresponds to the genera of the causative agent in LOS

The gut microbiome of infants with LOS were highly individual and dynamic through time. The pathogen identified by blood culture was one of the most abundant OTUs in the gut microbiota at diagnosis, with the corresponding genus from the gut microbiome the most abundant in four cases and second most abundant OTU in two cases (Fig. 1). An exception was Staphylococcus epidermidis LOS in infant 251, which was the seventh most abundant genus at diagnosis. In all cases, the genera of the bacteria isolated in diagnostic blood culture were present prior to LOS diagnosis. For infant 173 who was diagnosed with S. agalactiae, the organism was detected 2 days before diagnosis and within 6 days of antibiotic treatment (flucloxacillin and gentamicin) it was no longer detected.

Fig. 1
figure 1

Area plots showing the temporal development of the microbiome in infants diagnosed with late onset sepsis (LOS). Dashed red lines represent the day of LOS diagnosis with the bacteria isolated from blood culture identified. Dashed black lines represent the start of an antibiotic treatment as per Additional file 2: Table S2.

Preterm gut community types dominant in Bifidobacterium are protective for LOS

To further explore the complexity in the developing preterm microbiome, we employed PAM clustering analysis to ascertain preterm gut community types (PGCTs), as previously described [17]. All samples grouped into six discrete clusters (Additional file 3: Figure S1): dominance of Klebsiella (PGCT 1), dominance of both Klebsiella and Enterococcus (PGCT 2), dominance of Staphylococcus (PGCT 3), dominance of Enterococcus (PGCT 4), dominance of Escherichia (PGCT 5), and mixed population with high relative abundance of Bifidobacterium (PGCT 6) (Additional file 4: Figure S2). No PGCT was strongly associated with PreLOS samples when compared to all control samples, whereas PGCT 2 and PGCT 6 were never found in any sample from LOS infants before diagnosis (Fig. 2a). PGCT 6 was also never found in any LOS infant after diagnosis, and PGCT 2 was present in only 2 infants after diagnosis, detected >2 weeks following diagnosis and treatment (Fig. 2b and Additional file 4: Figure S2). Conversely PGCT 6, which represents a diverse community high in relative Bifidobacterium abundance, was present frequently throughout the control population from early to late samples. Specifically, PGCT 6 was detected in 65 samples from 12 control infants, representing 43% of the control population. Counting the number of unique PGCTs over the first 25 days of life showed control infants had an average of 3 unique PGCTs, compared to an average of 2 unique PGCTs in preLOS samples (P = 0.011).

Fig. 2
figure 2

Characterisation of the gut microbiome between infants diagnosed with late onset sepsis (LOS) and matched controls. a Transition network analysis showing PGCTs in PreLOS samples compared to matched controls approximated as a Markov chain with subject-independent transition probabilities. Arrow weights reflect the transition probabilities from one sample to the next. Size of circle reflects the relative number of samples associated with that PGCT. Pale blue indicates PGCTs of consisting of control samples only, and the darker shade of purple shows increased number of PreLOS samples in that PGCT. b Temporal change in PGCTs in each individual infant. Red lines represent day of LOS diagnosis. Only samples up to day 50 of life are included. Infant 178 died during the study

Untargeted metabolomic profiling indicates distinct functional profiles between infants with LOS and controls

Given the finding that the dominant genus in the microbiome is associated with the causative agent in LOS, we further investigated the potential functional differences in the gut between infants diagnosed with LOS and matched controls. Due to sample availability, this pilot experiment included 4 infants with LOS (infants 130, 172, 181, and 251) and 10 matched controls, across 5 time points spanning before and after LOS diagnosis: −14 days (time point 1; TP1), −7 days (TP2), 0 days (TP3), +7 days (TP4), and +14 days (TP3), relative to diagnosis of LOS. Although each LOS infant had different bacterial species isolated in diagnostic blood culture (Fig. 1), PCA showed metabolite profiles clustering distinctly between LOS infants and matched controls, with the most profound differences at diagnosis (0 days) and +7 days (Fig. 3). Receivers operating characteristic (ROC) curves were generated, and area under a ROC curve (AUC) was implemented to provide a measure of how well metabolites distinguish between LOS and matched controls. Lines progressing towards the upper-left corner of plots represent better discrimination (higher sensitivity and specificity). At diagnosis, the AUC ranged from 0.787 with 5 metabolites to 0.883 with 25 metabolites and both equated to a sensitivity of 75% and a specificity of 89%.

Fig. 3
figure 3

Metabolomic profiles between infants diagnosed with LOS and matched controls across all 5 time points, where TP3 represents samples at diagnosis. a PCA (unconstrained ordination) of LOS infants (red) and matched controls (green). Each sample represented by the small circle and ellipses represent the 95% confidence interval. b Receiver operating characteristic curves of support vector machine predictions for LOS and control samples. AUC represents the strength of the predictive classifications. Selected number of metabolites computed in intervals from 5, 10, 15, 25, 50, and 100 metabolites

Fourteen stool metabolites were identified as significantly altered between LOS and control infants at diagnosis (TP3), with 7 metabolites (all increased in controls) remaining significant following adjustment for 7 confounders (Table 2). Galactose metabolism was the most frequently increased pathway in control infants, and sucrose (P = 0.001) and raffinose (P = 0.001), both from galactose metabolism, were the most significant metabolites. These metabolites increased through time within control infants, whereas the same metabolites remained at baseline or reduced prior to diagnosis in LOS infants (Fig. 4). Notably, following diagnosis and treatment, these metabolites increased in LOS infants, but tended to remain at lower intensity compared to controls (with the exception of metabolites from C21-steroid hormone biosynthesis that remained at baseline throughout). Taken together, the PCA and box plot analysis of the most significant metabolites suggest altered and delayed functional development in the gut in LOS infants prior to diagnosis.

Table 2 List of metabolites and pathways significantly altered between control and LOS infants at diagnosis (day 0)
Fig. 4
figure 4

Box plots to show the levels of significant metabolites though each time point between infants diagnosed with late onset sepsis (LOS) and matched controls. Plots listed in order of significance. a Sucrose. b Raffinose. c L-Glutamate. d Didemethylcitalopram. e Acetic acid. f 18-Hydroxycortisol. g 18-Oxocortisol. h L-alpha-Acetyl-N-normethadol

Multi-omic analysis shows distinct correlations between significant metabolites and abundant bacterial genera

sPLS correlation analysis was performed using MixOmics to determine the correlations between the dominant bacterial genera and identified metabolites (Fig. 5). Bifidobacterium and Streptococcus showed comparable strong positive correlations with a range of metabolites that were significantly increased in control infants (Table 2), including raffinose, 18-hydroxycortisol, 18-oxocortisol, acetic acid, and L-alpha-acetyl-N-normethadol. These findings were supported when only including control infants in the analysis; demonstrating diagnosis is not confounding the results (Additional file 5: Figure S3). Furthermore, this correlation was lost when analysing a shuffled dataset and is thus not an artifact of autocorrelation. Morganella also showed weak positive correlation with these metabolites. Veillonella showed distinct clustering with strong correlations to a range of metabolites, including vitamin K and ascorbic acid (vitamin C), as well as 10,11-dihydro-12R-hydroxy-leukotriene E4 that was significantly increased (P = 0.032) in LOS infants. Staphylococcus, Bacteroides, Escherichia, Klebsiella, Enterococcus, and Pseudomonas all showed weak correlations with the detected metabolites.

Fig. 5
figure 5

Spares partial least squared correlations (sPLS) between dominant bacterial genera and identified metabolites. sPLS in regression mode (predict Y from X) to model a causal relationship between bacterial genera and metabolites. Bacterial genera represented in green boxes. Red boxes are metabolites significantly increased in LOS, blue boxed significantly increased in controls, yellow boxes are not significantly altered between LOS and controls. Significant metabolites based on the samples at diagnosis (time point 0)


We explored gut microbiome and metabolome factors that are altered in the development of LOS in preterm infants and show that the dominant taxa in the gut microbiome are usually isolated in diagnostic blood culture. Control infants had greater microbiome development and prevalence of PGCT 6 (Bifidobacterium dominant). Novel untargeted stool metabolomics on a subset of samples showed that metabolite profiles are significantly different between LOS and control infants at diagnosis and 7 days later. Stool metabolites from a range of pathways/sources, especially sucrose and raffinose, tended to increase in controls through time compared to LOS infants, where they remained low prior to and at diagnosis. Finally, combining omic datasets to explore correlations between the microbiome and metabolome showed Bifidobacterium was positively correlated with metabolites significantly increased in control infants.

The gut microbiome has been previously implicated in the pathogenesis of LOS [3,4,5,6,7,8,9], with whole genome shotgun sequencing revealing the exact strain in diagnostic blood culture can be detected in stool [9]. Our data concurs, showing the species cultured from blood was typically abundant from birth and corresponds to the most or second most abundant OTU in the gut at diagnosis. One exception occurred where the microbiome was dominated by Klebsiella and Escherichia, but S. epidermidis was isolated in blood culture. Due to the prevalence of S. epidermidis on the skin, this organism is a common contaminant of blood cultures, however, it is also commonly associated with preterm sepsis [22]. It is possible that different mechanisms are in action where LOS results from organisms that typically colonise the skin, for example, through compromised skin barrier from venepuncture, heel pricks etc., than where the mechanism is of translocation of gut bacteria [23].

PGCT 6 (Bifidobacteria dominant) was only found in control infants, detected in nearly half of all control infants (43%), but whether this is a cause of gut health or a marker of gut health is unknown. The potential health-promoting properties of Bifidobacterium are well reported [24,25,26,27,28,29]. Although the largest existing probiotic trial of 1315 preterm infants found no significant improvement to NEC or LOS rates using Bifidobacterium breve [30], only specific species of Bifidobacterium in the preterm gut are able to utilise human milk oligosaccharides (HMOs): these species appear to have key roles in establishing the pioneering species of the gut [31]. While B. breve produce fucosidases and sialidases, only Bifidobacterium longum subspecies infantis are able to digest all HMO structures [32]. The differences between timing, dose, and most importantly the Bifidobacterium species/strains used are likely to account for the inconsistent health benefits between Bifidobacterium probiotic studies. In a separate cohort, we have previously shown that Bifidobacterium bifidum deliberately administered to preterm infants can colonise and persist in threefold greater relative abundance following discharge, compared to control infants [33].

Metabolomics was employed to determine if the changes in gut microbiome were reflected at the functional level and if metabolic markers for LOS could be detected. LOS samples grouped distinctly at diagnosis (sensitivity of 75% and a specificity of 89%). Previous work has shown serum metabolite profiles [13] and stool volatile organic compound profiles [18] altered between LOS infants and controls within 3 days of onset, but no single metabolite being diagnostic of LOS. We found no differences in stool metabolites 14 or 7 days before diagnosis. Although significant in the unadjusted models, following adjustment for potential confounders, no metabolite was significantly increased in all LOS infants. This suggests the pathogenesis is of acute onset, with multiple aetiological components affecting individual infants differently.

Network analysis was employed to determine correlations between the microbiome and metabolome, revealing that Bifidobacterium and Streptococcus have comparable positive correlations. Given Bifidobacterium dominant communities were specific to control infants and Bifidobacterium is strongly associated with a healthy mature gut microbiome [24,25,26,27,28,29], it is notable that metabolites correlated with this genera were significantly increased in control infants. Metabolites correlated with Bifidobacterium and Streptococcus were from a range of pathways: raffinose (Galactose metabolism), L-alpha-acetyl-N-normethadol (Drug metabolism), and acetic acid, 18-hydroxycortisol, and 18-oxocortisol (C21-steroid hormone biosynthesis and metabolism). Raffinose is a derivative of sucrose, and these two metabolites were the most significant overall, with both increased in controls. Raffinose is a α-galactosyl (α-GAL) oligosaccharide, and because humans do not possess the α-GAL enzyme, it is fermented in the gut by bacteria containing the α-GAL enzyme. This metabolite reduced in LOS infants prior to diagnosis, increasing after treatment, whereas it remained consistently high in controls from day 7 (TP2). Raffinose is considered a prebiotic [34] that has been shown to inhibit the growth of potentially pathogenic bacteria [35] and to promote Bifidobacterium spp. in human and animal studies [36]. In addition, raffinose increases short chain fatty acid (SCFA) concentrations, specifically acetic and propionic acid, contributing to increased weight gain in animal models [37, 38] and a reduction in pathogenic bacteria [39]. Due to the use of LCMS, SCFA were not detected in the current study, but further work exploring the exact strains of Bifidobacterium and their specific effects on the gut microbiome and metabolome is warranted.

The study has several potential limitations. The strict inclusion criteria requiring extensive longitudinal sampling before and after disease diagnosis meant only 7 infants with LOS were included, despite collection of samples from >300 infants. However, this cohort size is comparable to existing studies in LOS and the findings here support existing data [3,4,5,6,7, 9]. Cost and sample size considerations meant metabolomics was performed on only four infants with LOS and ten controls and thus conclusions drawn this analysis should be considered preliminary. It is necessary to repeat the correlation analysis at specific time windows in larger cohorts. Nonetheless, this study has yielded important findings that warrant validation in large multi-center studies with extensive longitudinal sampling, particularly within 7 days of LOS diagnosis.


Using novel multi-omic analysis, we show for the first time that the gut microbiome and metabolome are associated with the pathogenesis of LOS. In accordance with published data, we find the causative agent in LOS is usually abundant in the gut microbiome, suggesting translocation though the gut epithelium. This is further supported by the change in bacterial and host metabolism in the gut, which is reflective of altered function. While the causative agent of LOS varies, Bifidobacteria dominant communities were only found in controls and this taxa was further correlated with the metabolites significantly associated with control infants, including raffinose, sucrose, and acetic acid. The current study supports that gut development and protection in preterm infants is associated with increased in prebiotic oligosaccharides (e.g. raffinose) and the growth of beneficial bacteria (e.g. Bifidobacterium). The finding in the current study requires validation in a larger cohort and the exact mechanisms, and the development of therapies aimed at promoting health for preterm infants, such as pre- and probiotics, warrant further investigation.


Participants and study design

The study design, setting, participants, and methods of data collection have been reported previously [17, 40]. Briefly, all infants were cared for in a single unit with standardised feeding, antibiotic, and antifungal guidelines. LOS was defined as a positive blood culture treated with antibiotics for a minimum of 5 days along with signs consistent with sepsis reviewed independently by two clinicians. Cultured isolates from positive blood culture were identified using matrix-assisted laser desorption ionisation–time of flight mass spectrometry. All demographic information is summarised in Table 1, and full information for each infant is provided in Additional file 1: Table S1.

Stool samples and clinical data were collected from a total of 318 preterm infants at study conception. Seven well-sampled cases of LOS and 28 matched controls, free of LOS, or NEC, were selected based on extensive longitudinal sampling and matched by gestational age (GA; +/−1 week), birth weight, and delivery mode. A total of 613 analysed stool samples underwent 16S rRNA gene bacterial profiling. Metabolomic profiling was performed on a subset of 14 infants (63 stools): 4 LOS and 10 matched controls. LOS samples were selected for analysis relative to disease diagnosis at day of life (DOL) −14 (time point 1; TP1), −7 (TP2), 0 (TP3), +7 (TP4), and +14 (TP5), and controls were matched to this by DOL.

16S rRNA gene bacterial profiling

Nucleic acid extraction of stool was carried out on 100 mg of sample using the PowerLyzer™ PowerSoil® DNA Isolation Kit (MoBio, CA, USA) in accordance with the manufacturer’s instructions. Bacterial profiling utilised the 16S rRNA gene targeting variable region 4 based on the Schloss wet-lab MiSeq SOP and resulting raw fastq data were processed using Mothur (version 1.31.2), as described previously [41]. Briefly, combined reads were trimmed to 275 reads with 0 ambiguous bases. Chimeric sequences were detected by Chimera.uchime and were removed from downstream analysis. Alignment was generated via the Silva v4 database [42] and chloroplast, mitochondria, unknown, archaea, and eukaryota linages were removed from the analysis. Raw sequences were deposited in MG-RAST under the accession numbers 4516545.3-4516585.3.

UPLC-MS metabolomic profiling

Metabolomic profiling was performed as previously described [19, 43]. Briefly, 100 mg stool was homogenised (80% methanol), was vortexed for 15 min, centrifuged (10000×g), and was lyophilised. Reverse-phase ultra-performance LCMS tandem mass-spectrometry (UPLC-MS/MS) was performed using an Accucore C18 column (2.6 μm, 150 × 2.1 mm) at 40 °C, 3.0 μl injection, and 300 μl/min flow rate. Gradients increased from 5% acetronitrile (ACN) to 95% ACN over 22 min, followed by 8 min wash, and re-equilibration. Samples were run randomly in triplicate on a Q-Exactive (Thermo) using HESI with high resolution (70,000) positive and negative switching. The mass range was set from 100–1000 m/z. SIEVE (version 2.2) was used to process the Thermo RAW files by component extraction.

Bioinformatic and statistical analysis

Bacterial community analysis

16S bacterial profiles were analysed using a stand-alone tool for analysing and visualising microbiome data sets developed at the Center for Metagenomics and Microbiome Research at Baylor College of Medicine (not published), conducted in R version 3.3 [44]. Each sample was rarefied to 4397 reads. PGCTs were determined using a publically available script for linear mixed-effects modelling, medoid-based clustering, and Markov chain modelling [45]. Bray-Curtis was used to calculate the distance between all samples, and this was denoised by extraction of the most significant Principal Coordinates Analysis (PCoA) eigenvectors before applying the PAM algorithm. Gap statistic was used to determine the number of clusters. Significance of categorical variables was determined using the non-parametric Mann-Whitney test for comparison of LOS and control infants. Only taxa present in >1% relative abundance were included in statistical analysis. All P values were adjusted for multiple comparisons with the false discovery rate (FDR) algorithm [46].

Metabolomics analysis

UPLC-MS data was filtered to include only m/z features that occurred in >20% of samples. Metabolite annotation and pathway enrichment was performed using Mummichog [47]. Mummichog was used to determine significant pathways between infants diagnosed with LOS and matched controls. Unlike the microbiome dataset, metabolomics was performed at five specific time windows relative to LOS onset, with matched day of life control samples. Analysis was therefore cross sectional within the specific time windows. Metabolomic MetaboAnalyst 3.0 [48] was employed to generate PCA plots and to determine the AUC between LOS and control infants at each time point. Receivers operating characteristic (ROC) curves were generated by linear support vector machine (SVM) classification with Monte-Carlo cross validation using balanced subsampling. In each Monte-Carlo cross validation, two thirds of the samples were used to examine the feature importance, and the classification model was validated using the one third of samples left out. Several iterations were performed to determine the optimal number of metabolites to predict MV use, with analysis based on 5, 10, 15, 25, 50, or 100 of the top metabolites based on the average importance. Two-tailed Welch’s t test (<2 variables) or ANOVA (>2 variables) were used to determine significant metabolites. Regression models adjusted for seven potential confounding variables: delivery mode, gestation age, sex, feed (received some maternal breast milk or formula only), number of days of antibiotics treatment, number of antibiotics used, and age. P values were adjusted for multiple comparisons using FDR [46].

Integrated analysis of microbiome and metabolomic datasets

MixOmics [49] was implemented in R to determine the correlation between the relative abundance of the dominant bacterial taxa from 16S rRNA gene sequencing and the intensity of metabolites of interest by sparse partial least squares regression (sPLS) [50].



Late onset sepsis


Necrotising enterocolitis


Neonatal intensive care unit


Preterm gut community type


Ultra-performance liquid chromatography mass spectrometry tandem mass-spectrometry


  1. Mwaniki MK, Atieno M, Lawn JE, Newton CR. Long-term neurodevelopmental outcomes after intrauterine and neonatal insults: a systematic review. Lancet. 2012;379:445–52.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Berrington JE, Stewart CJ, Cummings SP, Embleton ND. The neonatal bowel microbiome in health and infection. Curr Opin Infect Dis. 2014;27:236–43.

    Article  PubMed  Google Scholar 

  3. Madan JC, Salari RC, Saxena D, Davidson L, O’Toole G a, Moore JH, et al. Gut microbial colonisation in premature neonates predicts neonatal sepsis. Arch Dis Child Fetal Neonatal Ed. 2012;97:F456–62.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Stewart CJ, Nelson A, Scribbins D, Marrs ECL, Perry JD, Embleton ND, et al. Bacterial and fungal viability in the preterm gut: NEC and sepsis. Arch Dis Child Fetal Neonatal Ed. 2013;98:F298–303.

    Article  PubMed  Google Scholar 

  5. Stewart CJ, Marrs ECL, Magorrian S, Nelson A, Lanyon C, Perry JD, et al. The preterm gut microbiota: changes associated with necrotizing enterocolitis and infection. Acta Paediatr Int J Paediatr. 2012;101:1121–7.

    Article  CAS  Google Scholar 

  6. Mai V, Torrazza RM, Ukhanova M, Wang X, Sun Y, Li N, et al. Distortions in development of intestinal microbiota associated with late onset sepsis in preterm infants. PLoS One. 2013;8:e52876.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Mshvildadze M, Neu J, Shuster J, Theriaque D, Li N, Mai V. Intestinal microbial ecology in premature infants assessed with non-culture-based techniques. J Pediatr. 2010;156:20–5.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Taft DH, Ambalavanan N, Schibler KR, Yu Z, Newburg DS, Deshmukh H, et al. Center variation in intestinal microbiota prior to late-onset sepsis in preterm infants. Public Libr Sci. 2015;10:e0130604.

    Google Scholar 

  9. Carl MA, Ndao IM, Springman AC, Manning SD, Johnson JR, Johnston BD, et al. Sepsis from the gut: the enteric habitat of bacteria that cause late-onset neonatal bloodstream infections. Clin Infect Dis. 2014;58:1211–8.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Berrington J, Stewart C, Embleton N, Cummings S. Gut microbiota in preterm infants: assessment and relevance to health and disease. Arch Dis Child Fetal Neonatal Ed. 2013;98:F286–90.

    Article  PubMed  Google Scholar 

  11. Kaufman D, Fairchild KD. Clinical microbiology of bacterial and fungal sepsis in very-low-birth-weight infants. Clin Microbiol Rev Am Soc Microbiol. 2004;17:638.

    Article  Google Scholar 

  12. Dessì A, Pintus R, Marras S, Cesare Marincola F, De Magistris A, Fanos V. Metabolomics in necrotizing enterocolitis: the state of the art. Expert Rev Mol Diagn. 2016;16:1053–8.

    Article  PubMed  Google Scholar 

  13. Stewart CJ, Nelson A, Treumann A, Skeath T, Cummings SP, Embleton ND, et al. Metabolomic and proteomic analysis of serum from preterm infants with necrotising entercolitis and late onset sepsis. Pediatr Res Res. 2015;79:425–31.

    Article  Google Scholar 

  14. Wilcock A, Begley P, Stevens A, Whatmore A, Victor S. The metabolomics of necrotising enterocolitis in preterm babies: an exploratory study. J Matern Fetal Neonatal Med. 2016;29:758–62.

    Article  CAS  PubMed  Google Scholar 

  15. Ng PC, Ang IL, Chiu RWK, Li K, Lam HS, Wong RPO, et al. Host-response biomarkers for diagnosis of late-onset septicemia and necrotizing enterocolitis in preterm infants. J Clin Invest. 2010;120:2989–3000.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Morrow AL, Lagomarcino AJ, Schibler KR, Taft DH, Yu Z, Wang B, et al. Early microbial and metabolomic signatures predict later onset of necrotizing enterocolitis in preterm infants. Microbiome. 2013;1:13.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Stewart CJ, Embleton ND, Marrs ECL, Smith DP, Nelson A, Abdulkadir B, et al. Temporal bacterial and metabolic development of the preterm gut reveals specific signatures in health and disease. Microbiome. 2016;4:67.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Berkhout DJC, Niemarkt HJ, Buijck M, van Weissenbruch MM, Brinkman P, Benninga MA, et al. Detection of Sepsis in Preterm Infants by Fecal Volatile Organic Compounds Analysis: A Proof of Principle Study. J Pediatr Gastroenterol Nutr. 2016. [Epub Ahead of Print].

  19. Stewart C, Skeath T, Nelson A, Fernstad S, Marrs E, Perry J, et al. Preterm gut microbiota and metabolome following discharge from intensive care. Sci Rep. 2015;5:17141.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Stewart CJ, Embleton ND, Clements E, Luna PN, Smith DP, Fofanova TY, et al. Cesarean or vaginal birth does not impact the longitudinal development of the gut microbiome in a cohort of exclusively preterm infants. Front Microbiol. 2017;8:1–9.

    Article  Google Scholar 

  21. Embleton ND, Berrington JE, Dorling J, Ewer AK, Juszczak E, Kirby JA, et al. Mechanisms Affecting the Gut of Preterm Infants in Enteral Feeding Trials. Front Nutr. 2017;4:14.

  22. D’Angio CT, McGowan KL, Baumgart S, Geme JS, Harris MC. Surface colonization with coagulase-negative staphylococci in premature neonates. J Pediatr. 1989;114:1029–34.

    Article  PubMed  Google Scholar 

  23. Dong Y, Speer CP. The role of Staphylococcus epidermidis in neonatal sepsis: guarding angel or pathogenic devil? Int J Med Microbiol. 2014;304:513–20.

    Article  CAS  PubMed  Google Scholar 

  24. Fukuda S, Toh H, Hase K, Oshima K, Nakanishi Y, Yoshimura K, et al. Bifidobacteria can protect from enteropathogenic infection through production of acetate. Nature. 2011;469:543–7.

    Article  CAS  PubMed  Google Scholar 

  25. Ling X, Linglong P, Weixia D, Hong W. Protective effects of bifidobacterium on intestinal barrier function in lps-induced enterocyte barrier injury of Caco-2 Monolayers and in a Rat NEC Model. PLoS One. 2016;11:e0161635.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Fanning S, Hall LJ, Cronin M, Zomer A, MacSharry J, Goulding D, et al. Bifidobacterial surface-exopolysaccharide facilitates commensal-host interaction through immune modulation and pathogen protection. Proc Natl Acad Sci. 2012;109:2108–13.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Čitar M, Hacin B, Tompa G, Štempelj M, Rogelj I, Dolinšek J, et al. Human intestinal mucosa-associated Lactobacillus and Bifidobacterium strains with probiotic properties modulate IL-10, IL-6 and IL-12 gene expression in THP-1 cells. Benef Microbes Wageningen Academic Publishers. 2015;6:325–36.

    Google Scholar 

  28. Gueimonde M, Laitinen K, Salminen S, Isolauri E. Breast milk: a source of bifidobacteria for infant gut development and maturation? Neonatology. 2007;92:64–6.

    Article  PubMed  Google Scholar 

  29. Duranti S, Gaiani F, Mancabelli L, Milani C, Grandi A, Bolchi A, et al. Elucidating the gut microbiome of ulcerative colitis: Bifidobacteria as novel microbial biomarkers. FEMS Microbiol Ecol. 2016;92:fiw191. 2

  30. Costeloe K, Hardy P, Juszczak E, Wilks M, Millar MR. Bifidobacterium breve BBG-001 in very preterm infants: a randomised controlled phase 3 trial. Lancet. 2016;387:649–60.

    Article  PubMed  Google Scholar 

  31. Underwood MA, German JB, Lebrilla CB, Mills DA. Bifidobacterium longum subspecies infantis: champion colonizer of the infant gut. Pediatr Res. 2015;77:229–35.

    CAS  PubMed  Google Scholar 

  32. Sela DA, Mills DA. The marriage of nutrigenomics with the microbiome: the case of infant-associated bifidobacteria and milk. Am J Clin Nutr American Society for Nutrition. 2014;99:697S–703S.

    Article  CAS  Google Scholar 

  33. Abdulkadir B, Nelson A, Skeath T, Marrs ECL, Perry JD, Cummings SP, et al. Routine use of probiotics in preterm infants: longitudinal impact on the microbiome and metabolome. Neonatology. 2016;109:239–47.

    Article  CAS  PubMed  Google Scholar 

  34. Patel S, Goyal A. The current trends and future perspectives of prebiotics research: a review. 3 Biotech. Springer. 2012;2:115–25.

    CAS  Google Scholar 

  35. Vamanu E, Vamanu A. The influence of prebiotics on bacteriocin synthesis using the strain Lactobacillus paracasei CMGB16. African J Microbiol Res. 2010;4:534–7.

    CAS  Google Scholar 

  36. Dinoto A, Marques TM, Sakamoto K, Fukiya S, Watanabe J, Ito S, et al. Population dynamics of Bifidobacterium species in human feces during Raffinose Administration monitored by fluorescence in situ hybridization-flow cytometry. Appl Environ Microbiol. 2006;72:7739–47.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Bai G, Ni K, Tsuruta T, Nishino N. Dietary casein and soy protein isolate modulate the effects of Raffinose and Fructooligosaccharides on the composition and fermentation of gut microbiota in rats. J Food Sci. 2016;81:H2093–8.

    Article  CAS  PubMed  Google Scholar 

  38. Smiricky-Tjardes MR, Flickinger EA, Grieshop CM, Bauer LL, Murphy MR, Fahey GC. In vitro fermentation characteristics of selected oligosaccharides by swine fecal microflora. J Anim Sci. 2003;81:2505–14.

    Article  CAS  PubMed  Google Scholar 

  39. Makras L, De Vuyst L. The in vitro inhibition of Gram-negative pathogenic bacteria by bifidobacteria is caused by the production of organic acids. Int Dairy J. 2006;16:1049–57.

    Article  CAS  Google Scholar 

  40. Stewart CJ, Marrs ECL, Nelson A, Lanyon C, Perry JD, Embleton ND, et al. Development of the preterm gut microbiome in twins at risk of necrotising enterocolitis and sepsis. PLoS One. e73465;8.

  41. Kozich JJ, Westcott SL, Baxter NT, Highlander SK, Schloss PD. Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform. Appl Environ Microbiol. 2013;79:5112–20.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. Schloss PD, Gevers D, Westcott SL. Reducing the Effects of PCR Amplification and Sequencing Artifacts on 16S rRNA-Based Studies. PLoS One. 2011;6:e27310.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Lane JP, Stewart CJ, Cummings SP, Gennery AR. Functional changes in gut microbiota during hematopoietic stem cell transplantation for severe combined immunodeficiency. J Allergy Clin Immunol. 2016;138:622–5.

    Article  PubMed  Google Scholar 

  44. Team RC. R: A language and environment for statistical computing. R Foundation for Statistical Computing: Vienna, Austria; 2014. Available from:

    Google Scholar 

  45. DiGiulio DB, Callahan BJ, McMurdie PJ, Costello EK, Lyell DJ, Robaczewska A, et al. Temporal and spatial variation of the human microbiota during pregnancy. Proc Natl Acad Sci. 2015;112:11060–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  46. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B. 1995;57:289–300.

    Google Scholar 

  47. Li S, Park Y, Duraisingham S, Strobel FH, Khan N, Soltow QA, et al. Predicting network activity from high throughput metabolomics. PLoS Comput Biol. 2013;9:e1003123.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  48. Xia J, Sinelnikov IV, Han B, Wishart DS. MetaboAnalyst 3.0—making metabolomics more meaningful. Nucleic Acids Res Oxford University Press. 2015;43:W251–7.

    Article  CAS  Google Scholar 

  49. Lê Cao K-A, González I, Déjean S. integrOmics: an R package to unravel relationships between two omics datasets. Bioinformatics. 2009;25:2855–6.

    Article  PubMed  PubMed Central  Google Scholar 

  50. Lê Cao K-A, Rossouw D, Robert-Granié C, Besse P. A sparse PLS for variable selection when integrating omics data. Stat Appl Genet Mol Biol. 2008;7:Article 35.

Download references


We gratefully acknowledge the help and support of parents who have helped with this and our other studies. We further acknowledge the statistical support provided by Dr. Jacqueline O’Brien from the Centre for Metagenomics and Microbiome Research at Baylor College of Medicine.


This work was supported by funding from Tiny Lives charity (Newcastle upon Tyne, UK), Newcastle upon Tyne Hospitals NHS Charity and the Newcastle Healthcare Charity. The content is solely the responsibility of the authors.

Availability of data and materials

The data set supporting the results of this article is available in the MG-RAST repository [accession numbers 4516545.3-4516585.3].

Author information

Authors and Affiliations



CJS, JEB, NDE, and SPC were involved in the study concept and design. CJS, TF, TS, AN, ECLM, DPS, JFP, and JP were all involved in the collection and sorting of samples and data acquisition and analysis. CJS drafted the manuscript and all authors contributed to critical revisions and approved the final manuscript.

Corresponding author

Correspondence to Christopher J. Stewart.

Ethics declarations

Ethics approval and consent to participate

Ethical approval was obtained from the County Durham and Tees Valley Research Ethics Committee. Parental informed consent was given.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1: Table S1.

Extensive demographic info (DOCX 22 kb)

Additional file 2:

Antibiotic information (DOCX 18 kb)

Additional file 3: Table S2.

Figure S1. Gap statistic showing justification for selecting 6 clusters (PDF 30 kb)

Additional file 4: Figure S2.

Heatmaps of the most dominant bacterial genera per each preterm gut community type (PGCT). (A) PGCT 1. (B) PGCT 2. (C) PGCT 3. (D) PGCT 4. (E) PGCT 15. (F) PGCT 6. Genera shown in bold are dominant in that PGCT. (PDF 348 kb)

Additional file 5: Figure S3.

Spares partial least squared correlations (sPLS) between dominant bacterial genera and identified metabolites from control infants only. Analysis excludes infants diagnosed with late onset sepsis. Only significant metabolites based on the samples at diagnosis (time point 0) and the top 10 most abundant bacterial taxa were included. (PDF 145 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Stewart, C.J., Embleton, N.D., Marrs, E.C.L. et al. Longitudinal development of the gut microbiome and metabolome in preterm neonates with late onset sepsis and healthy controls. Microbiome 5, 75 (2017).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: