Banerjee S, Schlaeppi K, van der Heijden MGA. Keystone taxa as drivers of microbiome structure and functioning. Nature Reviews Microbiol; 2018;16:567–576.
de los Reyes FL. Challenges in determining causation in structure-function studies using molecular biological techniques. Water Res. 2010;44:4948–57.
Article
PubMed
Google Scholar
Koch C, Müller S, Harms H, Harnisch F. Microbiomes in bioenergy production : from analysis to management. Curr Opin Biotechnol. 2014;27:65–72.
Article
CAS
PubMed
Google Scholar
Verstraete W, Wittebolle L, Heylen K, Vanparys B, de Vos P, van de Wiele T, et al. Microbial resource management: the road to go for environmental biotechnology. Eng Life Sci. 2007;2:117–26.
Article
Google Scholar
Kleerebezem R, van Loosdrecht MC. Mixed culture biotechnology for bioenergy production. Curr Opin Biotechnol. 2007;18:207–12.
Article
CAS
PubMed
Google Scholar
Lawson CE, Harcombe WR, Hatzenpichler R. Common principles and best practices for engineering microbiomes. Nat Rev Microbiol. 2019;17:725–41.
Article
CAS
PubMed
PubMed Central
Google Scholar
Goldford JE, Lu N, Bajić D, Estrela S, Tikhonov M, Sanchez-Gorostiaga A, et al. Emergent simplicity in microbial community assembly. Science. 2018;361:469–74.
Article
CAS
PubMed
PubMed Central
Google Scholar
Zuñiga C, Li CT, Yu G, Al-Bassam MM, Li T, Jiang L, et al. Environmental stimuli drive a transition from cooperation to competition in synthetic phototrophic communities. Nature Microbiol; 2019;4:2184–2191.
Angenent LT, Richter H, Buckel W, Spirito CM, Steinbusch KJJ, Plugge CM, et al. Chain elongation with reactor microbiomes: open-culture biotechnology to produce biochemicals. Environ Sci Tech. 2016;50:2796–810.
Article
CAS
Google Scholar
Liu B, Kleinsteuber S, Centler F, Harms H, Sträuber H. Competition between butyrate fermenters and chain-elongating bacteria limits the efficiency of medium-chain carboxylate production. Front Microbiol. 2020;11:336.
Article
CAS
PubMed
PubMed Central
Google Scholar
Lambrecht J, Cichocki N, Schattenberg F, Kleinsteuber S, Harms H, Müller S, et al. Key sub-community dynamics of medium-chain carboxylate production. Microb Cell Fact; 2019;18:92.
Kucek L, Spirito CM, Angenent LT. High n-caprylate productivities and specificities from dilute ethanol and acetate: chain elongation with microbiomes to upgrade products from syngas fermentation. Energ Environ Sci. 2016;9:3482–94.
Article
CAS
Google Scholar
Kucek LA, Nguyen M, Angenent LT. Conversion of L-lactate into n-caproate by a continuously fed reactor microbiome. Water Res. 2016;93:163–71.
Article
CAS
PubMed
Google Scholar
Duber A, Jaroszynski L, Zagrodnik R, Chwialkowska J, Juzwa W, Ciesielski S, et al. Exploiting the real wastewater potential for resource recovery – n-caproate production from acid whey. Green Chem; 2018;20:3790–3803.
Grootscholten TIM, Steinbusch KJJ, Hamelers HVM, Buisman CJN. Improving medium chain fatty acid productivity using chain elongation by reducing the hydraulic retention time in an upflow anaerobic filter. Bioresour Technol. 2013;136:735–8.
Article
CAS
PubMed
Google Scholar
Nzeteu CO, Trego AC, Abram F, OʼFlaherty V. Reproducible, high-yielding, biological caproate production from food waste using a single-phase anaerobic reactor system. Biotechnol Biofuels; 2018;11:108.
Mansfeldt C, Achermann S, Men Y, Walser JC, Villez K, Joss A, et al. Microbial residence time is a controlling parameter of the taxonomic composition and functional profile of microbial communities. ISME J; 2019;13:1589–1601.
Bonk F, Popp D, Weinrich S, Sträuber H, Becker D, Kleinsteuber S, et al. Determination of microbial maintenance in acetogenesis and methanogenesis by experimental and modeling techniques. Front Microbiol. 2019;10:166.
Article
PubMed
PubMed Central
Google Scholar
Werner JJ, Knights D, Garcia ML, Scalfone NB, Smith S, Yarasheski K, et al. Bacterial community structures are unique and resilient in full-scale bioenergy systems. Proc Natl Acad Sci U S A. 2011;108:4158–63.
Article
CAS
PubMed
PubMed Central
Google Scholar
Oyetunde T, Bao FS, Chen JW, Martin HG, Tang YJ. Leveraging knowledge engineering and machine learning for microbial bio-manufacturing. Biotechnology. 2018;36:1308–15.
CAS
Google Scholar
Lopatkin AJ, Collins JJ. Predictive biology: modelling, understanding and harnessing microbial complexity. Nature Reviews Microbiol; 2020;
Astudillo-García C, Hermans SM, Stevenson B, Buckley HL, Lear G. Microbial assemblages and bioindicators as proxies for ecosystem health status: potential and limitations. Applied Microbiology and Biotechnology. Appl Microbiol Biotechnol; 2019;103:6407–6421.
Bodein A, Chapleur O, Droit A, Lê Cao KA. A generic multivariate framework for the integration of microbiome longitudinal studies with other data types. Front Genet. 2019;10:963.
Article
CAS
PubMed
PubMed Central
Google Scholar
Seshan H, Goyal MK, Falk MW, Wuertz S. Support vector regression model of wastewater bioreactor performance using microbial community diversity indices: effect of stress and bioaugmentation. Water Res. 2014;53:282–96.
Article
CAS
PubMed
Google Scholar
Hermans SM, Buckley HL, Case BS, Curran-Cournane F, Taylor M, Lear G. Using soil bacterial communities to predict physico-chemical variables and soil quality. Microbiome; 2020;8:79.
Yang F, Zou Q. mAML: an automated machine learning pipeline with a microbiome repository for human disease classification. Database. 2020;2020:baaa050.
Temudo MF, Mato T, Kleerebezem R, Van Loosdrecht MCM. Xylose anaerobic conversion by open-mixed cultures. Appl Microbiol Biotechnol. 2009;82:231–9.
Article
CAS
PubMed
PubMed Central
Google Scholar
Zhu X, Zhou Y, Wang Y, Wu T, Li X, Li D, et al. Production of high-concentration n-caproic acid from lactate through fermentation using a newly isolated Ruminococcaceae bacterium CPB6. Biotechnology for Biofuels; 2017;10:102.
Yoon SH, Ha SM, Kwon S, Lim J, Kim Y, Seo H, et al. Introducing EzBioCloud: a taxonomically united database of 16S rRNA gene sequences and whole-genome assemblies. Int J Syst Evol Microbiol. 2017;67:1613–7.
Article
CAS
PubMed
PubMed Central
Google Scholar
Sträuber H, Bühligen F, Kleinsteuber S, Dittrich-Zechendorf M. Carboxylic acid production from ensiled crops in anaerobic solid-state fermentation - trace elements as pH controlling agents support microbial chain elongation with lactic acid. Eng. Life Sci. 2018;0:447–58.
Google Scholar
Xu J, Hao J, Guzman JJL, Spirito CM, Harroff LA, Angenent LT. Temperature-phased conversion of acid whey waste into medium-chain carboxylic acids via lactic acid: no external e-donor. Joule. 2018;2:1–16.
Article
Google Scholar
Scarborough MJ, Lynch Griffin, Dickson Mitch, McGee Mick, Donohue TJ, Noguera DR. Increasing the economic value of lignocellulosic stillage through medium-chain fatty acid production. Biotechnol Biofuels; 2018;11:200.
Khor WC, Andersen S, Vervaeren H, Rabaey K. Electricity-assisted production of caproic acid from grass. Biotechnol Biofuels; 2017;10:180.
Andersen SJ, de Groof V, Khor WC, Roume H, Props R, Coma M, et al. A Clostridium group IV species dominates and suppresses a mixed culture fermentation by tolerance to medium chain fatty acids products. Front Bioeng Biotechnol. 2017;5:8.
Article
PubMed
PubMed Central
Google Scholar
Contreras-Dávila CA, Carrión VJ, Vonk VR, Buisman CNJ, Strik DPBTB. Consecutive lactate formation and chain elongation to reduce exogenous chemicals input in repeated-batch food waste fermentation. Water Res 2020;1:115215.
Vrancken G, Gregory AC, Huys GRB, Faust K, Raes J. Synthetic ecology of the human gut microbiota. Nature Reviews Microbiology; 2019;17:754–763.
Maus I, Klocke M, Derenkó J, Stolze Y, Beckstette M, Jost C, et al. Impact of process temperature and organic loading rate on cellulolytic/hydrolytic biofilm microbiomes during biomethanation of ryegrass silage revealed by genome-centered metagenomics and metatranscriptomics. Environ Microbiome. 2020;15:7.
Article
CAS
PubMed
PubMed Central
Google Scholar
Detman A, Mielecki D, Pleśniak Ł, Bucha M, Janiga M, Matyasik I, et al. Methane-yielding microbial communities processing lactate-rich substrates: a piece of the anaerobic digestion puzzle. Biotechnol Biofuels. 2018;11:116.
Article
PubMed
PubMed Central
Google Scholar
Zhu X, Feng X, Liang C, Li J, Jia J, Feng L, et al. Microbial ecological mechanism for long-term production of high concentrations of n-caproate via lactate-driven chain elongation. Appl Environ Microbiol. 2021;87.
Westerholm M, Müller B, Isaksson S, Schnürer A. Trace element and temperature effects on microbial communities and links to biogas digester performance at high ammonia levels. Biotechnol Biofuels. 2015;8:154.
Article
PubMed
PubMed Central
Google Scholar
Candry P, Radić L, Favere J, Carvajal-Arroyo JM, Rabaey K, Ganigué R. Mildly acidic pH selects for chain elongation to caproic acid over alternative pathways during lactic acid fermentation. Water Res. 2020;186:116396.
Article
CAS
PubMed
Google Scholar
Wu L, Yang Y, Chen S, Zhao M, Zhu Z, Yang S, et al. Long-term successional dynamics of microbial association networks in anaerobic digestion processes. Water Res. 2016;104:1–10.
Article
PubMed
Google Scholar
Topçuoğlu BD, Lesniak NA, Ruffin M, Wiens J, Schloss PD. A framework for effective application of machine learning to microbiome-based classification problems. mBio. 2020;11:e00434–20.
Article
PubMed
PubMed Central
Google Scholar
Fortino V, Wisgrill L, Werner P, Suomela S, Linder N, Jalonen E, et al. Machine-learning–driven biomarker discovery for the discrimination between allergic and irritant contact dermatitis. PNAS. 2020;117:33474–85.
Article
CAS
PubMed
PubMed Central
Google Scholar
Wirbel J, Zych K, Essex M, Karcher N, Kartal E, Salazar G, et al. Microbiome meta-analysis and cross-disease comparison enabled by the SIAMCAT machine learning toolbox. Genome Biol. 2021;22:93.
Article
PubMed
PubMed Central
Google Scholar
Uddin S, Khan A, Hossain ME, Moni MA. Comparing different supervised machine learning algorithms for disease prediction. BMC Med Inform Decis Mak. 2019;19:281.
Article
PubMed
PubMed Central
Google Scholar
Krawczyk B. Learning from imbalanced data: open challenges and future directions. Prog Artif Intell. 2016;5:221–32.
Article
Google Scholar
Bokulich NA, Ziemski M, Robeson MS, Kaehler BD. Measuring the microbiome: best practices for developing and benchmarking microbiomics methods. Comput Struct Biotechnol J. 2020;18:4048–62.
Article
CAS
PubMed
PubMed Central
Google Scholar
Breiman L. Random forests. Machine Learning. 2001;45:5–32.
Article
Google Scholar
Zhou YH, Gallins P. A review and tutorial of machine learning methods for microbiome host trait prediction. Front Genet. 2019;10:579.
Article
PubMed
PubMed Central
Google Scholar
Pasolli E, Truong DT, Malik F, Waldron L, Segata N. Machine learning meta-analysis of large metagenomic datasets: tools and biological insights. PLOS Computational Biology. Public Library of Science; 2016;12:e1004977.
Xiao J, Chen L, Johnson S, Yu Y, Zhang X, Chen J. Predictive modeling of microbiome data using a phylogeny-regularized generalized linear mixed model. Front Microbiol. 2018;9:1391.
Article
PubMed
PubMed Central
Google Scholar
Xiao J, Chen L, Yu Y, Zhang X, Chen J. A Phylogeny-regularized sparse regression model for predictive modeling of microbial community data. Front Microbiol. 2018;9:3112.
Article
PubMed
PubMed Central
Google Scholar
Saraiva JP, Worrich A, Karakoç C, Kallies R, Chatzinotas A, Centler F, et al. Mining synergistic microbial interactions: a roadmap on how to integrate multi-omics data. Microorganisms; 2021;9:840.
Dʼhoe K, Vet S, Faust K, Moens F, Falony G, Gonze D, et al. Integrated culturing, modeling and transcriptomics uncovers complex interactions and emergent behavior in a three-species synthetic gut community. eLife. 2018;7:e37090.
Article
PubMed
PubMed Central
Google Scholar
Mei R, Liu W-T. Quantifying the contribution of microbial immigration in engineered water systems. Microbiome; 2019;7:144.
Sträuber H, Lucas R, Kleinsteuber S. Metabolic and microbial community dynamics during the anaerobic digestion of maize silage in a two-phase process. In: Applied Microbiology and Biotechnology, vol. 100. Berlin Heidelberg: Springer; 2016. p. 479–91.
Google Scholar
Scarborough MJ, Lawson CE, Hamilton JJ, Donohue TJ, Noguera DR. Metatranscriptomic and thermodynamic insights into medium-chain fatty acid production using an anaerobic microbiome. mSystems. 2018;3:e00221–18.
Article
CAS
PubMed
PubMed Central
Google Scholar
Shahab RL, Brethauer S, Davey MP, Smith AG, Vignolini S, Luterbacher JS, et al. A heterogeneous microbial consortium producing short-chain fatty acids from lignocellulose. Science. 2020;369:eabb1214.
Chase JM. Stochastic community assembly causes higher biodiversity in more productive environments. science. 2010;328:1388–91.
Article
CAS
PubMed
Google Scholar
Ofiteru ID, Lunn M, Curtis TP, Wells GF, Criddle CS, Francis CA, et al. Combined niche and neutral effects in a microbial wastewater treatment community. Proc Natl Acad Sci. 2010;107:15345–50.
Article
CAS
PubMed
PubMed Central
Google Scholar
Urban C, Xu J, Sträuber H, dos Santos Dantas TR, Mühlenberg J, Härtig C, et al. Production of drop-in fuel from biomass by combined microbial and electrochemical conversions. Energ Environ Sci. 2017;10:2231–44.
Article
CAS
Google Scholar
Lucas R, Kuchenbuch A, Fetzer I, Harms H, Kleinsteuber S. Long-term monitoring reveals stable and remarkably similar microbial communities in parallel full-scale biogas reactors digesting energy crops. FEMS Microbiol Ecol 2015;91:fiv004.
Klindworth A, Pruesse E, Schweer T, Peplies J, Quast C, Horn M, et al. Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies. Nucleic Acids Res. 2013;41:e1.
Article
CAS
PubMed
Google Scholar
Bolyen E, Rideout JR, Dillon MR, Bokulich NA, Chase J, Cope EK, et al. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat Biotechnol. 2019;37:852–7.
Article
CAS
PubMed
PubMed Central
Google Scholar
Callahan BJ, McMurdie PJ, Rosen MJ, Han AW, Johnson AJA, Holmes SP. DADA2: high-resolution sample inference from Illumina amplicon data. Nat Methods. 2016;13:581–3.
Article
CAS
PubMed
PubMed Central
Google Scholar
McIlroy SJ, Kirkegaard RH, McIlroy B, Nierychlo M, Kristensen JM, Karst SM, et al. MiDAS 2.0: an ecosystem-specific taxonomy and online database for the organisms of wastewater treatment systems expanded for anaerobic digester groups. Database. 2017;2017:1–9.
Wang Q, Garrity GM, Tiedje JM, Cole JR. Naıve Bayesian classifier for rapid assignment of rRNA sequences. Appl Environ Microbiol. 2007;73:5261–7.
Article
CAS
PubMed
PubMed Central
Google Scholar
Shannon CE. A mathematical theory of communication. Bell Syst Tech J. 1948;27:379–423.
Article
Google Scholar
McMurdie PJ, Holmes S. Phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data. PLoS One. 2013;8:e61217.
Article
CAS
PubMed
PubMed Central
Google Scholar
Bray JR, Curtis JT. An ordination of the upland forest communities of southern Wisconsin. Ecological Monographs. 1957;27:325–49.
Article
Google Scholar
Anderson MJ. A new method for non-parametric multivariate analysis of variance. Austral Ecol. 2001;26:32–46.
Google Scholar
Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple Testing. J Royal Stat Soc B (Methodological). 1995;57:289–300.
Article
Google Scholar
Ju F, Xia Y, Guo F, Wang Z, Zhang T. Taxonomic relatedness shapes bacterial assembly in activated sludge of globally distributed wastewater treatment plants. Environ Microbiol. 2014;16:2421–32.
Article
CAS
PubMed
Google Scholar
Bastian M, Heymann S, Jacomy M. Gephi: an open source software for exploring and manipulating networks. BT - International AAAI Conference on Weblogs and Social. International AAAI Conference on Weblogs and Social Media. 2009;8:361–2.
Article
Google Scholar
Pruesse E, Peplies J, Glöckner FO. SINA: accurate high-throughput multiple sequence alignment of ribosomal RNA genes. Bioinformatics. 2012;28:1823–9.
Article
CAS
PubMed
PubMed Central
Google Scholar
Pruesse E, Quast C, Knittel K, Fuchs BM, Ludwig W, Peplies J, et al. SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB. Nucleic Acids Res. 2007;35:7188–96.
Article
CAS
PubMed
PubMed Central
Google Scholar
Letunic I, Bork P. Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res. 2016;44:W242–5.
Article
CAS
PubMed
PubMed Central
Google Scholar
Uritskiy G V., Diruggiero J, Taylor J. MetaWRAP - a flexible pipeline for genome-resolved metagenomic data analysis 08 Information and Computing Sciences 0803 Computer Software 08 Information and Computing Sciences 0806 Information Systems. Microbiome; 2018;6:158.
Galore K. Trim Galore!: a wrapper tool around Cutadapt and FastQC to consistently apply quality and adapter trimming to FastQ files [Internet]. 2015. Available from: https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/
Rotmistrovsky, K. Agarwala R. BMTagger: best match tagger for removing human reads from metagenomics datasets [Internet]. 2011. Available from: ftp://ftp.ncbi.nlm.nih.gov/pub/agarwala/bmtagger/
Nurk S, Meleshko D, Korobeynikov A, Pevzner PA. MetaSPAdes: a new versatile metagenomic assembler. Genome Res. 2017;27:824–34.
Article
CAS
PubMed
PubMed Central
Google Scholar
Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–60.
Article
CAS
PubMed
PubMed Central
Google Scholar
Kang DD, Froula J, Egan R, Wang Z. MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities. PeerJ. 2015;3:e1165.
Article
PubMed
PubMed Central
Google Scholar
Wu YW, Simmons BA, Singer SW. MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets. Bioinformatics. 2016;32:605–7.
Article
CAS
PubMed
Google Scholar
Alneberg J, Bjarnason BS, De Bruijn I, Schirmer M, Quick J, Ijaz UZ, et al. Binning metagenomic contigs by coverage and composition. Nat Methods. 2014;11:1144–6.
Article
CAS
PubMed
Google Scholar
Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: Assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25:1043–55.
Article
CAS
PubMed
PubMed Central
Google Scholar
Parks DH, Rinke C, Chuvochina M, Chaumeil PA, Woodcroft BJ, Evans PN, et al. Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life. Nature Microbiology; 2017;2:1533–1542.
Chaumeil P-A, Mussig AJ, Hugenholtz P, Parks DH. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics. 2019;36:1925–7.
PubMed Central
Google Scholar
Bushnell B. BBMap short read aligner, and other bioinformatic tools [Internet]. Available from: http://sourceforge.net/projects/bbmap
Katz L, Griswold T, Morrison S, Caravas J, Zhang S, Bakker H, et al. Mashtree: a rapid comparison of whole genome sequence files. J Open Source Software. 2019;4:1762.
Article
Google Scholar
Jain C, Rodriguez-R LM, Phillippy AM, Konstantinidis KT, Aluru S. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nat Commun. 2018;9:5114.
Article
PubMed
PubMed Central
Google Scholar
Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30:2068–9.
Article
CAS
PubMed
Google Scholar
Bateman A. UniProt: A worldwide hub of protein knowledge. Nucleic Acids Res. 2019;47:D506–15.
Article
Google Scholar
Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B, Koonin EV, et al. The COG database: an updated vesion includes eukaryotes. BMC Bioinformatics. 2003;4:41.
Article
PubMed
PubMed Central
Google Scholar
Clark K, Karsch-Mizrachi I, Lipman DJ, Ostell J, Sayers EW. GenBank. Nucleic Acids Res. 2016;44:D67–72.
Article
CAS
PubMed
Google Scholar
Liaw A, Wiener M. Classification and regression with random forest. R News. 2002;2:18–22.
Google Scholar
Huang BFF, Boutros PC. The parameter sensitivity of random forests. BMC Bioinformatics. BMC Bioinformatics; 2016;17:331.
Menze BH, Kelm BM, Masuch R, Himmelreich U, Bachert P, Petrich W, et al. A comparison of random forest and its Gini importance with standard chemometric methods for the feature selection and classification of spectral data. BMC Bioinformatics. 2009;10:213.
Article
PubMed
PubMed Central
Google Scholar
Branco P, Ribeiro RP, Torgo L. UBL: an R package for utility-based learning. arXiv preprint. 2016;arXiv:1604.08079.
Wright MN, Ziegler A. Ranger: a fast implementation of random forests for high dimensional data in C++ and R. J Stat Softw. 2017;77:i01.
Article
Google Scholar
Bischl B, Lang M, Kotthoff L, Schiffner J, Richter J, Studerus E, et al. Mlr: machine learning in R. J Machine Learn Res. 2016;17:1–5.
Google Scholar