Skip to main content
Fig. 5 | Microbiome

Fig. 5

From: Machine learning-aided analyses of thousands of draft genomes reveal specific features of activated sludge processes

Fig. 5

Performance of the random forest model. a Confusion matrix showing the performance of the random forest model on the 20% testing data group of the holdout validation. b Prediction accuracy of the random forest model determined based on 10-fold cross-validation. c ROC curves for evaluating the random forest model created from 10-fold cross-validation. d The completeness and contamination of correctly predicted MAGs and wrongly predicted MAGs. Boxplots along the x- and y-axes show the means and quartiles of the completeness and contamination values of correctly and wrongly predicted MAGs

Back to article page