- Open Access
Mobile antibiotic resistome in wastewater treatment plants revealed by Nanopore metagenomic sequencing
Microbiomevolume 7, Article number: 44 (2019)
Wastewater treatment plants (WWTPs) are recognized as hotspots for horizontal gene transfer (HGT) of antibiotic resistance genes (ARGs). Despite our understanding of the composition and distribution of ARGs in WWTPs, the genetic location, host, and fate of ARGs remain largely unknown.
In this study, we combined Oxford Nanopore and Illumina metagenomics sequencing to comprehensively uncover the resistome context of influent, activated sludge, and effluent of three WWTPs and simultaneously track the hosts of the ARGs. The results showed that most of the ARGs detected in all compartments of the WWTPs were carried by plasmids. Transposons and integrons also showed higher prevalence on plasmids than on the ARG-carrying chromosome. Notably, integrative and conjugative elements (ICEs) carrying five types of ARGs were detected, and they may play an important role in facilitating the transfer of ARGs, particularly for tetracycline and macrolide-lincosamide-streptogramin (MLS). A broad spectrum of ARGs carried by plasmids (29 subtypes) and ICEs (4 subtypes) was persistent across the WWTPs. Host tracking showed a variety of antibiotic-resistant bacteria in the effluent, suggesting the high potential for their dissemination into receiving environments. Importantly, phenotype-genotype analysis confirmed the significant role of conjugative plasmids in facilitating the survival and persistence of multidrug-resistant bacteria in the WWTPs. At last, the consistency in the quantitative results for major ARGs types revealed by Nanopore and Illumina sequencing platforms demonstrated the feasibility of Nanopore sequencing for resistome quantification.
Overall, these findings substantially expand our current knowledge of resistome in WWTPs, and help establish a baseline analysis framework to study ARGs in the environment.
The emergence and spread of antibiotic resistance genes (ARGs) have raised serious public health concerns. Wastewater treatment plants (WWTPs), as a unique interface between humans and environments, harbor a large microbial genetic diversity, facilitating the exchange of ARGs by horizontal gene transfer (HGT) . Mobile genetic elements (MGEs), including plasmids, integrative and conjugative elements (ICEs; also called conjugative transposons), transposons and integrons, are a means to transfer genetic information between bacterial cells or within the genome of a cell . Importantly, multiple ARGs are often located on MGEs, which makes the transfer of resistance easy to achieve even between bacteria from distant taxonomic lineages [3,4,5]. Antibiotics and other co-selection factors in sewage form a persistent selection pressure for ARGs and antibiotic-resistant bacteria (ARB) associated with MGEs in the WWTPs . To understand the dynamic dissemination of ARGs and multidrug-resistant ARBs in WWTPs, the ARGs associated with MGEs must be elucidated. Additionally, identifying the hosts of the ARGs is crucial to reveal their fates in the wastewater treatment process.
Previous studies have attempted to reveal the genetic contexts (including MGEs) and hosts of the ARGs with different approaches, including isolation [7, 8], high-throughput sequencing [9, 10], and epicPCR (emulsion, paired isolation, and concatenation PCR) . Pure culture isolation, combined with whole-genome sequencing, has been and remains an important method to determine the phenotypic and genotypic correlations for ARBs and identify the MGEs with which they are associated [12, 13]. However, only a limited fraction of bacteria in WWTPs can be cultured and isolated, which seriously limits the application of pure culture isolation to explore the environmental resistome. High-throughput sequencing has greatly increased our knowledge of the diversity and abundance of ARGs in WWTPs [14, 15]. However, the information about the ARG-carrying species and the genetic contexts remains poorly understood, because of the short read length generated by Illumina sequencing. Although assembly of short reads might provide such information, the frequent repetitive sequences flanking ARGs carried on MGEs usually hamper the effective assembly of genetic contexts of ARGs . EpicPCR was developed to link functional genes and phylogenetic markers, such as 16S rRNA in single cells, and has been used to identify the hosts of ARGs . However, this PCR-based method requires the sequence information of the target genes of interest, and only one functional target gene can be sorted each time, although the phylogenetic marker gene could be a universal one . Above all, a robust method is urgently required to resolve both the genetic environments and the hosts of the ARGs in a high-throughput format.
Third-generation sequencing technologies, including Pacific Biosciences (PacBio) and Oxford Nanopore sequencing, generate long reads (up to 2.27 Mb, as reported) that can span most repetitive sequences and provide an opportunity to link the ARGs and their flanking regions, and thus, the knowledge and technology gap mentioned above can be bridged. Compared with the PacBio platform, Oxford MinION has the advantages of producing raw data in real time and it is a more easily accessible and efficient tool for genome assembly and complex structural detection [18,19,20].
In this study, we report on the first workflow based on Nanopore and Illumina sequencing to rapidly profile the genetic location and track the hosts of ARGs, particularly for potential ARG-carrying pathogens along the wastewater treatment process. Additionally, the correlation between phenotype and genotype of multidrug-resistant bacteria in both influent and effluent of three WWTPs was determined based on the combination of cultivation and Nanopore sequencing.
Sample collection and pretreatment
Nine samples (i.e., influent, activated sludge, and effluent) were collected from three full-scale wastewater treatment plants (WWTPs) in Hong Kong: Shatin STP (22.407° N, 114.214° E), Shek Wu Hui STP (22.510° N, 114.119° E), and Stanley STP (22.219° N, 114.210° E). The influent and activated sludge samples were fixed on site using an equal volume of 100% ethanol. Additionally, 50 mL of influent without fixation was collected from each of the WWTPs for further multidrug-resistant bacteria isolation. All the samples were transported to the lab for immediate processing (within 2 h). For each influent sample, pellets were collected after centrifuging 50 mL of the sample at 5000 rpm for 15 min at room temperature. For each effluent sample, solids were collected after filtering 1 L of the sample through a 0.45 μm cellulose nitrate membrane. All the processed samples were stored at − 20 °C before DNA extraction.
Cultivation of multidrug-resistant bacteria
One hundred microliters of each influent without ethanol fixation was plated onto Lysogeny broth (LB) agar plate supplemented with 100 mg/L ampicillin, 50 mg/L kanamycin, 20 mg/L tetracycline, and 25 mg/L chloramphenicol. For the effluent samples, the bacteria pellets on the membranes were resuspended in 1 mL of LB medium, and 100 μL of the resuspension was plated onto the same kind of LB agar plate as described above. After incubating overnight (12 h) at 37 °C, the mixed multidrug-resistant culture used for DNA extraction and downstream analysis for each sample was collected by washing the colonies/isolates grown on the incubated LB plate (approximately 500 colonies/isolates) three times using LB.
DNA extraction, size selection, and purification
DNA extraction for the influent, activated sludge, and effluent samples was conducted using FastDNA® Spin Kit for Soil (MP Biomedicals, USA) following the manufacturer’s instructions. Total genomic DNA of the mixed isolates from each plate was extracted using a DNeasy PowerSoil Kit (Qiagen, Germany). Equal amounts of DNA from each of the three influent cultures from the three WWTPs were mixed; the same procedure was also conducted for the effluent cultures. After electrophoresis, DNA fragments larger than 8 kb were manually excised from the agarose gel and recovered using the Monarch® DNA Gel Extraction Kit (NEB Inc., USA). The recovered DNA was purified using AMPure XP beads (Beckman Coulter). DNA concentrations and purity were determined using microspectrophotometry (NanoDrop ND-1000; Wilmington, DE). DNA samples of sufficient purity (OD 260/280 of ~ 1.8 and OD 260/230 of 2.0–2.2) were used for library preparation.
MinION library preparation and sequencing
The sequencing library preparation was performed using SQK-LSK108 1D ligation genomic DNA kit following the procedures described below. For each DNA sample, 1.5–2.0 μg of DNA recovered from agarose gel purification was used.
End-repair and dA-tailing were performed using the NEBNext Ultra II End-Repair/dA-tailing Module. In detail, 7 μL of Ultra II End-Prep buffer, 3 μL of Ultra II End-Prep enzyme mix, 5 μL of NFW, and ~ 1.2 μg of DNA were mixed and incubated at 20 °C for 20 min at first followed by another incubation at 65 °C for 15 min.
Sixty microliters of AMPure XP beads was used for purification.
Ligation was performed by adding 20 μL of Adaptor Mix and 50 μL of Blunt/TA Ligation Master Mix to the 30 μL of dA-tailed DNA and then incubation was performed at room temperature for 15 min.
Another purification for the removal of remaining adapters from the adapter-ligated DNA was conducted using 40 μL of AMPure XP beads and ABB buffer supplied in the kit. The purified-ligated DNA was resuspended using 25 μL of ELB, and then the concentration was measured by Qubit to ensure ≥ 500 ng of DNA was retained.
Finally, MinION sequencing was performed using R9.4 flow cells (FLO-MIN 106). A total of eleven independent MinION runs were conducted for all the samples.
MinION data analysis
Raw reads generated by MinKNOW were base-called using Albacore (v2.1.10) to return fastq files. Passed reads were trimmed for adapters using PoreChop (0.2.3, https://github.com/rrwick/Porechop), and the parameter “--discard_middle” was used to remove the reads with internal adapters. Statistical analysis of the MinION sequencing data was generated and visualized using NanoPack . The base-called data sets were deposited into the NCBI SRA database with the following accession numbers: SAMN09603371-SAMN09603381.
To identify the antibiotic resistance genes, 1D reads after adaptor removal were aligned to the nucleotide sequences of the SARG database  using the LAST tool (version 926), which was recommended for high error rate reads with settings as “-a 1 -b 1 -q 2” ; next, the aligned result was filtered based on a strict cutoff of alignment length and similarity. Only alignment length cover > 95% of ARG length with similarity > 80% was used for the analysis. When overlapped ARG-containing regions (> 80%) were detected, only the best ARG hit was kept for this region. Then, PlasFlow  was used to identify the ARGs carried by plasmids from all the detected ARG reads. Taxonomic identification of the ARG-carrying reads was conducted using Centrifuge (v1.0.3)  with the NCBI nonredundant nucleotide sequence database; the classification results were visualized with Pavian (https://github.com/fbreitwieser/pavian). Identification of transposons and integrons located on the ARG-carrying reads was performed using the LAST tool (version 926) to align the sequences to the concatenated protein database of the NCBI Reference Sequence Database (RefSeq), and only the alignments showing 60%  amino acid identity over more than 60% of the marker protein (transposase or integrase) length were kept. The ICE-carrying ARGs were determined based on similarity alignment (> 80%) against the ICEs database downloaded from ICEberg  and pre-filtered by the criterion of > 50% alignment length over the ARG-carrying reads and then double-confirmed by cross-validation using a blast search against the NCBI non-redundant nucleotide sequence database and manual inspection.
The extracted DNA (~ 5 μg for each sample) was sent out for high-throughput metagenomics sequencing on the Illumina Hiseq4000 platform using the PE150 strategy at the Novogene Corporation (Beijing, China). On average, 14.5 Gb reads were generated for each sample, and all the sequenced datasets were deposited into the NCBI SRA database (PRJNA505617).
Illumina data analysis
For each metagenome data, raw reads were filtered to remove those reads containing low-quality (Qscore ≤ 5) base which is over 50% of the total base, adapters, and ambiguous bases (N > 10%) . Clean data generated from each sample was de novo assembled using CLC’s Genomic Workbench (version 6.04, QIAGEN Bioinformatics, Denmark) with the default parameters , yielding a total of 4,062,135 contigs. Open reading frames (ORFs) were predicted for each assembled contig set using Prodigal (v2.6.3). Then the ARGs-like ORFs were determined using BLASTN against SARG nucleotide database mentioned above at E value ≤ 10−7  with a minimum similarity of 80% over 95% query coverage. PlasFlow  was used to predict plasmid sequences for all ARGs-carrying contigs. To compare the taxonomic affiliation between Illumina and Nanopore sequencing platforms for the mixed influent multidrug-resistant cultures, Centrifuge  was also used for the classification of Illumina metagenomic sequences. In addition, this metagenomic data was used for community profiling with Ribotagger  using reads annotated as 16S rRNA V4 and V6 regions. Additionally, EMIRGE  was applied to reconstruct near-full-length 16S rRNA sequences with 40 iterations. The resulting 16S rRNA sequences were phylogenetically annotated via NCBI-nt database (online nucleotide BLAST) and the relative abundance of the reconstructed genes was estimated by utilizing the probabilistic accounting of reads in EMIRGE.
Comparison of resistome profiles based on Nanopore and Illumina sequencing
To compare the results generated from Illumina and Nanopore sequencing, ARG number per million base pairs was used for the quantification. Briefly, after the ARGs-like reads were extracted, the ARGs abundance was calculated and normalized by the length of the reference genes in SARGs database and the total sequencing depth. Then the data was analyzed by Pearson correlation coefficient, as well as linear regression analysis.
Results and discussion
MinION and Illumina sequencing read statistics
Nanopore sequencing on average generated 3.4 Gb of base-called data with an N50 range from 5872 to 10,674 bp for the nine metagenomics libraries, and the longest length of a single read was 73,530 bp. Compared with these direct sequencings of environmental samples (Additional file 1: Table S1 and Figure S1), the two sequencing libraries constructed for the multidrug-resistant bacteria cultured from influent and effluent yielded much longer reads as indicated by N50 (16,578 bp) with relatively high throughput (avg. 3.9 Gb per flow cell), but even longer reads will likely be achievable by choosing different strategies in DNA extraction and library preparation methods. For example, DNA fragment length will be limited to ~ 60 kb when using spin column kits, whereas larger DNA size can be obtained when using kits based on gravity flow columns (100–200 kb) or traditional phenol-chloroform method (> 150 kb) . Notably, the Nanopore reads length (avg. 5.3 kb; N50 8.1 kb) was much longer than those contigs (avg. 1.4 kb; N50 1.7 kb) assembled using Illumina short reads even the sequencing depth of Illumina sequencing was as deep as 14.5 Gb (Additional file 1: Table S1). These longer Nanopore reads facilitated profiling genetic location of ARGs and tracking their hosts in WWTP microbiomes in this study.
Plasmids and ICEs carrying ARGs dominate the resistome in WWTPs
As shown in Additional file 1: Table S2, 1791 ARGs-carrying Nanopore reads with an average length of 8.5 kb were identified, much more than the 316 ARGs-carrying contigs (avg. 2.9 kb) assembled from Illumina short reads, indicating the difficulty to assess genetic context of ARGs by assembling Illumina short reads into long contigs.
The resistome of influent, activated sludge, and effluent of the WWTPs presented by the total 1791 ARG-carrying long reads and 316 contigs of the nine environmental metagenomic samples were categorized into two major groups based on their HGT potentials, respectively: (1) the intercellularly mobile group, i.e., ARGs carried by plasmids and ICEs that could be transferred between bacterial cells through transformation or conjugation, and (2) the chromosomal group, i.e., ARGs located on chromosomes that cannot transfer themselves unless integrated into the members of the first group. As shown in Table 1, ARGs in the mobile group on average accounted for 55% of the total number of ARGs revealed by Nanopore sequencing, whereas those on the chromosome accounted for 29%, in addition to 16% that could not be clearly assigned to either of these two groups. Not surprisingly, nearly all types of ARGs in the SARG database were detected in the mobile group, demonstrating the wide distribution of these ARGs on plasmids and ICEs (Fig. 1). Although, several ARGs types in some samples were only detected by Illumina reads (Additional file 1: Table S3), the relative abundance of these ARGs not detected by Nanopore sequencing were very low (accounted for only 4% of total resistome revealed by Illumina sequencing on average). This result indicated that even though the sequencing throughput was relatively low for Nanopore sequencing at the current stage, major ARG types could be sufficiently covered based on the current depth. Note that, except for multidrug ARGs, all other detected ARGs, including aminoglycoside, macrolide-lincosamide-streptogramin (MLS), beta-lactam, tetracycline, sulfonamide, chloramphenicol, quinolone, and trimethoprim, were carried mostly by plasmids, indicating that plasmids were a substantial part in the resistome of the WWTPs (Fig. 1). These findings were in consistence with the results of Illumina assemblies (Additional file 1: Table S4). The quantitative result obtained in this study using Nanopore sequencing further expand the previous descriptive findings that the microbial community of WWTPs can contain a significant collection of plasmids encoding resistance to nearly all clinically relevant antibiotics [34,35,36,37].
In addition to the ARGs carried by plasmids, 13 ARG subtypes belonging to 5 ARG types were carried by ICEs (Fig. 1), including aminoglycoside (e.g., aadE), beta-lactam (e.g., CfxA, CfxA2, and CfxA3), chloramphenicol (e.g., catQ), MLS (e.g., ermB, mefA/E, ermF, ermG, and ermC), and tetracycline (e.g., tetQ, tetM, and tetO). Much more striking was that tetracycline and MLS located on ICEs had a relatively high abundance compared with that of other ARG types, accounting for 22% and 17% of the corresponding ARG type, respectively. Hence, ICEs may play an important role in the spread of ARGs, particularly for those belonging to tetracycline and MLS in WWTPs. The ARGs carried by ICEs revealed in this study were reported pivotal in driving the emergence of multidrug-resistance in diverse gram-positive and gram-negative pathogens [38,39,40].
ARGs carried by transposons (excluding conjugative transposons) and integrons are incapable of moving intercellularly, but they can “hitch a ride” on the intercellular mobile elements and therefore can further increase the possibility of HGT. As shown by the distribution patterns on both plasmids and chromosomes (Fig. 1), generally, the transposons and integrons were more frequently detected associated with ARGs on plasmids rather than on the chromosome. Six ARG types (trimethoprim, chloramphenicol, sulfonamide, tetracycline, beta-lactam, and aminoglycoside) carried by plasmids were closely related to (> 50% reads in each type) transposons and intergrons, whereas two types (chloramphenicol and sulfonamide) were detected on the chromosome. Thus, the combination of these elements creates an ideal environment on plasmids that enables ARGs to be quickly recruited and accumulated through transposition and recombination [41, 42]. Collectively, these results demonstrated that the ARGs carried by plasmids and ICEs dominated the resistome in the WWTPs and that the transposable elements and integrons might further increase the mobility of plasmid-mediated ARGs.
As to the occurrence and dynamics of the ARGs across the treatment process, notably, the effluent had an even higher proportion of plasmids and ICEs carrying ARGs (ranging from 62 to 66%) than that of the influent (ranging from 54 to 57%, P < 0.01 by t test) and activated sludge (ranging from 41 to 53%, P < 0.01 by t test) (Table 1). This result was in consistence with the recent findings that the relative abundance of MGE-associated ARGs increased in the effluent compared with influent . The increase in proportion observed in the effluent might be an indication of the threat posed by the effluent of WWTPs on the dissemination of MGE-associated ARGs in the receiving water [44, 45].
Prevalence and persistence of ARGs revealed by MGEs and host-tracking
Because of the remarkable resistance mobility observed in this study, we then tracked the ARGs associated with MGEs and hosts of ARGs localized on the chromosome through the treatment process (Fig. 2). A total of 29 ARG subtypes belonging to 8 types carried by plasmids were identified in both influent and effluent in at least 1 of the 3 WWTPs (Fig. 2a). Of the 29 ARG subtypes, 23 were shared by all the WWTPs, and 10 (e.g., aadA, aadA2, blaA, VEB-3, catB, cmlA, mefC, mphD, sul1, and tetA) were detected in all 3 effluents. Particularly, the tetA (tetracycline-resistant) gene and sul1 (sulfonamide-resistant) gene carried by plasmids had a persistent prevalence in all the compartments of the three WWTPs. These results are consistent with previous works that many plasmids isolated from WWTPs encode most of the persistent ARGs detected in this study [34, 46]. However, only based on metagenomic sequencing, it is currently impossible to determine the plasmids hosts due to their ability to move across different species. In future studies, Nanopore sequencing, combined with other technologies, such as high-throughput chromosome conformation capture (Hi-C) could help to bridge the knowledge gap.
ARGs carried by ICEs were also tracked across the treatment process. Four ARGs, including cfxA (extended-spectrum-beta-lactam-resistant), mefA/E (macrolides-resistant), tetQ (tetracycline-resistant), and tetM (tetracycline-resistant), were detected in all three WWTPs (Fig. 2b). Both cfxA and tetQ genes were carried by ICEs from Bacteroides. It has been reported that tetQ in Bacteroides has increased dramatically from approximately 30% to more than 80% because of HGT , and the involvement of Tn4555 in spreading the cfxA gene in Bacteroides species has also been confirmed . As the most predominant anaerobes in the human colon (~ 25–30%), Bacteroides may serve as reservoirs of ARGs when being released into the WWTPs. Additionally, Tn916-like ICEs from Streptococcus carrying mef(A/E) and tetM genes were detected. This ICE family that harbors a variety of ARGs is found in an extremely diverse range of bacteria and has the potential to mobilize non-self-transmissible elements [49,50,51]. Therefore, the ICEs are widespread in WWTPs, making them important vectors in the dissemination of various ARGs between human pathogens and environmental bacteria.
Similarly, the hosts and distribution patterns of ARGs located on the chromosome were also determined (Fig. 2c). The results showed that hosts of all identical ARG subtypes (e.g., OXA-347, MOX-6, VEB-9, ermF, lunB, macB, tetE, tetX, and tetW) across the WWTPs were attributed to seven genera and showed high stability (existing in both influent and effluent) in all the three WWTPs. For example, ermF (macrolides-resistant) and tetX (tetracycline-resistant) carried by Riemerella anatipestifer and Myroides odoratimimus, respectively, were detected in all influent and effluent samples. Overall, our results demonstrated that a high diversity of ARGs persisted through the treatment processes in WWTPs, and their association with plasmids and ICEs might have a large contribution to the spread of ARGs.
Rapid deciphering of potential antimicrobial-resistant pathogens in WWTPs
Rapid antimicrobial-resistant pathogens (ARPs) identification is necessary for effective pathogen control in wastewater treatment. The real-time nature and long reads of Nanopore sequencing allow for rapid identification and simultaneously fate tracking of potential ARPs in WWTPs. As shown in Fig. 3a, a total of 16 species with relative abundance greater than 2% were detected, of which 10 species were potential pathogenic bacteria, accounting for 48.7% of all the identified ARBs. All the identified potential ARPs were primarily affiliated with the Gammaproteobacteria class (74.4%), including Aeromonas, Escherichia, Klebsiella, Acinetobacter, and Pseudomonas (Fig. 3a). Remarkably, four species in the ESKAPE panel of pathogens, Enterococcus faecium (2.6% of ARBs), Klebsiella pneumoniae (5.5% of ARBs), Acinetobacter baumannii (6.6% ARBs), and Pseudomonas aeruginosa (3.3% ARBs), harboring a high diversity of ARGs (at least four types), were identified across the WWTPs, and the most abundant ARGs carried by these potential pathogens were those of beta-lactam, aminoglycoside, and MLS, accounting for 78.8% of the total detected ARGs (Fig. 3b). The prevalence of these potential ARPs in WWTPs, including carbapenem-resistant Acinetobacter baumannii , multidrug-resistant Enterococcus faecium , and CTX-M-producing Klebsiella pneumoniae , has been reported with culture-based techniques, which were usually time-consuming; however, as the leading cause of nosocomial infections throughout the world, reducing the detection time for these ESKAPE pathogens is critical for risk management. Nanopore sequencing can reveal the ARPs profile in less than 24 h after receiving a sample, which significantly reduces the time required from sample collection to results delivery, as demonstrated in real-time surveillance of microorganisms in the field [55, 56].
The fate of these potential ARPs was further investigated. As expected, the influent samples possessed the highest ARP diversity; however, five species, including the three ESKAPE pathogens (Klebsiella pneumoniae, Acinetobacter baumanni, and Pseudomonas aeruginosa), were found at all treatment stages (Fig. 3c), whereas Clostridium difficile was only detected in influent and effluent. These results indicated that a variety of ARPs had high potential to pass the treatment process of WWTPs and enter the receiving environments and highlighted the importance of effective effluent disinfection, particularly considering their regrowth and reactivation, which have been confirmed by several studies [57, 58].
Persistence of bacteria carrying plasmids encoding multidrug-resistance
Phenotype in combination with genotype was used for further verification of the persistence of bacteria carrying plasmids encoding multidrug-resistance in the WWTPs. Bacteria that can simultaneously confer resistance to four different families of antibiotics, including ampicillin, kanamycin, tetracycline, and chloramphenicol, were identified from both influent and effluent cultures of the three WWTPs. As shown in Fig. 4a, the identified cultures were primarily affiliated with eight species based on the analysis of Nanopore sequencing data, including Escherichia coli, Klebsiella pneumoniae, Citrobacter freundii, Aeromonas hydrophila, Elizabethkingia anophelis, Elizabethkingia miricola, Salmonella enterica, and Aeromonas media, most of which were human pathogenic bacteria. The most dominant member was Escherichia coli in both influent and effluent cultures, accounting for 67.5% and 72.9% of all the identified resistant bacteria, respectively. Some bacteria in the influent were removed effectively during the treatment process, such as species of Citrobacter and Elizabethkingia (Fig. 4a). Additionally, four species, Aeromonas hydrophila, Salmonella enterica, Klebsiella pneumoniae, and Aeromonas media, were detected in both influent and effluent cultures, which indicated they were persistent across the treatment process. Moreover, the relative abundance of Klebsiella pneumoniae increased in the effluent cultures (from 6.7% to 15%). To verify the accuracy of microbial community analysis based on Nanopore sequencing, Illumina metagenomic sequencing was performed for the mixed influent multidrug-resistant cultures. As shown in Additional file 1: Figure S3a, the taxonomy result generated by Centrifuge  with Illumina sequencing data was largely in agreement at the dominant species level with that obtained by Nanopore sequencing. Meanwhile, consistency was observed regarding the taxonomic classification result at family level when using metagenomic sequences and 16S rRNA (V4 and V6 regions) genes (Additional file 1: Figure S3b). In addition, near-full-length 16S rRNA sequences were reconstructed successfully for those species with relative high abundance (Additional file 1: Table S5). Although the community complexity of the multidrug-resistant bacteria cultured from influent and effluent was greatly reduced due to the selective pressure from combined antibiotics, it is worth to point out the possible biases in profiling the microbial community caused by the shallow Nanopore sequencing library, especially for those with low abundance.
The resistance gene profiles were highly correlated with the resistance phenotypes to the given antibiotics, with aminoglycoside as the most abundant ARG, followed by beta-lactam, tetracycline, and chloramphenicol. Five types of ARGs associated with sulfonamide, quinolone, trimethoprim, rifamycin, and bleomycin, which were not used in the selective media, were also detected (Fig. 4b), highlighting the coexistence of ARGs in these resistant bacteria in the WWTPs.
As shown in Fig. 4b, more aminoglycoside and beta-lactam resistance genes were carried by these resistant bacteria than those resistant to tetracycline and chloramphenicol. Notably, a high abundance of sulfonamide resistance genes was observed in both influent and effluent cultures, suggesting their prevalence and persistence in multidrug-resistant bacteria in the WWTPs. Regarding the ARG subtypes, the resistance patterns showed a substantial similarity between the influent and effluent cultures (Fig. 4c); most importantly, except for two chromosomal encoding resistance genes (acrD and AmpC), all other ARGs encoding resistance to the four screening antibiotics were carried by plasmids, demonstrating that the persistent antibiotic resistance was primarily conferred by plasmids. Among the detected ARG types, more aminoglycoside subtypes were detected (ten subtypes) than those of beta-lactam (five subtypes), chloramphenicol (five subtypes), and tetracycline (four subtypes) (Fig. 4c). The most abundant subtypes in each ARG type, i.e., aph(3′)-I, TEM-4, floR, and tetA, were responsible for resistance to the four antibiotics we used. However, several aminoglycoside subtypes, such as aph(3″)-Ib, aadA, and aph(6)-Id, were also detected, but each of these is predicted to cause resistance to streptomycin instead of kanamycin, suggesting that aminoglycoside resistance genes were more likely to co-occur with other subtypes of ARGs, which was consistent with the high abundance observed in Fig. 4b.
It is necessary to point out that since it is not currently possible to build consensus Nanopore sequences to increase the accuracy, raw unpolished reads were used to generate AMR profiles for the mixed cultures, as well as for the environmental samples analyzed above. This may cause ARGs annotation biases to some extent. However, we believe this approach should be a useful start for efficient antibiotic resistome profiling and could be further improved.
At last, the arrangement of ARGs located on plasmids was investigated. Given the antibiotics used in this study, only reads encoding at least four types of ARGs simultaneously in both influent and effluent cultures were investigated (Fig. 5). Genetic analysis of these reads showed genes involved in plasmid conjugation (i.e., relaxase and type IV secretion system), implying the prevalence of conjugative plasmids in the multidrug-resistant bacteria. Indeed, multiple ARGs are often co-localized on the same conjugative plasmids, which allows for the relatively easy spread of multidrug-resistance . For example, reads_1 (42,639 bp) carried 11 ARGs, which could confer resistance to multiple antibiotic classes, including beta-lactam (CTX-M and TEM-1), aminoglycoside [AAC(6′)-Ib-cr and aadA16], chloramphenicol (floR), tetracycline (tetA), quinolone (qnrS), sulfonamide (sul1), rifampicin (aar-3), trimethoprim (drfA), and macrolides (mphA). Additionally, a complete class 1 integron carrying AAC(6′)-Ib-cr, aar-3, drfA, and aadA16 was identified. Such complex structures of a multidrug-resistance gene cluster were easily resolved based on Nanopore ultra-long reads. Moreover, the co-occurrence of different aminoglycoside ARG subtypes was detected on the same plasmid (Fig. 5), such as the combination of three subtypes (APH(6)-Id, APH(3″)-Ib, and aph(3′)-I) on read_4 and similar arrangements on other reads. In addition to the co-occurrence pattern between ARG subtypes, Nanopore long sequences also identified the gene cluster encoding mercury resistance (merA, merC, merD, merE, merP, and merR) (shown on reads_4 in Fig. 5). Most strikingly, many types of insertion sequences (IS) and transposable (Tn) elements displayed a mosaic distribution on these conjugative plasmids, which may increase the probability of HGT. In fact, these abundant repetitive elements also make it difficult to assemble the Illumina reads into long contigs sufficient to elucidate the arrangement of complex ARG clusters . This difficulty could be largely overcome by Nanopore sequencing technology as demonstrated in this study.
Resistome quantification based on Nanopore and Illumina sequencing reads
The abundance of major ARG types (in terms of ARG number per million base pairs) revealed by Nanopore sequencing was compared to that revealed by Illumina sequencing. Overall, the quantitative results of the major ARGs calculated based on these two sequencing platforms were comparable as indicated by the Pearson correlation analyses (Additional file 1: Figure S2), except for two activated sludge samples (i.e., STAS and SWHAS). Discrepancies observed in activated sludge samples could be mainly originated from the limited sequencing depths, particularly considering the highly complex community with relatively low ARGs abundance in the activated sludge. (Additional file 1: Table S2). Such disagreements in ARGs quantification using different sequencing strategies have also been reported in previous studies where some ARG subtypes were only detected by Nanopore sequencing and some other ARG subtypes were only identified via Illumina sequencing . Additionally, sequencing platform biases  and different ARG prediction algorithms adopted by these two sequencing platforms might also influence the ARGs quantification outputs, as Nanopore sequencing data analysis was based on tools designed for alignment of long high-error-rate sequences, whereas Illumina algorithm was based on BLAST similarity search.
We reported the first workflow combining both Nanopore and Illumina sequencing technologies to comprehensively profile the genetic context of ARGs as well as to track their hosts across the wastewater treatment process. The results showed that MGEs (plasmids and ICEs)-associated ARGs dominated the resistome in WWTPs and their relative abundance increased in the effluent. Nanopore long reads greatly facilitated the characterization of multidrug-resistant conjugative plasmids. The significant role of these plasmids in facilitating the survival and persistence of multidrug-resistant bacteria in WWTPs was further confirmed by phenotype-genotype analysis. In summary, this work established a baseline framework for future studies related to mobile antibiotic resistome in the environment.
Karkman A, Do TT, Walsh F, Virta MPJ. Antibiotic-resistance genes in waste water. Antibiotic-resistance genes in waste water. Trends Microbiol. 2018;26(3):220–8.
Frost LS, Leplae R, Summers AO, Toussaint A. Mobile genetic elements: the agents of open source evolution. Nat Rev Microbiol. 2005;3(9):722–32.
Johnson TA, Stedtfeld RD, Wang Q, Cole JR, Hashsham SA, Looft T, et al. Clusters of antibiotic resistance genes enriched together stay together in swine agriculture. MBio. 2016;7(2):e02214–5.
Popowska M, Krawczyk-Balska A. Broad-host-range IncP-1 plasmids and their resistance potential. Front Microbiol. 2013;4:44.
Burrus V, Pavlovic G, Decaris B, Guédon G. Conjugative transposons: the tip of the iceberg. Mol Microbiol. 2002;46(3):601–10.
Di Cesare A, Eckert EM, D'Urso S, Bertoni R, Gillan DC, Wattiez R, et al. Co-occurrence of integrase 1, antibiotic and heavy metal resistance genes in municipal wastewater treatment plants. Water Res. 2016;94:208–14.
Paiva MC, Reis MP, Costa PS, Dias MF, Bleicher L, Scholte LLS, et al. Identification of new bacteria harboring qnrS and aac(6′)-Ib/cr and mutations possibly involved in fluoroquinolone resistance in raw sewage and activated sludge samples from a full-scale WWTP. Water Res. 2017;110:27–37.
Hu J, Shi J, Chang H, Li D, Yang M, Kamagata Y. Phenotyping and genotyping of antibiotic-resistant Escherichia coli isolated from a natural river basin. Environ Sci Technol. 2002;42(9):3415–20.
Guo J, Li J, Chen H, Bond PL, Yuan Z. Metagenomic analysis reveals wastewater treatment plants as hotspots of antibiotic resistance genes and mobile genetic elements. Water Res. 2017;23:468–78.
Ma L, Xia Y, Li B, Yang Y, Li LG, Tiedje JM, et al. Metagenomic assembly reveals hosts of antibiotic resistance genes and the shared Resistome in pig, chicken, and human feces. Environ Sci Technol. 2016;50(1):420–7.
Hultman J, Tamminen M, Pärnänen K, Cairns J, Karkman A, Virta M. Host range of antibiotic resistance genes in wastewater treatment plant influent and effluent. FEMS Microbiol Ecol. 2018;94(4).
Zhao S, Tyson GH, Chen Y, Li C, Mukherjee S, Young S, et al. Whole-genome sequencing analysis accurately predicts antimicrobial resistance phenotypes in Campylobacter spp. Appl Environ Microbiol. 2015;82(2):459–66.
Xia Y, Li AD, Deng Y, Jiang XT, Li LG, Zhang T. MinION Nanopore sequencing enables correlation between Resistome phenotype and genotype of coliform Bacteria in municipal sewage. Front Microbiol. 2017;8:2105.
Luo G, Li B, Li LG, Zhang T, Angelidaki I. Antibiotic resistance genes and correlations with microbial community and metal resistance genes in full-scale biogas reactors as revealed by metagenomic analysis. Environ Sci Technol. 2017;51(7):4069–80.
Ju F, Li B, Ma L, Wang Y, Huang D, Zhang T. Antibiotic resistance genes and human bacterial pathogens: co-occurrence, removal, and enrichment in municipal sewage sludge digesters. Water Res. 2016;91:1–10.
Ashton PM, Nair S, Dallman T, Rubino S, Rabsch W, Mwaigwisya S, et al. MinION nanopore sequencing identifies the position and structure of a bacterial antibiotic resistance island. Nat Biotechnol. 2015;33(3):296–300.
Spencer SJ, Tamminen MV, Preheim SP, Guo MT, Briggs AW, Brito IL, et al. Massively parallel sequencing of single cells by epicPCR links functional genes with phylogenetic markers. ISME J. 2016;10(2):427–36.
Loman NJ, Quick J, Simpson JT. A complete bacterial genome assembled de novo using only nanopore sequencing data. Nat Methods. 2015;12(8):733–5.
Jain M, Koren S, Miga KH, Quick J, Rand AC, Sasani TA, et al. Nanopore sequencing and assembly of a human genome with ultra-long reads. Nat Biotechnol. 2018;36(4):338–45.
Gong L, Wong CH, Cheng WC, Tjong H, Menghi F, Ngan CY, et al. Picky comprehensively detects high-resolution structural variants in nanopore long reads. Nat Methods. 2018;15(6):455–60.
De CW, D'Hert S, Schultz DT, Cruts M, Van Broeckhoven C. NanoPack: visualizing and processing long read sequencing data. Bioinformatics. 2018;34(15):2666–9.
Yin X, Jiang XT, Chai B, Li L, Yang Y, Cole JR, et al. ARGs-OAP v2.0 with an expanded SARG database and hidden Markov models for enhancement characterization and quantification of antibiotic resistance genes in environmental metagenomes. Bioinformatics. 2018;34(13):2263–70.
Quick J, Ashton P, Calus S, Chatt C, Gossain S, Hawker J, et al. Rapid draft sequencing and real-time nanopore sequencing in a hospital outbreak of Salmonella. Genome Biol. 2015;16:114.
Krawczyk PS, Lipinski L, Dziembowski A. PlasFlow: predicting plasmid sequences in metagenomic data using genome signatures. Nucleic Acids Res. 2018;46(6):e35.
Kim D, Song L, Breitwieser FP, Salzberg SL. Centrifuge: rapid and sensitive classification of metagenomic sequences. Genome Res. 2016;26(12):1721–9.
Ruvindy R, White RA 3rd, Neilan BA, Burns BP. Unravelling core microbial metabolisms in the hypersaline microbial mats of Shark Bay using high-throughput metagenomics. ISME J. 2016;10(1):183–96.
Bi D, Xu Z, Harrison EM, Tai C, Wei Y, He X, et al. ICEberg: a web-based resource for integrative and conjugative elements found in Bacteria. Nucleic Acids Res. 2012;40(Database issue):D621–6.
Ju F, Beck K, Yin X, Maccagnan A, McArdell CS, Singer HP, et al. Wastewater treatment plant resistomes are shaped by bacterial composition, genetic exchange, and upregulated expression in the effluent microbiomes. ISME J. 2018; Epub ahead of print.
Wang Y, Jiang X, Liu L, Li B, Zhang T. High-resolution temporal and spatial patterns of Virome in wastewater treatment systems. Environ Sci Technol. 2018;52(18):10337–46.
Xiong W, Wang Y, Sun Y, Ma L, Zeng Q, Jiang X, et al. Antibiotic-mediated changes in the fecal microbiome of broiler chickens define the incidence of antibiotic resistance genes. Microbiome. 2018;6(1):34.
Xie C, Goi CL, Huson DH, Little PF, Williams RB. RiboTagger: fast and unbiased 16S/18S profiling using whole community shotgun metagenomic or metatranscriptome surveys. BMC Bioinformatics. 2016;17:508.
Miller CS, Baker BJ, Thomas BC, Singer SW, Banfield JF. EMIRGE: reconstruction of full-length ribosomal genes from microbial community short read sequencing data. Genome Biol. 2011;12(5):R44.
Branton D, Deamer D. Nanopore sequencing: an introduction. London: World Scientific Pub Co Inc; 2019.
Schlüter A, Szczepanowski R, Pühler A, Top EM. Genomics of IncP-1 antibiotic resistance plasmids isolated from wastewater treatment plants provides evidence for a widely accessible drug resistance gene pool. FEMS Microbiol Rev. 2007;31(4):449–77.
Zhang T, Zhang XX, Ye L. Plasmid metagenome reveals high levels of antibiotic resistance genes and mobile genetic elements in activated sludge. PLoS One. 2011;6(10):e26041.
Sentchilo V, Mayer AP, Guy L, Miyazaki R, Green Tringe S, Barry K, et al. Community-wide plasmid gene mobilization and selection. ISME J. 2013;7(6):1173–86.
Li AD, Li LG, Zhang T. Exploring antibiotic resistance genes and metal resistance genes in plasmid metagenomes from wastewater treatment plants. Front Microbiol. 2015;6:1025.
Palmieri C, Magi G, Mingoia M, Bagnarelli P, Ripa S, Varaldo PE, et al. Characterization of a Streptococcus suis tet(O/W/32/O)-carrying element transferable to major streptococcal pathogens. Antimicrob Agents Chemother. 2012;56(9):4697–702.
Palmieri C, Magi G, Mingoia M, Bagnarelli P, Ripa S, Varaldo PE, et al. Tn5253 family integrative and conjugative elements carrying mef(I) and catQ determinants in Streptococcus pneumoniae and Streptococcus pyogenes. Antimicrob Agents Chemother. 2014;58(10):5886–93.
Wang Y, Wang GR, Shelby A, Shoemaker NB, Salyers AA. A newly discovered Bacteroides conjugative transposon, CTnGERM1, contains genes also found in gram-positive bacteria. Appl Environ Microbiol. 2003;69(8):4595–603.
Gillings MR, Gaze WH, Pruden A, Smalla K, Tiedje JM, Zhu YG. Using the class 1 integron-integrase gene as a proxy for anthropogenic pollution. ISME J. 2015;9(6):1269–79.
Jiang X, Ellabaan MMH, Charusanti P, Munck C, Blin K, Tong Y, et al. Dissemination of antibiotic resistance genes from antibiotic producers to pathogens. Nat Commun. 2017;8:15784.
Petrovich M, Chu B, Wright D, Griffin J, Elfeki M, Murphy BT, et al. Antibiotic resistance genes show enhanced mobilization through suspended growth and biofilm-based wastewater treatment processes. FEMS Microbiol Ecol. 2018;94(11).
Chu BTT, Petrovich ML, Chaudhary A, Wright D, Murphy B, Wells G, et al. Metagenomics reveals the impact of wastewater treatment plants on the dispersal of microorganisms and genes in aquatic sediments. Appl Environ Microbiol. 2018;84(5):e02168–17.
Rodriguez-Mozaz S, Chamorro S, Marti E, Huerta B, Gros M, Sànchez-Melsió A, et al. Occurrence of antibiotics and antibiotic resistance genes in hospital and urban wastewaters and their impact on the receiving river. Water Res. 2015;69:234–42.
Suhartono S, Savin M, Gbur EE. Genetic redundancy and persistence of plasmid-mediated trimethoprim/sulfamethoxazole resistant effluent and stream water Escherichia coli. Water Res. 2016;103:197–204.
Wexler HM. Bacteroides: the good, the bad, and the nitty-gritty. Clin Microbiol Rev. 2007;20(4):593–621.
Ferreira LQ, Avelar KE, Vieira JM, de Paula GR, Colombo AP, Domingues RM, et al. Association between the cfxA gene and transposon Tn4555 in Bacteroides distasonis strains and other Bacteroides species. Curr Microbiol. 2007;54(5):348–53.
Johnson CM, Grossman AD. Integrative and conjugative elements (ICEs): what they do and how they work. Annu Rev Genet. 2015;9:577–601.
Sansevere EA, Robinson DA. Staphylococci on ICE: overlooked agents of horizontal gene transfer. Mob Genet Elements. 2017;7(4):1–10.
Croucher NJ, Harris SR, Fraser C, Quail MA, Burton J, van der Linden M, et al. Rapid pneumococcal evolution in response to clinical interventions. Science. 2011;331(6016):430–4.
Higgins PG, Hrenovic J, Seifert H, Dekic S. Characterization of Acinetobacter baumannii from water and sludge line of secondary wastewater treatment plant. Water Res. 2018;40:261–7.
Łuczkiewicz A, Jankowska K, Fudala-Książek S, Olańczuk-Neyman K. Antimicrobial resistance of fecal indicators in municipal wastewater treatment plant. Water Res. 2010;44(17):5089–97.
Dolejska M, Frolkova P, Florek M, Jamborova I, Purgertova M, Kutilova I, et al. CTX-M-15-producing Escherichia coli clone B2-O25b-ST131 and Klebsiella spp. isolates in municipal wastewater treatment plant effluents. J Antimicrob Chemother. 2011;66(12):2784–90.
Quick J, Loman NJ, Duraffour S, Simpson JT, Severi E, Cowley L, et al. Real-time, portable genome sequencing for Ebola surveillance. Nature. 2016;530(7589):228–32.
Hoenen T, Groseth A, Rosenke K, Fischer RJ, Hoenen A, Judson SD, et al. Nanopore sequencing as a rapidly deployable Ebola outbreak tool. Emerg Infect Dis. 2016;22(2):331–4.
Becerra-Castro C, Macedo G, Silva AMT, Manaia CM, Nunes OC. Proteobacteria become predominant during regrowth after water disinfection. Sci Total Environ. 2016;573:313–23.
Huang JJ, Hu HY, Tang F, Li Y, Lu SQ, Lu Y. Inactivation and reactivation of antibiotic-resistant bacteria by chlorination in secondary effluents of a municipal wastewater treatment plant. Water Res. 2011;45(9):2775–81.
Carattoli A. Plasmids and the spread of resistance. Int J Med Microbiol. 2013;303(6–7):298–304.
Szabó M, Nagy T, Wilk T, Farkas T, Hegyi A, Olasz F, et al. Characterization of two multidrug-resistant IncA/C plasmids from the 1960s by using the MinION sequencer device. Antimicrob Agents Chemother. 2016;60(11):6780–6.
Sanderson ND, Street TL, Foster D, Swann J, Atkins BL, Brent AJ, et al. Real-time analysis of nanopore-based metagenomic sequencing from infected orthopaedic devices. BMC Genomics. 2018;19(1):714.
You Che, Lei Liu, and Yu Yang thank The University of Hong Kong for the postgraduate studentship. We appreciate the help of Vicky, and thank Lilian Y L CHAN for technical assistance of High Performance Computing & Grid Computing system.
This study was financially supported by (T21-711/16-R).
Availability of data and materials
The datasets generated from Nanopore and Illumina sequencing were deposited into the National Center for Biotechnology information under the following accession number SAMN09603371-SAMN09603381 and PRJNA505617.
Ethics approval and consent to participate
The manuscript dose not report data collected from humans and animals.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Table S1. Summary statistics for reads generated by Nanopore (numbers in bold) and Illumina assemblies. Table S2. Summary of ARGs-carrying contigs after Illumina assembly and long reads generated by Nanopore sequencing. Table S3. Distribution and relative abundance of ARGs only detected by Illumina sequencing, “√” indicates the ARGs type detected in the corresponding samples. Table S4. Genetic location of major ARGs predicted from all Illumina assembled contigs. Table S5 Summary for the near-full-length 16S rRNA sequences reconstructed from mixed influent cultures using EMIRGE. Figure S1. Overview of the reads length (a), reads number (b) and average base call quality score (c) of the eleven Nanopore metagenomics datasets. Figure S2. Correlation analysis of major ARGs abundance (ARGs number per million base pairs) quantified based on Illumina sequencing and Nanopore reads, x-axis and y-axis represents the ARGs number calculated by Illumina and Nanopore datasets respectively. Figure S3. Comparison of phylogenetic taxonomic affiliation at species (a) and family level (b) between Illumina and Nanopore sequencing platforms for the mixed influent multidrug-resistant cultures. (DOCX 1263 kb)