Temporal landscape of human gut RNA and DNA virome in SARS-CoV-2 infection and severity
Microbiome volume 9, Article number: 91 (2021)
Coronavirus disease 2019 (COVID-19) caused by the enveloped RNA virus SARS-CoV-2 primarily affects the respiratory and gastrointestinal tracts. SARS-CoV-2 was isolated from fecal samples, and active viral replication was reported in human intestinal cells. The human gut also harbors an enormous amount of resident viruses (collectively known as the virome) that play a role in regulating host immunity and disease pathophysiology. Understanding gut virome perturbation that underlies SARS-CoV-2 infection and severity is an unmet need.
We enrolled 98 COVID-19 patients with varying disease severity (3 asymptomatic, 53 mild, 34 moderate, 5 severe, 3 critical) and 78 non-COVID-19 controls matched for gender and co-morbidities. All subjects had fecal specimens sampled at inclusion. Blood specimens were collected for COVID-19 patients at admission to test for inflammatory markers and white cell counts. Among COVID-19 cases, 37 (38%) patients had serial fecal samples collected 2 to 3 times per week from time of hospitalization until after discharge. Using shotgun metagenomics sequencing, we sequenced and profiled the fecal RNA and DNA virome. We investigated alterations and longitudinal dynamics of the gut virome in association with disease severity and blood parameters.
Patients with COVID-19 showed underrepresentation of Pepper mild mottle virus (RNA virus) and multiple bacteriophage lineages (DNA viruses) and enrichment of environment-derived eukaryotic DNA viruses in fecal samples, compared to non-COVID-19 subjects. Such gut virome alterations persisted up to 30 days after disease resolution. Fecal virome in SARS-CoV-2 infection harbored more stress-, inflammation-, and virulence-associated gene encoding capacities including those pertaining to bacteriophage integration, DNA repair, and metabolism and virulence associated with their bacterial host. Baseline fecal abundance of 10 virus species (1 RNA virus, pepper chlorotic spot virus, and 9 DNA virus species) inversely correlated with disease COVID-19 severity. These viruses inversely correlated with blood levels of pro-inflammatory proteins, white cells, and neutrophils. Among the 10 COVID-19 severity-associated DNA virus species, 4 showed inverse correlation with age; 5 showed persistent lower abundance both during disease course and after disease resolution relative to non-COVID-19 subjects.
Both enteric RNA and DNA virome in COVID-19 patients were different from non-COVID-19 subjects, which persisted after disease resolution of COVID-19. Gut virome may calibrate host immunity and regulate severity to SARS-CoV-2 infection. Our observation that gut viruses inversely correlated with both severity of COVID-19 and host age may partly explain that older subjects are prone to severe and worse COVID-19 outcomes. Altogether, our data highlight the importance of human gut virome in severity and potentially therapeutics of COVID-19.
A novel RNA virus, SARS-CoV-2, has caused a global pandemic of coronavirus disease 2019 (COVID-19) since its emergence in December 2019. Although most cases of COVID-19 are mild, disease severity varies substantially across patients, and severe cases can result in respiratory failure or death . A significant amount of patients with COVID-19 suffered gastrointestinal (GI) symptoms such as vomiting and diarrhea [2,3,4,5]. In addition, SARS-CoV-2 viral RNA and transcriptional activity were detected in fecal samples [6,7,8], suggesting that the GI tract is an extra-pulmonary site for virus replication and activity.
The GI tract harbors the most abundant viruses in the human body collectively known as the gut virome [9, 10]. It encompasses both RNA and DNA viruses that chronically infect their eukaryotic (humans, animals, plants) and prokaryotic hosts (bacteria, amoeba) during a steady state . The gut viral communities en masse modulate both the innate and adaptive immunity of the host and the ecology of the host gut microbiota . This “normal” immune state varies from person to person and changes over time. Substantial variation in human immune responses is largely driven by both environmental influences  and infection by pathogens [9, 12]. Sequential infection or colonization with acute and chronic viruses can alter host hemostatic immune gene expression and affect response to vaccine . The viral carrier state and its association with host immune-phenotype are under-studied due to a lack of appropriate tools to detect and quantify extremely diverse members of the virome. With the advancement of high-throughput sequencing technology, the highly individual-specific human viromes consisting of members causing acute disease or chronically colonized in asymptomatic individuals are increasingly being uncovered [13, 14]. We have previously demonstrated that the gut microbiome was significantly perturbed in COVID-19 and associated with disease severity and symptoms [8, 15, 16]. Given the co-residing and co-evolutionary nature of the viral/phage and bacterial communities in human gut, we herein hypothesize that the gut virome is altered in patients with COVID-19 and that subject’s baseline virome configurations may associate with immune defense and therefore severity to SARS-CoV-2 infection. We employed RNA and DNA metagenomics to profile the enormous amount of viral members in fecal samples and correlated that with disease immune-phenotype to unravel host-viral relationship and potentially mechanisms underlying severity and recovery in COVID-19.
Here, we enrolled 98 hospitalized patients with COVID-19 (mean age 33 years; 53% male) and 78 non-COVID-19 controls matched for gender and co-morbidities (mean age 48 years; 42% male, Table 1). All COVID-19 patients and non-COVID-19 controls had stool specimens sampled at inclusion. Blood specimens were additionally sampled for COVID-19 patients at admission to test for pro-inflammatory markers and white cell counts (Supplementary Table 1). Among the COVID-19 patients, 37 (38%) had serial fecal samples collected from hospitalization until after discharge (Supplementary Figure 1). We enriched both fecal RNA and DNA virions from a total of 277 fecal samples and performed non-targeted shotgun metagenomic sequencing on the RNA virome (mostly eukaryotic viruses) and DNA virome (mostly prokaryotic bacteriophages). We report gut virome profiles in association with SARS-CoV-2 infection, disease severity, and blood parameters.
Alterations in fecal RNA virome of COVID-19 patients
To understand whether SARS-CoV-2 infection influences the gut RNA virome, we compared fecal RNA virome of COVID-19 patients at baseline (day 0, the first time point of stool collection after hospitalization) with that of non-COVID-19 controls. Among all host factors (SARS-CoV-2 infection, age, gender, medications, co-morbidities), SARS-CoV-2 infection showed the largest effect size on impacting composition of the fecal RNA virome (permanova test p<0.01, R2=0.041, Fig. 1a) followed by chronic hepatitis B (HBV) infection and asthma. At the species level, SARS-CoV-2 was enriched in fecal samples of patients with COVID-19 compared with non-COVID-19 controls (MaAsLin2 analysis with adjustment for HBV infection and asthma, FDR p<0.05, Fig. 1b). In contrast, pepper mild mottle virus (PMMoV), a plant virus known to be prevalent and abundant in human feces , was underrepresented in patients with COVID-19 (FDR p<0.05, Fig. 1b, c). Seven (19%) of the 37 COVID-19 patients who had longitudinal follow-up showed prolonged fecal SARS-CoV-2 shedding, as indicated by continued SARS-CoV-2 RNA detection in feces after nasopharyngeal clearance of the virus (Supplementary Figure 2A). PMMoV virus was persistently underrepresented both during hospitalization and after disease resolution in COVID-19 patients (Fig. 1c, Supplementary Figure 2B). Diet over the course of hospitalization (Supplementary Table 2) did not show significant effect in the temporal variation of the gut RNA virome (permanova test, p=0.2) or the relative abundance of PMMoV virus (permanova test, p=0.4).
Overall, the fecal RNA virome composition of COVID-19 patients remained distinct from that of non-COVID-19 controls during the disease course and after disease resolution (Fig. 1d, e). Among 16 COVID-19 patients who had nasopharyngeal clearance of SARS-CoV-2 virus (disease resolution as determined by negative PCR result for SARS-CoV-2 on nasopharyngeal swabs), 11 (69%) had persistently altered fecal RNA virome after disease resolution and two (13%) lasted up to 30 days (Fig. 1d).
We also performed quantitative RT-PCR assays to examine SARS-CoV-2 viral levels in fecal and nasopharyngeal swab specimens in patients with COVID-19 at hospitalization. We found that SARS-CoV-2 levels were lower in feces of patients with moderate COVID-19 than those with asymptotic/mild COVID-19 (p<0.01, Supplementary Figure 3A), which was also confirmed by RNA virome shotgun metagenomic sequencing assay (p<0.05, Supplementary Figure 3B), though the detection sensitivity of fecal SARS-CoV-2 RNA by shotgun sequencing assay was not on par with quantitative RT-PCR assay. SARS-CoV-2 viral load in fecal specimens was approximately 2-log lower than that in nasopharyngeal specimens (p<0.0001, Supplementary Figure 3C). Importantly, SARS-CoV-2 levels in nasopharyngeal samples significantly correlated with SARS-CoV-2 levels in fecal samples (Pearson correlation rho=0.3, p=0.0038, Supplementary Figure 3D).
Alterations in fecal DNA virome of COVID-19 patients
We then investigated the effect of SARS-CoV-2 infection on fecal DNA virome composition at baseline and during disease course. At the community level, viromes of COVID-19 patients at baseline differed significantly from that of non-COVID-19 controls (permanova p<0.01, Fig. 2a) and were more heterogeneous than that of non-COVID-19 controls (p<0.0001, Fig. 2b). Among all host factors (SARS-CoV-2 infection, age, gender, medications, co-morbidities), SARS-CoV-2 infection again showed the largest effect size on impacting the composition of the fecal DNA virome (R2=0.018, Fig. 2c) followed by hyperlipidemia and the antiviral medication Lopinavir-ritonavir. Administration of Lopinavir-ritonavir was inversely associated with the presence of Listeria phage (correlation coefficient −0.21, p=0.03), a phage infecting the pathogenic bacteria, Listeria.
A total of 45 DNA virus species were found to be significantly different in the fecal DNA virome between COVID-19 patients and non-COVID-19 controls (19 virus species enriched in COVID-19 patients versus 26 virus species enriched in non-COVID-19 controls, identified via DESeq, while controlling for the factors hyperlipidemia and Lopinavir-ritonavir, shown in Fig. 2d). A majority (69%, 18 out of 26 virus species) of the DNA viruses enriched in feces of non-COVID-19 controls were prokaryotic viruses, particularly bacteriophages (62%, 16 out of 26). In contrast, more eukaryotic viruses, particularly environment-derived eukaryotic viruses with unknown host, were enriched in feces of COVID-19 patients.
The differentially enriched gut DNA virus species in COVID-19 patients showed substantial temporal variations during the disease course (Fig. 3a). Diet during the time of hospitalization did not show significant effect in the temporal variation of the fecal DNA virome (permanova test, p=0.3). Overall, fecal DNA virome composition of COVID-19 patients differed markedly from that of non-COVID-19 controls during the disease course and after clearance of SARS-CoV-2 (Fig. 3b, c). Among COVID-19 patients who had follow-up after disease resolution, six (32%) showed markedly more dissimilar fecal DNA virome to non-COVID-19 controls at the last follow-up (three patients lasted up to 20–30 days), compared to their dissimilarity to non-COVID-19 controls at baseline (Fig. 3b).
Alterations in the functionality of the enteric virome in COVID-19 patients
We next investigated functionality alterations of the gut virome using HUMAnN2 prediction. A larger number of gene families were enriched in COVID-19 viromes at baseline than non-COVID-19 controls (28 versus 9 gene families, FDR p<0.05, Fig. 4). We found a significant enhancement in the functional capacities of gene mobilization and viral/phage integration into the host in COVID-19 viromes (Fig. 4). Features of viral integration (expansion of temperate virions/phages) have been observed in the gut under inflammatory conditions in both humans and mice [18, 19]. In addition, functions involved in host stress/inflammation/virulence response (DNA repair, Arginine repressor, Hemolysin channel protein, DNA polymerase IV), bacterial metabolism, and membrane transport were also enriched in the fecal virome of COVID-19 patients (Fig. 4). Diet during time of hospitalization did not show significant effect on virome functionality variation (p=0.45). These data highlight that SARS-CoV-2 infection may associate with a functionality shift of the human gut virome to inflammation- and stress-related responses in relation to their hosts (both the commensal bacteria and humans). The viral functions enriched in COVID-19 (particularly those associated with host metabolism) were significantly associated with the abundances of viruses enriched in COVID-19, including Streptococcus phage, Escherichia phage, Homavirus, Lactococcus phage, Ralstonia phage, Solumvirus, and Microcystis phage (Supplementary Figure 4).
Fecal virome alterations correlated with disease severity of COVID-19
Based on COVID-19 disease symptoms and severity classification criteria , we stratified our patients into non-severe (N=56; asymptomatic/mild cases) and moderate/severe groups (N=42; moderate/severe/critical cases) (Fig. 5a). Compared to non-severe cases, moderate/severe cases showed a significantly higher blood levels of LDH, neutrophil count, C-reactive protein (CRP), alanine aminotransferase (ALT), and lower blood levels of albumin at admission (all p<0.05, Fig. 5b–f, Supplementary Figure 5). Our data are in line with recent reports highlighting that more severe cases had more pronounced systemic inflammatory responses [2, 21,22,23,24]. We then explored association between baseline fecal RNA and DNA virome profiles with COVID-19 severity and blood measurements at hospitalization. Abundance of the plant-derived RNA virus, pepper chlorotic spot virus (PCSV) was higher in patients with non-severe than those with moderate/severe disease (p=0.013, Fig. 5g). In addition, a high abundance of PCSV in feces was associated with low blood concentrations of the inflammation markers, LDH and CRP (correlation coefficient Rho=−0.269 and −0.276, respectively, Fig. 5h, i). Similarly, abundance of 9 DNA virus species (Myxococcus phage, Rheinheimera phage, Microcystis virus, Bacteroides phage, Murmansk poxvirus, Saudi moumouvirus, Sphaerotilus phage, Tomelloso virus, and Ruegeria phage) in feces negatively correlated with COVID-19 severity (all FDR p<0.05, Fig. 5j). In particular, 8 out of the 9 DNA virus species showed strong negative correlation with blood levels of the inflammation indicators LDH, neutrophil count, white cell count, or CRP (Fig. 5k). Interestingly, among them, four viral species, Myxococcus phage, Bacteroides phage, Murmansk poxvirus, and Sphaerotilus phage, which inversely associated with inflammation indicators, also inversely correlated with host age (Fig. 5k). This result coincides with the observation that elderly individuals were at higher risk for unfavorable severe COVID-19 outcomes [2, 25]. These data suggest that such RNA and DNA viruses may counteract the effect of SARS-CoV-2 infection predisposing infected subjects to a less severe COVID-19 course. Five out of the 9 severity-associated DNA virus species showed persistent lower abundances in the feces of COVID-19 patients during disease course and after disease resolution compared to non-COVID-19 controls (all p<0.05, Fig. 6), indicating an unfavorable effect of SARS-CoV-2 infection on these gut viruses. The cause or consequence of such associations needs to be further explored. In addition, a large number of DNA virus species in feces (n=132) showed significant correlations with blood parameters in COVID-19 patients, most of which were negative correlations with blood LDH concentrations, neutrophil, and white cell counts (Supplementary Figure 6). These data underscore the potential significance of gut DNA virome in calibrating host immunity and counteracting infection of SARS-CoV-2, warranting further investigation.
In this study, we provide the first comprehensive characterization of both the gut RNA and DNA virome in patients with COVID-19 and explore the association between host gut virome features with COVID-19 severity. Diet-originated plant RNA viruses are known to be prevalent in a healthy human gut . In the present study, two pepper-derived RNA virus species were found to be underrepresented (PMMoV) or correlated inversely with disease severity in COVID-19 patients (PCSV). In addition, COVID-19 patients had higher abundances of eukaryotic- and environment-derived viruses whereas non-COVID-19 subjects harbored more bacteriophages in feces. Such virome compositional features might be a consequence of SARS-CoV-2 infection along with its impact on the host immunity and the holistic ecology of the host gut microbiota [8, 15, 16]. The gut virome changes in COVID-19 were modest, given that COVID-19 is an acute respiratory disease primarily inflicting the respiratory tract with miniscule enteric involvement [2,3,4,5] and that human gut virome is substantially diverse across healthy individuals . We also observed persistently discrepant viromes in COVID-19 patients with that of healthy controls over time, both during disease course and after disease resolution. In a subset of COVID-19 patients, even more dissimilar virome configurations to that of healthy controls at the last follow-up were observed. Concordantly, a study has shown that a proportion (~35%) of COVID-19 patients reported not returning to their usual state of health after hospital discharge, manifesting fatigue, and dyspnea . Such data indicates that SARS-CoV-2 infection may have long-term detrimental effect on certain COVID-19 patients. Together, the persistently different gut RNA and DNA viromes in COVID-19 versus healthy controls, during disease course and after disease resolution, imply a potential long-lasting detrimental effect of SARS-CoV-2 infection on the host.
The immediate response to any infectious-disease outbreak is to approach it from the pathogen perspective because disease severity is assumed to be a direct function of pathogen burden . However, the complexities of SARS-CoV-2 infection serve as an important reminder that this perspective is not sufficient for understanding the survival of infectious diseases [30, 31]. Disease and immune tolerance as opposed to hyper-inflammatory (cytokine storm) state is proposed to be critical for limiting damage to the host and for improving patient survival in COVID-19 [31,32,33]. In line with findings from recent studies probing blood immune cell and biochemical measurements in COVID-19 patients [21,22,23], we found elevations of innate immune cells (neutrophils and white cells) and pro-inflammatory indicators (C-reactive protein and LDH) in the blood of more severe COVID-19 cases. Of note, fecal SARS-CoV-2 levels were significantly higher in asymptomatic/mild COVID-19 patients than in patients with a moderate disease course. This might be due to an immune-tolerant phenotype in mild COVID-19 patients which accommodates non-symptomatic existence of viruses in the host, rather than fulminant host immune defense against SARS-CoV-2 which results in viral attenuation but host tissue damage in severer COVID-19 patients. We did not find significant linear correlation between fecal SARS-CoV-2 levels and COVID-19 severity, yet a large number of gut prokaryotic and eukaryotic viruses correlated negatively with blood concentrations of immune cells and pro-inflammatory proteins and COVID-19 severity, suggesting that gut virome may calibrate host physiology and immune response against SARS-CoV-2 infection.
The bacteriophages, Escherichia virus (phage) and Enterobacter phage, were also enriched in COVID-19 patients. Expansion of these phages has been causally implicated in gut inflammation and host interferon response in mice and humans [14, 19]. These data highlight that the enteric virome alteration in COVID-19 may further contribute to immunological and physiological changes in the host during the disease course. In our study, we observed administration of Lopinavir-ritonavir was associated with decreased abundance of Listeria phage, a phage infecting the pathogenic Listeria (bacteria). It suggests that use of antivirals may tune host bacteriophage-bacteria ecology in the gut, beyond its role in enhancing host immune defense against infectious viruses.
Interestingly, among the identified 9 viral species inversely correlated with disease severity of COVID-19 and inflammation markers in the blood, 4 (Myxococcus phage, Bacteroides phage, Murmansk poxvirus, and Sphaerotilus phage) also inversely correlated with human age. This finding may partly explain the observation that elderly individuals are prone to unfavorable severe COVID-19 outcomes [2, 25] and shed light on the importance of gut viruses in human pathophysiology. Those phages associated with a mild COVID-19 course and younger human age might be leveraged for therapeutic exploration.
The human gut RNA virome, comprising multiple plant-related viruses, are both compositionally and functionally under-explored. The question of whether plant viruses can infect humans remains unresolved, though anecdotal evidence suggests so . PMMoV is the most abundant and prevalent RNA virus infecting pepper in human feces, which is proposed to be a viral indicator for human fecal pollution in aquatic environments and water treatment systems . Current literature favors that it unlikely to infect human cells. Its relationships with human immune/physiological responses are also unknown. Analogous to the changes in the bacteriophage fraction of the gut virome in COVID-19, alterations in the plant-related RNA virome fraction in COVID-19 might also be a consequence of SARS-CoV-2 infection. SARS-CoV-2 may modulate host immunity to create an unfavorable environment for certain RNA viruses, such as PMMoV. In return, the demise or lysis of such viruses in human gut could result in a release of nucleic acids, proteins, and lipids that serve as pathogen-associated molecular patterns (PAMP) that trigger inflammatory responses to induce cellular infiltration, cytokine production, and even tissue damage. Though study has shown that some plant viruses are much resistant and can remain long time under the harsh GI conditions , data are lacking about the transit time of various plant-related viruses through the human GI tract, particularly for those viruses correlated with COVID-19.
A population-based study from France reported that PMMoV was detected with a low prevalence in hospitalized adults (7%) and children (<1%) . In contrast, PMMoV is much more prevalent in our cohort (~67%) and another US cohort . Human gut virome is known to vary substantially across population, geography and lifestyle , and highly individual-specific . Therefore, the observed differences in gut virome configuration between studies might be accounted for by the host and environmental factors in conjunction with population characteristics.
Over the long co-evolutionary history of viruses with their hosts, the parasitic viruses are able to snatch genetic elements/remnants form their hosts during infection and/or co-existence [37, 38]. This is particularly true for bacteriophages and their bacteria hosts. Phages are genetically diverse and their genome architectures are characteristically mosaic, driven by horizontal gene transfer (HGT) with other phages and host genomes [39, 40]. Therefore, the genomes of phages are highly complex and diverse, and they are composed of genes with distinct and varied evolutionary histories [41, 42]. For instance, numerous cultivated and uncultivated viruses encode ribosomal proteins . In addition, bacteria-encoded tRNA synthase was also found in phages . Concordantly, certain ribosomal protein- and tRNA synthase-coding genes were enriched in COVID-19 viromes seen in our study. Overall, the enrichment of bacterial genes in COVID-19 virome may partly be ascribed to the co-evolution nature of virus-host relationship or the bacterial dysfunction in COVID-19.
There are some limitations of this study. First, it is exploratory in nature without clear cause or consequence effect established. Confirmation of gut virome alterations and their impact on disease severity or trajectory will require functional validation in other populations and animal studies. Second, stool collected after hospitalization for virome analysis does not represent the bona fide baseline microbiome before or at COVID-19 onset. Further studies should prospectively include healthy subjects, and if infected with SARS-CoV-2, followed-up at disease onset, during disease course, and long term after discovery, to delineate the role of virome changes in SARS-CoV-2 infection, severity, and post-infection recovery. Third, we documented the dietary record of patients over the course of hospitalization, yet did not capture the diet record before disease onset nor that of non-COVD-19 controls, which precludes us to investigate the virtual effect of diet on the gut virome differences between COVID-19 and non-COVID-19 cases. However, the diets provided during time of patient hospitalization were diverse as opposed to monotonous and were consistent with habitual daily diet consumed by Hong Kong Chinese. In addition, diets over the course of hospitalization did not show significant effects in the temporal variation of the gut DNA or RNA virome of hospitalized COVID-19 patients. Moreover, most of the viruses differed between COVID-19 patients and non-COVID-19 subjects were persistently different over the course of hospitalization, despite various diets were provided during time of hospitalization. Altogether, it is likely that the virome alterations in COVID-19 patients are at least partially reflective of the disease course rather than an effect of diet.
In summary, both enteric RNA and DNA virome were different in COVID-19 versus non-COVID-19 subjects, which persisted even after disease resolution. Gut virome may calibrate host immunity and regulate severity to SARS-CoV-2 infection. Our observation that gut viruses inversely correlated with both severity of COVID-19 and host age may partly explain that older subjects are prone to severe and unfavorable COVID-19 outcomes. Our data altogether highlight the significance of human gut virome in COVID-19 disease course and potentially therapeutics.
Materials and methods
Study subject and design
This prospective study enrolled 98 hospitalized patients with COVID-19 and 78 gender- and co-morbidity-matched non-COVID-19 controls (Table 1 and Supplementary Table 1). All study subjects had a single stool specimen collected at inclusion. All COVID-19 patients had blood samples collected at admission and measured for white cells and biochemical markers in plasma (Supplementary Table 1). Among the included COVID-19 patients, 37 COVID-17 patients had longitudinal follow-up from hospitalization till after discharge and had serial stool specimens collected (2–3 times per week, Supplementary Figure. 1). SARS-CoV-2 infection was confirmed by two consecutive RT-PCR test targeting different regions of the RdRp gene performed by the local hospital and Public Health Laboratory Service. All COVID-19 patients were admitted to the Prince of Wales Hospital or the United Christian Hospital, Hong Kong, from February through May, 2020. COVID-19 severity was categorized as (i) asymptomatic, if there was no clinical symptoms; (ii) mild, if there was no radiographic evidence of pneumonia and the clinical symptoms were light; (iii) moderate, if fever, respiratory tract and other symptoms present and imaging suggests pneumonia; (v) severe, if respiratory rate ≥30/min, or oxygen saturation ≤93% when breathing ambient air; or PaO2/FiO2 ≤ 300 mmHg (1mmHg = 0.133 kPa); or (v) critical, if there was respiratory failure requiring mechanical ventilation, shock, or organ failure requiring intensive care . All hospitalized patients were provided 3 standardized meals daily, varying from Monday to Sunday, by the department of hospital catering service (Supplementary Table 2). The dietary component and pattern were consistent with the habitual diets commonly consumed by Hong Kong Chinese.
Non-COVID-19 controls were average healthy Hong Kong individuals recruited at Prince of Wales Hospital before the COVID-19 outbreak and tested negative for SARS-CoV-2 during the COVID-19 pandemic. The inclusion criteria for non-COVID-19 subjects are (1) aged ≥18 years old, (2) competent to provide informed consent (no mental illness or dementia), (3) have no underlying infectious or acute disease, and (4) have lived in the same geographic area for the preceding 6 months. The exclusion criteria for non-COVID-19 subjects are (1) the use of laxatives or anti-diarrheal drugs in the last 3 months; (2) recent dietary changes (e.g., becoming vegetarian/vegan); (3) known complex infections or sepsis (excluding uncomplicated infections such as influenza); (4) known history of severe organ failure (including decompensated cirrhosis, malignant disease, kidney failure, epilepsy, active serious infection, acquired immunodeficiency syndrome); (5) bowel surgery in the last 6 months (excluding colonoscopy/procedure related to perianal disease); (6) presence of an ileostomy/stoma; (7) current pregnancy; and (8) colonoscopy in the last month prior to sampling. This study was approved by the Joint Chinese University of Hong Kong–New Territories East Cluster Clinical Research Ethics Committee (Reference number: 2020.076). All subjects provided informed consent to participate in this study and agreed for publication of the research results. Data including demographic, epidemiological, clinical, laboratory results, treatment, and medication were extracted from the electronic medical records in Hong Kong Hospital Authority clinical management system. This study was conducted in accordance with the Declaration of Helsinki.
Fecal viral DNA and RNA extraction, isolation, and shotgun metagenomics
The total viral nucleic acid was extracted from fecal sample, using TaKaRa MiniBEST Viral RNA/DNA Extraction Kit (Takara, Japan) following manufacturer’s instructions. Extracted total viral nucleic acid was then purified by the DNA and RNA Clean & Concentrator Kits (Zymo Research, CA, USA) to obtain viral DNA and RNA, respectively. After the quality control procedures by Qubit 2.0, agarose gel electrophoresis, and Agilent 2100, the qualified DNA and RNA were subjected to library preparation using Nextera DNA Flex Library Preparation kit (Illumina, USA) and KAPA RNA HyperPrep Kit (Roche, Swiss). The qualified libraries were then sequenced (150bp paired end) on Illumina Novaseq platform.
RNA virome classification and quantification
Raw RNA virome metagenomic sequence reads were filtered and quality-trimmed using Trimmomatic v0.36  as follows: (1) trimming low-quality base (quality score < 20), (2) removing reads shorter than 50bp, and (3) removing sequencing adapters. Contaminating human reads were filtering using Kneaddata (Reference database: GRCh38 p12) with default parameters. Cleaned RNA viral reads were rarefied to even depth (20 million reads per sample) and profiled via Kraken2  against the NCBI viral Refseq database as of April 20, 2020. Kraken2 has demonstrated appropriate performance in viral metagenomic classification for time-constrained viral diagnostics and profiling purposes, as well as for surveillance and outbreak source tracing . The abundance of constituent RNA virus species was calculated as count per million sequencing reads (CPM). A CPM filter of > 1 was used to select species for downstream differential analysis. Differential viral taxa between COVID-19 patients and non-COVID-19 controls were identified using Multivariate Association with Linear Models (MaAsLin2), while adjusting for confounding factors.
DNA virome classification and quantification
Raw sequence reads were filtered utilizing Trimmomatic using the following parameters; SLIDINGWINDOW: 4:20, MINLEN: 60 HEADCROP 15; CROP 225. Contaminating human reads were filtering using Kneaddata (Reference database: GRCh38 p12) with default parameters. Megahit, with default parameters, was chosen to assemble the reads into contigs per sample. Assemblies were subsequently pooled and retained if longer than 1kb. Bacterial contigs were removed by using an extensive set of inclusion criteria to select viral sequences only. Briefly, contigs were required to fulfill one of the following criteria: (1) categories 1–6 from VirSorter when running with default parameters and Refseqdb (–db 1) positive, (2) circular, (3) greater than 3kb with no BLASTn alignments to the NT database (e value threshold: 1e−10), (4) a minimum of 2 pVogs with at least 3 per 1kb, (5) BLASTn alignments to viral RefSeq database (v.89) (e value threshold: 1e−10), and (6) less than 3 ribosomal proteins as predicted using the COG database. HMMscan was used to search the viral contig ORFs against the pVOGs hmm profile database with an e value filter of 1e−5, retaining the top hit in each case. Afterwards, a fasta file combining viral contigs was compiled. This viral database includes the viral contigs recovered by the screening criteria from the bulk metagenomic assemblies. Then, the paired reads were mapped to the viral contig database with BWA, using default parameters. The viral operational taxonomic unit (OTU) table of viral abundance was pulled from BWA sam output files by customized script and normalized by the number of metagenomic reads (metagenomic reads were rarefied to even depth, 20 million reads per sample). The contigs were then subjected to open reading frame (ORF) prediction, using MetaProdigal (v2.6.3) with the metagenomics procedure (-p meta). To annotate the predicted ORFs, the amino acid sequences of the ORFs were queried by Diamond against the viral RefSeq protein (v84) with an e value <10˗5 and a bitscore >50. The viral Refseq proteins with the top closest homologies (e value <10˗5 and bitscore >50) were considered for each ORF. Contigs were taxonomically binned to a taxon according to the predominant assignment of its constituent ORFs. Contigs shared the same taxonomic identity were therefore collapsed. The viral contig abundances in each sample were finally summed as reads per kilobase per million mapped reads (RPKM, shown in Supplementary Table 3). The virome community differences were visualized via PCoA analysis based on the Euclidean distance between each individual’s virome. Differential DNA viral taxa between COVID-19 patients and non-COVID-19 controls were identified using DESeq, while adjusting for confounding factors.
Virome functionality analysis
Viral reads were retrieved via BBMap against the collection of assembled viral contigs, followed by functionality enquiry and annotation via HUMANN2 v0.9.4. Predicted functions were collapsed by gene family identity, with abundance values expressed in RPK (reads per kilobase). To establish the presence or absence a function within a sample, a stringent RPK threshold value > 10 was used to be defined as present. Differential functions between COVID-19 and non-COVID-19 subjects were identified by DESeq, while adjusting for co-morbidities, antiviral, and age.
Detection of fecal SARS-CoV-2 viral load
SARS-CoV-2 viral loads in stool and nasopharyngeal swab specimens were measured using real-time reverse-transcriptase-polymerase chain-reaction (RT-PCR) assay. Viral RNA from specimens was extracted using QIAamp Viral RNA Mini Kit (Qiagen, Hilden, Germany). SARS-CoV-2 RNA was quantified using real-time reverse-transcriptase-polymerase-chain-reaction (RT-PCR). The primer-probe set N1 (2019-nCoV_N1-F: 5’-GAC CCC AAA ATC AGC GAA AT-3’, 2019-nCoV_N1-R: 5’-TCT GGT TAC TGC CAG TTG AAT CTG-3’ and 2019-nCoV_N1-P: 5’-FAM-ACC CCG CAT TAC GTT TGG TGG ACC-BHQ1-3’) designed by the US Centers for Disease Control and Prevention (CDC) was purchased from the Integrated DNA Technologies, USA. The one-step real-time RT-PCR reaction contained 10 μL of the extracted preparation, 4 μL TaqMan™ Fast Virus 1-Step Master Mix (Applied Biosystems, USA) in a final reaction volume of 20 μL. The primer and probe concentration were 0.5 μM and 0.125 μM, respectively. The cycling conditions, 25°C for 2 min, 50°C for 15 min, 95 °C for 2 min, followed by 45 cycles of 95 °C for 15 s, and 55 °C for 30 s, were performed with the StepOnePlus Real-Time PCR System (Applied Biosystems, USA). The cycle threshold (Ct) values of real-time RT-PCR were converted into viral RNA copies based on a standard curve prepared from 10-fold serial dilutions of know copies of plasmid containing the full N gene (2019-nCoV_N_Positive Control, Integrated DNA Technologies, USA). Samples were considered as negative if the Ct values exceeded 39.9 cycles.
Correlation between viruses, COVID-19 severity, and blood measurements
Spearman correlation analyses were conducted to correlate both RNA and DNA virome abundance profiles with blood parameter measurements. Spearman correlation analyses with multiple comparison adjustment were conducted to correlate the DNA virome abundance profile with COVID-19 severity across COVID-19 patients.
Availability of data and materials
Virome metagenomic sequencing data has been deposited to the NCBI Sequence Read Archive under BioProject accession number PRJNA657711.
Onder G, Rezza G, Brusaferro S. Case-fatality rate and characteristics of patients dying in relation to COVID-19 in Italy. Jama. 2020. https://doi.org/10.1001/jama.2020.4683.
Huang C, Wang Y, Li X, Ren L, Zhao J, Hu Y, et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet. 2020;395(10223):497–506. https://doi.org/10.1016/S0140-6736(20)30183-5.
Chen N, Zhou M, Dong X, Qu J, Gong F, Han Y, et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study. Lancet. 2020;395(10223):507–13. https://doi.org/10.1016/S0140-6736(20)30211-7.
Liang W, Feng Z, Rao S, Xiao C, Xue X, Lin Z, et al. Diarrhoea may be underestimated: a missing link in 2019 novel coronavirus. Gut. 2020;69(6):1141–3. https://doi.org/10.1136/gutjnl-2020-320832.
Cheung, KS, et al. "Gastrointestinal manifestations of SARS-CoV-2 infection and virus load in fecal samples from a Hong Kong cohort: systematic review and meta-analysis." Gastroenterology. 2020;159(1):81–95.
Wölfel R, Corman VM, Guggemos W, Seilmaier M, Zange S, Müller MA, et al. Virological assessment of hospitalized patients with COVID-2019. Nature. 2020;581(7809):465–9. https://doi.org/10.1038/s41586-020-2196-x.
Xu Y, Li X, Zhu B, Liang H, Fang C, Gong Y, Gong S. Characteristics of pediatric SARS-CoV-2 infection and potential evidence for persistent fecal viral shedding. Nat Med. 2020;26(4):502–5.
Zuo T, Liu Q, Zhang F, Lui GC-Y, Tso EYK, Yeoh YK, et al. Depicting SARS-CoV-2 faecal viral activity in association with gut microbiota composition in patients with COVID-19. Gut. 2020:gutjnl-2020-322294. https://doi.org/10.1136/gutjnl-2020-322294.
Virgin HW. The virome in mammalian physiology and disease. Cell. 2014;157(1):142–50. https://doi.org/10.1016/j.cell.2014.02.032.
Neil JA, Cadwell K. The intestinal virome and immunity. J Immunol. 2018;201(6):1615–24. https://doi.org/10.4049/jimmunol.1800631.
Brodin P, Jojic V, Gao T, Bhattacharya S, Angel CJL, Furman D, et al. Variation in the human immune system is largely driven by non-heritable influences. Cell. 2015;160(1-2):37–47. https://doi.org/10.1016/j.cell.2014.12.020.
Reese TA, Bi K, Kambal A, Filali-Mouhim A, Beura LK, Bürger MC, et al. Sequential infection with common pathogens promotes human-like immune gene expression and altered vaccine response. Cell Host Microbe. 2016;19(5):713–9. https://doi.org/10.1016/j.chom.2016.04.003.
Shkoporov AN, Clooney AG, Sutton TDS, Ryan FJ, Daly KM, Nolan JA, et al. The human gut virome is highly diverse, stable, and individual specific. Cell Host Microbe. 2019;26(4):527–541. e525.
Zuo T, Lu X-J, Zhang Y, Cheung CP, Lam S, Zhang F, et al. Gut mucosal virome alterations in ulcerative colitis. Gut. 2019;68(7):1169–79. https://doi.org/10.1136/gutjnl-2018-318131.
Zuo T, Zhang F, Lui GC, Yeoh YK, Li AY, Zhan H, Ng SC. Alterations in gut microbiota of patients with COVID-19 during time of hospitalization. Gastroenterology, 2020;159(3):944–55.
Zuo T, Zhan H, Zhang F, Liu Q, Tso EY, Lui GC, Ng SC. Alterations in fecal fungal microbiome of patients with COVID-19 during time of hospitalization until discharge. Gastroenterology. 2020;159(4):1302–10.
Kitajima M, Sassi HP, Torrey JR. Pepper mild mottle virus as a water quality indicator. NPJ Clean Water. 2018;1(1):1–9.
Clooney AG, Sutton TDS, Shkoporov AN, Holohan RK, Daly KM, O’Regan O, et al. Whole-virome analysis sheds light on viral dark matter in inflammatory bowel disease. Cell Host Microbe. 2019;26(6):764–778. e765.
Gogokhia L, Buhrke K, Bell R, Hoffman B, Brown DG, Hanke-Gogokhia C, et al. Expansion of bacteriophages is linked to aggravated intestinal inflammation and colitis. Cell Host Microbe. 2019;25(2):285–299. e288.
Wu J, Liu J, Zhao X, Liu C, Wang W, Wang D, Xu W, Zhang C, Yu J, Jiang B, Cao H, Li L. Clinical Characteristics of Imported Cases of Coronavirus Disease 2019 (COVID-19) in Jiangsu Province: A Multicenter Descriptive Study. Clin Infect Dis. 2020;71(15):706-712. https://doi.org/10.1093/cid/ciaa199.
Giamarellos-Bourboulis EJ, Netea MG, Rovina N, Akinosoglou K, Antoniadou A, Antonakos N, et al. Complex immune dysregulation in COVID-19 patients with severe respiratory failure. Cell Host Microbe. 2020;27(6):992–1000.e3. https://doi.org/10.1016/j.chom.2020.04.009.
Wang L. C-reactive protein levels in the early stage of COVID-19. Med Mal Infect. 2020;50(4):332–4.
Sun S, Cai X, Wang H, He G, Lin Y, Lu B, Chen C, Pan Y, Hu X: Abnormalities of peripheral blood system in patients with COVID-19 in Wenzhou, China. Clinica Chimica Acta 2020
Lechien JR, Chiesa‐Estomba CM, Place S, Van Laethem Y, Cabaraux P, Mat Q, Cantarella G. Clinical and epidemiological characteristics of 1420 European patients with mild‐to‐moderate coronavirus disease 2019. J Intern Med. 2020;288(3):335–44.
Gupta A, Madhavan MV, Sehgal K, Nair N, Mahajan S, Sehrawat TS, Landry DW. Extrapulmonary manifestations of COVID-19. Nat Med. 2020;26(7):1017–32.
Zhang T, Breitbart M, Lee WH, Run J-Q, Wei CL, Soh SWL, et al. RNA viral community in human feces: prevalence of plant pathogenic viruses. Plos Biol. 2005;4(1):e3. https://doi.org/10.1371/journal.pbio.0040003.
Zuo T, Sun Y, Wan Y, Yeoh YK, Zhang F, Cheung CP, Ng SC. Human-Gut-DNA Virome Variations across Geography, Ethnicity, and Urbanization. Cell Host & Microbe. 2020;28(5):741–51.
del Rio C, Collins LF, Malani P. Long-term Health Consequences of COVID-19. Jama. 2020;324(17):1723–4.
Schneider DS, Ayres JS. Two ways to survive infection: what resistance and tolerance can teach us about treating infectious diseases. Nat Rev Immunol. 2008;8(11):889–95. https://doi.org/10.1038/nri2432.
Ayres JS. A metabolic handbook for the COVID-19 pandemic. Nat Metab. 2020;2(7):572–85.
Ayres JS. Surviving COVID-19: a disease tolerance perspective. In: American Association for the Advancement of Science; 2020.
Vabret N, Britton GJ, Gruber C, Hegde S, Kim J, Kuksin M, et al. Immunology of COVID-19: current state of the science. Immunity. 2020;52(6):910–41. https://doi.org/10.1016/j.immuni.2020.05.002.
Tufan A, AA GÜL, Matucci-Cerinic M. COVID-19, immune system response, hyperinflammation and repurposing antirheumatic drugs. Turkish J Med Sci. 2020;50(SI-1):620–32.
Balique F, Lecoq H, Raoult D, Colson P. Can plant viruses cross the kingdom border and be pathogenic to humans? Viruses. 2015;7(4):2074–98. https://doi.org/10.3390/v7042074.
Berardi A, Evans DJ, Bombelli FB, Lomonossoff GP. Stability of plant virus-based nanocarriers in gastrointestinal fluids. Nanoscale. 2018;10(4):1667–79. https://doi.org/10.1039/C7NR07182E.
Colson P, Richet H, Desnues C, Balique F, Moal V, Grob J-J, et al. Pepper mild mottle virus, a plant virus associated with specific immune responses, fever, abdominal pains, and pruritus in humans. PloS one. 2010;5(4):e10041. https://doi.org/10.1371/journal.pone.0010041.
Hatfull GF, Hendrix RW. Bacteriophages and their genomes. Curr Opin Virol. 2011;1(4):298–303. https://doi.org/10.1016/j.coviro.2011.06.009.
Colson P, La Scola B, Levasseur A, Caetano-Anollés G, Raoult D. Mimivirus: leading the way in the discovery of giant viruses of amoebae. Nat Rev Microbiol. 2017;15(4):243–54. https://doi.org/10.1038/nrmicro.2016.197.
Mavrich TN, Hatfull GF. Bacteriophage evolution differs by host, lifestyle and genome. Nat Microbiol. 2017;2(9):17112. https://doi.org/10.1038/nmicrobiol.2017.112.
Pedulla ML, Ford ME, Houtz JM, Karthikeyan T, Wadsworth C, Lewis JA, et al. Origins of highly mosaic mycobacteriophage genomes. Cell. 2003;113(2):171–82. https://doi.org/10.1016/S0092-8674(03)00233-2.
Lawrence JG, Hatfull GF, Hendrix RW. Imbroglios of viral taxonomy: genetic exchange and failings of phenetic approaches. J Bacteriol. 2002;184(17):4891–905. https://doi.org/10.1128/JB.184.17.4891-4905.2002.
Hendrix RW, Hatfull GF, Ford ME, Smith MCM, Burns RN. Evolutionary relationships among diverse bacteriophages and prophages: all the world's a phage. In Horizontal gene transfer. Academic Press; 2020:133-VI.
Mizuno CM, Guyomar C, Roux S, Lavigne R, Rodriguez-Valera F, Sullivan MB, et al. Numerous cultivated and uncultivated viruses encode ribosomal proteins. Nat Commun. 2019;10(1):1–11.
Hennecke H, Springer M, Böck A. A specialized transducing λ phage carrying the Escherichia coli genes for phenylalanyl-tRNA synthetase. Mol Gen Genet. 1977;152(2):205–10. https://doi.org/10.1007/BF00268819.
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20. https://doi.org/10.1093/bioinformatics/btu170.
Wood DE, Salzberg SL. Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol. 2014;15(3):R46. https://doi.org/10.1186/gb-2014-15-3-r46.
Nooij S, Schmitz D, Vennema H, Kroneman A, Koopmans MPG. Overview of virus metagenomic classification methods and their biological applications. Front Microbiol. 2018;9:749. https://doi.org/10.3389/fmicb.2018.00749.
We would like to thank all healthcare workers working in isolation wards of the Prince of Wales Hospital, Hong Kong, China. We thank Apple CM Yeung, Wendy CS Ho, Miu L Chin, Rity Wong, Barry Wong, and Vickie Li for their technical contributions in this study.
This work was supported by the Hui Hoy & Chow Sin Lan Charity Fund Limited, Pine and Crane Company Limited, and Mr. Hui Ming; Center for Gut Microbiota Research (Faculty of Medicine, The Chinese University of Hong Kong); and InnoHK, The Government of Hong Kong, Special Administrative Region of the People’s Republic of China.
Ethics approval and consent to participate
This study was approved by the Joint Chinese University of Hong Kong–New Territories East Cluster Clinical Research Ethics Committee (Reference number: 2020.076). All subjects provided informed consent to participate in this study and agreed for publication of the research results.
Consent for publication
All subjects provided informed consent to participate in this study and agreed for publication of the research results.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Schematic diagram of COVID-19 patient (n=37) follow-up, including disease onset, admission, stool sample collection, duration of hospital stay. “CoV” denotes patient with COVID-19. Stool specimens were serially collected for separate shotgun metagenomic sequencing of RNA and DNA virome; “SARS-CoV-2 PCR negative in nasopharyngeal test”: the first negative result for SARS-CoV-2 virus in two consecutive negative nasopharyngeal tests, upon which patient was then discharged.
Temporal changes of the RNA viruses, SARS-CoV-2 (A) and PMMoV (B), in the faecal RNA virome in each COVID-19 case. CPM, count per million reads.
SARS-CoV-2 viral load. Stool SARS-CoV-2 viral load in COVID-19 patients were detected by quantitative RT-PCR (A) and RNA virome shotgun metagenomics sequencing (B) respectively. Between-group comparison was conducted by one-way anova, **p<0.01, *p<0.05. (C) Comparison of SARS-CoV-2 viral loads between nasopharyngeal swab and fecal specimens. Statistical analysis was conducted by Mann-Whitney test. (D) Correlation of levels of SARS-CoV-2 viral load between nasopharyngeal swab and fecal specimens, calculated by Pearson correlation analysis.
Heatmap of correlations of COVID-19-enriched faecal viral functions and species. The color and intensity denote the spearman correlation direction and coefficient, where only the significant correlations were shown. Viruses labeled with asterisk were species enriched in COVID-19. Only the most abundant 80 species in faecal virome were plotted and ranked in descending order (from left to right) on the basis of the relative abundance.
Blood parameter levels in COVID-19 patients between non-severe and moderate/severe groups. Data are shown in mean±s.e. Statistical significance was performed by Mann-Whitney test.
The faecal DNA viruses at patient baseline correlated with blood parameters in COVID-19 patients. The color and intensity denote the spearman correlation direction and coefficient, where only the significant correlations with |correlation coefficient| > 0.3 were shown.
Subject characteristics and blood measurements.
Diet regimen during time of hospitalization.
Viral abundance profile in faecal samples. Abundance is expressed in RPKM.
About this article
Cite this article
Zuo, T., Liu, Q., Zhang, F. et al. Temporal landscape of human gut RNA and DNA virome in SARS-CoV-2 infection and severity. Microbiome 9, 91 (2021). https://doi.org/10.1186/s40168-021-01008-x