Skip to main content

Table 1 Existing datasets for disease- and species-related entities. Note that there are only two datasets which contain both diseases and species (miRNA and variome). In addition, species-level datasets are not specific to the human microbiome, so there is a need to create datasets curated for human microbiota

From: Challenges in the construction of knowledge bases for human microbiome-disease associations

Dataset

Entity type

No. of annotations

No. of unique annotations

CDR [34]

Disease

12,694

3459

Variome [35]

Disease

6025

629

miRNA [36]

Disease

2123

671

NCBI Disease [37]

Disease

6881

2129

Arizona Disease [38]

Disease

3206

1188

SCAI [39]

Disease

2226

1048

CellFinder [40]

Species

435

51

Variome [35]

Species

182

8

miRNA [36]

Species

726

47

S800 [41]

Species

3646

1564

LocText [42]

Species

276

39

Linneaus [43]

Species

4077

419

BioNLP-ST 16 [43]

Species

619

277