Skip to main content

Table 1 MME techniques yield data sets with common structures, and often, limitations

From: The community ecology perspective of omics data

 

Common techniques

0-inflated

No N

Compositional

Genomics: The system-wide identification and quantification of DNA sequences and the encoded functions in an organism or population [2].

High throughput sequencing

 

(+)

 

Transcriptomics: The system-wide identification and quantification of the RNA transcripts in an organism or population [3].

High throughput sequencing, microarrays

+

+

+

Proteomics: The use of quantitative protein-level measurements of gene translation to characterize biological processes and decipher the mechanisms of gene expression control [4].

Mass spectrometry

+

+

 

Metabolomics: The systematic identification and quantification of metabolites (small molecule substrates, intermediates, products of cell metabolism) in an organism or population [5]..

Nuclear magnetic resonance spectroscopy, mass spectrometry

+

(+)

 

Metabarcoding: The large-scale identification and quantification of variation of diversity in an environmental sample in terms of a specific genomic region (DNA) [6].

High throughput amplicon sequencing

++

+

+

Metagenomics: Large-scale identification and quantification of all DNA in an environmental sample [7].

High throughput shotgun metagenomic sequencing

++

 

+

Metatranscriptomics: Large-scale identification and quantification of all RNA transcripts in an environmental sample [8]

High throughput RNA sequencing, (microarrays)

++

 

+

Metaproteomics: Large-scale identification and quantification of the entire protein complement from an environmental sample [9].

Mass spectrometry

++

+

 

Meta-metabolomics: Large-scale identification and quantification of small molecules from an environmental sample [10].

Nuclear magnetic resonance spectroscopy, mass spectrometry

+

+

 
  1. For techniques with “no N,” the total number of molecular entities measured contains no biological information. For techniques that produce 0-inflated datasets, the data matrices contain more zeros than non-zero values, while for compositional datasets, the abundance of species is correlated to the technique and contains no biological information. For each limitation, whether it is an issue or a serious issue for each data type is indicated with + or ++, respectively. Data types that face a limitation but it seldom affects the scientific questions asked with these data are indicated with (+)