ARPHA Conference Abstracts :
Conference Abstract
|
Corresponding author: Emiliano Pereira-Flores (pereiramemo@gmail.com)
Received: 23 Feb 2021 | Published: 04 Mar 2021
© 2021 Emiliano Pereira-Flores, Albert Barberan, Frank Oliver Glöckner, Antonio Fernandez-Guerra
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Pereira-Flores E, Barberan A, Glöckner FO, Fernandez-Guerra A (2021) Mg-Traits pipeline: advancing functional trait-based approaches in metagenomics. ARPHA Conference Abstracts 4: e64908. https://doi.org/10.3897/aca.4.e64908
|
Microorganisms comprise an immense phylogenetic and metabolic diversity, inhabit every conceivable niche on earth, and play a fundamental role in global biogeochemical processes. Among others, their study is highly relevant to develop biotechnological applications, understand ecosystem processes and monitor environmental systems. Functional traits (FTs) (i.e., measurable properties of an organism that influence its fitness (
In this work, we present the Metagenomic Traits pipeline: Mg-Traits. Mg-Traits is dedicated to the computation of 25 (and counting) functional traits in short-read metagenomic data, ranging from GC content and amino acid composition to functional diversity and average genome size (see Fig.
Mg-Traits pipeline. The 25 metagenomic traits computed by the Mg-Traits pipeline are divided into four different groups. The first includes the metagenomic traits computed at the nucleotide level: (1) GC content, (2) GC variance, and (3) Tetranucleotide frequency. The second group includes the traits obtained from the open reading frame (ORF) sequence data: (4) ORFs to Base Pairs (BPs) ratio, (5) Codon frequency, (6) Amino acid frequency, and (7) Acidic to basic amino acid ratio. The third group is based on the functional annotation of the ORF amino acid sequences. The first 12 metagenomic traits (from 8 to 19 in the figure) comprise the composition, diversity, richness, and percentage of annotated genes for three different sets of genes: Pfam (https://pfam.xfam.org), Resfam (http://www.dantaslab.org/resfams), and Biosynthetic Gene Cluster (BGC) domains (https://doi.org/10.1101/2021.01.20.427441). Additionally, this group includes (20) the percentage of transcription factors (TFs) and (21) the average genome size (AGS). Lastly, in the fourth group are included the taxonomy-related metagenomic traits: (22) average copy number of 16S rRNA genes (ACN), taxonomic (23) composition, (24) diversity, and (25) richness.
Mg-Traits allows the systematic computation of a comprehensive set of metagenomic functional traits, which can be used to generate a functional and taxonomic fingerprint and reveal the predominant life-history strategies and ecological processes in a microbial community. Mg-Traits contributes to improving the exploitation of metagenomic data and facilitates comparative and quantitative studies. Considering the high genomic plasticity of microorganisms and their capacity to rapidly adapt to changing environmental conditions, Mg-Traits constitutes a valuable tool to monitor environmental systems.
The Mg-Traits pipeline is available at https://github.com/pereiramemo/metagenomic_pipelines. It is programmed in AWK, BASH, and R, and it was devised using a modular design to facilitate the integration of new metagenomic traits.
Bioinformatics: Metagenomics; Functional Traits; Microbial Ecology; Environmental Monitoring
Emiliano Pereira Flores
1st DNAQUA International Conference (March 9-11, 2021)