81urn:lsid:arphahub.com:pub:EE60897E-EDEE-5EC8-86EB-7203BECD195DARPHA Conference AbstractsACA2603-3925Pensoft Publishers10.3897/aca.4.e649086490815765Conference AbstractDATA ANALYSIS & STORAGЕ - Revealing, comparing and storing biodiversity patterns in massive amplicon datasetsMg-Traits pipeline: advancing functional trait-based approaches in metagenomicsPereira-FloresEmilianopereiramemo@gmail.comhttps://orcid.org/0000-0001-5830-33281BarberanAlbert2GlöcknerFrank Oliverhttps://orcid.org/0000-0001-8528-90233Fernandez-GuerraAntonio4University of the Republic, Rocha, UruguayUniversity of the RepublicRochaUruguayThe University of Arizona, Arizona, United States of AmericaThe University of ArizonaArizonaUnited States of AmericaUniversity of Bremen, Bremen, GermanyUniversity of BremenBremenGermanyUniversity of Copenhagen, Copenhagen, DenmarkUniversity of CopenhagenCopenhagenDenmark
2021040320214e6490827949B36-C493-5DAA-9D9B-E67E5658194623022021Emiliano Pereira-Flores, Albert Barberan, Frank Oliver Glöckner, Antonio Fernandez-GuerraThis is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Microorganisms comprise an immense phylogenetic and metabolic diversity, inhabit every conceivable niche on earth, and play a fundamental role in global biogeochemical processes. Among others, their study is highly relevant to develop biotechnological applications, understand ecosystem processes and monitor environmental systems. Functional traits (FTs) (i.e., measurable properties of an organism that influence its fitness (McGill et al. 2006)) provide complementary information to the taxonomic composition to improve the characterization of microbial communities and study their ecology (Martiny et al. 2012). The application of FT-based approaches can be particularly enhanced when coupled with metagenomics, which as a culture-independent method, allows us to obtain the genetic material of microorganisms from the environment: Metagenomic data can be used to compute functional traits at the genome level from a random sample of individuals in a microbial community, irrespective of their taxonomic affiliation (Fierer et al. 2014). Previous works using FT-based approaches in metagenomics include the study of community assembly processes (Burke et al. 2011) and responses to environmental change (Leff et al. 2015), and ecosystem functioning (Babilonia et al. 2018).
In this work, we present the Metagenomic Traits pipeline: Mg-Traits. Mg-Traits is dedicated to the computation of 25 (and counting) functional traits in short-read metagenomic data, ranging from GC content and amino acid composition to functional diversity and average genome size (see Fig. 1). As an example application, we used the Mg-Traits pipeline to process the 139 prokaryotic metagenomes of the TARA Oceans data set (Sunagawa et al. 2015). In this analysis, we observed that the computed metagenomic traits track community changes along the water column, which denote microorganisms’ environmental adaptations.
Mg-Traits allows the systematic computation of a comprehensive set of metagenomic functional traits, which can be used to generate a functional and taxonomic fingerprint and reveal the predominant life-history strategies and ecological processes in a microbial community. Mg-Traits contributes to improving the exploitation of metagenomic data and facilitates comparative and quantitative studies. Considering the high genomic plasticity of microorganisms and their capacity to rapidly adapt to changing environmental conditions, Mg-Traits constitutes a valuable tool to monitor environmental systems.
The Mg-Traits pipeline is available at https://github.com/pereiramemo/metagenomic_pipelines. It is programmed in AWK, BASH, and R, and it was devised using a modular design to facilitate the integration of new metagenomic traits.
1st DNAQUA International Conference (March 9-11, 2021)
ReferencesBabiloniaJoanyConesaAnaCasaburiGiorgioPereiraCecileLouyakisArtemis SReidR PamelaFosterJamie S2018Comparative Metagenomics Provides Insight Into the Ecosystem Functioning of the Shark Bay Stromatolites, Western Australia.9135910.3389/fmicb.2018.01359BurkeCatherineSteinbergPeterRuschDougKjellebergStaffanThomasTorsten2011Bacterial Community Assembly Based on Functional Genes Rather than Species108341428814293Englishhttps://www.pnas.org/content/early/2011/08/03/110159110810.1073/pnas.1101591108FiererNoahBarberánAlbertLaughlinDaniel C2014Seeing the forest for the genes: using metagenomics to infer the aggregated traits of microbial communities.561410.3389/fmicb.2014.00614LeffJonathan WJonesStuart EProberSuzanne MBarberánAlbertBorerElizabeth TFirnJennifer LHarpoleW StanleyHobbieSarah EHofmockelKirsten SKnopsJohannes M HMcCulleyRebecca LLa PierreKimberlyRischAnita CSeabloomEric WSchützMartinSteenbockChristopherStevensCarly JFiererNoah2015Consistent responses of soil microbial communities to elevated nutrient inputs in grasslands across the globe.11235109677210.1073/pnas.1508382112MartinyAdam CTresederKathleenPuschGordon2012Phylogenetic conservatism of functional traits in microorganisms.74830810.1038/ismej.2012.160McGillBrian JEnquistBrian JWeiherEvanWestobyMark2006Rebuilding community ecology from functional traits.2141788510.1016/j.tree.2006.02.002SunagawaShinichiCoelhoLuis PedroChaffronSamuelKultimaJens RoatLabadieKarineSalazarGuillemDjahanschiriBardyaZellerGeorgMendeDaniel RAlbertiAdrianaCornejo-CastilloFrancisco MCosteaPaul ICruaudCorinned'OvidioFrancescoEngelenStefanFerreraIsabelGasolJosep MGuidiLionelHildebrandFalkKokoszkaFlorianLepoivreCyrilleLima-MendezGipsiPoulainJuliePoulosBonnie TRoyo-LlonchMartaSarmentoHugoVieira-SilvaSaraDimierCélinePicheralMarcSearsonSarahKandels-LewisStefanieBowlerChrisde VargasColombanGorskyGabrielGrimsleyNigelHingampPascalIudiconeDanieleJaillonOlivierNotFabriceOgataHiroyukiPesantStephaneSpeichSabrinaStemmannLarsSullivanMatthew BWeissenbachJeanWinckerPatrickKarsentiEricRaesJeroenAcinasSilvia GBorkPeer2015Ocean plankton. Structure and function of the global ocean microbiome.3486237126135910.1126/science.12613593F059DA8-2DDA-5AD8-9388-EA6F37C288E810.3897/aca.4.e64908.figure1
Mg-Traits pipeline. The 25 metagenomic traits computed by the Mg-Traits pipeline are divided into four different groups. The first includes the metagenomic traits computed at the nucleotide level: (1) GC content, (2) GC variance, and (3) Tetranucleotide frequency. The second group includes the traits obtained from the open reading frame (ORF) sequence data: (4) ORFs to Base Pairs (BPs) ratio, (5) Codon frequency, (6) Amino acid frequency, and (7) Acidic to basic amino acid ratio. The third group is based on the functional annotation of the ORF amino acid sequences. The first 12 metagenomic traits (from 8 to 19 in the figure) comprise the composition, diversity, richness, and percentage of annotated genes for three different sets of genes: Pfam (https://pfam.xfam.org), Resfam (http://www.dantaslab.org/resfams), and Biosynthetic Gene Cluster (BGC) domains (https://doi.org/10.1101/2021.01.20.427441). Additionally, this group includes (20) the percentage of transcription factors (TFs) and (21) the average genome size (AGS). Lastly, in the fourth group are included the taxonomy-related metagenomic traits: (22) average copy number of 16S rRNA genes (ACN), taxonomic (23) composition, (24) diversity, and (25) richness.