Direkt zum Inhalt

Pichler, Maximilian ; Hartig, Florian

A new joint species distribution model for faster and more accurate inference of species associations from big community data

Pichler, Maximilian und Hartig, Florian (2021) A new joint species distribution model for faster and more accurate inference of species associations from big community data. Methods in Ecology and Evolution 12, S. 2159-2173.

Veröffentlichungsdatum dieses Volltextes: 13 Aug 2021 10:18
Artikel
DOI zum Zitieren dieses Dokuments: 10.5283/epub.47837


Zusammenfassung

Joint species distribution models (JSDMs) explain spatial variation in community composition by contributions of the environment, biotic associations and possibly spatially structured residual covariance. They show great promise as a general analytical framework for community ecology and macroecology, but current JSDMs, even when approximated by latent variables, scale poorly on large datasets, ...

Joint species distribution models (JSDMs) explain spatial variation in community composition by contributions of the environment, biotic associations and possibly spatially structured residual covariance. They show great promise as a general analytical framework for community ecology and macroecology, but current JSDMs, even when approximated by latent variables, scale poorly on large datasets, limiting their usefulness for currently emerging big (e.g. metabarcoding and metagenomics) community datasets. Here, we present a novel, more scalable JSDM (sjSDM) that circumvents the need to use latent variables by using a Monte Carlo integration of the joint JSDM likelihood together with flexible elastic net regularization on all model components. We implemented sjSDM in PyTorch, a modern machine learning framework, which allows making use of both CPU and GPU calculations. Using simulated communities with known species-species associations and different number of species and sites, we compare sjSDM with state-of-the-art JSDM implementations to determine computational runtimes and accuracy of the inferred species-species and species-environment associations. We find that sjSDM is orders of magnitude faster than existing JSDM algorithms (even when run on the CPU) and can be scaled to very large datasets. Despite the dramatically improved speed, sjSDM produces more accurate estimates of species association structures than alternative JSDM implementations. We demonstrate the applicability of sjSDM to big community data using eDNA case study with thousands of fungi operational taxonomic units (OTU). Our sjSDM approach makes the analysis of JSDMs to large community datasets with hundreds or thousands of species possible, substantially extending the applicability of JSDMs in ecology. We provide our method in an R package to facilitate its applicability for practical data analysis.



Beteiligte Einrichtungen


Details

DokumentenartArtikel
Titel eines Journals oder einer ZeitschriftMethods in Ecology and Evolution
Verlag:Wiley
Ort der Veröffentlichung:HOBOKEN
Band:12
Seitenbereich:S. 2159-2173
Datum28 Juli 2021
InstitutionenBiologie und Vorklinische Medizin > Institut für Pflanzenwissenschaften > Arbeitsgruppe Theoretische Ökologie (Prof. Dr. Florian Hartig)
Identifikationsnummer
WertTyp
10.1111/2041-210X.13687DOI
Stichwörter / KeywordsBIOTIC INTERACTIONS; ENVIRONMENTAL DNA; CLIMATE-CHANGE; MULTIVARIATE; BIODIVERSITY; ABUNDANCE; COOCCURRENCE; ARTHROPODS; MECHANISMS; FRAMEWORK; big data; co-occurrence; machine learning; metacommunity; regularization; statistics
Dewey-Dezimal-Klassifikation500 Naturwissenschaften und Mathematik > 570 Biowissenschaften, Biologie
500 Naturwissenschaften und Mathematik > 580 Pflanzen (Botanik)
StatusVeröffentlicht
BegutachtetJa, diese Version wurde begutachtet
An der Universität Regensburg entstandenJa
URN der UB Regensburgurn:nbn:de:bvb:355-epub-478374
Dokumenten-ID47837

Bibliographische Daten exportieren

Nur für Besitzer und Autoren: Kontrollseite des Eintrags

nach oben