Direkt zum Inhalt

Canzar, Stefan ; Do, Van Hoan ; Jelić, Slobodan ; Laue, Sören ; Matijević, Domagoj ; Prusina, Tomislav

Metric multidimensional scaling for large single-cell datasets using neural networks

Canzar, Stefan, Do, Van Hoan, Jelić, Slobodan, Laue, Sören, Matijević, Domagoj und Prusina, Tomislav (2024) Metric multidimensional scaling for large single-cell datasets using neural networks. Algorithms for Molecular Biology 19 (1).

Veröffentlichungsdatum dieses Volltextes: 18 Jun 2024 06:21
Artikel
DOI zum Zitieren dieses Dokuments: 10.5283/epub.58424


Zusammenfassung

Metric multidimensional scaling is one of the classical methods for embedding data into low-dimensional Euclidean space. It creates the low-dimensional embedding by approximately preserving the pairwise distances between the input points. However, current state-of-the-art approaches only scale to a few thousand data points. For larger data sets such as those occurring in single-cell RNA ...

Metric multidimensional scaling is one of the classical methods for embedding data into low-dimensional Euclidean space. It creates the low-dimensional embedding by approximately preserving the pairwise distances between the input points. However, current state-of-the-art approaches only scale to a few thousand data points. For larger data sets such as those occurring in single-cell RNA sequencing experiments, the running time becomes prohibitively large and thus alternative methods such as PCA are widely used instead. Here, we propose a simple neural network-based approach for solving the metric multidimensional scaling problem that is orders of magnitude faster than previous state-of-the-art approaches, and hence scales to data sets with up to a few million cells. At the same time, it provides a non-linear mapping between high- and low-dimensional space that can place previously unseen cells in the same embedding.



Beteiligte Einrichtungen


Details

DokumentenartArtikel
Titel eines Journals oder einer ZeitschriftAlgorithms for Molecular Biology
Verlag:Springer
Band:19
Nummer des Zeitschriftenheftes oder des Kapitels:1
Datum11 Juni 2024
InstitutionenInformatik und Data Science > Fachbereich Bioinformatik > Algorithmische Bioinformatik (Prof. Dr. Stefan Canzar)
Identifikationsnummer
WertTyp
10.1186/s13015-024-00265-3DOI
Stichwörter / KeywordsMetric multidimensional scaling, Neural networks, Large-scale data, Dimensionality reduction, Single-cell RNA-seq, Clustering
Dewey-Dezimal-Klassifikation000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik
StatusVeröffentlicht
BegutachtetJa, diese Version wurde begutachtet
An der Universität Regensburg entstandenZum Teil
URN der UB Regensburgurn:nbn:de:bvb:355-epub-584242
Dokumenten-ID58424

Bibliographische Daten exportieren

Nur für Besitzer und Autoren: Kontrollseite des Eintrags

nach oben