| Veröffentlichte Version Download ( PDF | 3MB) | Lizenz: Creative Commons Namensnennung 4.0 International |
Metric multidimensional scaling for large single-cell datasets using neural networks
Canzar, Stefan, Do, Van Hoan, Jelić, Slobodan, Laue, Sören, Matijević, Domagoj und Prusina, Tomislav (2024) Metric multidimensional scaling for large single-cell datasets using neural networks. Algorithms for Molecular Biology 19 (1).Veröffentlichungsdatum dieses Volltextes: 18 Jun 2024 06:21
Artikel
DOI zum Zitieren dieses Dokuments: 10.5283/epub.58424
Zusammenfassung
Metric multidimensional scaling is one of the classical methods for embedding data into low-dimensional Euclidean space. It creates the low-dimensional embedding by approximately preserving the pairwise distances between the input points. However, current state-of-the-art approaches only scale to a few thousand data points. For larger data sets such as those occurring in single-cell RNA ...
Metric multidimensional scaling is one of the classical methods for embedding data into low-dimensional Euclidean space. It creates the low-dimensional embedding by approximately preserving the pairwise distances between the input points. However, current state-of-the-art approaches only scale to a few thousand data points. For larger data sets such as those occurring in single-cell RNA sequencing experiments, the running time becomes prohibitively large and thus alternative methods such as PCA are widely used instead. Here, we propose a simple neural network-based approach for solving the metric multidimensional scaling problem that is orders of magnitude faster than previous state-of-the-art approaches, and hence scales to data sets with up to a few million cells. At the same time, it provides a non-linear mapping between high- and low-dimensional space that can place previously unseen cells in the same embedding.
Alternative Links zum Volltext
Beteiligte Einrichtungen
Details
| Dokumentenart | Artikel | ||||
| Titel eines Journals oder einer Zeitschrift | Algorithms for Molecular Biology | ||||
| Verlag: | Springer | ||||
|---|---|---|---|---|---|
| Band: | 19 | ||||
| Nummer des Zeitschriftenheftes oder des Kapitels: | 1 | ||||
| Datum | 11 Juni 2024 | ||||
| Institutionen | Informatik und Data Science > Fachbereich Bioinformatik > Algorithmische Bioinformatik (Prof. Dr. Stefan Canzar) | ||||
| Identifikationsnummer |
| ||||
| Stichwörter / Keywords | Metric multidimensional scaling, Neural networks, Large-scale data, Dimensionality reduction, Single-cell RNA-seq, Clustering | ||||
| Dewey-Dezimal-Klassifikation | 000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik | ||||
| Status | Veröffentlicht | ||||
| Begutachtet | Ja, diese Version wurde begutachtet | ||||
| An der Universität Regensburg entstanden | Zum Teil | ||||
| URN der UB Regensburg | urn:nbn:de:bvb:355-epub-584242 | ||||
| Dokumenten-ID | 58424 |
Downloadstatistik
Downloadstatistik