Liu, Junzhuo ; Eckstein, Markus ; Wang, Zhixiang ; Feuerhake, Friedrich ; Merhof, Dorit

Spatial transcriptomics expression prediction from histopathology based on cross-modal mask reconstruction and contrastive learning

Liu, Junzhuo, Eckstein, Markus, Wang, Zhixiang, Feuerhake, Friedrich und Merhof, Dorit

(2025) Spatial transcriptomics expression prediction from histopathology based on cross-modal mask reconstruction and contrastive learning. Medical Image Analysis 108, S. 103889.

Veröffentlichungsdatum dieses Volltextes: 03 Dez 2025 05:41
Artikel
DOI zum Zitieren dieses Dokuments: 10.5283/epub.78252

Veröffentlichte Version
Download ( PDF | 14MB)

Lizenz: Creative Commons Namensnennung-NichtKommerziell-KeineBearbeitung 4.0 International

Zusammenfassung

Spatial transcriptomics is a technology that captures gene expression at different spatial locations, widely used in tumor microenvironment analysis and molecular profiling of histopathology, providing valuable insights into resolving gene expression and clinical diagnosis of cancer. Due to the high cost of data acquisition, large-scale spatial transcriptomics data remain challenging to obtain. In this study, we develop a contrastive learning-based deep learning method to predict spatially resolved gene expression from the whole-slide images (WSIs). Unlike existing end-to-end prediction frameworks, our method leverages multi-modal contrastive learning to establish a correspondence between histopathological morphology and spatial gene expression in the feature space. By computing cross-modal feature similarity, our method generates spatially resolved gene expression directly from WSIs. Furthermore, to enhance the standard contrastive learning paradigm, a cross-modal masked reconstruction is designed as a pretext task, enabling feature-level fusion between modalities. Notably, our method does not rely on large-scale pretraining datasets or abstract semantic representations from either modality, making it particularly effective for scenarios with limited spatial transcriptomics data. Evaluation across six different disease datasets demonstrates that, compared to existing studies, our method improves Pearson Correlation Coefficient (PCC) in the prediction of highly expressed genes, highly variable genes, and marker genes by 6.27 %, 6.11 %, and 11.26 % respectively. Further analysis indicates that our method preserves gene-gene correlations and applies to datasets with limited samples. Additionally, our method exhibits potential in cancer tissue localization based on biomarker expression. The code repository for this work is available at https://github.com/ngfufdrdh/CMRCNet.

Alternative Links zum Volltext

Beteiligte Einrichtungen

Informatik und Data Science > Fachbereich Bioinformatik
Browse Publikationen
Informatik und Data Science > Fachbereich Bioinformatik > Lehrstuhl für Bildverarbeitung (Prof. Dr.-Ing. Dorit Merhof)
Browse Publikationen

Details

Dokumentenart

Artikel

Titel eines Journals oder einer Zeitschrift

Medical Image Analysis

Verlag:

Elsevier

Band:

108

Seitenbereich:

S. 103889

Datum

25 November 2025

Institutionen

Informatik und Data Science > Fachbereich Bioinformatik
Informatik und Data Science > Fachbereich Bioinformatik > Lehrstuhl für Bildverarbeitung (Prof. Dr.-Ing. Dorit Merhof)

Identifikationsnummer

Wert	Typ
10.1016/j.media.2025.103889	DOI

Stichwörter / Keywords

Histopathology, Spatial transcriptomics, Contrastive learning, Multimodal fusion

Dewey-Dezimal-Klassifikation

000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik

Status

Veröffentlicht

Begutachtet

Ja, diese Version wurde begutachtet

An der Universität Regensburg entstanden

Zum Teil

URN der UB Regensburg

urn:nbn:de:bvb:355-epub-782528

Dokumenten-ID

78252

Bibliographische Daten exportieren

Nur für Besitzer und Autoren: Kontrollseite des Eintrags

Downloadstatistik

Altmetric

Alternative Statistik (altmetrics)

Weitere Literatur (mittels CORE)

nach oben