| Eingereichte Version Download ( PDF | 652kB) | Lizenz: Creative Commons Namensnennung-NichtKommerziell-KeineBearbeitung 4.0 International |
Overcoming Observation Bias for Cancer Progression Modeling
Schill, Rudolf, Klever, Maren, Lösch, Andreas, Hu, Y. Linda
, Vocht, Stefan, Rupp, Kevin, Grasedyck, Lars, Spang, Rainer
und Beerenwinkel, Niko
(2023)
Overcoming Observation Bias for Cancer Progression Modeling.
bioRxiv.
(Eingereicht)
Veröffentlichungsdatum dieses Volltextes: 24 Jan 2024 11:48
Artikel
DOI zum Zitieren dieses Dokuments: 10.5283/epub.55392
Zusammenfassung
Cancers evolve by accumulating genetic alterations, such as mutations and copy number changes. The chronological order of these events is important for understanding the disease, but not directly observable from cross-sectional genomic data. Cancer progression models (CPMs), such as Mutual Hazard Networks (MHNs), reconstruct the progression dynamics of tumors by learning a network of causal ...
Cancers evolve by accumulating genetic alterations, such as mutations and copy number changes. The chronological order of these events is important for understanding the disease, but not directly observable from cross-sectional genomic data. Cancer progression models (CPMs), such as Mutual Hazard Networks (MHNs), reconstruct the progression dynamics of tumors by learning a network of causal interactions between genetic events from their co-occurrence patterns. However, current CPMs fail to include effects of genetic events on the observation of the tumor itself and assume that observation occurs independently of all genetic events. Since a dataset contains by definition only tumors at their moment of observation, neglecting any causal effects on this event leads to the “conditioning on a collider” bias: Events that make the tumor more likely to be observed appear anti-correlated, which results in spurious suppressive effects or masks promoting effects among genetic events. Here, we extend MHNs by modeling effects from genetic progression events on the observation event, thereby correcting for the collider bias. We derive an efficient tensor formula for the likelihood function and learn two models on somatic mutation datasets from the MSK-IMPACT study. In colon adenocarcinoma, we find a strong effect on observation by mutations in TP53, and in lung adenocarcinoma by mutations in EGFR. Compared to classical MHNs, this explains away many spurious suppressive interactions and uncovers several promoting effects. The data, code, and results are available at https://github.com/cbg-ethz/ObservationMHN.
Alternative Links zum Volltext
Beteiligte Einrichtungen
Details
| Dokumentenart | Artikel | ||||
| Titel eines Journals oder einer Zeitschrift | bioRxiv | ||||
| Buchtitel: | Overcoming Observation Bias for Cancer Progression Modeling | ||||
|---|---|---|---|---|---|
| Datum | 5 Dezember 2023 | ||||
| Zusätzliche Informationen (Öffentlich) | "This article is a preprint and has not been certified by peer review" | ||||
| Institutionen | Medizin > Institut für Funktionelle Genomik > Lehrstuhl für Statistische Bioinformatik (Prof. Spang) Informatik und Data Science > Fachbereich Bioinformatik > Lehrstuhl für Statistische Bioinformatik (Prof. Spang) | ||||
| Identifikationsnummer |
| ||||
| Stichwörter / Keywords | Cancer progression model · Selection bias · Collider bias | ||||
| Dewey-Dezimal-Klassifikation | 000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik | ||||
| Status | Eingereicht | ||||
| Begutachtet | Nein, diese Version wurde noch nicht begutachtet (bei preprints) | ||||
| An der Universität Regensburg entstanden | Zum Teil | ||||
| URN der UB Regensburg | urn:nbn:de:bvb:355-epub-553921 | ||||
| Dokumenten-ID | 55392 |
Downloadstatistik
Downloadstatistik