Direkt zum Inhalt

Schill, Rudolf ; Klever, Maren ; Lösch, Andreas ; Hu, Y. Linda ; Vocht, Stefan ; Rupp, Kevin ; Grasedyck, Lars ; Spang, Rainer ; Beerenwinkel, Niko

Overcoming Observation Bias for Cancer Progression Modeling

Schill, Rudolf, Klever, Maren, Lösch, Andreas, Hu, Y. Linda , Vocht, Stefan, Rupp, Kevin, Grasedyck, Lars, Spang, Rainer und Beerenwinkel, Niko (2023) Overcoming Observation Bias for Cancer Progression Modeling. bioRxiv. (Eingereicht)

Veröffentlichungsdatum dieses Volltextes: 24 Jan 2024 11:48
Artikel
DOI zum Zitieren dieses Dokuments: 10.5283/epub.55392


Zusammenfassung

Cancers evolve by accumulating genetic alterations, such as mutations and copy number changes. The chronological order of these events is important for understanding the disease, but not directly observable from cross-sectional genomic data. Cancer progression models (CPMs), such as Mutual Hazard Networks (MHNs), reconstruct the progression dynamics of tumors by learning a network of causal ...

Cancers evolve by accumulating genetic alterations, such as mutations and copy number changes. The chronological order of these events is important for understanding the disease, but not directly observable from cross-sectional genomic data. Cancer progression models (CPMs), such as Mutual Hazard Networks (MHNs), reconstruct the progression dynamics of tumors by learning a network of causal interactions between genetic events from their co-occurrence patterns. However, current CPMs fail to include effects of genetic events on the observation of the tumor itself and assume that observation occurs independently of all genetic events. Since a dataset contains by definition only tumors at their moment of observation, neglecting any causal effects on this event leads to the “conditioning on a collider” bias: Events that make the tumor more likely to be observed appear anti-correlated, which results in spurious suppressive effects or masks promoting effects among genetic events. Here, we extend MHNs by modeling effects from genetic progression events on the observation event, thereby correcting for the collider bias. We derive an efficient tensor formula for the likelihood function and learn two models on somatic mutation datasets from the MSK-IMPACT study. In colon adenocarcinoma, we find a strong effect on observation by mutations in TP53, and in lung adenocarcinoma by mutations in EGFR. Compared to classical MHNs, this explains away many spurious suppressive interactions and uncovers several promoting effects. The data, code, and results are available at https://github.com/cbg-ethz/ObservationMHN.



Beteiligte Einrichtungen


Details

DokumentenartArtikel
Titel eines Journals oder einer ZeitschriftbioRxiv
Buchtitel:Overcoming Observation Bias for Cancer Progression Modeling
Datum5 Dezember 2023
Zusätzliche Informationen (Öffentlich)"This article is a preprint and has not been certified by peer review"
InstitutionenMedizin > Institut für Funktionelle Genomik > Lehrstuhl für Statistische Bioinformatik (Prof. Spang)
Informatik und Data Science > Fachbereich Bioinformatik > Lehrstuhl für Statistische Bioinformatik (Prof. Spang)
Identifikationsnummer
WertTyp
10.1101/2023.12.03.569824DOI
Stichwörter / KeywordsCancer progression model · Selection bias · Collider bias
Dewey-Dezimal-Klassifikation000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik
StatusEingereicht
BegutachtetNein, diese Version wurde noch nicht begutachtet (bei preprints)
An der Universität Regensburg entstandenZum Teil
URN der UB Regensburgurn:nbn:de:bvb:355-epub-553921
Dokumenten-ID55392

Bibliographische Daten exportieren

Nur für Besitzer und Autoren: Kontrollseite des Eintrags

nach oben