| Veröffentlichte Version Download ( PDF | 2MB) | Lizenz: Creative Commons Namensnennung 4.0 International |
Chances and challenges of machine learning based disease classification in genetic association studies illustrated on age-related macular degeneration
Günther, Felix
, Brandl, Caroline, Winkler, Thomas W.
, Wanner, Veronika, Stark, Klaus, Kuechenhoff, Helmut und Heid, Iris M.
(2020)
Chances and challenges of machine learning based disease classification in genetic association studies illustrated on age-related macular degeneration.
Genetic Epidemiology 44, S. 759-777.
Veröffentlichungsdatum dieses Volltextes: 28 Jan 2021 14:19
Artikel
DOI zum Zitieren dieses Dokuments: 10.5283/epub.44655
Zusammenfassung
Imaging technology and machine learning algorithms for disease classification set the stage for high-throughput phenotyping and promising new avenues for genome-wide association studies (GWAS). Despite emerging algorithms, there has been no successful application in GWAS so far. We establish machine learning-based phenotyping in genetic association analysis as misclassification problem. To ...
Imaging technology and machine learning algorithms for disease classification set the stage for high-throughput phenotyping and promising new avenues for genome-wide association studies (GWAS). Despite emerging algorithms, there has been no successful application in GWAS so far. We establish machine learning-based phenotyping in genetic association analysis as misclassification problem. To evaluate chances and challenges, we performed a GWAS based on automatically classified age-related macular degeneration (AMD) in UK Biobank (images from 135,500 eyes; 68,400 persons). We quantified misclassification of automatically derived AMD in internal validation data (4,001 eyes; 2,013 persons) and developed a maximum likelihood approach (MLA) to account for it when estimating genetic association. We demonstrate that our MLA guards against bias and artifacts in simulation studies. By combining a GWAS on automatically derived AMD and our MLA in UK Biobank data, we were able to dissect true association (ARMS2/HTRA1,CFH) from artifacts (nearHERC2) and identified eye color as associated with the misclassification. On this example, we provide a proof-of-concept that a GWAS using machine learning-derived disease classification yields relevant results and that misclassification needs to be considered in analysis. These findings generalize to other phenotypes and emphasize the utility of genetic data for understanding misclassification structure of machine learning algorithms.
Alternative Links zum Volltext
Beteiligte Einrichtungen
Details
| Dokumentenart | Artikel | ||||
| Titel eines Journals oder einer Zeitschrift | Genetic Epidemiology | ||||
| Verlag: | Wiley | ||||
|---|---|---|---|---|---|
| Ort der Veröffentlichung: | HOBOKEN | ||||
| Band: | 44 | ||||
| Seitenbereich: | S. 759-777 | ||||
| Datum | 2020 | ||||
| Institutionen | Medizin > Lehrstuhl für Augenheilkunde Medizin > Institut für Epidemiologie und Präventivmedizin Medizin > Institut für Epidemiologie und Präventivmedizin > Lehrstuhl für Genetische Epidemiologie | ||||
| Identifikationsnummer |
| ||||
| Stichwörter / Keywords | MISCLASSIFICATION; VALIDATION; IMAGES; RARE; age-related macular degeneration (AMD); genome-wide association study; machine learning-based disease classification; response misclassification; UK Biobank | ||||
| Dewey-Dezimal-Klassifikation | 600 Technik, Medizin, angewandte Wissenschaften > 610 Medizin | ||||
| Status | Veröffentlicht | ||||
| Begutachtet | Ja, diese Version wurde begutachtet | ||||
| An der Universität Regensburg entstanden | Ja | ||||
| URN der UB Regensburg | urn:nbn:de:bvb:355-epub-446556 | ||||
| Dokumenten-ID | 44655 |
Downloadstatistik
Downloadstatistik