Direkt zum Inhalt

Wankerl, Heribert ; Stern, Maike L. ; Mahdavi, Ali ; Eichler, Christoph ; Lang, Elmar W.

Parameterized reinforcement learning for optical system optimization

Wankerl, Heribert , Stern, Maike L. , Mahdavi, Ali, Eichler, Christoph und Lang, Elmar W. (2021) Parameterized reinforcement learning for optical system optimization. Journal of Physics D: Applied Physics 54 (30), S. 305104.

Veröffentlichungsdatum dieses Volltextes: 12 Jul 2021 08:21
Artikel
DOI zum Zitieren dieses Dokuments: 10.5283/epub.46324


Zusammenfassung

Engineering a physical system to feature designated characteristics states an inverse design problem, which is often determined by several discrete and continuous parameters. If such a system must feature a particular behavior, the mentioned combination of both, discrete and continuous, parameters results in a challenging optimization problem that requires an extensive search for an optimal ...

Engineering a physical system to feature designated characteristics states an inverse design problem, which is often determined by several discrete and continuous parameters. If such a system must feature a particular behavior, the mentioned combination of both, discrete and continuous, parameters results in a challenging optimization problem that requires an extensive search for an optimal system design. However, if the corresponding inverse design problem can be reformulated as a parameterized Markov decision process, reinforcement learning (RL) provides a heuristic framework to solve it. In this work, we use multi-layer thin films as an example of the aforementioned optimization problems and consider three design parameters: Each of the thin film layer's dielectric material (discrete) and thickness (continuous), as well as the total number of layers (discrete). While recent methods merely determine the optimal thicknesses and-less commonly-the layers' materials, our approach optimizes the total number of stacked layers as well. In summary, we further develop a Q-learning variant to solve inverse design optimization and thereby outperform human experts and current approaches like needle-point optimization or naive RL. For this purpose, we propose an exponentially transformed reward signal that eases policy search and enables constrained optimization. Moreover, the learned Q-values contain information about the optical properties of multi-layer thin films, which allows us a physical interpretation or what-if analysis and thus enables explainability.



Beteiligte Einrichtungen


Details

DokumentenartArtikel
Titel eines Journals oder einer ZeitschriftJournal of Physics D: Applied Physics
Verlag:IOP Publishing Ltd
Ort der Veröffentlichung:BRISTOL
Band:54
Nummer des Zeitschriftenheftes oder des Kapitels:30
Seitenbereich:S. 305104
Datum18 Mai 2021
InstitutionenBiologie und Vorklinische Medizin > Institut für Biophysik und physikalische Biochemie > Prof. Dr. Elmar Lang
Identifikationsnummer
WertTyp
10.1088/1361-6463/abfddbDOI
Stichwörter / KeywordsREFRACTIVE-INDEX PROFILE; LIGHT-EMITTING-DIODES; DEEP NEURAL-NETWORKS; ANTIREFLECTION COATINGS; INVERSE DESIGN; GO; machine learning; reinforcement learning; inverse design problem; optics; multi-layer thin-film; optimization
Dewey-Dezimal-Klassifikation500 Naturwissenschaften und Mathematik > 570 Biowissenschaften, Biologie
StatusVeröffentlicht
BegutachtetJa, diese Version wurde begutachtet
An der Universität Regensburg entstandenJa
URN der UB Regensburgurn:nbn:de:bvb:355-epub-463244
Dokumenten-ID46324

Bibliographische Daten exportieren

Nur für Besitzer und Autoren: Kontrollseite des Eintrags

nach oben