| Veröffentlichte Version Download ( PDF | 1MB) | Lizenz: Creative Commons Namensnennung 4.0 International |
Parameterized reinforcement learning for optical system optimization
Wankerl, Heribert
, Stern, Maike L.
, Mahdavi, Ali, Eichler, Christoph und Lang, Elmar W.
(2021)
Parameterized reinforcement learning for optical system optimization.
Journal of Physics D: Applied Physics 54 (30), S. 305104.
Veröffentlichungsdatum dieses Volltextes: 12 Jul 2021 08:21
Artikel
DOI zum Zitieren dieses Dokuments: 10.5283/epub.46324
Zusammenfassung
Engineering a physical system to feature designated characteristics states an inverse design problem, which is often determined by several discrete and continuous parameters. If such a system must feature a particular behavior, the mentioned combination of both, discrete and continuous, parameters results in a challenging optimization problem that requires an extensive search for an optimal ...
Engineering a physical system to feature designated characteristics states an inverse design problem, which is often determined by several discrete and continuous parameters. If such a system must feature a particular behavior, the mentioned combination of both, discrete and continuous, parameters results in a challenging optimization problem that requires an extensive search for an optimal system design. However, if the corresponding inverse design problem can be reformulated as a parameterized Markov decision process, reinforcement learning (RL) provides a heuristic framework to solve it. In this work, we use multi-layer thin films as an example of the aforementioned optimization problems and consider three design parameters: Each of the thin film layer's dielectric material (discrete) and thickness (continuous), as well as the total number of layers (discrete). While recent methods merely determine the optimal thicknesses and-less commonly-the layers' materials, our approach optimizes the total number of stacked layers as well. In summary, we further develop a Q-learning variant to solve inverse design optimization and thereby outperform human experts and current approaches like needle-point optimization or naive RL. For this purpose, we propose an exponentially transformed reward signal that eases policy search and enables constrained optimization. Moreover, the learned Q-values contain information about the optical properties of multi-layer thin films, which allows us a physical interpretation or what-if analysis and thus enables explainability.
Alternative Links zum Volltext
Beteiligte Einrichtungen
Details
| Dokumentenart | Artikel | ||||
| Titel eines Journals oder einer Zeitschrift | Journal of Physics D: Applied Physics | ||||
| Verlag: | IOP Publishing Ltd | ||||
|---|---|---|---|---|---|
| Ort der Veröffentlichung: | BRISTOL | ||||
| Band: | 54 | ||||
| Nummer des Zeitschriftenheftes oder des Kapitels: | 30 | ||||
| Seitenbereich: | S. 305104 | ||||
| Datum | 18 Mai 2021 | ||||
| Institutionen | Biologie und Vorklinische Medizin > Institut für Biophysik und physikalische Biochemie > Prof. Dr. Elmar Lang | ||||
| Identifikationsnummer |
| ||||
| Stichwörter / Keywords | REFRACTIVE-INDEX PROFILE; LIGHT-EMITTING-DIODES; DEEP NEURAL-NETWORKS; ANTIREFLECTION COATINGS; INVERSE DESIGN; GO; machine learning; reinforcement learning; inverse design problem; optics; multi-layer thin-film; optimization | ||||
| Dewey-Dezimal-Klassifikation | 500 Naturwissenschaften und Mathematik > 570 Biowissenschaften, Biologie | ||||
| Status | Veröffentlicht | ||||
| Begutachtet | Ja, diese Version wurde begutachtet | ||||
| An der Universität Regensburg entstanden | Ja | ||||
| URN der UB Regensburg | urn:nbn:de:bvb:355-epub-463244 | ||||
| Dokumenten-ID | 46324 |
Downloadstatistik
Downloadstatistik