| Veröffentlichte Version Download ( PDF | 226kB) | Lizenz: Creative Commons Namensnennung-Weitergabe unter gleichen Bedingungen 3.0 de | |
| Download ( PDF | 198kB) |
Using Deep Learning for Emotion Analysis of 18th and 19th Century German Plays
Schmidt, Thomas, Dennerlein, Katrin und Wolff, Christian
(2021)
Using Deep Learning for Emotion Analysis of 18th and 19th Century German Plays.
In: Burghardt, Manuel und Dieckmann, Lisa und Steyer, Timo und Trilcke, Peer und Walkowski, Niels-Oliver und Weis, Joëlle und Wuttke, Ulrike, (eds.)
Fabrikation von Erkenntnis: Experimente in den Digital Humanities. Teilband 1.
Melusina Press, Esch-sur-Alzette, Luxembourg.
ISBN 978-2-919815-25-8.
Veröffentlichungsdatum dieses Volltextes: 21 Okt 2021 06:45
Buchkapitel
Zusammenfassung
We present first results of the project “Emotions in Drama” in which we explore the annotation of emotions and the application of computational emotion analysis, predominantly deep learning-based methods, in the context of historical German plays of the time around 1800. We performed a pilot annotation study with five plays generating over 6,500 annotations for up to 13 sub-emotions structured in ...
We present first results of the project “Emotions in Drama” in which we explore the annotation of emotions and the application of computational emotion analysis, predominantly deep learning-based methods, in the context of historical German plays of the time around 1800. We performed a pilot annotation study with five plays generating over 6,500 annotations for up to 13 sub-emotions structured in a hierarchical scheme. This emotion scheme includes common types like joy, anger or hate but also concepts that are specifically important for German literary criticism of this period like friendship, compassion or Schadenfreude. We evaluate the performance of various methods of emotion-based text sequence classification including lexicon-based methods, traditional machine learning, fastText as static word embedding, various transformer models based on BERT- or ELECTRA-architectures and pretrained with contemporary language, transformer-based methods pretrained or finetuned for historical and/or poetic language as well as the finetuning of BERT models via our own corpora and plays. We do achieve state-of-the-art results with hierarchical levels with two or three classes, i. e. the classification of valence (positive/negative). The best models are the transformer-based models gbert-large and gelectra-large by deepset pretrained on large corpora of contemporary German, which achieve accuracy values of up to 83%. Lexicon-based methods, traditional machine learning as well as static word embeddings are consistently outperformed by transformer-based models. Models trained on historical texts show small and inconsistent improvements. The performance becomes significantly smaller for settings with multiple sub-emotions like 6 or 13 due to the general challenge and class imbalances in which the models achieve 57% and 47% respectively. We discuss how we intend to continue our annotations and how to improve the prediction results via various optimization techniques in future work.
Alternative Links zum Volltext
Beteiligte Einrichtungen
Details
| Dokumentenart | Buchkapitel | ||||
| ISBN | 978-2-919815-25-8 | ||||
| Buchtitel: | Fabrikation von Erkenntnis: Experimente in den Digital Humanities. Teilband 1 | ||||
|---|---|---|---|---|---|
| Verlag: | Melusina Press | ||||
| Ort der Veröffentlichung: | Esch-sur-Alzette, Luxembourg | ||||
| Datum | August 2021 | ||||
| Institutionen | Sprach- und Literatur- und Kulturwissenschaften > Institut für Information und Medien, Sprache und Kultur (I:IMSK) > Lehrstuhl für Medieninformatik (Prof. Dr. Christian Wolff) Informatik und Data Science > Fachbereich Menschzentrierte Informatik > Lehrstuhl für Medieninformatik (Prof. Dr. Christian Wolff) | ||||
| Identifikationsnummer |
| ||||
| Stichwörter / Keywords | German Drama Studies, Emotion Analysis, BERT, Deep Neural Networks, Sentiment Analysis, Deep Learning, ELECTRA | ||||
| Dewey-Dezimal-Klassifikation | 000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik 400 Sprache > 400 Sprachwissenschaft, Linguistik 400 Sprache > 430 Deutsch 700 Künste und Unterhaltung > 792 Theater, Tanz 800 Literatur > 800 Literatur, Rhetorik, Literaturwissenschaft 800 Literatur > 830 Deutsche Literatur | ||||
| Status | Veröffentlicht | ||||
| Begutachtet | Ja, diese Version wurde begutachtet | ||||
| An der Universität Regensburg entstanden | Ja | ||||
| URN der UB Regensburg | urn:nbn:de:bvb:355-epub-508273 | ||||
| Dokumenten-ID | 50827 |
Downloadstatistik
Downloadstatistik