| Published Version Download ( PDF | 226kB) | License: Creative Commons: Attribution-Share Alike 3.0 | |
| Download ( PDF | 198kB) |
Using Deep Learning for Emotion Analysis of 18th and 19th Century German Plays
Schmidt, Thomas, Dennerlein, Katrin and Wolff, Christian
(2021)
Using Deep Learning for Emotion Analysis of 18th and 19th Century German Plays.
In: Burghardt, Manuel and Dieckmann, Lisa and Steyer, Timo and Trilcke, Peer and Walkowski, Niels-Oliver and Weis, Joëlle and Wuttke, Ulrike, (eds.)
Fabrikation von Erkenntnis: Experimente in den Digital Humanities. Teilband 1.
Melusina Press, Esch-sur-Alzette, Luxembourg.
ISBN 978-2-919815-25-8.
Date of publication of this fulltext: 21 Oct 2021 06:45
Book section
Abstract
We present first results of the project “Emotions in Drama” in which we explore the annotation of emotions and the application of computational emotion analysis, predominantly deep learning-based methods, in the context of historical German plays of the time around 1800. We performed a pilot annotation study with five plays generating over 6,500 annotations for up to 13 sub-emotions structured in ...
We present first results of the project “Emotions in Drama” in which we explore the annotation of emotions and the application of computational emotion analysis, predominantly deep learning-based methods, in the context of historical German plays of the time around 1800. We performed a pilot annotation study with five plays generating over 6,500 annotations for up to 13 sub-emotions structured in a hierarchical scheme. This emotion scheme includes common types like joy, anger or hate but also concepts that are specifically important for German literary criticism of this period like friendship, compassion or Schadenfreude. We evaluate the performance of various methods of emotion-based text sequence classification including lexicon-based methods, traditional machine learning, fastText as static word embedding, various transformer models based on BERT- or ELECTRA-architectures and pretrained with contemporary language, transformer-based methods pretrained or finetuned for historical and/or poetic language as well as the finetuning of BERT models via our own corpora and plays. We do achieve state-of-the-art results with hierarchical levels with two or three classes, i. e. the classification of valence (positive/negative). The best models are the transformer-based models gbert-large and gelectra-large by deepset pretrained on large corpora of contemporary German, which achieve accuracy values of up to 83%. Lexicon-based methods, traditional machine learning as well as static word embeddings are consistently outperformed by transformer-based models. Models trained on historical texts show small and inconsistent improvements. The performance becomes significantly smaller for settings with multiple sub-emotions like 6 or 13 due to the general challenge and class imbalances in which the models achieve 57% and 47% respectively. We discuss how we intend to continue our annotations and how to improve the prediction results via various optimization techniques in future work.
Alternative links to fulltext
Involved Institutions
Details
| Item type | Book section | ||||
| ISBN | 978-2-919815-25-8 | ||||
| Title of Book: | Fabrikation von Erkenntnis: Experimente in den Digital Humanities. Teilband 1 | ||||
|---|---|---|---|---|---|
| Publisher: | Melusina Press | ||||
| Place of Publication: | Esch-sur-Alzette, Luxembourg | ||||
| Date | August 2021 | ||||
| Institutions | Languages and Literatures > Institut für Information und Medien, Sprache und Kultur (I:IMSK) > Lehrstuhl für Medieninformatik (Prof. Dr. Christian Wolff) Informatics and Data Science > Department Human-Centered Computing > Lehrstuhl für Medieninformatik (Prof. Dr. Christian Wolff) | ||||
| Identification Number |
| ||||
| Keywords | German Drama Studies, Emotion Analysis, BERT, Deep Neural Networks, Sentiment Analysis, Deep Learning, ELECTRA | ||||
| Dewey Decimal Classification | 000 Computer science, information & general works > 004 Computer science 400 Language > 400 Language, Linguistics 400 Language > 430 Germanic 700 Arts & recreation > 792 Stage presentations 800 Literature > 800 Literature & rhetoric 800 Literature > 830 Literatures of Germanic languages | ||||
| Status | Published | ||||
| Refereed | Yes, this version has been refereed | ||||
| Created at the University of Regensburg | Yes | ||||
| URN of the UB Regensburg | urn:nbn:de:bvb:355-epub-508273 | ||||
| Item ID | 50827 |
Download Statistics
Download Statistics