Direkt zum Inhalt

Schmidt, Thomas ; Dennerlein, Katrin ; Wolff, Christian

Using Deep Learning for Emotion Analysis of 18th and 19th Century German Plays

Schmidt, Thomas, Dennerlein, Katrin und Wolff, Christian (2021) Using Deep Learning for Emotion Analysis of 18th and 19th Century German Plays. In: Burghardt, Manuel und Dieckmann, Lisa und Steyer, Timo und Trilcke, Peer und Walkowski, Niels-Oliver und Weis, Joëlle und Wuttke, Ulrike, (eds.) Fabrikation von Erkenntnis: Experimente in den Digital Humanities. Teilband 1. Melusina Press, Esch-sur-Alzette, Luxembourg. ISBN 978-2-919815-25-8.

Veröffentlichungsdatum dieses Volltextes: 21 Okt 2021 06:45
Buchkapitel


Zusammenfassung

We present first results of the project “Emotions in Drama” in which we explore the annotation of emotions and the application of computational emotion analysis, predominantly deep learning-based methods, in the context of historical German plays of the time around 1800. We performed a pilot annotation study with five plays generating over 6,500 annotations for up to 13 sub-emotions structured in ...

We present first results of the project “Emotions in Drama” in which we explore the annotation of emotions and the application of computational emotion analysis, predominantly deep learning-based methods, in the context of historical German plays of the time around 1800. We performed a pilot annotation study with five plays generating over 6,500 annotations for up to 13 sub-emotions structured in a hierarchical scheme. This emotion scheme includes common types like joy, anger or hate but also concepts that are specifically important for German literary criticism of this period like friendship, compassion or Schadenfreude. We evaluate the performance of various methods of emotion-based text sequence classification including lexicon-based methods, traditional machine learning, fastText as static word embedding, various transformer models based on BERT- or ELECTRA-architectures and pretrained with contemporary language, transformer-based methods pretrained or finetuned for historical and/or poetic language as well as the finetuning of BERT models via our own corpora and plays. We do achieve state-of-the-art results with hierarchical levels with two or three classes, i. e. the classification of valence (positive/negative). The best models are the transformer-based models gbert-large and gelectra-large by deepset pretrained on large corpora of contemporary German, which achieve accuracy values of up to 83%. Lexicon-based methods, traditional machine learning as well as static word embeddings are consistently outperformed by transformer-based models. Models trained on historical texts show small and inconsistent improvements. The performance becomes significantly smaller for settings with multiple sub-emotions like 6 or 13 due to the general challenge and class imbalances in which the models achieve 57% and 47% respectively. We discuss how we intend to continue our annotations and how to improve the prediction results via various optimization techniques in future work.



Beteiligte Einrichtungen


Details

DokumentenartBuchkapitel
ISBN978-2-919815-25-8
Buchtitel:Fabrikation von Erkenntnis: Experimente in den Digital Humanities. Teilband 1
Verlag:Melusina Press
Ort der Veröffentlichung:Esch-sur-Alzette, Luxembourg
DatumAugust 2021
InstitutionenSprach- und Literatur- und Kulturwissenschaften > Institut für Information und Medien, Sprache und Kultur (I:IMSK) > Lehrstuhl für Medieninformatik (Prof. Dr. Christian Wolff)
Informatik und Data Science > Fachbereich Menschzentrierte Informatik > Lehrstuhl für Medieninformatik (Prof. Dr. Christian Wolff)
Identifikationsnummer
WertTyp
10.26298/melusina.8f8w-y749-udlfDOI
Stichwörter / KeywordsGerman Drama Studies, Emotion Analysis, BERT, Deep Neural Networks, Sentiment Analysis, Deep Learning, ELECTRA
Dewey-Dezimal-Klassifikation000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik
400 Sprache > 400 Sprachwissenschaft, Linguistik
400 Sprache > 430 Deutsch
700 Künste und Unterhaltung > 792 Theater, Tanz
800 Literatur > 800 Literatur, Rhetorik, Literaturwissenschaft
800 Literatur > 830 Deutsche Literatur
StatusVeröffentlicht
BegutachtetJa, diese Version wurde begutachtet
An der Universität Regensburg entstandenJa
URN der UB Regensburgurn:nbn:de:bvb:355-epub-508273
Dokumenten-ID50827

Bibliographische Daten exportieren

Nur für Besitzer und Autoren: Kontrollseite des Eintrags

nach oben