| Veröffentlichte Version Download ( PDF | 1MB) | Lizenz: Creative Commons Namensnennung 4.0 International |
Making Sense of Subtitles: Sentence Boundary Detection and Speaker Change Detection in Unpunctuated Texts
Kruschwitz, Udo
, Donabauer, Gregor und Corney, David
(2021)
Making Sense of Subtitles: Sentence Boundary Detection and Speaker Change Detection in Unpunctuated Texts.
In: WWW '21: The Web Conference 2021, Apr 19, 2021 - Apr 23, 2021, virtuell Ljubljana Slovenia.
Veröffentlichungsdatum dieses Volltextes: 24 Mrz 2021 06:29
Konferenz- oder Workshop-Beitrag
Zusammenfassung
The rise of deep learning methods has transformed the research area of natural language processing beyond recognition. New benchmark performances are reported on a daily basis ranging from machine translation to question-answering. Yet, some of the unsolved practical research questions are not in the spotlight and this includes, for example, issues arising at the interface between spoken and ...
The rise of deep learning methods has transformed the research area of natural language processing beyond recognition. New benchmark performances are reported on a daily basis ranging from machine translation to question-answering. Yet, some of the unsolved practical research questions are not in the spotlight and this includes, for example, issues arising at the interface between spoken and written language processing.
We identify sentence boundary detection and speaker change detection applied to automatically transcribed texts as two NLP problems that have not yet received much attention but are nevertheless of practical relevance. We frame both problems as binary tagging tasks that can be addressed by fine-tuning a transformer model and we report promising results.
Alternative Links zum Volltext
Beteiligte Einrichtungen
Details
| Dokumentenart | Konferenz- oder Workshop-Beitrag (Paper) | ||||
| ISBN | 978-1-4503-8313-4 | ||||
| Buchtitel: | WWW '21: Companion Proceedings of the Web Conference 2021 | ||||
|---|---|---|---|---|---|
| Verlag: | Association for Computing Machinery | ||||
| Ort der Veröffentlichung: | New York, United States | ||||
| Seitenbereich: | S. 357-362 | ||||
| Datum | 2021 | ||||
| Institutionen | Sprach- und Literatur- und Kulturwissenschaften > Institut für Information und Medien, Sprache und Kultur (I:IMSK) > Lehrstuhl für Informationswissenschaft (Prof. Dr. Udo Kruschwitz) Informatik und Data Science > Fachbereich Menschzentrierte Informatik > Lehrstuhl für Informationswissenschaft (Prof. Dr. Udo Kruschwitz) | ||||
| Identifikationsnummer |
| ||||
| Dewey-Dezimal-Klassifikation | 000 Informatik, Informationswissenschaft, allgemeine Werke > 020 Bibliotheks- und Informationswissenschaft | ||||
| Status | Veröffentlicht | ||||
| Begutachtet | Ja, diese Version wurde begutachtet | ||||
| An der Universität Regensburg entstanden | Ja | ||||
| URN der UB Regensburg | urn:nbn:de:bvb:355-epub-452811 | ||||
| Dokumenten-ID | 45281 |
Downloadstatistik
Downloadstatistik