| Veröffentlichte Version Download ( PDF | 315kB) | Lizenz: Creative Commons Namensnennung-NichtKommerziell 4.0 International |
Creating a Lexicon of Bavarian Dialect by Means of Facebook Language Data and Crowdsourcing
Burghardt, Manuel
, Granvogl, Daniel und Wolff, Christian
(2016)
Creating a Lexicon of Bavarian Dialect by Means of Facebook Language Data and Crowdsourcing.
In:
LREC 2016, Tenth Int. Conf. on Language Resources and Evaluation : May 23-28, 2016, Portorož, Slovenia; Proc.
European Language Resources Association, Paris, S. 2029-2033.
ISBN 978-2-9517408-9-1.
Veröffentlichungsdatum dieses Volltextes: 29 Mai 2017 12:00
Buchkapitel
DOI zum Zitieren dieses Dokuments: 10.5283/epub.35701
Zusammenfassung
Data acquisition in dialectology is typically a tedious task, as dialect samples of spoken language have to be collected via questionnaires or interviews. In this article, we suggest to use the “web as a corpus” approach for dialectology. We present a case study that demonstrates how authentic language data for the Bavarian dialect (ISO 639-3:bar) can be collected automatically from the social ...
Data acquisition in dialectology is typically a tedious task, as dialect samples of spoken language have to be collected via questionnaires or interviews. In this article, we suggest to use the “web as a corpus” approach for dialectology. We present a case study that demonstrates how authentic language data for the Bavarian dialect (ISO 639-3:bar) can be collected automatically from the social network Facebook. We also show that Facebook can be used effectively as a crowdsourcing platform, where users are willing to translate dialect words
collaboratively in order to create a common lexicon of their Bavarian dialect. Key insights from the case study are summarized as “lessons learned”, together with suggestions for future enhancements of the lexicon creation approach.
Alternative Links zum Volltext
Beteiligte Einrichtungen
Details
| Dokumentenart | Buchkapitel |
| ISBN | 978-2-9517408-9-1 |
| Buchtitel: | LREC 2016, Tenth Int. Conf. on Language Resources and Evaluation : May 23-28, 2016, Portorož, Slovenia; Proc. |
|---|---|
| Verlag: | European Language Resources Association |
| Ort der Veröffentlichung: | Paris |
| Seitenbereich: | S. 2029-2033 |
| Datum | 2016 |
| Institutionen | Sprach- und Literatur- und Kulturwissenschaften > Institut für Information und Medien, Sprache und Kultur (I:IMSK) > Lehrstuhl für Medieninformatik (Prof. Dr. Christian Wolff) Informatik und Data Science > Fachbereich Menschzentrierte Informatik > Lehrstuhl für Medieninformatik (Prof. Dr. Christian Wolff) |
| Stichwörter / Keywords | dialectology, Bavarian, ISO 639-3:bar, dialect lexicon, crowdsourcing, social media, Facebook |
| Dewey-Dezimal-Klassifikation | 000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik 000 Informatik, Informationswissenschaft, allgemeine Werke > 020 Bibliotheks- und Informationswissenschaft |
| Status | Veröffentlicht |
| Begutachtet | Ja, diese Version wurde begutachtet |
| An der Universität Regensburg entstanden | Ja |
| URN der UB Regensburg | urn:nbn:de:bvb:355-epub-357012 |
| Dokumenten-ID | 35701 |
Downloadstatistik
Downloadstatistik