Direkt zum Inhalt

Burghardt, Manuel ; Granvogl, Daniel ; Wolff, Christian

Creating a Lexicon of Bavarian Dialect by Means of Facebook Language Data and Crowdsourcing

Burghardt, Manuel , Granvogl, Daniel und Wolff, Christian (2016) Creating a Lexicon of Bavarian Dialect by Means of Facebook Language Data and Crowdsourcing. In: LREC 2016, Tenth Int. Conf. on Language Resources and Evaluation : May 23-28, 2016, Portorož, Slovenia; Proc. European Language Resources Association, Paris, S. 2029-2033. ISBN 978-2-9517408-9-1.

Veröffentlichungsdatum dieses Volltextes: 29 Mai 2017 12:00
Buchkapitel
DOI zum Zitieren dieses Dokuments: 10.5283/epub.35701


Zusammenfassung

Data acquisition in dialectology is typically a tedious task, as dialect samples of spoken language have to be collected via questionnaires or interviews. In this article, we suggest to use the “web as a corpus” approach for dialectology. We present a case study that demonstrates how authentic language data for the Bavarian dialect (ISO 639-3:bar) can be collected automatically from the social ...

Data acquisition in dialectology is typically a tedious task, as dialect samples of spoken language have to be collected via questionnaires or interviews. In this article, we suggest to use the “web as a corpus” approach for dialectology. We present a case study that demonstrates how authentic language data for the Bavarian dialect (ISO 639-3:bar) can be collected automatically from the social network Facebook. We also show that Facebook can be used effectively as a crowdsourcing platform, where users are willing to translate dialect words
collaboratively in order to create a common lexicon of their Bavarian dialect. Key insights from the case study are summarized as “lessons learned”, together with suggestions for future enhancements of the lexicon creation approach.



Beteiligte Einrichtungen


Details

DokumentenartBuchkapitel
ISBN978-2-9517408-9-1
Buchtitel:LREC 2016, Tenth Int. Conf. on Language Resources and Evaluation : May 23-28, 2016, Portorož, Slovenia; Proc.
Verlag:European Language Resources Association
Ort der Veröffentlichung:Paris
Seitenbereich:S. 2029-2033
Datum2016
InstitutionenSprach- und Literatur- und Kulturwissenschaften > Institut für Information und Medien, Sprache und Kultur (I:IMSK) > Lehrstuhl für Medieninformatik (Prof. Dr. Christian Wolff)
Informatik und Data Science > Fachbereich Menschzentrierte Informatik > Lehrstuhl für Medieninformatik (Prof. Dr. Christian Wolff)
Stichwörter / Keywordsdialectology, Bavarian, ISO 639-3:bar, dialect lexicon, crowdsourcing, social media, Facebook
Dewey-Dezimal-Klassifikation000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik
000 Informatik, Informationswissenschaft, allgemeine Werke > 020 Bibliotheks- und Informationswissenschaft
StatusVeröffentlicht
BegutachtetJa, diese Version wurde begutachtet
An der Universität Regensburg entstandenJa
URN der UB Regensburgurn:nbn:de:bvb:355-epub-357012
Dokumenten-ID35701

Bibliographische Daten exportieren

Nur für Besitzer und Autoren: Kontrollseite des Eintrags

nach oben