| Veröffentlichte Version Download ( PDF | 997kB) | Lizenz: Creative Commons Namensnennung-Weitergabe unter gleichen Bedingungen 4.0 International |
Combining Minimally-supervised Methods for Arabic Named Entity Recognition
Althobaiti, M., Kruschwitz, Udo
und Poesio, Massimo
(2015)
Combining Minimally-supervised Methods for Arabic Named Entity Recognition.
Transactions of the Association for Computational Linguistics TACL 3 (3), S. 243-256.
Veröffentlichungsdatum dieses Volltextes: 13 Jun 2019 13:22
Artikel
DOI zum Zitieren dieses Dokuments: 10.5283/epub.40345
Zusammenfassung
Supervised methods can achieve high performance on NLP tasks, such as Named Entity Recognition (NER), but new annotations are required for every new domain and/or genre change. This has motivated research in minimally supervised methods such as semi-supervised learning and distant learning, but neither technique has yet achieved performance levels comparable to those of supervised methods. ...
Supervised methods can achieve high performance on NLP tasks, such as Named Entity Recognition (NER), but new annotations are required for every new domain and/or genre change. This has motivated research in minimally supervised methods such as semi-supervised learning and distant learning, but neither technique has yet achieved performance levels comparable to those of supervised methods. Semi-supervised methods tend to have very high precision but comparatively low recall, whereas distant learning tends to achieve higher recall but lower precision. This complementarity suggests that better results may be obtained by combining the two types of minimally supervised methods. In this paper we present a novel approach to Arabic NER using a combination of semi-supervised and distant learning techniques. We trained a semi-supervised NER classifier and another one using distant learning techniques, and then combined them using a variety of classifier combination schemes, including the Bayesian Classifier Combination (BCC) procedure recently proposed for sentiment analysis. According to our results, the BCC model leads to an increase in performance of 8 percentage points over the best base classifiers.
Alternative Links zum Volltext
Beteiligte Einrichtungen
Details
| Dokumentenart | Artikel |
| Titel eines Journals oder einer Zeitschrift | Transactions of the Association for Computational Linguistics TACL |
| Verlag: | Association for Computational Linguistics |
|---|---|
| Band: | 3 |
| Nummer des Zeitschriftenheftes oder des Kapitels: | 3 |
| Seitenbereich: | S. 243-256 |
| Datum | 2015 |
| Institutionen | Sprach- und Literatur- und Kulturwissenschaften > Institut für Information und Medien, Sprache und Kultur (I:IMSK) > Lehrstuhl für Informationswissenschaft (Prof. Dr. Udo Kruschwitz) Informatik und Data Science > Fachbereich Menschzentrierte Informatik > Lehrstuhl für Informationswissenschaft (Prof. Dr. Udo Kruschwitz) |
| Dewey-Dezimal-Klassifikation | 000 Informatik, Informationswissenschaft, allgemeine Werke > 020 Bibliotheks- und Informationswissenschaft |
| Status | Veröffentlicht |
| Begutachtet | Ja, diese Version wurde begutachtet |
| An der Universität Regensburg entstanden | Unbekannt / Keine Angabe |
| URN der UB Regensburg | urn:nbn:de:bvb:355-epub-403453 |
| Dokumenten-ID | 40345 |
Downloadstatistik
Downloadstatistik