Direkt zum Inhalt

Althobaiti, M. ; Kruschwitz, Udo ; Poesio, Massimo

Combining Minimally-supervised Methods for Arabic Named Entity Recognition

Althobaiti, M., Kruschwitz, Udo und Poesio, Massimo (2015) Combining Minimally-supervised Methods for Arabic Named Entity Recognition. Transactions of the Association for Computational Linguistics TACL 3 (3), S. 243-256.

Veröffentlichungsdatum dieses Volltextes: 13 Jun 2019 13:22
Artikel
DOI zum Zitieren dieses Dokuments: 10.5283/epub.40345


Zusammenfassung

Supervised methods can achieve high performance on NLP tasks, such as Named Entity Recognition (NER), but new annotations are required for every new domain and/or genre change. This has motivated research in minimally supervised methods such as semi-supervised learning and distant learning, but neither technique has yet achieved performance levels comparable to those of supervised methods. ...

Supervised methods can achieve high performance on NLP tasks, such as Named Entity Recognition (NER), but new annotations are required for every new domain and/or genre change. This has motivated research in minimally supervised methods such as semi-supervised learning and distant learning, but neither technique has yet achieved performance levels comparable to those of supervised methods. Semi-supervised methods tend to have very high precision but comparatively low recall, whereas distant learning tends to achieve higher recall but lower precision. This complementarity suggests that better results may be obtained by combining the two types of minimally supervised methods. In this paper we present a novel approach to Arabic NER using a combination of semi-supervised and distant learning techniques. We trained a semi-supervised NER classifier and another one using distant learning techniques, and then combined them using a variety of classifier combination schemes, including the Bayesian Classifier Combination (BCC) procedure recently proposed for sentiment analysis. According to our results, the BCC model leads to an increase in performance of 8 percentage points over the best base classifiers.



Beteiligte Einrichtungen


Details

DokumentenartArtikel
Titel eines Journals oder einer ZeitschriftTransactions of the Association for Computational Linguistics TACL
Verlag:Association for Computational Linguistics
Band:3
Nummer des Zeitschriftenheftes oder des Kapitels:3
Seitenbereich:S. 243-256
Datum2015
InstitutionenSprach- und Literatur- und Kulturwissenschaften > Institut für Information und Medien, Sprache und Kultur (I:IMSK) > Lehrstuhl für Informationswissenschaft (Prof. Dr. Udo Kruschwitz)
Informatik und Data Science > Fachbereich Menschzentrierte Informatik > Lehrstuhl für Informationswissenschaft (Prof. Dr. Udo Kruschwitz)
Dewey-Dezimal-Klassifikation000 Informatik, Informationswissenschaft, allgemeine Werke > 020 Bibliotheks- und Informationswissenschaft
StatusVeröffentlicht
BegutachtetJa, diese Version wurde begutachtet
An der Universität Regensburg entstandenUnbekannt / Keine Angabe
URN der UB Regensburgurn:nbn:de:bvb:355-epub-403453
Dokumenten-ID40345

Bibliographische Daten exportieren

Nur für Besitzer und Autoren: Kontrollseite des Eintrags

nach oben