| Download ( PDF | 755kB) | Lizenz: Creative Commons Namensnennung-NichtKommerziell 4.0 International |
LLM-as-an-Annotator: Training Lightweight Models with LLM-Annotated Examples for Aspect Sentiment Tuple Prediction
Hellwig, Nils Constantin
, Fehle, Jakob
, Kruschwitz, Udo
und Wolff, Christian
(2026)
LLM-as-an-Annotator: Training Lightweight Models with LLM-Annotated Examples for Aspect Sentiment Tuple Prediction.
In: Piperidis, Stelios und Bel, Núria und van den Heuvel, Henk und Ide, Nancy und Krek, Simon und Toral, Antonio, (eds.)
Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026).
European Language Resources Association (ELRA), Paris, S. 7955-7972.
ISBN 978-2-493814-49-4.
Veröffentlichungsdatum dieses Volltextes: 20 Mai 2026 06:46
Buchkapitel
DOI zum Zitieren dieses Dokuments: 10.5283/epub.79458
Zusammenfassung
Training models for Aspect-Based Sentiment Analysis (ABSA) tasks requires manually annotated data, which is expensive and time-consuming to obtain. This paper introduces LA-ABSA, a novel approach that leverages Large Language Model (LLM)-generated annotations to fine-tune lightweight models for complex ABSA tasks. We evaluate our approach on five datasets for Target Aspect Sentiment Detection ...
Training models for Aspect-Based Sentiment Analysis (ABSA) tasks requires manually annotated data, which is expensive and time-consuming to obtain. This paper introduces LA-ABSA, a novel approach that leverages Large Language Model (LLM)-generated annotations to fine-tune lightweight models for complex ABSA tasks. We evaluate our approach on five datasets for Target Aspect Sentiment Detection (TASD) and Aspect Sentiment Quad Prediction (ASQP). Our approach outperformed previously reported augmentation strategies and achieved competitive performance with LLM-prompting in low-resource scenarios, while providing substantial energy efficiency benefits. For example, using 50 annotated examples for in-context learning (ICL) to guide the annotation of unlabeled data, LA-ABSA achieved an F1 score of 49.85 for ASQP on the SemEval Rest16 dataset, closely matching the performance of ICL prompting with Gemma-3-27B (51.10), while requiring significantly lower computational resources.
Downloadstatistik
Downloadstatistik