| Veröffentlichte Version Download ( PDF | 270kB) | Lizenz: Creative Commons Namensnennung 4.0 International |
Comparing Wizard of Oz & Observational Studies for Conversational IR Evaluation: Lessons Learned from These two Diverse Approaches
Elsweiler, David
, Frummet, Alexander
und Harvey, Morgan
(2020)
Comparing Wizard of Oz & Observational Studies for Conversational IR Evaluation: Lessons Learned from These two Diverse Approaches.
Datenbank-Spektrum 20 (1), S. 37-41.
Veröffentlichungsdatum dieses Volltextes: 25 Jun 2020 05:37
Artikel
DOI zum Zitieren dieses Dokuments: 10.5283/epub.43374
Zusammenfassung
Systematic and repeatable measurement of information systems via test collections, the Cranfield model, has been the mainstay of Information Retrieval since the 1960s. However, this may not be appropriate for newer, more interactive systems, such as Conversational Search agents. Such systems rely on Machine Learning technologies, which are not yet sufficiently advanced to permit true human-like ...
Systematic and repeatable measurement of information systems via test collections, the Cranfield model, has been the mainstay of Information Retrieval since the 1960s. However, this may not be appropriate for newer, more interactive systems, such as Conversational Search agents. Such systems rely on Machine Learning technologies, which are not yet sufficiently advanced to permit true human-like dialogues, and so research can be enabled by simulation via human agents.
In this work we compare dialogues obtained from two studies with the same context, assistance in the kitchen, but with different experimental setups, allowing us to learn about and evaluate conversational IR systems. We discover that users adapt their behaviour when they think they are interacting with a system and that human-like conversations in one of the studies were unpredictable to an extent we did not expect. Our results have implications for the development of new studies in this area and, ultimately, the design of future conversational agents.
Alternative Links zum Volltext
Beteiligte Einrichtungen
Details
| Dokumentenart | Artikel | ||||
| Titel eines Journals oder einer Zeitschrift | Datenbank-Spektrum | ||||
| Verlag: | Springer | ||||
|---|---|---|---|---|---|
| Band: | 20 | ||||
| Nummer des Zeitschriftenheftes oder des Kapitels: | 1 | ||||
| Seitenbereich: | S. 37-41 | ||||
| Datum | 10 Februar 2020 | ||||
| Institutionen | Sprach- und Literatur- und Kulturwissenschaften > Institut für Information und Medien, Sprache und Kultur (I:IMSK) > Lehrstuhl für Informationswissenschaft (Prof. Dr. Udo Kruschwitz) Informatik und Data Science > Fachbereich Menschzentrierte Informatik > Lehrstuhl für Informationswissenschaft (Prof. Dr. Udo Kruschwitz) | ||||
| Identifikationsnummer |
| ||||
| Stichwörter / Keywords | Conversational search, Evaluation | ||||
| Dewey-Dezimal-Klassifikation | 000 Informatik, Informationswissenschaft, allgemeine Werke > 020 Bibliotheks- und Informationswissenschaft | ||||
| Status | Veröffentlicht | ||||
| Begutachtet | Ja, diese Version wurde begutachtet | ||||
| An der Universität Regensburg entstanden | Ja | ||||
| URN der UB Regensburg | urn:nbn:de:bvb:355-epub-433745 | ||||
| Dokumenten-ID | 43374 |
Downloadstatistik
Downloadstatistik