Direkt zum Inhalt

Heinrich, Bernd ; Klier, Mathias ; Schiller, Alexander Paul Rudolf ; Wagner, Gerit

Assessing Data Quality - A Probability-based Metric for Semantic Consistency

Heinrich, Bernd, Klier, Mathias, Schiller, Alexander Paul Rudolf und Wagner, Gerit (2018) Assessing Data Quality - A Probability-based Metric for Semantic Consistency. Decision Support Systems (DSS) 10, S. 95-106.

Veröffentlichungsdatum dieses Volltextes: 09 Mai 2018 08:25
Artikel
DOI zum Zitieren dieses Dokuments: 10.5283/epub.37290


Zusammenfassung

We present a probability-based metric for semantic consistency using a set of uncertain rules. As opposed to existing metrics for semantic consistency, our metric allows to consider rules that are expected to be fulfilled with specific probabilities. The resulting metric values represent the probability that the assessed dataset is free of internal contradictions with regard to the uncertain ...

We present a probability-based metric for semantic consistency using a set of uncertain rules. As opposed to existing metrics for semantic consistency, our metric allows to consider rules that are expected to be fulfilled with specific probabilities. The resulting metric values represent the probability that the assessed dataset is free of internal contradictions with regard to the uncertain rules and thus have a clear interpretation. The theoretical basis for determining the metric values are statistical tests and the concept of the p-value, allowing the interpretation of the metric value as a probability. We demonstrate the practical applicability and effectiveness of the metric in a real-world setting by analyzing a customer dataset of an insurance company. Here, the metric was applied to identify semantic consistency problems in the data and to support decision-making, for instance, when offering individual products to customers.



Beteiligte Einrichtungen


Details

DokumentenartArtikel
Titel eines Journals oder einer ZeitschriftDecision Support Systems (DSS)
Verlag:ELSEVIER SCIENCE BV
Ort der Veröffentlichung:AMSTERDAM
Band:10
Seitenbereich:S. 95-106
DatumJuni 2018
InstitutionenWirtschaftswissenschaften > Institut für Wirtschaftsinformatik > Lehrstuhl für Wirtschaftsinformatik II (Prof. Dr. Bernd Heinrich)
Informatik und Data Science > Fachbereich Wirtschaftsinformatik > Lehrstuhl für Wirtschaftsinformatik II (Prof. Dr. Bernd Heinrich)
Identifikationsnummer
WertTyp
10.1016/j.dss.2018.03.011DOI
Stichwörter / KeywordsDATA CURRENCY; TAXONOMY; Data quality; Data quality assessment; Data quality metric; Data consistency
Dewey-Dezimal-Klassifikation300 Sozialwissenschaften > 330 Wirtschaft
StatusVeröffentlicht
BegutachtetJa, diese Version wurde begutachtet
An der Universität Regensburg entstandenJa
URN der UB Regensburgurn:nbn:de:bvb:355-epub-372906
Dokumenten-ID37290

Bibliographische Daten exportieren

Nur für Besitzer und Autoren: Kontrollseite des Eintrags

nach oben