Direkt zum Inhalt

Restat, Valerie ; Klettke, Meike ; Störl, Uta

Towards a Holistic Data Preparation Tool

Restat, Valerie, Klettke, Meike und Störl, Uta (2022) Towards a Holistic Data Preparation Tool. In: Workshops of the EDBT/ICDT 2022 Joint Conference, March 29, 2022, Edinburgh, UK.

Veröffentlichungsdatum dieses Volltextes: 28 Aug 2025 06:42
Konferenz- oder Workshop-Beitrag
DOI zum Zitieren dieses Dokuments: 10.5283/epub.77285


Zusammenfassung

Data-driven systems and machine learning-based decisions are becoming increasingly important and are having an impact on our everyday lives. The prerequisite for this is good data quality, which must be ensured by preprocessing the data. However, a number of challenges arise in the process. These include the results of the process in terms of data quality, e.g., combating bias and ensuring ...

Data-driven systems and machine learning-based decisions are becoming increasingly important and are having an impact on our everyday lives. The prerequisite for this is good data quality, which must be ensured by preprocessing the data. However, a number of challenges arise in the process. These include the results of the process in terms of data quality, e.g., combating bias and ensuring fairness, and the preprocessing process itself. Here, human involvement and the lack of intelligent solutions and applications for domain experts without in-depth IT knowledge play a major role. This paper summarizes these challenges and provides an overview of the current state of the art. It proposes the design of a holistic tool, along with the necessary tasks to overcome these challenges and to support data preprocessing.



Beteiligte Einrichtungen


Details

DokumentenartKonferenz- oder Workshop-Beitrag (Nicht ausgewählt)
Buchtitel:Proceedings of the Workshops of the EDBT/ICDT 2022 Joint Conference
Verlag:CEUR-WS.org
Sonstige Reihe:CEUR Workshop Proceedings
Band:3135
Datum2022
InstitutionenInformatik und Data Science > Allgemeine Informatik > Data Engineering (Prof. Dr.-Ing. Meike Klettke)
Stichwörter / Keywordsdata preparation, data quality, data preprocessing, data wrangling, data cleaning
Dewey-Dezimal-Klassifikation000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik
StatusVeröffentlicht
BegutachtetJa, diese Version wurde begutachtet
An der Universität Regensburg entstandenNein
URN der UB Regensburgurn:nbn:de:bvb:355-epub-772855
Dokumenten-ID77285

Bibliographische Daten exportieren

Nur für Besitzer und Autoren: Kontrollseite des Eintrags

nach oben