| Veröffentlichte Version Download ( PDF | 835kB) | Lizenz: Creative Commons Namensnennung 4.0 International |
Towards a Holistic Data Preparation Tool
Restat, Valerie, Klettke, Meike
und Störl, Uta
(2022)
Towards a Holistic Data Preparation Tool.
In: Workshops of the EDBT/ICDT 2022 Joint Conference, March 29, 2022, Edinburgh, UK.
Veröffentlichungsdatum dieses Volltextes: 28 Aug 2025 06:42
Konferenz- oder Workshop-Beitrag
DOI zum Zitieren dieses Dokuments: 10.5283/epub.77285
Zusammenfassung
Data-driven systems and machine learning-based decisions are becoming increasingly important and are having an impact on our everyday lives. The prerequisite for this is good data quality, which must be ensured by preprocessing the data. However, a number of challenges arise in the process. These include the results of the process in terms of data quality, e.g., combating bias and ensuring ...
Data-driven systems and machine learning-based decisions are becoming increasingly important and are having an impact on our everyday lives. The prerequisite for this is good data quality, which must be ensured by preprocessing the data. However, a number of challenges arise in the process. These include the results of the process in terms of data quality, e.g., combating bias and ensuring fairness, and the preprocessing process itself. Here, human involvement and the lack of intelligent solutions and applications for domain experts without in-depth IT knowledge play a major role. This paper summarizes these challenges and provides an overview of the current state of the art. It proposes the design of a holistic tool, along with the necessary tasks to overcome these challenges and to support data preprocessing.
Alternative Links zum Volltext
Beteiligte Einrichtungen
Details
| Dokumentenart | Konferenz- oder Workshop-Beitrag (Nicht ausgewählt) |
| Buchtitel: | Proceedings of the Workshops of the EDBT/ICDT 2022 Joint Conference |
|---|---|
| Verlag: | CEUR-WS.org |
| Sonstige Reihe: | CEUR Workshop Proceedings |
| Band: | 3135 |
| Datum | 2022 |
| Institutionen | Informatik und Data Science > Allgemeine Informatik > Data Engineering (Prof. Dr.-Ing. Meike Klettke) |
| Stichwörter / Keywords | data preparation, data quality, data preprocessing, data wrangling, data cleaning |
| Dewey-Dezimal-Klassifikation | 000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik |
| Status | Veröffentlicht |
| Begutachtet | Ja, diese Version wurde begutachtet |
| An der Universität Regensburg entstanden | Nein |
| URN der UB Regensburg | urn:nbn:de:bvb:355-epub-772855 |
| Dokumenten-ID | 77285 |
Downloadstatistik
Downloadstatistik