Deploying machine learning based data quality controls – Design principles and insights from the field

Details

Ressource 1Request a copy Under indefinite embargo.
UNIL restricted access
State: Public
Version: Author's accepted manuscript
License: Not specified
Serval ID
serval:BIB_46B575AAA402
Type
Inproceedings: an article in a conference proceedings.
Collection
Publications
Institution
Title
Deploying machine learning based data quality controls – Design principles and insights from the field
Title of the conference
Proceedings of the International Conference Wirtschaftsinformatik 2022 (WI2022)
Author(s)
Walter Valérianne, Gyoery Andreas, Legner Christine
Publisher
AIS Electronic Library (AISeL).
Organization
17th International Conference Wirtschaftsinformatik (WI 2022)
Publication state
Published
Issued date
21/02/2022
Peer-reviewed
Oui
Language
english
Abstract
Machine Learning (ML) has become one of the most promising technological advances for enterprises to improve manual, highly resource- and time-consuming processes. Developing and deploying these ML based systems in an organizational setting, however, is linked to a range of processual and technical requirements and implications that researchers and enterprises have only started to comprehend. Based on an Action Design Research approach, this study develops a ML based solution for data quality (DQ) controls, an essential instrument in Data Quality Management. We synthesize our findings through a set of design principles for ML based DQ controls that describe key components in the three phases from proof-of-concept to deployment and business process integration. Our findings lay groundwork for future research in the field of ML based systems for DQ and contribute to the broader IS discourse on how to embed learning-based systems in real-world organizational contexts.
Keywords
data quality, data quality controls, machine learning, AI-based systems, rule-based systems
Funding(s)
Other / Industry grant
Create date
27/02/2022 11:15
Last modification date
28/02/2022 6:39
Usage data