Data governance taxonomy for machine learning business applications with consideration of data modality
[Preprint]
Publication date
2025-09-12
Document type
Preprint
Author
Organisational unit
Publisher
Universitätsbibliothek der HSU/UniBw H
Part of the university bibliography
✅
Language
English
DDC Class
004 Informatik
Keyword
Data governance
Data quality management
Data dictionary
Knowledge graphs
Abstract
Technology regulation and data quality considerations demand a higher control over company data. In this literature review, we synthesize a data governance taxonomy which emphasize the evolving challenges associated with managing diverse data modalities, including numerical tabular data, big data, images, videos and textual content for learning algorithms. This systematic literature review collects foundational concepts, theoretical frameworks and organizational structures, highlighting the critical roles of stakeholders of governance principles and of policy developments to synthesis a comprehensive taxonomy including the data modalities. The analysis underscores the importance of a tailored governance approach that address modality-specific issues such as metadata management, privacy and security. Technological and methodological considerations, including data quality management, lifecycle policies as well as interoperability and standardization. Our study combines knowledge management and considerations about data modalities which are especially relevant for general artificial intelligence and provides a robust foundation for advancing both theoretical understanding and practical implementation of effective data governance. The paper contributes to a robust data governance and aims at advancing theoretical understanding as well as practical implications for quality management across heterogeneous data environments and which creates insight for policy makers.
Version
Submitted version under review
Access right on openHSU
Open access
