Data for Digital Forensics: Why a Discussion on ‘How Realistic is Synthetic Data’ is Dispensable

Détails

Ressource 1Télécharger: 3609863.pdf (806.49 [Ko])
Etat: Public
Version: Final published version
Licence: Non spécifiée
ID Serval
serval:BIB_F89ADA33B54A
Type
Article: article d'un périodique ou d'un magazine.
Collection
Publications
Institution
Titre
Data for Digital Forensics: Why a Discussion on ‘How Realistic is Synthetic Data’ is Dispensable
Périodique
Digital Threats
Auteur⸱e⸱s
Göbel Thomas, Baier Harald, Breitinger Frank
ISSN
2692-1626
Statut éditorial
Publié
Date de publication
07/2023
Peer-reviewed
Oui
Langue
anglais
Résumé
Digital forensics depends on data sets for various purposes like concept evaluation, educational training, and tool validation. Researchers have gathered such data sets into repositories and created data simulation frameworks for producing large amounts of data. Synthetic data often face skepticism due to its perceived deviation from real-world data, raising doubts about its realism. This paper addresses this concern, arguing that there is no definitive answer. We focus on four common digital forensic use cases that rely on data. Through these, we elucidate the specifications and prerequisites of data sets within their respective contexts. Our discourse uncovers that both real-world and synthetic data are indispensable for advancing digital forensic science, software, tools, and the competence of practitioners. Additionally, we provide an overview of available data set repositories and data generation frameworks, contributing to the ongoing dialogue on digital forensic data sets’ utility. Keywords: Digital forensic corpora; Data sets; Real-world data; Synthetic data; Data usage; Data synthesis, Types of data; Use cases; Realistic data; Data simulation frameworks
Open Access
Oui
Création de la notice
24/07/2023 13:16
Dernière modification de la notice
15/01/2024 8:29
Données d'usage