TraceGen: user activity emulation for digital forensic test image generation

<p>Digital forensic test images are commonly used across a variety of digital forensic use cases including education and training, tool testing and validation, proficiency testing, malware analysis, and research and development. Using real digital evidence for these purposes is often not viabl...

Full description

Bibliographic Details
Main Authors: Du, X, Hargreaves, C, Sheppard, J, Scanlon, M
Format: Journal article
Language:English
Published: Elsevier 2021
_version_ 1826310109429497856
author Du, X
Hargreaves, C
Sheppard, J
Scanlon, M
author_facet Du, X
Hargreaves, C
Sheppard, J
Scanlon, M
author_sort Du, X
collection OXFORD
description <p>Digital forensic test images are commonly used across a variety of digital forensic use cases including education and training, tool testing and validation, proficiency testing, malware analysis, and research and development. Using real digital evidence for these purposes is often not viable or permissible, especially when factoring in the ethical and in some cases legal considerations of working with individuals' personal data. Furthermore, when using real data it is not usually known what actions were performed when, i.e., what was the ‘ground truth’. The creation of synthetic digital forensic test images typically involves an arduous, time-consuming process of manually performing a list of actions, or following a ‘story’ to generate artefacts in a subsequently imaged disk. Besides the manual effort and time needed in executing the relevant actions in the scenario, there is often little room to build a realistic volume of non-pertinent wear-and-tear or ‘background noise’ on the suspect device, meaning the resulting disk images are inherently limited and to a certain extent simplistic.</p> <br> <p>This work presents the TraceGen framework, an automated system focused on the emulation of user actions to create realistic and comprehensive artefacts in an auditable and reproducible manner. The framework consists of a series of actions contained within scripts that are executed both externally and internally to a target virtual machine. These actions use existing automation APIs to emulate a real user's behaviour on a Windows system to generate realistic and comprehensive artefacts. These actions can be quickly scripted together to form complex stories or to emulate wear-and-tear on the test image. In addition to the development of the framework, evaluation is also performed in terms of the ability to produce background artefacts at scale, and also the realism of the artefacts compared with their human-generated counterparts.</p>
first_indexed 2024-03-07T07:45:47Z
format Journal article
id oxford-uuid:a9e9e596-1848-4afc-88d9-1e1d373af716
institution University of Oxford
language English
last_indexed 2024-03-07T07:45:47Z
publishDate 2021
publisher Elsevier
record_format dspace
spelling oxford-uuid:a9e9e596-1848-4afc-88d9-1e1d373af7162023-06-08T09:27:51ZTraceGen: user activity emulation for digital forensic test image generationJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:a9e9e596-1848-4afc-88d9-1e1d373af716EnglishSymplectic ElementsElsevier2021Du, XHargreaves, CSheppard, JScanlon, M<p>Digital forensic test images are commonly used across a variety of digital forensic use cases including education and training, tool testing and validation, proficiency testing, malware analysis, and research and development. Using real digital evidence for these purposes is often not viable or permissible, especially when factoring in the ethical and in some cases legal considerations of working with individuals' personal data. Furthermore, when using real data it is not usually known what actions were performed when, i.e., what was the ‘ground truth’. The creation of synthetic digital forensic test images typically involves an arduous, time-consuming process of manually performing a list of actions, or following a ‘story’ to generate artefacts in a subsequently imaged disk. Besides the manual effort and time needed in executing the relevant actions in the scenario, there is often little room to build a realistic volume of non-pertinent wear-and-tear or ‘background noise’ on the suspect device, meaning the resulting disk images are inherently limited and to a certain extent simplistic.</p> <br> <p>This work presents the TraceGen framework, an automated system focused on the emulation of user actions to create realistic and comprehensive artefacts in an auditable and reproducible manner. The framework consists of a series of actions contained within scripts that are executed both externally and internally to a target virtual machine. These actions use existing automation APIs to emulate a real user's behaviour on a Windows system to generate realistic and comprehensive artefacts. These actions can be quickly scripted together to form complex stories or to emulate wear-and-tear on the test image. In addition to the development of the framework, evaluation is also performed in terms of the ability to produce background artefacts at scale, and also the realism of the artefacts compared with their human-generated counterparts.</p>
spellingShingle Du, X
Hargreaves, C
Sheppard, J
Scanlon, M
TraceGen: user activity emulation for digital forensic test image generation
title TraceGen: user activity emulation for digital forensic test image generation
title_full TraceGen: user activity emulation for digital forensic test image generation
title_fullStr TraceGen: user activity emulation for digital forensic test image generation
title_full_unstemmed TraceGen: user activity emulation for digital forensic test image generation
title_short TraceGen: user activity emulation for digital forensic test image generation
title_sort tracegen user activity emulation for digital forensic test image generation
work_keys_str_mv AT dux tracegenuseractivityemulationfordigitalforensictestimagegeneration
AT hargreavesc tracegenuseractivityemulationfordigitalforensictestimagegeneration
AT sheppardj tracegenuseractivityemulationfordigitalforensictestimagegeneration
AT scanlonm tracegenuseractivityemulationfordigitalforensictestimagegeneration