Performance Issues About Context-Triggered Piecewise Hashing

Details

Serval ID
serval:BIB_1CBCA8182CC3
Type
Inproceedings: an article in a conference proceedings.
Collection
Publications
Title
Performance Issues About Context-Triggered Piecewise Hashing
Title of the conference
Digital Forensics and Cyber Crime
Author(s)
Breitinger Frank, Baier Harald
Publisher
Springer Berlin Heidelberg
ISBN
978-3-642-35514-1
Publication state
Published
Issued date
2012
Editor
Gladyshev Pavel, Rogers MarcusK.
Volume
88
Pages
141-155
Language
english
Abstract
A hash function is a well-known method in computer science to map arbitrary large data to bit strings of a fixed short length. This property is used in computer forensics to identify known files on base of their hash value. As of today, in a pre-step process hash values of files are generated and stored in a database; typically a cryptographic hash function like MD5 or SHA-1 is used. Later the investigator computes hash values of files, which he finds on a storage medium, and performs look ups in his database. Due to security properties of cryptographic hash functions, they can not be used to identify similar files. Therefore Jesse Kornblum proposed a similarity preserving hash function to identify similar files. This paper discusses the efficiency of Kornblum’s approach. We present some enhancements that increase the performance of his algorithm by 55% if applied to a real life scenario. Furthermore, we discuss some characteristics of a sample Windows XP system, which are relevant for the performance of Kornblum’s approach.
Keywords
Digital forensics techniques and tools, context-triggered piecewise hash functions, fuzzy hashing, efficiency of ssdeep, fuzzy hashing
Create date
06/05/2021 11:01
Last modification date
06/05/2021 11:22
Usage data