Electrical & Computer Engineering and Computer Science Faculty Publications

An Efficient Similarity Digests Database Lookup -- a Logarithmic Divide and Conquer Approach

Author URLs

Professor Breitinger's Faculty Profile

Professor Breitinger's web page

Professor Breitinger's Full Bibliography

Document Type

Article

Publication Date

2014

Subject: LCSH

Cyber forensics, Computer forensics, Hashing (Computer science)

Disciplines

Computer Engineering | Computer Sciences | Electrical and Computer Engineering | Forensic Science and Technology | Information Security

Abstract

Investigating seized devices within digital forensics represents a challenging task due to the increasing amount of data. Common procedures utilize automated file identification, which reduces the amount of data an investigator has to examine manually. In the past years the research field of approximate matching arises to detect similar data. However, if n denotes the number of similarity digests in a database, then the lookup for a single similarity digest is of complexity of O(n). This paper presents a concept to extend existing approximate matching algorithms, which reduces the lookup complexity from O(n) to O(log(n)). Our proposed approach is based on the well-known divide and conquer paradigm and builds a Bloom filter-based tree data structure in order to enable an efficient lookup of similarity digests. Further, it is demonstrated that the presented technique is highly scalable operating a trade-off between storage requirements and computational efficiency. We perform a theoretical assessment based on recently published results and reasonable magnitudes of input data, and show that the complexity reduction achieved by the proposed technique yields a 220-fold acceleration of look-up costs.

Comments

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Repository Citation

Breitinger, Frank; Rathgeb, Christian; and Baier, Harald, "An Efficient Similarity Digests Database Lookup -- a Logarithmic Divide and Conquer Approach" (2014). Electrical & Computer Engineering and Computer Science Faculty Publications. 7.
https://digitalcommons.newhaven.edu/electricalcomputerengineering-facpubs/7

Publisher Citation

Breitinger, F. , Rathgeb, C., and Baier, H. (2014) An efficient similarity digests database lookup -- a logarithmic divide and conquer approach. Journal of Digital Forensics, Security and Law 9(2): 152-166.

Download

Check your library

Find in your library

Included in

Computer Engineering Commons, Electrical and Computer Engineering Commons, Forensic Science and Technology Commons, Information Security Commons

COinS

Electrical & Computer Engineering and Computer Science Faculty Publications

An Efficient Similarity Digests Database Lookup -- a Logarithmic Divide and Conquer Approach

Author URLs

Document Type

Publication Date

Subject: LCSH

Disciplines

Abstract

Comments

Creative Commons License

Repository Citation

Publisher Citation

Included in

Search

Browse

Author Corner

Links

Library Link

Electrical & Computer Engineering and Computer Science Faculty Publications

An Efficient Similarity Digests Database Lookup -- a Logarithmic Divide and Conquer Approach

Authors

Author URLs

Document Type

Publication Date

Subject: LCSH

Disciplines

Abstract

Comments

Creative Commons License

Repository Citation

Publisher Citation

Included in

Share

Search

Browse

Author Corner

Links

Library Link