System Files

IN THIS ARTICLE:

Spectra uses two complementary methods to identify and remove system files from the data population.

MIME-type file signatures

When setting up a volume for processing, you have the option to Remove System Files. This will apply a filter based on the following doctype and MIME-type signatures:

DocTypeGroup (System file) MIME-Type
application/vnd.apple-safari-cache
application/vnd.google-chrome-history-entry
application/vnd.google-chrome-history-index
application/vnd.google-chrome-shortcuts
application/vnd.linux-syslog
application/vnd.logstash-log
application/vnd.ms-ie-cache-entry
application/vnd.ms-installer
application/vnd.ms-installer-patch-package
application/vnd.ms-registry
application/vnd.ms-registry-journal
application/vnd.ms-shell-scrap
application/vnd.ms-windows-event-log
application/vnd.ms-windows-event-logx
application/x-browser-search
application/x-empty
application/x-font-ttf
application/x-thumbs-db
image/vnd.ms-windows-cursor
text/x-windows-registry

DeNIST

When creating a new matter, you have the option to DeNIST your files (on by default). This uses the MD5 hash to filter out any documents that match against the NIST's NSFL reference data set, which is a collection of digital signatures of known, traceable software applications.
Go to www.nsrl.nist.gov for more information.

Back to top