duplicate files

You may find yourself overwhelmed by files and in the need to keep the filesystem organized. If deleting is the best option, you may consider these 2 options:

Dupinator

Dupinator, tries to find duplicates and to report them in order to clean-up the organization of your files.

changelog

It works by:

  • launched via command line by passing a set of directories to be scanned

  • traverses all directories and groups all files by size

  • scans all sets of files of one size and checksums (md5) the first 1024 bytes

  • for all files that have the same checksum for the first 1024 bytes, checksums the whole file and collects together all real duplicates

  • deletes all duplicates of any one file, leaving the first encountered file as the one remaining copy