X-Git-Url: https://fleuret.org/cgi-bin/gitweb/gitweb.cgi?p=finddup.git;a=blobdiff_plain;f=finddup.1;h=94e31b28df3c181d836cd0c23db0be1f90c650e2;hp=2da4ee3dc12e5b24d40182090bd9cfca0d4e2943;hb=fb007ddf575e75c2cd53398cd9c48590a6e34bf4;hpb=8c988a4aca00501c9a9d53f4ff228dcb0bce0acb diff --git a/finddup.1 b/finddup.1 index 2da4ee3..94e31b2 100644 --- a/finddup.1 +++ b/finddup.1 @@ -72,10 +72,10 @@ file content. Here are the things I tried, which did not help at all: (1) Computing md5s on the whole files, which is not satisfactory because files are -often never read entirely hence the md5s can not be properly computed, -(2) computing XOR of the first 4, 16 and 256 bytes with rejection as -soon as one does not match, (3) reading parts of the files of -increasing sizes so that rejection could be done with a small fraction +often not read entirely, hence the md5s can not be properly computed, +(2) computing XORs of the first 4, 16 and 256 bytes with rejection as +soon as one does not match, (3) reading files in parts of increasing +sizes so that rejection could be done with only a small fraction read when possible, (4) using mmap instead of open/read. .SH "WISH LIST"