Regarding this well-known quote, often attributed to Chris's famous "Fri,
08 Oct 2004 03:29:03 GMT" speech:
[color=blue]
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> Michael Vilain wrote:
>
> [snip][color=green]
>> Basically, crc32 hashes aren't unique while md5 hashes are. SUN
>> offers md5 checksums of all the files in the Solaris distributions
>> as a
>> 'fingerprint' to verify if a file is authentic. That way a sysadmin
>> can verify if the "ls" or "ps" they're using is the original from
>> SUN.
>>[/color]
>
> Hi,
> I'm sorry, but MD5 hashes are *not* unique. An MD5 hash is 128 bits
> long; therefore, for any input length > 128 bits, there must be at
> *least* two possible inputs which produce the same output. For the
> given file lengths measured in megabytes, there would be an immense
> number of possible inputs that give the same output: the only thing
> is, it's relatively difficult to arbitrarily *find* another file with
> the same MD5 as a given input. They do exist, however, as a little
> math demonstrates:
>
> (snipped: big files/small hashes--some will be the same)[/color]
But the idea, IIRC, is that although there may be collisions, the chance of
two *legible* inputs with the same MD5 are immensely small. Most collisions
will just be one intelligible value, and one with unusable garbage. Hence,
MD5's usefulness in calculating file integrity (it would be very difficult,
and quite detectable, to inject malware into a file and keep the MD5), and
its dubious state as a password-security mechanism (since a password needs
to be legible in no other way except to pass the MD5 check).
--
-- Rudy Fleminger
--
sp@mmers.and.evil.ones.will.bow-down-to.us
(put "Hey!" in the Subject line for priority processing!)
--
http://www.pixelsaredead.com