Could anyone verify the correctness of getting a md5 hash using this method?

There are a few weird things in your code. UTF-8 encoding of a character may use more than one byte. So you should not use the string length as final parameter to the update() call, but the length of the array of bytes that getBytes() actually returned.As suggested by PaÅlo, use the update() method which takes a single byte as parameter.

The output of MD5 is a sequence of 16 bytes with quite arbitrary values. If you interpret it as an integer (that's what you do with your call to BigInteger()), then you will get a numerical value which will be smaller than 2160, possibly much smaller. When converted back to hexadecimal digits, you may get 32, 31, 30... or less than 30 characters.

Your usage of the the "%032X" format string left-pads with enough zeros, so your code works, but it is kind of indirect (the output of MD5 has never been an integer to begin with). You assemble the hash input elements with raw concatenation. This may induce issues.

For instance, if modeName is "foo" and modeParentName is "barqux", then the MD5 input will begin with (the UTF-8 encoding of) "foobarqux". If modeName is "foobar" and modeParentName is "qux", then the MD5 input will also begin with "foobarqux". You do not tell why you want to use a hash function, but usually, when one uses a hash function, it is to have a unique trace of some piece of data; two distinct data elements should yield distinct hash inputs.

When handling nodeValue, you call trim(), which means that this string could begin and/or end with whitespace, and you do not want to include that whitespace into the hash input -- but you do include it, since you append nodeValue and not nodeValue.trim(). If what you are trying to do has any relation to security then you should not use MD5, which is cryptographically broken. Use SHA-256 instead.

Hashing an XML element is normally done through canonicalization (which handles whitespace, attribute order, text representation, and so on). See this question on the topic of canonicalizing XML data with Java.

One possible problem is here: m. Update(sb.toString(). GetBytes("UTF-8"),0,sb.toString().length()); As said by Robing Green, the UTF-8 encoding can produce a byte which is longer than your original string (it will do this exactly when the String contains non-ASCII characters).

In this case, you are only hashing the start of your String. Better write it like this: m. Update(sb.toString().

GetBytes("UTF-8")); Of course, this would not cause an exception, simply another hash than would be produced otherwise, if you have non-ASCII-characters in your string. You should try to brew your failure down to an SSCCE, like lesmana recommended.

I cant really gove you an answer,but what I can give you is a way to a solution, that is you have to find the anglde that you relate to or peaks your interest. A good paper is one that people get drawn into because it reaches them ln some way.As for me WW11 to me, I think of the holocaust and the effect it had on the survivors, their families and those who stood by and did nothing until it was too late.

Could anyone verify the correctness of getting a md5 hash using this method?

Related Questions

Why isn't my .net-calculated MD5 hash equivalent to the hash calculated on a web site?

Dmesg shows "md: serializing resync, md4 has overlapping physical units with md5" (where md4 and md5 are two of your software RAID devices). What does this mean?

(Problem Solved) Hash(m1 xor m2) = Hash(m1) xor Hash (m2) Is this true in case of SHA1?

Using OpenSAML to verify correctness of a SAML2 federation?

Using MD5 hash on a string in cocoa?

Why, after using 'CryptSetHashParam', can I no longer add data to my MD5 hash object?