@Anders Anderson,
Sure I have no problem sharing. I will be posting the code soon. I am going to remove all the bin and obj files and post soon. I imagine anything above Visual Studio 2010 will work. This was originally done on a Windows 7 virtual in VS 2010 (you can see it in the videos).
The WT library contents themselves are about 300 MB unzipped, 164 MB zipped. This is somewhat of a larger upload. But I will give it a shot at some point here. The code is definitely going up soon.
About the small changes: Yes, I ran across a lot of those. I filter a great deal out. Since the diff code works on the character level, it detects all these annoying little changes. As I mentioned, the WT, in some cases, decided to change the unicode characters it used for the Greek letters. The characters render the same, but are different byte-wise. You have to handle these things on a case-by-case basis. When you see an annoying little change, you can code an exception and it will ignore it.
Note: if you have your own version of WT library, and you get the code running, you can export your own copy of the WT library.
I think at some point I will post just the raw program itself. This way a non-programmer could get their own copy (for fun I guess?)
The code has some fun little gems in it, for those Windows developers that want to mess with the API. For example, you can iterate through the index numbers of a ListView without a lot of trouble. But as soon as you want to get the text value of that ListViewItem, it gets harder. The Windows API allows you to allocate memory and provide the memory address so that it can be filled with the item text. But that allocated memory has to be a part of the ListView process - in this case, the WT library. So at one point I had to allocate virtual memory in the WT library process and then copy the memory back into the automated program.
MMM