> Subject: [EE]: How to find the changes in a new version of a file James: My advice? Don't reinvent the "diff" tools, use them. I'd take this route: 1) Strip the html tags out of the two files. 2) Diff 'em (turn off white-space differences) 3) Parse the result of the diff file, possibly detecting "false hits" that may not matter. This probably isn't too much initial work. You can refine the process if this shows good results. -Rob -- http://www.piclist.com hint: The PICList is archived three different ways. See http://www.piclist.com/#archives for details.