1. HTML document content comparison algorithm