What is HTML?

HTML, or HyperText Markup Language is the predominate code behind the pages you see on the web.  It is written in the form of "tags" that are surrounded by angle brackets.  These tags tell the web browser to create paragraphs, tables, links, bold or emphasized text and so on. 

It's important to note that different browsers render HTML each in their own way, and all browsers follow relaxed rules for handling HTML code that may be non-standard or incorrect.  At Aware Research we use a three-level hierarchy of fallback methods for parsing broken HTML while preserving the information content.