Testing charset encoding support in Web Browsers
13 Feb 2012
Note: To jump straight to test page click here http://www.lookout.net/test/charsets/ascii-unsafe/
Web browsers support a variety of character set encodings mostly for legacy reasons and backwards compatibility. After all, UTF-8 and a handful of other encodings today are capable of representing all of the characters that were once relegated to a wide assortment of character encodings. It's clearly evident from Google's February 2012 report that UTF-8 is dominating the Web, with 60% of Web documents using UTF-8 - and that number is rising as other legacy character encodings are declining in use.
Those of us who test Web application security are often concerned with character encodings in our attempts to manipulate string input in ways that would eventually lead to mayhem. For that reason it's good to know a bit not just about which encodings the server-side components support, but also which ones the Web browser supports. I've documented the results of testing character set support in Web browsers in the table below, along with a brief summary.
Web browsers support a variety of character set encodings mostly for legacy reasons and backwards compatibility. After all, UTF-8 and a handful of other encodings today are capable of representing all of the characters that were once relegated to a wide assortment of character encodings. It's clearly evident from Google's February 2012 report that UTF-8 is dominating the Web, with 60% of Web documents using UTF-8 - and that number is rising as other legacy character encodings are declining in use.
Those of us who test Web application security are often concerned with character encodings in our attempts to manipulate string input in ways that would eventually lead to mayhem. For that reason it's good to know a bit not just about which encodings the server-side components support, but also which ones the Web browser supports. I've documented the results of testing character set support in Web browsers in the table below, along with a brief summary.