Some great information from UTF-8: The Secret of Character Encoding.
I've used the charset
meta tag with a UTF-8
value for a long long time and now it's the default in HTML5.
While this can also be achieved on the server, the meta tag is an easy and human-readable way to instruct the browser to use UTF-8.
<!DOCTYPE html>
<html>
<head>
<meta charset="UTF-8">
Note that the characters used to write the META are in ASCII. This solves the problem of the browser not knowing how to interpret the meta tag text until it has parsed its charset
value.
Character entity references are required when using mostly character set A, but needing the odd character from set B or C.
These are difficult to remember and fiddly to write. See Character Entities and Unicode Lookup.
But UTF-8 supports every character. This means that Character Entities are a thing of the past - just copy the real character into your web page and it will just work.