Why use Unicode?

Allan Odegaard, the developer of TextMate, has put up a great post about the advantages of Unicode. I must say that this is mandatory reading for those who are producing text files (especially html), and still are unaware of the benefits from using Unicode.

You can read the full post here.

2 Responses to “Why use Unicode?”


  1. 1 Roger Jun 2nd, 2006 at 8:35 am

    There may well be some benefits from using Unicode, but let’s not forget about the drawbacks:
    http://www.schneier.com/crypto-gram-0007.html#9

  2. 2 Geir-Tore Jun 2nd, 2006 at 9:03 am

    Security issues are obviously a consideration to take when using Unicode, or more specifically, when implementing support for Unicode. For character representations (XML/HTML, etc) UTF-8 is the most common Unicode scheme, and during the updates of the Unicode specification there have been numerous restrictions made on implementations of this scheme. Considering the article by Schneier, one way is to restrict the size of the character sequences which are parsed, such that illegal sequences can be ignored. This is also a requirement set by the Unicode Standard.

    Here are some additionally links with information related to Unicode and security aspects:

    1. Unicode Security Considerations
    2. Secure Programming for Linux and Unix HOWTO
    3. UTF-8 and Unicode FAQ for Unix/Linux

Leave a Reply