User Tools

Site Tools


design:utf8-requirements

This is an old revision of the document!


All documents will be UTF-8 encoded and MUST apply Normalization Form C to all metadata fields such as document name, authors, and references unless a specific exception is granted by the RSE. The body of the document MAY contain other normalization forms as declared necessary by the authors. Non-ASCII characters are only allowed in author names, contact information, examples, and References. Author names will also require an ASCII representation to encourage broader indexing. (This requirement is under discussion with the i18n program and is likely to be modified)

All documents should identify themselves as being UTF-8. Both the canonical XML format and the non-canonical HTML format must contain metadata that specifies that the encoding is UTF-8. The non-canonical text-only format must begin with a UTF-8 BOM.

An implementer must be able to implement the specification without any confusion or ambiguity introduced by the use of UTF-8 rather than ASCII.

People must be able to reference (cite) the RFC from elsewhere in a standard way, including from documents that only support ASCII.

The RFC must be able to reference (cite) other documents in an unambiguous way.

Cross-references (including references to other documents) must be unambiguous even from a printed document.

Tools must be able to index the RFC in various ways, so searching for keywords, author names, and so on can work.

design/utf8-requirements.1383676698.txt.gz · Last modified: 2013/11/05 10:38 by rsewikiadmin