[rfc-i] UTF-8 and Unicode examples

Henning Schulzrinne hgs at cs.columbia.edu
Mon May 3 10:52:23 PDT 2004

Increasingly, protocols use UTF-8 as their 'native' format. If a 
document wants to present an example, it can, due to the US-ASCII rule, 
not use the character itself. A possible solution is to use the common 
U+1234 notation for Unicode instead, or some specific notation for UTF-8 

The same problem exists for names, e.g., in acknowledgements, albeit 
less urgently.

It would be nice to find a general solution rather than each author 
winging it. (The inability to specify non-ASCII examples might well 
contribute to implementor laziness as well, as implementors code from 
examples and thus simply consider the UTF-8 thing some political 
correctness item that they can safely ignore :-))


More information about the rfc-interest mailing list