[rfc-i] UTF-8 and Unicode examples
Henning Schulzrinne
hgs at cs.columbia.edu
Mon May 3 10:52:23 PDT 2004
Increasingly, protocols use UTF-8 as their 'native' format. If a
document wants to present an example, it can, due to the US-ASCII rule,
not use the character itself. A possible solution is to use the common
U+1234 notation for Unicode instead, or some specific notation for UTF-8
in ASCII.
The same problem exists for names, e.g., in acknowledgements, albeit
less urgently.
It would be nice to find a general solution rather than each author
winging it. (The inability to specify non-ASCII examples might well
contribute to implementor laziness as well, as implementors code from
examples and thus simply consider the UTF-8 thing some political
correctness item that they can safely ignore :-))
Henning
More information about the rfc-interest
mailing list