RFC 2044

UTF-8, a transformation format of Unicode and ISO 10646, October 1996

Canonical URL:
File formats:
Plain TextPDF
Obsoleted by:
RFC 2279
F. Yergeau

Cite this RFC: TXT  |  XML

DOI:  http://dx.doi.org/10.17487/RFC2044

Other actions: Find Errata (if any)  |  Submit Errata  |  Find IPR Disclosures from the IETF


The Unicode Standard, version 1.1, and ISO/IEC 10646-1:1993 jointly define a 16 bit character set which encompasses most of the world's writing systems. UTF-8, the object of this memo, has the characteristic of preserving the full US-ASCII range. This memo provides information for the Internet community. This memo does not specify an Internet standard of any kind.

For the definition of Status, see RFC 2026.

For the definition of Stream, see RFC 4844.

Download PDF Reader

Search RFCs
Advanced Search