RFC Errata


Errata Search

 
Source of RFC  
Summary Table Full Records

RFC 2640, "Internationalization of the File Transfer Protocol", July 1999

Source of RFC: ftpext (app)

Errata ID: 8007
Status: Reported
Type: Technical
Publication Format(s) : TEXT

Reported By: Tim Geiser
Date Reported: 2024-06-27

Section B.2.1 & B.2.3 says:

The first step in the process can be performed by maintaining a
mapping table that includes the local character set code and the
corresponding UCS code. For instance the ISO/IEC 8859-8 [ISO-8859]
code for the Hebrew letter "VAV" is 0xE4. The corresponding 4 byte
ISO/IEC 10646 code is 0x000005D5.

=====================================================================

This example demonstrates mapping ISO/IEC 8859-8 character set to
UTF-8 and back to ISO/IEC 8859-8. As noted earlier, the Hebrew letter
"VAV" is convertd from the ISO/IEC 8859-8 character code 0xE4 to the
corresponding 4 byte ISO/IEC 10646 code of 0x000005D5 by a simple
lookup of a conversion/mapping file.

=====================================================================

Finally, the UCS-4 character code is converted to ISO/IEC 8859-8
character code (using the mapping table which matches ISO/IEC 8859-8
to UCS-4 ) to produce the original 0xE4 code for the Hebrew letter
"VAV".

It should say:

The first step in the process can be performed by maintaining a
mapping table that includes the local character set code and the
corresponding UCS code. For instance the ISO/IEC 8859-8 [ISO-8859]
code for the Hebrew letter "VAV" is 0xE5. The corresponding 4 byte
ISO/IEC 10646 code is 0x000005D5.

=====================================================================

This example demonstrates mapping ISO/IEC 8859-8 character set to
UTF-8 and back to ISO/IEC 8859-8. As noted earlier, the Hebrew letter
"VAV" is convertd from the ISO/IEC 8859-8 character code 0xE5 to the
corresponding 4 byte ISO/IEC 10646 code of 0x000005D5 by a simple
lookup of a conversion/mapping file.

=====================================================================

Finally, the UCS-4 character code is converted to ISO/IEC 8859-8
character code (using the mapping table which matches ISO/IEC 8859-8
to UCS-4 ) to produce the original 0xE5 code for the Hebrew letter
"VAV".

Notes:

The ISO-8859-8 encoding for the Hebrew letter "VAV" is 0xE5, not 0xE4.
The Unicode U+05D5 (0x000005D5) is the Hebrew letter "VAV" so everything else about these examples are correct. If you actually try converting 0xE4 you'll get Hebrew letter "HE" (U+05D4).

Report New Errata



Advanced Search