RFC Errata
RFC 2640, "Internationalization of the File Transfer Protocol", July 1999
Source of RFC: ftpext (app)
Errata ID: 8007
Status: Reported
Type: Technical
Publication Format(s) : TEXT
Reported By: Tim Geiser
Date Reported: 2024-06-27
Section B.2.1 & B.2.3 says:
The first step in the process can be performed by maintaining a mapping table that includes the local character set code and the corresponding UCS code. For instance the ISO/IEC 8859-8 [ISO-8859] code for the Hebrew letter "VAV" is 0xE4. The corresponding 4 byte ISO/IEC 10646 code is 0x000005D5. ===================================================================== This example demonstrates mapping ISO/IEC 8859-8 character set to UTF-8 and back to ISO/IEC 8859-8. As noted earlier, the Hebrew letter "VAV" is convertd from the ISO/IEC 8859-8 character code 0xE4 to the corresponding 4 byte ISO/IEC 10646 code of 0x000005D5 by a simple lookup of a conversion/mapping file. ===================================================================== Finally, the UCS-4 character code is converted to ISO/IEC 8859-8 character code (using the mapping table which matches ISO/IEC 8859-8 to UCS-4 ) to produce the original 0xE4 code for the Hebrew letter "VAV".
It should say:
The first step in the process can be performed by maintaining a mapping table that includes the local character set code and the corresponding UCS code. For instance the ISO/IEC 8859-8 [ISO-8859] code for the Hebrew letter "VAV" is 0xE5. The corresponding 4 byte ISO/IEC 10646 code is 0x000005D5. ===================================================================== This example demonstrates mapping ISO/IEC 8859-8 character set to UTF-8 and back to ISO/IEC 8859-8. As noted earlier, the Hebrew letter "VAV" is convertd from the ISO/IEC 8859-8 character code 0xE5 to the corresponding 4 byte ISO/IEC 10646 code of 0x000005D5 by a simple lookup of a conversion/mapping file. ===================================================================== Finally, the UCS-4 character code is converted to ISO/IEC 8859-8 character code (using the mapping table which matches ISO/IEC 8859-8 to UCS-4 ) to produce the original 0xE5 code for the Hebrew letter "VAV".
Notes:
The ISO-8859-8 encoding for the Hebrew letter "VAV" is 0xE5, not 0xE4.
The Unicode U+05D5 (0x000005D5) is the Hebrew letter "VAV" so everything else about these examples are correct. If you actually try converting 0xE4 you'll get Hebrew letter "HE" (U+05D4).