RFC Errata
RFC 3454, "Preparation of Internationalized Strings ("stringprep")", December 2002
Note: This RFC has been obsoleted by RFC 7564
Source of RFC: IETF - NON WORKING GROUPArea Assignment: int
Errata ID: 4577
Status: Held for Document Update
Type: Technical
Publication Format(s) : TEXT
Reported By: Loïc Jonas Etienne
Date Reported: 2016-01-04
Held for Document Update by: Brian Haberman
Date Held: 2016-04-05
Section 3.1 says:
Some characters are only useful in line-based text, and are otherwise invisible and ignored. 00AD; SOFT HYPHEN 1806; MONGOLIAN TODO SOFT HYPHEN 200B; ZERO WIDTH SPACE 2060; WORD JOINER FEFF; ZERO WIDTH NO-BREAK SPACE
It should say:
Some characters are only useful in line-based text, and are otherwise invisible and ignored. 00AD; SOFT HYPHEN 200B; ZERO WIDTH SPACE 2060; WORD JOINER FEFF; ZERO WIDTH NO-BREAK SPACE
Notes:
This issue has been reported to the unicode consortium (http://www.unicode.org/L2/L2015/15277-pubrev.html), according to which: U+1806 is not a control character; RFC 3454 is mistaken in mapping it to nothing, since the character always has a distinct visual appearance; For more information about the character, see page 528 of Core Specification, Version 8.0.