[rfc-i] Byte Order Marks for UTF-8

Dave Thaler dthaler at microsoft.com
Wed Jul 18 11:57:15 PDT 2012


> -----Original Message-----
> From: rfc-interest-bounces at rfc-editor.org [mailto:rfc-interest-bounces at rfc-
> editor.org] On Behalf Of Paul Hoffman
> Sent: Wednesday, July 18, 2012 9:39 AM
> To: Tim Bray
> Cc: rfc-interest at rfc-editor.org
> Subject: Re: [rfc-i] Byte Order Marks for UTF-8
> 
> On Jul 18, 2012, at 9:23 AM, Tim Bray wrote:
> 
> > That's probably a good recommendation, if we couple it with a mandate to
> never generate UTF-16.
> 
> Did I misread the messages from yesterday? I thought some text-reading
> software worked when it saw a UTF8 BOM but not if it didn't. If I
> misunderstood, then Phill's idea (don't include it in generated text formats) is
> fine. If not, the RFC Editor should investigate further.

Right, there's plenty of software that displays UTF8 text correctly when a UTF8
BOM is present and does not display it correct when it's absent.  (Usually because
there's many possible encodings, and UTF8 isn't the default guess of that software.)

-Dave



More information about the rfc-interest mailing list