Friday, 14 July 2017

UTF-8 processing

Looking in detail at some recent HTML email attachments (received in
Messenger Pro), Netsurf's rendering of them seems to have a dependence
on the coding of the <meta content=""> tag.

When this tag is present and includes:

content="text/html charset=utf-8"

any non-ASCII characters are shown as the three bytes of their UTF-8
encoding, whereas if the two attributes are separated by semicolon:

content="text/html; charset=utf-8"

the rendering (using Unicode font DejaVu) looks correct.

Is this a known problem?

--
Bernard

No comments:

Post a Comment