Re: [SLUG] Linux and Windows character sets

From: Eben King (eben1@tampabay.rr.com)
Date: Mon Dec 01 2003 - 10:44:34 EST


On Mon, 1 Dec 2003, Russell Hires wrote:

> Is this the difference between the 7-bit character sets vs the 8-bit ones? I
> don't know how to fix it, but I'm wondering if that would be a useful amount
> of information for a possible fix...

Yeah, the 7-bit ones are defined as ASCII, and the rest are
implementation-dependent. What you see is the standard, which those pages
don't follow. If you made web pages containing raw non-7-bit characters,
some web users would see them as boxes or question marks too. 'man ascii'
if you have that man page. The proper way of encoding 'á' in HTML is
'á'; if you just include the raw character, it'll look incorrect
depending on the web server. Mine (thttpd) sends this header:

Content-Type: text/html; charset=iso-8859-1

If no Content-Type is sent, and the client guesses wrong, you get the
boxes or question marks, or some incorrect character.

-- 
The people don't want war, but it's always simple to drag the people along.
The people can always be brought to the bidding of the leaders.  All you
have to do is tell them they are being attacked, and denounce the pacifists
for lack of patriotism, and exposing the country. -- Hermann Goering

----------------------------------------------------------------------- This list is provided as an unmoderated internet service by Networked Knowledge Systems (NKS). Views and opinions expressed in messages posted are those of the author and do not necessarily reflect the official policy or position of NKS or any of its employees.



This archive was generated by hypermail 2.1.3 : Fri Aug 01 2014 - 19:09:18 EDT