473,583 Members | 3,097 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

Whcih Charset is Correct? And, How to Render a Macron?

I don't reallly understand charsets, but I see that you are supposed
to declare one. I am putting a paper up in HTML. It uses some special
characters, like umlauts and diacriticals. The biggest problem I've
had with it is creating characters with lines over them (macrons), as
is done to indicate a long vowel sound, for example. There is no way
to do this using my word processor. Is there a way to do it in an HTML
document? Which charset should I declare in the <meta> tag?
Jul 20 '05 #1
8 3680
bs*****@mac.com (Ben Sharvy) wrote:
I don't reallly understand charsets,
Then I suggest reading http://ppewww.ph.gla.ac.uk/~flavell/charset/ .
I am putting a paper up in HTML. It uses some special
characters, like umlauts and diacriticals. The biggest problem I've
had with it is creating characters with lines over them (macrons), as
is done to indicate a long vowel sound, for example.
The vowels with macron are included the Baltic Rim character set,
ISO-8859-13 or Windows-1257.
http://ppewww.ph.gla.ac.uk/~flavell/charset/baltic.html
http://www.unics.uni-hannover.de/nhtcapri/baltic.html13
http://www.unics.uni-hannover.de/nhtcapri/baltic.win
Nevertheless, it seems best for you to include all special characters
as &#number; expressions.
http://www.unics.uni-hannover.de/nht...al2.html#latin
Both Netscape 4.0 and Internet Explorer 4.0 can display them (if you
have fonts with the necessary glyphs, of course).
There is no way to do this using my word processor.
Meaning: You didn't find it. Which word processor and which operating
system do you use? For Macintosh, see
http://www.unics.uni-hannover.de/nht...-european.html .
For MS Windows, see http://www.microsoft.com/typography/multilang/ .
Which charset should I declare in the <meta> tag?


None. Declare the encoding ("charset=UT F-8") in the HTTP header, see
http://www.w3.org/International/O-HTTP-charset.html and
http://ppewww.ph.gla.ac.uk/~flavell/...t/ns-burp.html .

--
But thats what FP puts in to the page, so i asume thats correct
Harry H. Arends in microsoft.publi c.frontpage.cli ent
Jul 20 '05 #2
bs*****@mac.com (Ben Sharvy) wrote:
I don't reallly understand charsets,
Then I suggest reading http://ppewww.ph.gla.ac.uk/~flavell/charset/ .
I am putting a paper up in HTML. It uses some special
characters, like umlauts and diacriticals. The biggest problem I've
had with it is creating characters with lines over them (macrons), as
is done to indicate a long vowel sound, for example.
The vowels with macron are included in the Baltic Rim character set,
ISO-8859-13 or Windows-1257.
http://ppewww.ph.gla.ac.uk/~flavell/charset/baltic.html
http://www.unics.uni-hannover.de/nhtcapri/baltic.html13
http://www.unics.uni-hannover.de/nhtcapri/baltic.win
Nevertheless, it seems best for you to include all special characters
as &#number; expressions.
http://www.unics.uni-hannover.de/nht...al2.html#latin
Both Netscape 4.0 and Internet Explorer 4.0 can display them (if you
have fonts with the necessary glyphs, of course).
There is no way to do this using my word processor.
Meaning: You didn't find it. Which word processor and which operating
system do you use? For Macintosh, see
http://www.unics.uni-hannover.de/nht...-european.html .
For MS Windows, see http://www.microsoft.com/typography/multilang/ .
Which charset should I declare in the <meta> tag?


None. Declare the encoding ("charset=UT F-8") in the HTTP header, see
http://www.w3.org/International/O-HTTP-charset.html and
http://ppewww.ph.gla.ac.uk/~flavell/...t/ns-burp.html .

--
But thats what FP puts in to the page, so i asume thats correct
Harry H. Arends in microsoft.publi c.frontpage.cli ent
Jul 20 '05 #3
On Mon, 29 Sep 2003, Andreas Prilop wrote:

[amongst some good advice... but I /would/ say that, wouldn't I? ;-) ]
There is no way to do this using my word processor.


Meaning: You didn't find it. Which word processor and which operating
system do you use?


One might ask just what part a "word processor" is playing towards
making a well-engineered web page, though.

Good luck to the O.P anyway.

cheers
Jul 20 '05 #4
Andreas Prilop <nh******@rrz n-user.uni-hannover.de> wrote in message news:<290920032 001027517%nh*** ***@rrzn-user.uni-hannover.de>...
bs*****@mac.com (Ben Sharvy) wrote:
I don't reallly understand charsets,
Then I suggest reading http://ppewww.ph.gla.ac.uk/~flavell/charset/ .


I've read it. It didn't help.
The vowels with macron are included the Baltic Rim character set,
ISO-8859-13 or Windows-1257.
http://ppewww.ph.gla.ac.uk/~flavell/charset/baltic.html
http://www.unics.uni-hannover.de/nhtcapri/baltic.html13
http://www.unics.uni-hannover.de/nhtcapri/baltic.win
Nevertheless, it seems best for you to include all special characters
as &#number; expressions.
I found the &#xxx; form. However, it doesn't display correctly on
Opera 5 (Mac). I've also discovered the CSS "overline" property, which
I might employ instead.
There is no way to do this using my word processor.


Meaning: You didn't find it


"There is no way to do this" does not mean "I didn't find it." It
means: there is no way to do this.
Which word processor and which operating
system do you use? For Macintosh, see
http://www.unics.uni-hannover.de/nht...-european.html .
For MS Windows, see http://www.microsoft.com/typography/multilang/ .


Appleworks 6, MacOS 9.
Which charset should I declare in the <meta> tag?


None. Declare the encoding ("charset=UT F-8") in the HTTP header, see
http://www.w3.org/International/O-HTTP-charset.html and
http://ppewww.ph.gla.ac.uk/~flavell/...t/ns-burp.html .


I don't know what an "HTTP" header is, and that page doesn't say. But,
I did look at the source of that page, and saw that it declares its
charset in a <meta> tag.

I looked at some other pages with the W3C's "Valid HTML 4.0" icon, and
they usually use:

<META HTTP-EQUIV="Content-Type" CONTENT="text/html;
CHARSET=iso-8859-1">

Why do you say that's a mistake?
Jul 20 '05 #5
On 30 Sep 2003, Ben Sharvy wrote:
Then I suggest reading http://ppewww.ph.gla.ac.uk/~flavell/charset/ .
I've read it. It didn't help.


You seem to be one of those persons who need to be guided by the hand
through everything.
Nevertheless, it seems best for you to include all special characters
as &#number; expressions.


I found the &#xxx; form. However, it doesn't display correctly on
Opera 5 (Mac).


Opera is infamous for its complete failure in internationaliz ation.
It works only with West European Latin-1. I suggest Netscape 7 or
Mozilla on Mac OS 9. http://www.mozilla.org/ http://wamcom.org/
I've also discovered the CSS "overline" property, which
I might employ instead.
No, you must not. "A with macron" is a different character than "A"
and therefore cannot be represented by "A" with some style.
"There is no way to do this" does not mean "I didn't find it." It
means: there is no way to do this.
Appleworks 6, MacOS 9.
You need to install the Central European (CE) language kit:
http://www.unics.uni-hannover.de/nhtcapri/#tech_info
For sources of additional CE fonts see
http://www.unics.uni-hannover.de/nht...-european.html
Especially have a look at
http://hyperarchive.lcs.mit.edu/cgi-...=european+html
I don't know what an "HTTP" header is,
As a webmaster/author, you need to know!
I did look at the source of that page, and saw that it declares its
charset in a <meta> tag.
I consider this a mistake/oversight. (Cc of this message to w3.org.)
I looked at some other pages with the W3C's "Valid HTML 4.0" icon
That doesn't mean anything.
http://www.google.com/search?q=%22Va...title:Untitled
<META HTTP-EQUIV="Content-Type" CONTENT="text/html;
CHARSET=iso-8859-1">
Why do you say that's a mistake?


Did you read
http://ppewww.ph.gla.ac.uk/~flavell/...t/ns-burp.html ?

Jul 20 '05 #6
bs*****@mac.com (Ben Sharvy) wrote:
http://ppewww.ph.gla.ac.uk/~flavell/charset/ .
I've read it. It didn't help.


Read it again.
I found the &#xxx; form. However, it doesn't display correctly on
Opera 5 (Mac).
I'm usually against the idea that we should expect users to upgrade
their browsers, but surely we can regard Opera users as educated and
avantgardistic enough to update to something newer than that. At least
Opera 7 (Win) can display, say, Ā correctly an an uppercase A with
macron.

The problems are elsewhere, especially in Windows PC's that haven't got
"multilingu al support" installed.
I've also discovered the CSS "overline" property,
which I might employ instead.
There's something to be said in favor of that idea, if the macrons are
just additional information working as length mark (e.g., as in Latin
grammars). They might - just might - be seen as "presentational ".

But on the practical side anyway, the overline property means what it
says, a line over the text line. Thus, for most characters, it is way
too high, since most characters (calculated by frequency in normal
text, where lower case dominates) have their tops well below the top of
the text line. In a word, the overline flies well over letters.
Besides, if you overline, say, "A", the overline extends over the full
character width, as opposite to a macron, which is far shorter.
Appleworks 6, MacOS 9.
I have no idea of the properties of that software, but the real
question is whether it can work using standardized character encodings,
or at least convert the result to them, instead of using Mac's
proprietary encodings.
I don't know what an "HTTP" header is, and that page doesn't say.
But, I did look at the source of that page, and saw that it
declares its charset in a <meta> tag.

I looked at some other pages with the W3C's "Valid HTML 4.0" icon,
and they usually use:

<META HTTP-EQUIV="Content-Type" CONTENT="text/html;
CHARSET=iso-8859-1">

Why do you say that's a mistake?


Well, you can put the "Valid HTML 4.0" icon on any page you like (the
W3C can't really prevent you), and validity is about formal grammar
_only_, so why would it affect the question how to specify the
character encoding properly? Besides, you are not really forbidden from
using the <meta> hack too; actually there are reasons in favor of using
it, too. The HTTP header is still the right way to specify the
encoding. Actually, the <meta> hack is just a plastic surrogate for it.

--
Yucca, http://www.cs.tut.fi/~jkorpela/
Pages about Web authoring: http://www.cs.tut.fi/~jkorpela/www.html

Jul 20 '05 #7
"Jukka K. Korpela" <jk******@cs.tu t.fi> wrote:
There's something to be said in favor of that idea, if the macrons are
just additional information working as length mark (e.g., as in Latin
grammars). They might - just might - be seen as "presentational ".


But only in Latin. In other languages, vowels with macron are characters
of their own - just as å ä ö.
Jul 20 '05 #8
bs*****@mac.com (Ben Sharvy) wrote in message news:<d1******* *************** ****@posting.go ogle.com>...
The vowels with macron are included the Baltic Rim character set,
ISO-8859-13 or Windows-1257.
Nevertheless, it seems best for you to include all special characters
as &#number; expressions.


I found the &#xxx; form. However, it doesn't display correctly on
Opera 5 (Mac). I've also discovered the CSS "overline" property, which
I might employ instead.


Opera 5 and below does not support Unicode on Mac or Windows.

Upgrade to Opera 6 for Mac, which does support Unicode.

--
Alan Wood
http://www.alanwood.net (Unicode, special characters, pesticide names)
Jul 20 '05 #9

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

30
20269
by: The Plankmeister | last post by:
In the <meta http-equiv="Content-Type" content="text/html; charset=????????"> tag, what is the best charset to use? I tend to use UTF-8, for no reason other than it sounds much cooler than iso-8859-1 etc etc... Are there any guidelines for this sort of thing? P.
1
1563
by: Nick Kew | last post by:
Someone just drew attention to an open bug report in Apache concerning shipping with an AddDefaultCharset set by default in httpd.conf. This leads to bogus charsets being served in many cases. I've just put forward a suggestion, but I'd welcome review from those familiar with character encoding issues and CA-2000-02 (which is the main...
37
3942
by: Haines Brown | last post by:
I understand that <br /> is marginal in CSS, and so am looking for a substitute for the EOL character. I've failed in both approaches and seek advice. The first thing I tried was to use the unicode EOL characer, which is 000A. I figured with was 10 in decimal notation, and so tried: line 1 line 2 No go. What was wrong with my thinking...
4
26204
by: N. Demos | last post by:
The following code renders as intended in IE (A TABLE, with cells of fixed width and height, inside of a DIV with fixed width and height and overflow set to hidden.) In Firefox, the table cells assume a narrower with than specified. If I comment out the width for the DIV, then the cells render with the correct width and height. Why is this...
28
3213
by: Xiaotian Sun | last post by:
I added the following line to the header of my html file <meta http-equiv="content-type" content="text/html; charset=utf-8"> hoping browsers will use UTF-8 encoding. But all browsers I tried still use ISO-8859-1. What did I do wrong? Thanks,
4
2052
by: Rémi | last post by:
Question: How can you determine the character set used by a webpage you built? My understanding of the issue is that the character set used by an HTML file (or any other file, for that matter) depends on your own system, and the encoding used by it; you cannot randomly insert a <META HTTP-EQUIV="Content-Type" content="text/html;...
7
5972
by: gmclee | last post by:
Hi there, I am writing a program to load HTML from file and send it to IE directly. I've met some problem in charset setting. Most of HTML have charset "us-ascii", for some reason, some UNICODE TEXT will be inserted into the HTML before sending to IE. The problem is 1) Can I specify special charset for some component, e.g. <span...
3
2413
by: mistral | last post by:
how to set correct charset type for multi-language website? Site is on cms, and same headers are dynamically generated for all pages. The recommened charset type is <meta http-equiv="Content-Type" content="text/html;charset=UTF-8">. But if use this type, one of lanuages will not be displayed correctly. Probably this done via server side...
2
3801
by: vunet.us | last post by:
I translate my website to German and some ASCII characters are not translated by my browser. For example I see this: &#x00FC; I use charset: <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> How can I determine correct charset? Thanks
0
8186
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed. This is as boiled down as I can make it. ...
1
7936
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For...
0
8197
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the...
0
6583
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then...
0
5377
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert...
0
3821
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols. I succeeded, with both firewalls in...
0
3847
by: adsilva | last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
1
2335
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system
1
1436
muto222
by: muto222 | last post by:
How can i add a mobile payment intergratation into php mysql website.

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.