473,770 Members | 1,841 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

Non-ASCII characters in web pages

I run a website for my very extended family. The site is not a static one,
and pages are frequently added and changed. I constructed it by myself, but
I can best be described as a casual and unsophisticated web designer.

Because I have always had difficulty in producing dashes on my pages, I
generally use double hyphens instead. Books that I possess or have seen on
HTML tell me that I could make e.g. an em-dash by using the escape sequence
(without the quotes) "&emdash;", but this is displayed *literally* on
browsers, not as an em-dash. A friend has told me now that I can make the
desired dashes with the strings "&#8211" and "&#8212" for en- and em-dashes
respectively, and sure enough this works.

These strings are very unintuitive (which is an understatement) ; there is no
obvious way to form a mnemonic to remember them. Why do books tell me the
easily remembered strings I have mentioned above? Has the spec changed? When
and why?

More importantly, is there a list somewhere on the Net that I could
download, and that would list all the other similar strings for non-ASCII
characters: quotes, spaces, diacritics, etc.?

--
Stan Goodman
Qiryat Tiv'on
Israel

To send me email, please replace the CAPITAL_LETTERS with "sig". Please do
not send me HTML-formatted messages.Please do not send me attachments
without telling me beforehand.

Jul 20 '05 #1
23 7177
Stan Goodman:
Books that I possess or have seen on
HTML tell me that I could make e.g. an em-dash by using the escape sequence
(without the quotes) "&emdash;", but this is displayed *literally* on
browsers, not as an em-dash. A friend has told me now that I can make the
desired dashes with the strings "&#8211" and "&#8212" for en- and em-dashes
respectively, and sure enough this works.
It should be "mdash" (not "emdash"). Either your books are crap, or you
should get new glasses.
More importantly, is there a list somewhere on the Net that I could
download, and that would list all the other similar strings for non-ASCII
characters: quotes, spaces, diacritics, etc.?


Try the HTML 4.01 specification:

<URL:http://www.w3.org/TR/html40/sgml/entities.html>

--
Bertilo Wennergren <be******@gmx.n et> <http://www.bertilow.co m>
Jul 20 '05 #2
Stan Goodman wrote:
Books that I possess or have seen on
HTML tell me that I could make e.g. an em-dash by using the escape sequence
(without the quotes) "&emdash;", but this is displayed *literally* on
browsers, not as an em-dash. A friend has told me now that I can make the
desired dashes with the strings "&#8211" and "&#8212" for en- and em-dashes
respectively, and sure enough this works.
Bertilo Wennergren <be******@gmx.n et> wrote:
It should be "mdash" (not "emdash"). Either your books are crap, or you
should get new glasses.


To be fair, HTML 3.0 (RIP) specified &endash; and &emdash; rather than the
&ndash; and &mdash; specified by HTML 4.x and implemented by modern
browsers.
More importantly, is there a list somewhere on the Net that I could
download, and that would list all the other similar strings for non-ASCII
characters: quotes, spaces, diacritics, etc.?


Try the HTML 4.01 specification:

<URL:http://www.w3.org/TR/html40/sgml/entities.html>


The situation is complicated by browser support, or lack thereof.

There are (old, obsolete) browsers that display — properly, but don't
display &mdash; properly. So by using — instead of &mdash;, you
improve the situation for readers using those browsers.

There are other (older, more obsolete) browsers that display neither
— nor &mdash; properly. When the character name is displayed "as is"
by such browsers, "mdash" might be more sensible than "#8212", and by using
&mdash; instead of —, you might improve the situation for readers
using those browsers.

See also http://www.htmlhelp.com/faq/html/bas...l#special-char
--
Darin McGrew, mc****@stanford alumni.org, http://www.rahul.net/mcgrew/
Web Design Group, da***@htmlhelp. com, http://www.HTMLHelp.com/

"Experience is something you don't get until just after you need it."
Jul 20 '05 #3
On Tue, 24 Feb 2004 00:59:46 +0100, Bertilo Wennergren <be******@gmx.n et>
wrote:

It should be "mdash" (not "emdash").
This is true.
Either your books are crap, or you
should get new glasses.


This is insulting, and for no purpose.
More importantly, is there a list somewhere on the Net that I could
download, and that would list all the other similar strings for
non-ASCII
characters: quotes, spaces, diacritics, etc.?


Try the HTML 4.01 specification:

<URL:http://www.w3.org/TR/html40/sgml/entities.html>


I'll also give this very comprehensive link:

http://www.pemberley.com/janeinfo/latin1.html
Jul 20 '05 #4
Neal:
On Tue, 24 Feb 2004 00:59:46 +0100, Bertilo Wennergren <be******@gmx.n et>
wrote:
It should be "mdash" (not "emdash"). This is true.

Either your books are crap, or you should get new glasses.

This is insulting, and for no purpose.


Sorry, I didn't mean to insult. It was a bad joke. I regret it.

--
Bertilo Wennergren <be******@gmx.n et> <http://www.bertilow.co m>
Jul 20 '05 #5
On 23 Feb 2004 23:31:11 GMT, "Stan Goodman" <SP*********@ha shkedim.com>
wrote:
Because I have always had difficulty in producing dashes on my pages, I
generally use double hyphens instead. Books that I possess or have seen on
HTML tell me that I could make e.g. an em-dash by using the escape sequence
(without the quotes) "&emdash;", but this is displayed *literally* on
browsers, not as an em-dash. A friend has told me now that I can make the
desired dashes with the strings "&#8211" and "&#8212" for en- and em-dashes
respectively , and sure enough this works.

These strings are very unintuitive (which is an understatement) ; there is no
obvious way to form a mnemonic to remember them. Why do books tell me the
easily remembered strings I have mentioned above? Has the spec changed? When
and why?


To what others have said, I would add:

- At least the commoner characters are supported in the mnemonic form
(character entity reference) by most/all browsers newer than Netscape 4.

- I use a couple of sed scripts to produce these characters myself. They
are available on my site if you'd like to try them:
http://www.xs4all.nl/~sbpoley/webmat...er_quotes.html

--
Stephen Poley

http://www.xs4all.nl/~sbpoley/webmatters/
Jul 20 '05 #6
On 23 Feb 2004, Stan Goodman wrote:
A friend has told me now that I can make the
desired dashes with the strings "&#8211" and "&#8212" for en- and em-dashes
respectively, and sure enough this works.
In addition to the other answers, see also
<http://ppewww.ph.gla.a c.uk/~flavell/charset/checklist.html# NoteUTF>
More importantly, is there a list somewhere on the Net that I could
download, and that would list all the other similar strings for non-ASCII
characters: quotes, spaces, diacritics, etc.?


For example <http://www.unics.uni-hannover.de/nhtcapri/multilingual2.h tml>

Jul 20 '05 #7
On Mon, 23 Feb 2004 23:59:46 UTC, Bertilo Wennergren <be******@gmx.n et>
opined:
Stan Goodman:
Books that I possess or have seen on
HTML tell me that I could make e.g. an em-dash by using the escape sequence
(without the quotes) "&emdash;", but this is displayed *literally* on
browsers, not as an em-dash. A friend has told me now that I can make the
desired dashes with the strings "&#8211" and "&#8212" for en- and em-dashes
respectively, and sure enough this works.


It should be "mdash" (not "emdash"). Either your books are crap, or you
should get new glasses.
More importantly, is there a list somewhere on the Net that I could
download, and that would list all the other similar strings for non-ASCII
characters: quotes, spaces, diacritics, etc.?


Try the HTML 4.01 specification:

<URL:http://www.w3.org/TR/html40/sgml/entities.html>


Thank you,

--
Stan Goodman
Qiryat Tiv'on
Israel

Saddam is gone. Ceterum, censeo Arafat esse delendum.

To send me email, please replace the CAPITAL_LETTERS with "sig". Please do
not send me HTML-formatted messages.Please do not send me attachments
without telling me beforehand.

Jul 20 '05 #8
On Tue, 24 Feb 2004 01:06:53 UTC, Darin McGrew <mc****@stanfor dalumni.org>
opined:
Stan Goodman wrote:
Books that I possess or have seen on
HTML tell me that I could make e.g. an em-dash by using the escape sequence
(without the quotes) "&emdash;", but this is displayed *literally* on
browsers, not as an em-dash. A friend has told me now that I can make the
desired dashes with the strings "&#8211" and "&#8212" for en- and em-dashes
respectively, and sure enough this works.
Bertilo Wennergren <be******@gmx.n et> wrote:
It should be "mdash" (not "emdash"). Either your books are crap, or you
should get new glasses.


To be fair, HTML 3.0 (RIP) specified &endash; and &emdash; rather than the
&ndash; and &mdash; specified by HTML 4.x and implemented by modern
browsers.
More importantly, is there a list somewhere on the Net that I could
download, and that would list all the other similar strings for non-ASCII
characters: quotes, spaces, diacritics, etc.?


Try the HTML 4.01 specification:

<URL:http://www.w3.org/TR/html40/sgml/entities.html>


The situation is complicated by browser support, or lack thereof.

There are (old, obsolete) browsers that display — properly, but don't
display &mdash; properly. So by using — instead of &mdash;, you
improve the situation for readers using those browsers.

There are other (older, more obsolete) browsers that display neither
— nor &mdash; properly. When the character name is displayed "as is"
by such browsers, "mdash" might be more sensible than "#8212", and by using
&mdash; instead of —, you might improve the situation for readers
using those browsers.

See also http://www.htmlhelp.com/faq/html/bas...l#special-char


I am grateful to you for your fuller (and, incidentally, more temperate)
explanation of the situation. Evidently, the book most readily available to
me may not be actual crap, but merely obsolete (HTML v3.2), and that I still
do not need glasses. A reading of my query will show that a change in the
specification was exactly what I was asking about.

--
Stan Goodman
Qiryat Tiv'on
Israel

Saddam is gone. Ceterum, censeo Arafat esse delendum.

To send me email, please replace the CAPITAL_LETTERS with "sig". Please do
not send me HTML-formatted messages.Please do not send me attachments
without telling me beforehand.

Jul 20 '05 #9
On Tue, 24 Feb 2004 10:42:28 UTC, Bertilo Wennergren <be******@gmx.n et>
opined:
Neal:
On Tue, 24 Feb 2004 00:59:46 +0100, Bertilo Wennergren <be******@gmx.n et>
wrote:
It should be "mdash" (not "emdash").

This is true.

Either your books are crap, or you should get new glasses.

This is insulting, and for no purpose.


Sorry, I didn't mean to insult. It was a bad joke. I regret it.


All is forgiven.

--
Stan Goodman
Qiryat Tiv'on
Israel

Saddam is gone. Ceterum, censeo Arafat esse delendum.

To send me email, please replace the CAPITAL_LETTERS with "sig". Please do
not send me HTML-formatted messages.Please do not send me attachments
without telling me beforehand.

Jul 20 '05 #10

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

12
4429
by: lothar | last post by:
re: 4.2.1 Regular Expression Syntax http://docs.python.org/lib/re-syntax.html *?, +?, ?? Adding "?" after the qualifier makes it perform the match in non-greedy or minimal fashion; as few characters as possible will be matched. the regular expression module fails to perform non-greedy matches as described in the documentation: more than "as few characters as possible"
5
3754
by: klaus triendl | last post by:
hi, recently i discovered a memory leak in our code; after some investigation i could reduce it to the following problem: return objects of functions are handled as temporary objects, hence their dtor is called immediately and not at the end of the function. to be able to use return objects (to avoid copying) i often assign them to a const reference. now, casting a const return object from a function to a non-const reference to this...
3
12264
by: Mario | last post by:
Hello, I couldn't find a solution to the following problem (tried google and dejanews), maybe I'm using the wrong keywords? Is there a way to open a file (a linux fifo pipe actually) in nonblocking mode in c++? I did something ugly like --- c/c++ mixture --- mkfifo( "testpipe", 777);
25
7645
by: Yves Glodt | last post by:
Hello, if I do this: for row in sqlsth: ________pkcolumns.append(row.strip()) ________etc without a prior:
32
4526
by: Adrian Herscu | last post by:
Hi all, In which circumstances it is appropriate to declare methods as non-virtual? Thanx, Adrian.
8
3514
by: Bern McCarty | last post by:
Is it at all possible to leverage mixed-mode assemblies from AppDomains other than the default AppDomain? Is there any means at all of doing this? Mixed-mode is incredibly convenient, but if I cannot load/unload/reload extensions into my large and slow-to-load application during development without restarting the process then the disadvantages may outweigh the advantages. I've got a mixed-mode program in which I create a new AppDomain...
14
8469
by: Patrick Kowalzick | last post by:
Dear all, I have an existing piece of code with a struct with some PODs. struct A { int x; int y; };
11
3453
by: ypjofficial | last post by:
Hello All, So far I have been reading that in case of a polymorphic class ( having at least one virtual function in it), the virtual function call get resolved at run time and during that the vtable pointer is made use of.. eg. class one {
2
6118
by: Ian825 | last post by:
I need help writing a function for a program that is based upon the various operations of a matrix and I keep getting a "non-aggregate type" error. My guess is that I need to dereference my pointers, but I'm not sure. Please help. The code: void equate(matrix *A, matrix *B) { int i, j; assert(A.row_dim == B.col_dim && A.col_dim == B.col_dim); for(i=0; i < A.row_dim; i++) for(j=0; j < A.col_dim; j++)
12
29913
by: puzzlecracker | last post by:
is it even possible or/and there is a better alternative to accept input in a nonblocking manner?
0
9617
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, we’ll explore What is ONU, What Is Router, ONU & Router’s main usage, and What is the difference between ONU and Router. Let’s take a closer look ! Part I. Meaning of...
0
9453
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can effortlessly switch the default language on Windows 10 without reinstalling. I'll walk you through it. First, let's disable language synchronization. With a Microsoft account, language settings sync across devices. To prevent any complications,...
0
9904
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the choice of these technologies. I'm particularly interested in Zigbee because I've heard it does some...
0
8929
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then launch it, all on its own.... Now, this would greatly impact the work of software developers. The idea...
1
7451
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms. Adolph will...
0
6710
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert into image. Globals.ThisAddIn.Application.ActiveDocument.Select();...
0
5354
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols. I succeeded, with both firewalls in the same network. But I'm wondering if it's possible to do the same thing, with 2 Pfsense firewalls...
0
5481
by: adsilva | last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
1
4007
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.