473,785 Members | 2,801 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

What special character notation is this

I saved an html page the other day that encoded some punctuation with codes
like

?T --> apostrophe

(in case those characters don't show up in your news reader that's
a_circumflex + euro + trademark)

--Art

Jul 23 '05 #1
7 3793
"Art M" <aa*****@cox.ne t> wrote:
I saved an html page the other day that encoded some punctuation with
codes like

?T --> apostrophe
This would be much easier if you had told _which_ page (and which
apostrophe - there are several characters that are called "apostrophe ").

The odds are that you have saved a UTF-8 encoded page and are now looking
at its source code using an editor that does not grok UTF-8 or simply
does not know the encoding.
(in case those characters don't show up in your news reader that's
a_circumflex + euro + trademark)


Your Usenet message specified no encoding, so all characters are assumed
to be Ascii characters. So the problem is not so much in our newsreaders
but in your way of sending data.

--
Yucca, http://www.cs.tut.fi/~jkorpela/
Pages about Web authoring: http://www.cs.tut.fi/~jkorpela/www.html

Jul 23 '05 #2
Art M wrote:
I saved an html page the other day that encoded some punctuation with codes
like

?T --> apostrophe

(in case those characters don't show up in your news reader that's
a_circumflex + euro + trademark)


Some discussion of character sets / encodings is in these pages:

http://webtips.dan.info/char.html
http://mailformat.dan.info/body/charsets.html

--
== Dan ==
Dan's Mail Format Site: http://mailformat.dan.info/
Dan's Web Tips: http://webtips.dan.info/
Dan's Domain Site: http://domains.dan.info/
Jul 23 '05 #3

"Jukka K. Korpela" <jk******@cs.tu t.fi> wrote in message
news:Xn******** *************** *****@193.229.0 .31...
"Art M" <aa*****@cox.ne t> wrote:
I saved an html page the other day that encoded some punctuation with
codes like

?T --> apostrophe


This would be much easier if you had told _which_ page (and which
apostrophe - there are several characters that are called "apostrophe ").

The odds are that you have saved a UTF-8 encoded page and are now looking
at its source code using an editor that does not grok UTF-8 or simply
does not know the encoding.
(in case those characters don't show up in your news reader that's
a_circumflex + euro + trademark)


Your Usenet message specified no encoding, so all characters are assumed
to be Ascii characters. So the problem is not so much in our newsreaders
but in your way of sending data.

--
Yucca, http://www.cs.tut.fi/~jkorpela/
Pages about Web authoring: http://www.cs.tut.fi/~jkorpela/www.html


The page is at
http://216.239.39.104/search?q=cache...Lucid%22&hl=en

Odd thing is that when I view source from either NS 4.8 or IE 6.0 they show
the desired character, but when saved from either program they use these 3
character codes that begin with a_circumflex + euro.

What encoding should I have used to send my original post as plain text so
that the correct latter 2 characters showed?

Thanks
--Art
Jul 23 '05 #4
Never mind. It seems to be a problem with the Rich Text Box control or
program in which I'm viewing the saved file.

--Art
"Art M" <aa*****@cox.ne t> wrote in message
news:BDDed.1571 16$a85.29973@fe d1read04...

"Jukka K. Korpela" <jk******@cs.tu t.fi> wrote in message
news:Xn******** *************** *****@193.229.0 .31...
"Art M" <aa*****@cox.ne t> wrote:
I saved an html page the other day that encoded some punctuation with
codes like

?T --> apostrophe


This would be much easier if you had told _which_ page (and which
apostrophe - there are several characters that are called "apostrophe ").

The odds are that you have saved a UTF-8 encoded page and are now looking
at its source code using an editor that does not grok UTF-8 or simply
does not know the encoding.
(in case those characters don't show up in your news reader that's
a_circumflex + euro + trademark)


Your Usenet message specified no encoding, so all characters are assumed
to be Ascii characters. So the problem is not so much in our newsreaders
but in your way of sending data.

--
Yucca, http://www.cs.tut.fi/~jkorpela/
Pages about Web authoring: http://www.cs.tut.fi/~jkorpela/www.html


The page is at
http://216.239.39.104/search?q=cache...Lucid%22&hl=en

Odd thing is that when I view source from either NS 4.8 or IE 6.0 they
show the desired character, but when saved from either program they use
these 3 character codes that begin with a_circumflex + euro.

What encoding should I have used to send my original post as plain text so
that the correct latter 2 characters showed?

Thanks
--Art

Jul 23 '05 #5
Art,

I read the first little bit of the site you posted. I noticed it talks of
Hawaii. Should I then assume that the a_circumflex and apostrophe you are
talking about are the Hawaiian letters "a with kahako" and the okina? If
that is the case, please do some research for the proper unicode
characters for these letters.

http://www.geocities.com/aloha_spiri.../alphabet.html

HTH,
La'ie Techie

Jul 23 '05 #6
On Sat, 23 Oct 2004, Art M wrote:
"Jukka K. Korpela" <jk******@cs.tu t.fi> wrote in message
news:Xn******** *************** *****@193.229.0 .31...

The odds are that you have saved a UTF-8 encoded page and are now
looking at its source code using an editor that does not grok
UTF-8 or simply does not know the encoding.
Odd thing is that when I view source from either NS 4.8 or IE 6.0
they show the desired character, but when saved from either program
they use these 3 character codes that begin with a_circumflex +
euro.


Nothing odd about that - indeed, you quoted the answer to your own
question.

When a page is sent out with its encoding specified (charset) on the
HTTP header, and then saved to a local file, the charset information
typically goes missing. When you then browse the local file, the
browser uses whichever encoding has been chosen as its default.
In this case, you'd want to switch that to utf-8 to view the page
correctly.

There's some folks who favour putting the charset into a <meta...>
in the document itself, for this very reason, although doing so does
also have some negative implications. Discussing the minutiae of that
here would probably not be useful: the first step is to get you up to
speed with the character representation model of HTML.

This was already set out in RFC2070, and there's a reasonable
presentation of it in the HTML4 specification; the W3C are developing
a more comprehensive document about it here:
http://www.w3.org/TR/charmod/
What encoding should I have used to send my original post as plain
text so that the correct latter 2 characters showed?


Well, that's a question about usenet, rather than about authoring HTML
for the WWW. Usenet specifications are still rather old-fashioned
about this kind of thing, and don't really welcome postings in utf-8
encoding, except maybe in specialised contexts.

Part of the problem with your usenet postings is that they don't have
proper MIME headers, and, as such, doesn't even conform to the
decade-old MIME specification. I'm told that it's possible to tame
that Lookout-Excess thingy into posting to usenet in a form that's in
somewhat closer agreement with generally-accepted usenet conventions.
But as I don't use it myself, I'm not best-placed to give advice,
other than to suggest using something else as your usenet client.

all the best

Jul 23 '05 #7
On Fri, 22 Oct 2004, Art M wrote:
X-Newsreader: Microsoft Outlook Express 6.00.2900.2180

??T --> apostrophe
(in case those characters don't show up in your news reader that's
a_circumflex + euro + trademark)


Perhaps _you_ should use a newsreader instead of this surrogate
from Microsoft. If you continue to use Outlook Express, then
select

Tools > Options > Send
Mail Sending Format > Plain Text Settings > Message format MIME
News Sending Format > Plain Text Settings > Message format MIME
Encode text using: None

to send special, non-ASCII characters.

--
Top-posting.
What's the most irritating thing on Usenet?

Jul 23 '05 #8

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

3
17210
by: Barry Olly | last post by:
Hi, I'm working on a mini content management system and need help with dealing with special characters. The input are taken from html form which are then stored into a varchar column in oracle database. When i retrieve the data, some of the special characters have been changed to ??? and also
28
3306
by: David MacQuigg | last post by:
I'm concerned that with all the focus on obj$func binding, &closures, and other not-so-pretty details of Prothon, that we are missing what is really good - the simplification of classes. There are a number of aspects to this simplification, but for me the unification of methods and functions is the biggest benefit. All methods look like functions (which students already understand). Prototypes (classes) look like modules. This will...
3
2375
by: ATH0 | last post by:
How to search for special character { } and how to count them.. I got field called text ( undefined length ) and in this field you must define "{" as start and "}" as end of some text line. If you can find both {} then you have to count them. example: bla bla bla {bla { asd asd} Result:
1
2294
by: lkrubner | last post by:
>Alan J. Flavell Oct 7 2004, 1:44 pm show options >>On Thu, 7 Oct 2004, Shmuel (Seymour J.) Metz wrote: >> at 08:24 PM, "Alan J. Flavell" <flav...@ph.gla.ac.uk> said: >> >I think you mean "multiple character encoding schemes". >> Yes, although a different character set would imply a different >> encoding scheme. > >Absolutely not. That's the whole point! > >In (X)HTML you can (if you so choose) represent any Unicode character
6
2724
by: sviau | last post by:
http://www.mls.ca/PropertyDetails.aspx?vd=&SearchURL=%3fMode%3d0%26Page%3d1%26vs%3d1%26rlt%3d%26cp%3d%26pt%3d1%26mp%3d0-0-0%26mrt%3d-1-0-0%26Beds%3d0-0% 26Baths%3d0-0%26f%3d%26ft%3dall%26o%3dA%26of%3d1%26ps%3d10%26aid%3d3490%26Ma pURL%3d%253fAreaID%253d3476&PropertyID=2127525
5
2386
by: Cylix | last post by:
this.menus = { root: new Array };
10
1944
by: Aray | last post by:
<!ENTITY % testEntity "(test)"> <!ELEMENT testElement %testEntity;> Above is a valid DTD file. But it doesn't work when I try to put a charater '/' in to the Content of testEntity. like following <!ENTITY % testEntity "(test/test)"> <!ELEMENT testElement %testEntity;> I am newbie to xml, Could you please tell me how to make it works?
2
5583
by: Diilb | last post by:
I am using DOM to create an rss feed. The problem I am running into is "special characters" such as . If I try adding them to the XML as character data (CData), DOM chokes and throws out errors. If I do a search and replace on the characters changing them to HTML entities instead of using CDATA, DOM chokes up again. If I do a combination of both (search and replace then add as Character Data) it does not choke, but the RSS...
0
2000
by: AAaron123 | last post by:
Been playing with asp:changepassword and have it looking OK except that I can't elininate or change the title at the top that says "Change Your Password". It's a repeat of my pages title. But more importantly, I can't find the regular expression for at least 7 characters and at least 1 special character. Can you help?
0
9645
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, well explore What is ONU, What Is Router, ONU & Routers main usage, and What is the difference between ONU and Router. Lets take a closer look ! Part I. Meaning of...
0
9480
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can effortlessly switch the default language on Windows 10 without reinstalling. I'll walk you through it. First, let's disable language synchronization. With a Microsoft account, language settings sync across devices. To prevent any complications,...
0
9949
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the choice of these technologies. I'm particularly interested in Zigbee because I've heard it does some...
1
7499
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupr who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms. Adolph will...
0
6739
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert into image. Globals.ThisAddIn.Application.ActiveDocument.Select();...
0
5511
by: adsilva | last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
1
4050
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system
2
3645
muto222
by: muto222 | last post by:
How can i add a mobile payment intergratation into php mysql website.
3
2879
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating effective websites that not only look great but also perform exceptionally well. In this comprehensive...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.