I saved an html page the other day that encoded some punctuation with codes
like
?T --> apostrophe
(in case those characters don't show up in your news reader that's
a_circumflex + euro + trademark)
--Art 7 3793
"Art M" <aa*****@cox.ne t> wrote: I saved an html page the other day that encoded some punctuation with codes like
?T --> apostrophe
This would be much easier if you had told _which_ page (and which
apostrophe - there are several characters that are called "apostrophe ").
The odds are that you have saved a UTF-8 encoded page and are now looking
at its source code using an editor that does not grok UTF-8 or simply
does not know the encoding.
(in case those characters don't show up in your news reader that's a_circumflex + euro + trademark)
Your Usenet message specified no encoding, so all characters are assumed
to be Ascii characters. So the problem is not so much in our newsreaders
but in your way of sending data.
--
Yucca, http://www.cs.tut.fi/~jkorpela/
Pages about Web authoring: http://www.cs.tut.fi/~jkorpela/www.html
"Jukka K. Korpela" <jk******@cs.tu t.fi> wrote in message
news:Xn******** *************** *****@193.229.0 .31... "Art M" <aa*****@cox.ne t> wrote:
I saved an html page the other day that encoded some punctuation with codes like
?T --> apostrophe
This would be much easier if you had told _which_ page (and which apostrophe - there are several characters that are called "apostrophe ").
The odds are that you have saved a UTF-8 encoded page and are now looking at its source code using an editor that does not grok UTF-8 or simply does not know the encoding.
(in case those characters don't show up in your news reader that's a_circumflex + euro + trademark)
Your Usenet message specified no encoding, so all characters are assumed to be Ascii characters. So the problem is not so much in our newsreaders but in your way of sending data.
-- Yucca, http://www.cs.tut.fi/~jkorpela/ Pages about Web authoring: http://www.cs.tut.fi/~jkorpela/www.html
The page is at http://216.239.39.104/search?q=cache...Lucid%22&hl=en
Odd thing is that when I view source from either NS 4.8 or IE 6.0 they show
the desired character, but when saved from either program they use these 3
character codes that begin with a_circumflex + euro.
What encoding should I have used to send my original post as plain text so
that the correct latter 2 characters showed?
Thanks
--Art
Never mind. It seems to be a problem with the Rich Text Box control or
program in which I'm viewing the saved file.
--Art
"Art M" <aa*****@cox.ne t> wrote in message
news:BDDed.1571 16$a85.29973@fe d1read04... "Jukka K. Korpela" <jk******@cs.tu t.fi> wrote in message news:Xn******** *************** *****@193.229.0 .31... "Art M" <aa*****@cox.ne t> wrote:
I saved an html page the other day that encoded some punctuation with codes like
?T --> apostrophe
This would be much easier if you had told _which_ page (and which apostrophe - there are several characters that are called "apostrophe ").
The odds are that you have saved a UTF-8 encoded page and are now looking at its source code using an editor that does not grok UTF-8 or simply does not know the encoding.
(in case those characters don't show up in your news reader that's a_circumflex + euro + trademark)
Your Usenet message specified no encoding, so all characters are assumed to be Ascii characters. So the problem is not so much in our newsreaders but in your way of sending data.
-- Yucca, http://www.cs.tut.fi/~jkorpela/ Pages about Web authoring: http://www.cs.tut.fi/~jkorpela/www.html
The page is at http://216.239.39.104/search?q=cache...Lucid%22&hl=en
Odd thing is that when I view source from either NS 4.8 or IE 6.0 they show the desired character, but when saved from either program they use these 3 character codes that begin with a_circumflex + euro.
What encoding should I have used to send my original post as plain text so that the correct latter 2 characters showed?
Thanks --Art
Art,
I read the first little bit of the site you posted. I noticed it talks of
Hawaii. Should I then assume that the a_circumflex and apostrophe you are
talking about are the Hawaiian letters "a with kahako" and the okina? If
that is the case, please do some research for the proper unicode
characters for these letters. http://www.geocities.com/aloha_spiri.../alphabet.html
HTH,
La'ie Techie
On Sat, 23 Oct 2004, Art M wrote: "Jukka K. Korpela" <jk******@cs.tu t.fi> wrote in message news:Xn******** *************** *****@193.229.0 .31... The odds are that you have saved a UTF-8 encoded page and are now looking at its source code using an editor that does not grok UTF-8 or simply does not know the encoding. Odd thing is that when I view source from either NS 4.8 or IE 6.0 they show the desired character, but when saved from either program they use these 3 character codes that begin with a_circumflex + euro.
Nothing odd about that - indeed, you quoted the answer to your own
question.
When a page is sent out with its encoding specified (charset) on the
HTTP header, and then saved to a local file, the charset information
typically goes missing. When you then browse the local file, the
browser uses whichever encoding has been chosen as its default.
In this case, you'd want to switch that to utf-8 to view the page
correctly.
There's some folks who favour putting the charset into a <meta...>
in the document itself, for this very reason, although doing so does
also have some negative implications. Discussing the minutiae of that
here would probably not be useful: the first step is to get you up to
speed with the character representation model of HTML.
This was already set out in RFC2070, and there's a reasonable
presentation of it in the HTML4 specification; the W3C are developing
a more comprehensive document about it here: http://www.w3.org/TR/charmod/
What encoding should I have used to send my original post as plain text so that the correct latter 2 characters showed?
Well, that's a question about usenet, rather than about authoring HTML
for the WWW. Usenet specifications are still rather old-fashioned
about this kind of thing, and don't really welcome postings in utf-8
encoding, except maybe in specialised contexts.
Part of the problem with your usenet postings is that they don't have
proper MIME headers, and, as such, doesn't even conform to the
decade-old MIME specification. I'm told that it's possible to tame
that Lookout-Excess thingy into posting to usenet in a form that's in
somewhat closer agreement with generally-accepted usenet conventions.
But as I don't use it myself, I'm not best-placed to give advice,
other than to suggest using something else as your usenet client.
all the best
On Fri, 22 Oct 2004, Art M wrote: X-Newsreader: Microsoft Outlook Express 6.00.2900.2180
??T --> apostrophe (in case those characters don't show up in your news reader that's a_circumflex + euro + trademark)
Perhaps _you_ should use a newsreader instead of this surrogate
from Microsoft. If you continue to use Outlook Express, then
select
Tools > Options > Send
Mail Sending Format > Plain Text Settings > Message format MIME
News Sending Format > Plain Text Settings > Message format MIME
Encode text using: None
to send special, non-ASCII characters.
--
Top-posting.
What's the most irritating thing on Usenet? This thread has been closed and replies have been disabled. Please start a new discussion. Similar topics |
by: Barry Olly |
last post by:
Hi,
I'm working on a mini content management system and need help with
dealing with special characters.
The input are taken from html form which are then stored into a
varchar column in oracle database.
When i retrieve the data, some of the special characters have been
changed to ??? and also
|
by: David MacQuigg |
last post by:
I'm concerned that with all the focus on obj$func binding, &closures,
and other not-so-pretty details of Prothon, that we are missing what
is really good - the simplification of classes. There are a number of
aspects to this simplification, but for me the unification of methods
and functions is the biggest benefit.
All methods look like functions (which students already understand).
Prototypes (classes) look like modules. This will...
|
by: ATH0 |
last post by:
How to search for special character { } and how to count them..
I got field called text ( undefined length ) and in this field you must
define "{" as start and "}" as end of some text line. If you can find both
{} then you have to count them.
example:
bla bla bla {bla { asd asd}
Result:
|
by: lkrubner |
last post by:
>Alan J. Flavell Oct 7 2004, 1:44 pm show options
>>On Thu, 7 Oct 2004, Shmuel (Seymour J.) Metz wrote:
>> at 08:24 PM, "Alan J. Flavell" <flav...@ph.gla.ac.uk> said:
>> >I think you mean "multiple character encoding schemes".
>> Yes, although a different character set would imply a different
>> encoding scheme.
>
>Absolutely not. That's the whole point!
>
>In (X)HTML you can (if you so choose) represent any Unicode character
|
by: sviau |
last post by:
http://www.mls.ca/PropertyDetails.aspx?vd=&SearchURL=%3fMode%3d0%26Page%3d1%26vs%3d1%26rlt%3d%26cp%3d%26pt%3d1%26mp%3d0-0-0%26mrt%3d-1-0-0%26Beds%3d0-0%
26Baths%3d0-0%26f%3d%26ft%3dall%26o%3dA%26of%3d1%26ps%3d10%26aid%3d3490%26Ma
pURL%3d%253fAreaID%253d3476&PropertyID=2127525
| |
by: Cylix |
last post by:
this.menus = { root: new Array };
|
by: Aray |
last post by:
<!ENTITY % testEntity "(test)">
<!ELEMENT testElement %testEntity;>
Above is a valid DTD file. But it doesn't work when I try to put a charater
'/' in to the Content of testEntity. like following
<!ENTITY % testEntity "(test/test)">
<!ELEMENT testElement %testEntity;>
I am newbie to xml, Could you please tell me how to make it works?
|
by: Diilb |
last post by:
I am using DOM to create an rss feed. The problem I am running into
is "special characters" such as . If I try adding them to the
XML as character data (CData), DOM chokes and throws out errors. If I
do a search and replace on the characters changing them to HTML
entities instead of using CDATA, DOM chokes up again. If I do a
combination of both (search and replace then add as Character Data) it
does not choke, but the RSS...
|
by: AAaron123 |
last post by:
Been playing with asp:changepassword and have it looking OK except that I
can't elininate or change the title at the top that says "Change Your
Password". It's a repeat of my pages title.
But more importantly, I can't find the regular expression for at least 7
characters and at least 1 special character.
Can you help?
|
by: marktang |
last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, well explore What is ONU, What Is Router, ONU & Routers main usage, and What is the difference between ONU and Router. Lets take a closer look !
Part I. Meaning of...
|
by: Hystou |
last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can effortlessly switch the default language on Windows 10 without reinstalling. I'll walk you through it.
First, let's disable language synchronization. With a Microsoft account, language settings sync across devices. To prevent any complications,...
| |
by: tracyyun |
last post by:
Dear forum friends,
With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the choice of these technologies. I'm particularly interested in Zigbee because I've heard it does some...
|
by: isladogs |
last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM).
In this session, we are pleased to welcome a new presenter, Adolph Dupr who will be discussing some powerful techniques for using class modules.
He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms.
Adolph will...
|
by: conductexam |
last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one.
At the time of converting from word file to html my equations which are in the word document file was convert into image.
Globals.ThisAddIn.Application.ActiveDocument.Select();...
|
by: adsilva |
last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
|
by: 6302768590 |
last post by:
Hai team
i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system
|
by: muto222 |
last post by:
How can i add a mobile payment intergratation into php mysql website.
| |
by: bsmnconsultancy |
last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating effective websites that not only look great but also perform exceptionally well. In this comprehensive...
| |