473,889 Members | 1,557 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

unicode and numeric character reference in html

I've got some html-files with a lot of greek characters in utf-8. When
I open my source code, I don't see the numeric character reference but
the character itself. How can I change this?
I really need source codes with numeric character references.

Oct 18 '07 #1
6 3952
st************* ***@gmail.com wrote:
I've got some html-files with a lot of greek characters in utf-8. When
I open my source code, I don't see the numeric character reference but
the character itself.
Most probably, you used a text editor supporting UTF-8 and the greek
characters are encoded in UTF-8. This is perfectly fine.
How can I change this?
I really need source codes with numeric character references.
To see the Unicode code points, use an hex editor.
Do you want to convert these characters to HTML character references such
as &#N; where N is a number in decimal notation?

--
If you've a question that doesn't belong to Usenet, contact me at
<ta************ *****@yahoDELET ETHATo.fr>
Oct 18 '07 #2
To see the Unicode code points, use an hex editor.
Do you want to convert these characters to HTML character references such
as &#N; where N is a number in decimal notation?
yes, I do...

i'll have a look at this recode program.

Oct 19 '07 #3
On Thu, 18 Oct 2007, st************* ***@gmail.com wrote:
I've got some html-files with a lot of greek characters in utf-8. When
I open my source code, I don't see the numeric character reference but
the character itself. How can I change this?
I really need source codes with numeric character references.
Open in Mozilla or a similar creature and save your files
in the encoding ISO-8859-1. All characters outside ISO-8859-1
will be converted to numeric references.

--
In memoriam Alan J. Flavell
http://groups.google.com/groups/sear...Alan.J.Flavell
Oct 19 '07 #4
st************* ***@gmail.com wrote:
I've got some html-files with a lot of greek characters in utf-8. When
I open my source code, I don't see the numeric character reference but
the character itself.
Well--that IS the point of using UTF-8--so you can enter the characters
directly in a text editor and read them in a file viewer, instead of
having to use &#nnnn;.
How can I change this?
I really need source codes with numeric character references.
Use global search and replace in your text editor.

Oct 19 '07 #5
Scripsit st************* ***@gmail.com:
You ask me why?
Thank you for Upside-down Fullquoting, the standard cluelessness indicator.
This search engine can only
handle numeric character reference. I know this is weird,
No, it's simply crappy software and should not be used for anything related
to HTML authoring for the WWW. There you have the real problem.
I managed to solve the problem
No, you just created the illusion of being able to live with the problem.
The problem, I repeat, is a "search engine" that is not useful, in a world
with free search engines available around the globe.

--
Jukka K. Korpela ("Yucca")
http://www.cs.tut.fi/~jkorpela/

Nov 2 '07 #6
Jukka K. Korpela wrote:
Scripsit st************* ***@gmail.com:
>This search engine can only
handle numeric character reference. I know this is weird,

No, it's simply crappy software and should not be used for anything
related to HTML authoring for the WWW. There you have the real problem.
He didn't specify what the search engine searches.
It may (or may not) be a very specific search engine, with highly complex
underlying technologies so that, there may be no alternative. For
instance, a search engine looking for specific english grammatical
constructs with a true english language grammar parser.
In that case, he may have to live with the quirks and bugs of this search
engine, or ask for support from the software developers, if there's still
support for this software.

--
If you've a question that doesn't belong to Usenet, contact me at
<ta************ *****@yahoDELET ETHATo.fr>
Nov 2 '07 #7

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

6
2794
by: S. | last post by:
if in my website i am using the sgml { notation, is it accurate to say to my users that the site uses unicode or that it requires unicode? is there a mathematical formula to calculate a unicode value given its utf8 value? Rgds, Sam
5
4422
by: Nancy | last post by:
I recently completed a web page, "Browser Tests of Entities in 2004". http://www.santagata.us/characters/CharacterEntities.html It shows those characters that work in all of the version 5.2+ browsers that were tested and those that only work in some of them. Take a look, maybe you'll consider it useful. This is not my field (I'm an architect - you know the house construction kind), so if you notice any inaccuracies I'd appreciate a...
11
3681
by: Patrick Van Esch | last post by:
Hello, I have the following problem of principle: in writing HTML pages containing ancient greek, there are two possibilities: one is to write the unicode characters directly (encoded as two bytes) into the HTML source, and save this source not as an ASCII text, but as a UNICODE text file (using 16 bits per character, also for the Western ASCII characters, which are usually encoded as Ox00XX with XX the ASCII code) ; or to write a pure...
1
3213
by: Mark Johnson | last post by:
I wonder if anyone has a solution? I wanted to use the web browser control as a 'zoom' box for a smaller textbox. I can format in the control, and save whatever formatting as HTML code back to the textbox when the web browser is closed. The only problem comes in the use of numeric entities to specify Unicode. The web browser control is fine when it comes to named entities, like &nbsp; . And there are a lot of named entities, just...
15
2128
by: John Salerno | last post by:
Forgive my newbieness, but I don't quite understand why Unicode is still something that needs special treatment in Python (and perhaps elsewhere). I'm reading Dive Into Python right now, and it constantly refers to a 'regular string' versus a 'Unicode string' and how you need to convert back and forth. But why isn't Unicode considered a regular string by now? Is it for historical reasons that we still use ASCII and Latin-1? Why can't...
40
3287
by: apprentice | last post by:
Hello, I'm writing an class library that I imagine people from different countries might be interested in using, so I'm considering what needs to be provided to support foreign languages, including asian languages (chinese, japanese, korean, etc). First of all, strings will be passed to my class methods, some of which based on the language (and on the encoding) might contain characters that require more that a single byte.
8
2817
by: lorenzo.viscanti | last post by:
X-No-Archive: yes Hi, I've found lots of material on the net about unicode html conversions, but still i'm having many problems converting unicode characters to html entities. Is there any available function to solve this issue? As an example I would like to do this kind of conversion: \uc3B4 =&ocirc; for all available html entities. thanks,
4
2738
by: Jason | last post by:
This is a Chinese character in unicode: 挪 I made it in Javascript by adding "&#"+"25"+"386" I need to convert it in Javascript to this: 挪 (The actual character) How do I achieve this conversion in Javascript? Jas
29
2139
by: Ioannis Vranos | last post by:
Hi, I am currently learning QT, a portable C++ framework which comes with both a commercial and GPL license, and which provides conversion operations to its various types to/from standard C++ types. For example its QString type provides a toWString() that returns a std::wstring with its Unicode contents. So, since wstring supports the largest character set, why do we need explicit Unicode types in C++?
0
9962
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, we’ll explore What is ONU, What Is Router, ONU & Router’s main usage, and What is the difference between ONU and Router. Let’s take a closer look ! Part I. Meaning of...
0
9810
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can effortlessly switch the default language on Windows 10 without reinstalling. I'll walk you through it. First, let's disable language synchronization. With a Microsoft account, language settings sync across devices. To prevent any complications,...
0
11198
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed. This is as boiled down as I can make it. Here is my compilation command: g++-12 -std=c++20 -Wnarrowing bit_field.cpp Here is the code in...
0
10791
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven tapestry of website design and digital marketing. It's not merely about having a website; it's about crafting an immersive digital experience that captivates audiences and drives business growth. The Art of Business Website Design Your website is...
1
10889
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For most users, this new feature is actually very convenient. If you want to control the update process,...
0
9609
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then launch it, all on its own.... Now, this would greatly impact the work of software developers. The idea...
1
7993
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms. Adolph will...
0
6029
by: adsilva | last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
2
4251
muto222
by: muto222 | last post by:
How can i add a mobile payment intergratation into php mysql website.

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.