HTML Encoded Translation

Dave

How can I translate this:

gi

to this:

"gi"

I've tried urllib.unencode and it doesn't work.

Thanks!

Oct 17 '06 #1

Subscribe Reply

1232

Fredrik Lundh

Dave wrote:

How can I translate this:

gi

to this:

"gi"

the easiest way is to run it through an HTML or XML parser (depending on
what the source is). or you could use something like this:

import re

def fix_charrefs(text):
def fixup(m):
text = m.group(0)
try:
if text[:3] == "&#x":
return unichr(int(text[3:-1], 16))
else:
return unichr(int(text[2:-1]))
except ValueError:
pass
return text # leave as is
return re.sub("&#?\w+;", fixup, text)

>>fix_charrefs("gi")

'gi'

also see:

http://effbot.org/zone/re-sub.htm#strip-html

I've tried urllib.unencode and it doesn't work.

those are HTML/XML character references, not encoded URL characters.

</F>

Oct 17 '06 #2

Sybren Stuvel

Dave enlightened us with:

How can I translate this:

gi

to this:

"gi"

I've tried urllib.unencode and it doesn't work.

As you put so nicely in the subject: it is HTML encoding, not URL
encoding. Those are two very different things! Try a HTML decoder,
you'll have more luck with that...

Sybren
--
Sybren StÃ¼vel
StÃ¼vel IT - http://www.stuvel.eu/

Oct 17 '06 #3

Dave

Got it, great. This worked like a charm. I knew I was barking up the
wrong tree with urllib, but I didn't know which tree to bark up...

Thanks!

Fredrik Lundh wrote:

Dave wrote:

How can I translate this:

gi

to this:

"gi"

the easiest way is to run it through an HTML or XML parser (depending on
what the source is). or you could use something like this:

import re

def fix_charrefs(text):
def fixup(m):
text = m.group(0)
try:
if text[:3] == "&#x":
return unichr(int(text[3:-1], 16))
else:
return unichr(int(text[2:-1]))
except ValueError:
pass
return text # leave as is
return re.sub("&#?\w+;", fixup, text)

>>fix_charrefs("gi")

'gi'

also see:

http://effbot.org/zone/re-sub.htm#strip-html

I've tried urllib.unencode and it doesn't work.

those are HTML/XML character references, not encoded URL characters.

</F>

Oct 17 '06 #4

Similar topics

4396

multipart/mixed html's

by: Niels Berkers | last post by:

Hi, i'd like to host my web pages using multiparts to reduce the number of hits on the server. i know this isn't a real PHP subject, but i'll try it anyway. i've been searching the web for...

PHP

2024

html e-mail seding problem with php...

by: Cem Louis | last post by:

Hi to all, I have the following html mail sender code written in php. It is working properly but my problem is, code doesn't send the text correctly which is in the varible "$mesaj". Code sends...

PHP

3745

html tags within meta tags allowed?

by: Donald Firesmith | last post by:

Are html tags allowed within meta tags? Specifically, if I have html tags within a <definition> tag within XML, can I use the definition as the content within the <meta content="description> tag? ...

.NET Framework

2405

Why is this valid HTML?

by: Mr. Clean | last post by:

As you may know, spammer use this technique to get by filters. <!H>It<!W> is<!N> <!K>a<!L> w<!Q>el<!Q>l <!X>k<!O>now<!B>n <!F>f<!G>a<!V>c<!O>t <!S>th<!B>at p<!R>eopl<!J>e<!G> <!Z>who...

HTML / CSS

3617

Can an HTML source file be specified in unicode ?

by: Patrick Van Esch | last post by:

Hello, I have the following problem of principle: in writing HTML pages containing ancient greek, there are two possibilities: one is to write the unicode characters directly (encoded as two...

HTML / CSS

2791

Attributes added at runtime get HTML encoded

by: Steven Berkovitz | last post by:

Hi there, I am having a problem where if i add an attribute to a control at runtime the rendered attribute is HTML encoded. For example, on a textbox: textBox.Attributes = "if(x && y)...

ASP.NET

1830

How/where to store my encoded HTML? In DB? In XML?

by: darrel | last post by:

We have a parent-child table set up to maintain content. When this table is updated, I do a recursive call through the data and spit out an XML file. Then, when we want to display this...

.NET Framework

7596

Does XmlTextWriter encode HTML?

by: darrel | last post by:

I'm trying to get ASP.net to write out some XML including HTML from a DB: The HTML is stored in the DB as encoded HTML. I'm trying to decode it and write it to an XML node (The HTML is valid...

.NET Framework

1927

.NET-ey way to convert XML-encoded/escaped entities into normal characters/HTML?

by: | last post by:

I've written an app to parse an XML RSS feed. Some of the content in that RSS feed has entities properly encoded for XML, e.g.: UW's Hawes To Enter NBA Draft High school's...

ASP.NET

7128

What is ONU?

by: marktang | last post by:

ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...

General

7169

Problem With Comparison Operator <=> in G++

by: Oralloy | last post by:

Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...

C / C++

7215

Maximizing Business Potential: The Nexus of Website Design and Digital Marketing

by: jinu1996 | last post by:

In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...

Online Marketing

5467

AI Job Threat for Devs

by: agi2029 | last post by:

Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...

Career Advice

4917

Access Europe - Using VBA to create a class based on a table - Wed 1 May

by: isladogs | last post by:

The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new...

Microsoft Access / VBA

3096

Trying to create a lan-to-lan vpn between two differents networks

by: TSSRALBI | last post by:

Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The...

Networking - Hardware / Configuration

3088

Windows Forms - .Net 8.0

by: adsilva | last post by:

A Windows Forms form does not have the event Unload, like VB6. What one acts like?

Visual Basic .NET

661

How to add payments to a PHP MySQL app.

by: muto222 | last post by:

How can i add a mobile payment intergratation into php mysql website.

PHP

294

Comprehensive Guide to Website Development in Toronto: Expert Insights from BSMN Consultancy

by: bsmnconsultancy | last post by:

In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence...

General