Problems with <link> in a 0.91 RSS

Francesco Moi

Hello.

I'm trying to build a RSS feed for my website. It starts:

----------------//---------------------
<?xml version="1.0" encoding="ISO-8859-1"?>
<!DOCTYPE rss PUBLIC "-//Netscape Communications//DTD RSS 0.91//EN"
"http://my.netscape.com/publish/formats/rss-0.91.dtd">
<rss version="0.91">
----------------//----------------------

And an item could be:
--------//--------------
<item>
<link>http://www.mydomain.com</link>
<title>Foo</title>
</item>
------//---------------

If instead of 'http://www.mydomain.com', I set
'http://www.mydomain.com/mypage.aspx?ID=1&cod=9&num=20031206'
I get problems of validation (some RSS readers do not read it).

Does exist any problem with these kind of URLs?

Thank you very much.

Jul 20 '05 #1

Subscribe Post Reply

2929

Andy Dingley

On 6 Dec 2003 07:48:23 -0800, fr**********@europe.com (Francesco Moi)
wrote:

If instead of 'http://www.mydomain.com', I set
'http://www.mydomain.com/mypage.aspx?ID=1&cod=9&num=20031206'
I get problems of validation (some RSS readers do not read it).

Try this instead

http://www.mydomain.com/mypage.aspx?...p;num=20031206
It's an XML entity issue, not RSS

Jul 20 '05 #2

Bill Kearney

To futher clarify, XML itself requires that a certain 5 characters should always
be entity encoded if used within an element or attribute value.

& - &
< - <

- > ' - '
" - "

The one thing to guard against is double encoding. Do not re-encode n already
encoded entity. As in don't create &amp;

This is less of an issue inside the link element than it is inside the
descriptions.

While many folks argue about this, the most commonly used and least distruptive
form is a single encoding of markup. For example, an HTML snippet of "this text
has both bold & italic text". The least harmful way to encode
this is "this text has both bold & italic
text". Sure, if the generating tool can /properly/ assure it's well-formed it's
perfectly reasonable to use XHTML instead. But most applications don't
consistently guarantee that their text will be valid, let alone well-formed. In
a perfect world it would be arguably superior to avoid using markup encoding.
Until that time arrives (don't hold your breath) using a single pass of encoding
has shown itself to be the most workable all-around.

-Bill Kearney
www.Syndic8.com - The worlds largest directory of RSS content

"Andy Dingley" <di*****@codesmiths.com> wrote in message
news:h7********************************@4ax.com... On 6 Dec 2003 07:48:23 -0800, fr**********@europe.com (Francesco Moi)
wrote:
If instead of 'http://www.mydomain.com', I set
'http://www.mydomain.com/mypage.aspx?ID=1&cod=9&num=20031206'
I get problems of validation (some RSS readers do not read it).

Try this instead

http://www.mydomain.com/mypage.aspx?...p;num=20031206
It's an XML entity issue, not RSS

Jul 20 '05 #3

Patrick TJ McPhee

In article <Jr********************@speakeasy.net>,
Bill Kearney <wk********@hotmail.com> wrote:

% To futher clarify, XML itself requires that a certain 5 characters should always
% be entity encoded if used within an element or attribute value.
%
% & - &
% < - <
% > - >
% ' - '
% " - "

You waffle a bit there (requires ... should), but I'm going to disagree
anyway. Except when used in a CDATA section, & and < must always be
encoded. On the other hand, > never needs to be encoded. and ' and "
need be encoded only in attribute values, and only when they match the
value's delimiter. It is legal to encode any of the five outside a CDATA
section, but not always required.

My personal opinion is that you're better off using the predefined
entities as little as possible. It's hard to avoid using &, but
I would always encode your example using a CDATA section

<![CDATA[this text has both bold & italic text]]>

--

Patrick TJ McPhee
East York Canada
pt**@interlog.com

Jul 20 '05 #4

Andy Dingley

On Wed, 10 Dec 2003 12:43:05 -0500, "Bill Kearney"
<wk********@hotmail.com> wrote:

The one thing to guard against is double encoding. Do not re-encode n already
encoded entity. As in don't create &amp;

You often can't avoid this happening, especially not in an RSS-like
context where you're handling material that may already be encoded.

But if it does, make sure that your de-coding and en-coding is
balanced.

Jul 20 '05 #5

Bill Kearney

The one thing to guard against is double encoding. Do not re-encode n alreadyencoded entity. As in don't create &amp;
You often can't avoid this happening, especially not in an RSS-like
context where you're handling material that may already be encoded.

Then your code better work at improving the situation. Honestly, don't pass
along crap.
But if it does, make sure that your de-coding and en-coding is
balanced.

Sure, the trick lies in making sure the input is decoded properly and passed
along with the proper encoding as well.

It's not all that hard but it can be tedious to code properly.

-Bill Kearney

Jul 20 '05 #6

Bill Kearney

> You waffle a bit there (requires ... should), but I'm going to disagree

anyway. Except when used in a CDATA section, & and < must always be
encoded. On the other hand, > never needs to be encoded. and ' and "
need be encoded only in attribute values, and only when they match the
value's delimiter. It is legal to encode any of the five outside a CDATA
section, but not always required.
Well, what's better, to worry about the if's and when's or to encode them
consistently?
My personal opinion is that you're better off using the predefined
entities as little as possible. It's hard to avoid using &, but
I would always encode your example using a CDATA section

<![CDATA[this text has both bold & italic text]]>

Sure, provided tools understand how to use CDATA properly (many don't).

Jul 20 '05 #7

Andy Dingley

On Thu, 11 Dec 2003 11:28:46 -0500, "Bill Kearney"
<wk********@hotmail.com> wrote:

[double encoding]

You often can't avoid this happening, especially not in an RSS-like
context where you're handling material that may already be encoded.
Then your code better work at improving the situation. Honestly, don't pass
along crap.

Rubbish. The _last_ thing your code should ever do is to try and "fix
up" content in transit. (Especially note the "in transit")

Multiple encoding is perfectly safe, and can be decoded perfectly by
applying the appropriate number of decodes. Where it goes wrong is
when someone breaks this number - encoding more than they should, or
less than they should. But I would _much_ rather receive the
occasional bit of extra-encoded garbage (it's semantically wrong, but
it's still well-formed XML) rather than run the risk of getting things
which have been "smart de-encoded" by something en-route that
"thought" it ought not to see an entity in that location and so
decided to decode the lot. That means it's no longer well-formed, and
that causes a lot of trouble down the line.

If you're _really_ worried about never rendering "&" on screen for
the poor squeamish user, then do this in the user agent at the very
last point, when there's _no_ risk of it being propagated further.
This is also a good time to do it, as it's clearer (sic) here what the
content author's original intent was (maybe they're writing an RSS
feed of HTML coding tips and the entity is deliberate).

Are you really part of syndic8 ? Is this their official policy ?
--
Die Gotterspammerung - Junkmail of the Gods

Jul 20 '05 #8

Patrick TJ McPhee

In article <xM********************@speakeasy.net>,
Bill Kearney <wk********@hotmail.com> wrote:
% > You waffle a bit there (requires ... should), but I'm going to disagree
% > anyway. Except when used in a CDATA section, & and < must always be
% > encoded. On the other hand, > never needs to be encoded. and ' and "
% > need be encoded only in attribute values, and only when they match the
% > value's delimiter. It is legal to encode any of the five outside a CDATA
% > section, but not always required.
%
% Well, what's better, to worry about the if's and when's or to encode them
% consistently?

I guess, it depends on your goals. If you're writing what's `required',
I think it's better to be correct. If you have trouble keeping track
of when to use pre-defined entities, then you can take comfort in
the fact that it's always allowed, and not worry about when it's required.

% > My personal opinion is that you're better off using the predefined
% > entities as little as possible. It's hard to avoid using &, but
% > I would always encode your example using a CDATA section
% >
% > <![CDATA[this text has both bold & italic text]]>
%
% Sure, provided tools understand how to use CDATA properly (many don't).

Well, why use these tools? What's the point of pretending to use XML if
you're really spending your life worrying about whether your tools can
support the basic syntax? It's fair enough to say that you'd prefer to
always use the predefined entities, but lack of CDATA support doesn't
merit consideration.

--

Patrick TJ McPhee
East York Canada
pt**@interlog.com

Jul 20 '05 #9

Andy Dingley

On Thu, 11 Dec 2003 19:59:14 +0100 (MET), pt**@interlog.com (Patrick
TJ McPhee) wrote:

What's the point of pretending to use XML if
you're really spending your life worrying about whether your tools can
support the basic syntax?

We're dealing with RSS 0.91 here. The spec for the content here is
"ASCII", not even CDATA or PCDATA (Yes, Dave Winer's lousy
spec-writing).

If you do anything vaguely clever in the RSS field, it;'s likely to
break other people's (broken) code all over the place. It sucks, but
there you have it - your call.
--
Die Gotterspammerung - Junkmail of the Gods

Jul 20 '05 #10

by: Todd Peterson | last post by:

I'm encountering some wierd behavior with a <link> tag over an HTTPS connection, vs. an HTTP connection... In an ASP/HTML page on my website, I've add a <link rel="shortcut icon"...> in order to...

HTML / CSS

by: Hernán Castelo | last post by:

hi i'm trying to do : <link rel="stylesheet" type="text/css" href="myurl..../inc/css/style.css"/> from the url field of the browser i can normally open the "style.css" with notepad but thru...

ASP / Active Server Pages

dynamically adding a <LINK> tag to an asp.net page

by: brw | last post by:

Is there a way to dynamically add a link tag to the head block of an ..aspx page? I'm aware that you can add a link tag (or literal control) statically and then dynamically modify the attributes....

ASP.NET

RSS 2.0 question - why are "=" characters not allowed in URLs, even inside the <link> tag?

by: Jake Barnes | last post by:

Very odd. Check out this RSS feed that my PHP script just built: http://www.tagcastle.com/rss/photography.xml When I had a straight URL in the <link> tag, or the <comment> tag, then "="...

PHP

'TYPE' for <link> is extended markup use -x <extension>

by: Baldoni | last post by:

It's been years since putting together a page. This line in my HTML: <link type="text/css" rel="stylesheet" href="net4801style.css"> gives this problem (via weblint HTML checker): attribute...

HTML / CSS

<LINK> tags

by: The Numerator | last post by:

I know a lot about HTML, but all this time I don't know what the <LINK> tags in the head do. There are those that call a stylesheet, favicon, etc. But what about those that state the contents of...

HTML / CSS

RSS <channel><link> element

by: John A Grandy | last post by:

When constructing an RSS 2.0 XML doc , should the <channel><link> element's value be 1. the url of the page the displays the content that the RSS feed describes : fox example:...

.NET Framework

parsing the <link> element tag

by: Doug.Sheahan | last post by:

I am attempting to parse and xml file using javascript but am running into a problem when parsing a <link></link> pair. For example, the link information in most RSS feeds is given as <link> ...

Javascript

Including CSS Stylesheets - <link> or @import?

by: Arancaytar | last post by:

I have so far seen two methods for including external resources as CSS stylesheets in a document. The first is this: <link href="/stylesheets/style.css" rel="stylesheet" type="text/css" /> And...

HTML / CSS

Cloud Servers without Credit Card and Email Registration: A Simpler Way to Get on the Cloud

by: CloudSolutions | last post by:

Introduction: For many beginners and individual users, requiring a credit card and email registration may pose a barrier when starting to use cloud servers. However, some cloud server providers now...

General

Wordpress or something else?

by: Faith0G | last post by:

I am starting a new it consulting business and it's been a while since I setup a new website. Is wordpress still the best web based software for hosting a 5 page website? The webpages will be...

Content Management Systems

Access Europe: Command bars, the Access Shortcut Tool and a simple Audit Log - Wed 3 April

by: isladogs | last post by:

The next Access Europe User Group meeting will be on Wednesday 3 Apr 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome former...

General

One-click Importing Excel Data into a*Database

by: ryjfgjl | last post by:

In our work, we often need to import Excel data into databases (such as MySQL, SQL Server, Oracle) for data analysis and processing. Usually, we use database tools like Navicat or the Excel import...

Microsoft Excel

How to turn on java script in a villaon keypad mobile phone

by: Charles Arthur | last post by:

How do i turn on java script on a villaon, callus and itel keypad mobile phone

Java

Basic Javascript concepts

by: aa123db | last post by:

Variable and constants Use var or let for variables and const fror constants. Var foo ='bar'; Let foo ='bar';const baz ='bar'; Functions function $name$ ($parameters$) { } ...

Javascript

Batch import of multiple excel files into the database

by: ryjfgjl | last post by:

If we have dozens or hundreds of excel to import into the database, if we use the excel import function provided by database editors such as navicat, it will be extremely tedious and time-consuming...

Data Management

Merging data from multiple Excel files

by: ryjfgjl | last post by:

In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...

Data Management

Navigating the Data Structures and Algorithms (DSA)

by: BarryA | last post by:

What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...

Algorithms / Advanced Math

Problems with <link> in a 0.91 RSS

Similar topics