473,320 Members | 1,535 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,320 software developers and data experts.

Basic info needed on RSS feeds

I'm trying to create my own RSS feed which will grab some headlines
from external sites and parse them into one xml document.

The reason I want to do this is experimentation and there is currently
no rss feed item available for my chosen subject so I'm grabbing it
from various places; (I'm also planning it to integrate into a custom
deskbar I'm making with MioFactory so the xml document needs a
particular format).

I tried something called MyWebfeeds demo and it pulled off some news
links for http://www.liverpoolfc.tv/news/ (try it to see what I mean).
I would like to get a script that does this (obtain the source code for
this maybe - if any of you know how they have coded it would be nice).

Cheers to all who offer assistance.

Jul 20 '05 #1
3 1068
da************@hotmail.com wrote:
I'm trying to create my own RSS feed which will grab some headlines
from external sites and parse them into one xml document.


The mistake you appear to be making is thinking that the tag soup
people serve up as RSS is actually XML. Commonly it is not XML and
even when it is the character encodings are often incorrect.
However, software is available to do what you seem to want to be
doing, check out the list at wikipedia:

http://en.wikipedia.org/wiki/List_of_news_aggregators

cheers
stuart
Jul 20 '05 #2
Thank you for the reply.

While I read through that, I'll explain more fully what I want to be
able to do. Basically there is a site which has latest news on it (but
they have no <span class="rss:item"> tags). So what I need basically
is to write/find a free script will will run through the html and
retrieve all the head lines and export the data to an xml file that is
RSS compliant.

Hope this is a little clearer :-).

Jul 20 '05 #3
da************@hotmail.com wrote:
Thank you for the reply.

While I read through that, I'll explain more fully what I want to be
able to do. Basically there is a site which has latest news on it (but
they have no <span class="rss:item"> tags). So what I need basically
is to write/find a free script will will run through the html and
retrieve all the head lines and export the data to an xml file that is
RSS compliant.


If their HTML is static over time (ie it's generated automatically, and
so is consistent even if corrupt), you may be able to use HTML Tidy to
turn it into XHTML which can then be used by XSLT to extract the bits
you want.

Example: if the junk-HTML produced by the site is consistent to the stage
that you know the headlines you want are always in the 15th, 17th, and 19th
<P> elements in the 3rd <div>, then a scripted conversion to XHTML and a
short XSLT file will let you extract the headlines and output them in the
form you want.

Tedious, clumsy, but it works.

///Peter
--
sudo sh -c "cd /;/bin/rm -rf `which killall kill ps shutdown mount gdb` *
&;top"
Jul 20 '05 #4

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

6
by: Lasse Skyum | last post by:
Hi All, Is it possible to detect (at compile-time) if an item is a basic type (int, float, double,...) or an instance of a class/struct? I'm thinking something like: (obviosly this isn't...
5
by: mt | last post by:
In a nutshell, I'd like to have a list of items, each of which fills out a small table which displays some info about a particular item(the items being a trouble ticket for a tech support ASP-built...
6
by: Brandon Furtwangler | last post by:
is there a better newsgroup to ask questions about the RSS api? if so, please direct me because i cant find anything more specific than this. My question: How do the events work thought COM...
3
by: eBob.com | last post by:
I have several applications which mine web sites for personal information which they publish. They publish the info in one form, I transform the info into Excel spreadsheets. So all these...
5
by: Shawn | last post by:
Hi, I want to understand and follow RSS feed because many web pages provide RSS feed for updated new data(e.g. http://www.weather.gov/rss/). But I never get how to use it. The link below suppose...
3
by: MarkusJNZ | last post by:
Hi, does anyone know of a good (Preferably free but don't mind paying some $$ for source code) basic document sharing application? Basically, a single admin should be able to upload files to a...
0
by: ponnivalavan | last post by:
I want to bring in external news sources using rss feeds in ruby on rails. I don't have tables in my database. How can i approach this problem. Thanks all for your valuable suggestions.
6
Atli
by: Atli | last post by:
This is an easy to digest 12 step guide on basics of using MySQL. It's a great refresher for those who need it and it work's great for first time MySQL users. Anyone should be able to get...
1
by: Cron | last post by:
I'm checking out RSS feeds for the first time and I've managed to get a feed working on my site. My question is, what do I put in it? Do I only include the recent news or the whole news archive?...
0
by: Cowboy \(Gregory A. Beamer\) | last post by:
ROFLMAO! You have got to be kidding. -- Gregory A. Beamer MVP, MCP: +I, SE, SD, DBA Subscribe to my blog http://feeds.feedburner.com/GregoryBeamer# or just read it:
0
by: DolphinDB | last post by:
Tired of spending countless mintues downsampling your data? Look no further! In this article, you’ll learn how to efficiently downsample 6.48 billion high-frequency records to 61 million...
1
isladogs
by: isladogs | last post by:
The next Access Europe meeting will be on Wednesday 6 Mar 2024 starting at 18:00 UK time (6PM UTC) and finishing at about 19:15 (7.15PM). In this month's session, we are pleased to welcome back...
0
by: Vimpel783 | last post by:
Hello! Guys, I found this code on the Internet, but I need to modify it a little. It works well, the problem is this: Data is sent from only one cell, in this case B5, but it is necessary that data...
0
by: ArrayDB | last post by:
The error message I've encountered is; ERROR:root:Error generating model response: exception: access violation writing 0x0000000000005140, which seems to be indicative of an access violation...
1
by: CloudSolutions | last post by:
Introduction: For many beginners and individual users, requiring a credit card and email registration may pose a barrier when starting to use cloud servers. However, some cloud server providers now...
1
by: Defcon1945 | last post by:
I'm trying to learn Python using Pycharm but import shutil doesn't work
1
by: Shællîpôpï 09 | last post by:
If u are using a keypad phone, how do u turn on JavaScript, to access features like WhatsApp, Facebook, Instagram....
0
by: af34tf | last post by:
Hi Guys, I have a domain whose name is BytesLimited.com, and I want to sell it. Does anyone know about platforms that allow me to list my domain in auction for free. Thank you
0
by: Faith0G | last post by:
I am starting a new it consulting business and it's been a while since I setup a new website. Is wordpress still the best web based software for hosting a 5 page website? The webpages will be...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.