473,326 Members | 2,168 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,326 software developers and data experts.

XML RSS reader with BBC Website..

I have made an RSS reader and am testing on the BBC website, and I use
this code to grab the contents of the XML file, however when I look at
the contents grabbed by my function, and the HTML source of the bbc
website XML, they are different... how is that even possible?

Anyone have an XML parser that they could test this on please? Heres a
sample link and my code:
http://newsrss.bbc.co.uk/rss/sporton...otball/rss.xml
$rss_name = "filename.xml";

$ch = curl_init($feed);
$fp = fopen($rss_name, "w");

curl_setopt($ch, CURLOPT_FILE, $fp);
curl_setopt($ch, CURLOPT_HEADER, 0);

curl_exec($ch);
curl_close($ch);
fclose($fp);

Aug 15 '07 #1
6 2412
Rik
On Wed, 15 Aug 2007 16:57:21 +0200, junkmate <ju******@gmail.comwrote:
I have made an RSS reader and am testing on the BBC website, and I use
this code to grab the contents of the XML file, however when I look at
the contents grabbed by my function, and the HTML source of the bbc
website XML, they are different... how is that even possible?

Anyone have an XML parser that they could test this on please? Heres a
sample link and my code:
http://newsrss.bbc.co.uk/rss/sporton...otball/rss.xml
$rss_name = "filename.xml";

$ch = curl_init($feed);
$fp = fopen($rss_name, "w");

curl_setopt($ch, CURLOPT_FILE, $fp);
curl_setopt($ch, CURLOPT_HEADER, 0);

curl_exec($ch);
curl_close($ch);
fclose($fp);
Viewing the feed source & the file from CURL, the only difference I see is
(understandably) <lastBuildDate />. What do you see and what do you expect?
--
Rik Wasmus
Aug 15 '07 #2
I get an old set of items... the latest items are not included...
Now I am thinking my cUrl function maybe grabbing cached versions of
the xml file? is that possible and if so, can it be switched off?

On Aug 15, 4:13 pm, Rik <luiheidsgoe...@hotmail.comwrote:
On Wed, 15 Aug 2007 16:57:21 +0200, junkmate <junkm...@gmail.comwrote:
I have made an RSS reader and am testing on the BBC website, and I use
this code to grab the contents of the XML file, however when I look at
the contents grabbed by my function, and the HTML source of the bbc
website XML, they are different... how is that even possible?
Anyone have an XML parser that they could test this on please? Heres a
sample link and my code:
http://newsrss.bbc.co.uk/rss/sporton...otball/rss.xml
$rss_name = "filename.xml";
$ch = curl_init($feed);
$fp = fopen($rss_name, "w");
curl_setopt($ch, CURLOPT_FILE, $fp);
curl_setopt($ch, CURLOPT_HEADER, 0);
curl_exec($ch);
curl_close($ch);
fclose($fp);

Viewing the feed source & the file from CURL, the only difference I see is
(understandably) <lastBuildDate />. What do you see and what do you expect?

--
Rik Wasmus

Aug 15 '07 #3
Rik
On Wed, 15 Aug 2007 17:23:05 +0200, junkmate <ju******@gmail.comwrote:
On Aug 15, 4:13 pm, Rik <luiheidsgoe...@hotmail.comwrote:
>On Wed, 15 Aug 2007 16:57:21 +0200, junkmate <junkm...@gmail.comwrote:
I have made an RSS reader and am testing on the BBC website, and I use
this code to grab the contents of the XML file, however when I lookat
the contents grabbed by my function, and the HTML source of the bbc
website XML, they are different... how is that even possible?
Anyone have an XML parser that they could test this on please? Heres a
sample link and my code:
http://newsrss.bbc.co.uk/rss/sporton...otball/rss.xml
$rss_name = "filename.xml";
$ch = curl_init($feed);
$fp = fopen($rss_name, "w");
curl_setopt($ch, CURLOPT_FILE, $fp);
curl_setopt($ch, CURLOPT_HEADER, 0);
curl_exec($ch);
curl_close($ch);
fclose($fp);

Viewing the feed source & the file from CURL, the only difference I see
is
(understandably) <lastBuildDate />. What do you see and what do you
expect?
(topposting fixed)
I get an old set of items... the latest items are not included...
Now I am thinking my cUrl function maybe grabbing cached versions of
the xml file? is that possible and if so, can it be switched off?
No such problem here, though it might depend on sever setup. Are you sure
that what CURL gets is cached data, and it is not your own output on the
web which is? (i.e. your file gets updated, browser still shows old file)
--
Rik Wasmus
Aug 15 '07 #4
No, I have a button which grabs a fresh XML file and writes a fresh
htm file to be included every time via AJAX.

I did find this:
curl_setopt($ch, CURLOPT_DNS_CACHE_TIMEOUT, 0);

Since adding that, I get the latest results... which means one of two
things:
1) The cache finally ran out and it refreshed anyway!
2) its fixed...

On Aug 15, 4:44 pm, Rik <luiheidsgoe...@hotmail.comwrote:
On Wed, 15 Aug 2007 17:23:05 +0200, junkmate <junkm...@gmail.comwrote:
On Aug 15, 4:13 pm, Rik <luiheidsgoe...@hotmail.comwrote:
On Wed, 15 Aug 2007 16:57:21 +0200, junkmate <junkm...@gmail.comwrote:
I have made an RSS reader and am testing on the BBC website, and I use
this code to grab the contents of the XML file, however when I look at
the contents grabbed by my function, and the HTML source of the bbc
website XML, they are different... how is that even possible?
Anyone have an XML parser that they could test this on please? Heres a
sample link and my code:
http://newsrss.bbc.co.uk/rss/sporton...otball/rss.xml
$rss_name = "filename.xml";
$ch = curl_init($feed);
$fp = fopen($rss_name, "w");
curl_setopt($ch, CURLOPT_FILE, $fp);
curl_setopt($ch, CURLOPT_HEADER, 0);
curl_exec($ch);
curl_close($ch);
fclose($fp);
Viewing the feed source & the file from CURL, the only difference I see
is
(understandably) <lastBuildDate />. What do you see and what do you
expect?

(topposting fixed)
I get an old set of items... the latest items are not included...
Now I am thinking my cUrl function maybe grabbing cached versions of
the xml file? is that possible and if so, can it be switched off?

No such problem here, though it might depend on sever setup. Are you sure
that what CURL gets is cached data, and it is not your own output on the
web which is? (i.e. your file gets updated, browser still shows old file)
--
Rik Wasmus

Aug 15 '07 #5
No i just tried on a brand new fresh feed:
http://newsrss.bbc.co.uk/rss/newsonl...t_page/rss.xml

The second item is different...


On Aug 15, 4:55 pm, junkmate <junkm...@gmail.comwrote:
No, I have a button which grabs a fresh XML file and writes a fresh
htm file to be included every time via AJAX.

I did find this:
curl_setopt($ch, CURLOPT_DNS_CACHE_TIMEOUT, 0);

Since adding that, I get the latest results... which means one of two
things:
1) The cache finally ran out and it refreshed anyway!
2) its fixed...

On Aug 15, 4:44 pm, Rik <luiheidsgoe...@hotmail.comwrote:
On Wed, 15 Aug 2007 17:23:05 +0200, junkmate <junkm...@gmail.comwrote:
On Aug 15, 4:13 pm, Rik <luiheidsgoe...@hotmail.comwrote:
>On Wed, 15 Aug 2007 16:57:21 +0200, junkmate <junkm...@gmail.comwrote:
I have made an RSS reader and am testing on the BBC website, and I use
this code to grab the contents of the XML file, however when I look at
the contents grabbed by my function, and the HTML source of the bbc
website XML, they are different... how is that even possible?
Anyone have an XML parser that they could test this on please? Heres a
sample link and my code:
>http://newsrss.bbc.co.uk/rss/sporton...otball/rss.xml
$rss_name = "filename.xml";
$ch = curl_init($feed);
$fp = fopen($rss_name, "w");
curl_setopt($ch, CURLOPT_FILE, $fp);
curl_setopt($ch, CURLOPT_HEADER, 0);
curl_exec($ch);
curl_close($ch);
fclose($fp);
>Viewing the feed source & the file from CURL, the only difference I see
>is
>(understandably) <lastBuildDate />. What do you see and what do you
>expect?
(topposting fixed)
I get an old set of items... the latest items are not included...
Now I am thinking my cUrl function maybe grabbing cached versions of
the xml file? is that possible and if so, can it be switched off?
No such problem here, though it might depend on sever setup. Are you sure
that what CURL gets is cached data, and it is not your own output on the
web which is? (i.e. your file gets updated, browser still shows old file)
--
Rik Wasmus

Aug 15 '07 #6
OK, somethings erratic... I added to my parser a date at the top which
shows the LastBuildDate of the XML file being parsed. It changes as
you click on refresh... and is always different to the one found in
the actual XML source found by clicking the rss button.

Is it my browser? Is it my page being cached? I dont know. Any ideas?
Here: http://dev.oldsushi.com/joe
The top one, labeled BBC News
(the actual RSS feed can be accessed by clicking the rss button in the
top right)


On Aug 15, 5:01 pm, junkmate <junkm...@gmail.comwrote:
No i just tried on a brand new fresh feed:http://newsrss.bbc.co.uk/rss/newsonl...t_page/rss.xml

The second item is different...

On Aug 15, 4:55 pm, junkmate <junkm...@gmail.comwrote:
No, I have a button which grabs a fresh XML file and writes a fresh
htm file to be included every time via AJAX.
I did find this:
curl_setopt($ch, CURLOPT_DNS_CACHE_TIMEOUT, 0);
Since adding that, I get the latest results... which means one of two
things:
1) The cache finally ran out and it refreshed anyway!
2) its fixed...
On Aug 15, 4:44 pm, Rik <luiheidsgoe...@hotmail.comwrote:
On Wed, 15 Aug 2007 17:23:05 +0200, junkmate <junkm...@gmail.comwrote:
On Aug 15, 4:13 pm, Rik <luiheidsgoe...@hotmail.comwrote:
On Wed, 15 Aug 2007 16:57:21 +0200, junkmate <junkm...@gmail.comwrote:
I have made an RSS reader and am testing on the BBC website, and I use
this code to grab the contents of the XML file, however when I look at
the contents grabbed by my function, and the HTML source of the bbc
website XML, they are different... how is that even possible?
Anyone have an XML parser that they could test this on please? Heres a
sample link and my code:
http://newsrss.bbc.co.uk/rss/sporton...otball/rss.xml
$rss_name = "filename.xml";
$ch = curl_init($feed);
$fp = fopen($rss_name, "w");
curl_setopt($ch, CURLOPT_FILE, $fp);
curl_setopt($ch, CURLOPT_HEADER, 0);
curl_exec($ch);
curl_close($ch);
fclose($fp);
Viewing the feed source & the file from CURL, the only difference I see
is
(understandably) <lastBuildDate />. What do you see and what do you
expect?
(topposting fixed)
I get an old set of items... the latest items are not included...
Now I am thinking my cUrl function maybe grabbing cached versions of
the xml file? is that possible and if so, can it be switched off?
No such problem here, though it might depend on sever setup. Are you sure
that what CURL gets is cached data, and it is not your own output on the
web which is? (i.e. your file gets updated, browser still shows old file)
--
Rik Wasmus

Aug 15 '07 #7

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

2
by: Kai | last post by:
Hi, I am looking for a Bar code reader softwre to read (Scan) bar code to MS-Access database table, I tried on the Web, but donn't know which one is good. Please help for recomanding one. ...
1
by: juli jul | last post by:
Hello, I am using a reader in order to read query results: SqlDataReader rdr = null; SqlDataReader rdr_rows=null; this.conn.Open(); SqlCommand cmd = new SqlCommand("select name from...
0
by: | last post by:
Hi, I'm looking for a web based newsgroup reader to integrate into my ASP.net website. Any suggestions? Do
5
by: sameer | last post by:
Hi, I have a client who wants to use barcode scanners with web pages( internet website). They want to scan the barcode no off a label and want this no to go straight to the focused text field on...
5
by: Serdar C. | last post by:
hello everyone, i have a question about retrieving data from a bar code reader plugged in keyboard port (ps/2) i really dont know how to retrieve data from keyboard, i tried some methods but all i...
0
by: wannieb | last post by:
Does any one know of any ticker tape RSS feed reader scripts to go on a website, as I can only fine vertical one. I know you can do a ticker tape style as most RSS readers which go to desktop are...
4
by: Wade | last post by:
Does anyone know where I can get an open source rss feed reader?
10
by: Grizlyk | last post by:
Google makes new interface, it results in web page refreshing about 3-10 minutes, sometimes hangs during refresh. !@#$%. Is any other web reader for usenet groups?
0
by: ashes | last post by:
Hi, i created a register form on a website. The information from the form has to go in the customer table in a database in MS Access. the information is going into the table perfectly. ...
0
by: ryjfgjl | last post by:
ExcelToDatabase: batch import excel into database automatically...
0
isladogs
by: isladogs | last post by:
The next Access Europe meeting will be on Wednesday 6 Mar 2024 starting at 18:00 UK time (6PM UTC) and finishing at about 19:15 (7.15PM). In this month's session, we are pleased to welcome back...
1
isladogs
by: isladogs | last post by:
The next Access Europe meeting will be on Wednesday 6 Mar 2024 starting at 18:00 UK time (6PM UTC) and finishing at about 19:15 (7.15PM). In this month's session, we are pleased to welcome back...
0
by: Vimpel783 | last post by:
Hello! Guys, I found this code on the Internet, but I need to modify it a little. It works well, the problem is this: Data is sent from only one cell, in this case B5, but it is necessary that data...
0
by: jfyes | last post by:
As a hardware engineer, after seeing that CEIWEI recently released a new tool for Modbus RTU Over TCP/UDP filtering and monitoring, I actively went to its official website to take a look. It turned...
1
by: PapaRatzi | last post by:
Hello, I am teaching myself MS Access forms design and Visual Basic. I've created a table to capture a list of Top 30 singles and forms to capture new entries. The final step is a form (unbound)...
1
by: CloudSolutions | last post by:
Introduction: For many beginners and individual users, requiring a credit card and email registration may pose a barrier when starting to use cloud servers. However, some cloud server providers now...
1
by: Shællîpôpï 09 | last post by:
If u are using a keypad phone, how do u turn on JavaScript, to access features like WhatsApp, Facebook, Instagram....
0
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 3 Apr 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome former...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.