473,507 Members | 2,472 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

Comparing Pages and Returning Differences as Regular Expressions

The concept - a HTML file changes regularly. However, only certain
parts of the page change. I want to find a regular expression that
describes that changed area.

Can anyone here think of a rough way to go about this? I'm thinking
grab the page once a day, put it in a table with a date stamp, and
then do something.
Jul 17 '05 #1
4 1418
matt cook wrote:
The concept - a HTML file changes regularly. However, only certain
parts of the page change. I want to find a regular expression that
describes that changed area.

Can anyone here think of a rough way to go about this? I'm thinking
grab the page once a day, put it in a table with a date stamp, and
then do something.


How about using the power of PHP? no HTML file at all, instead use PHP
to dynamically generate the webpage every time it's requested from the
webserver... you could objectify the webpage to your hearts desire and
have any or all parts of it change for any given time interval.
--
Ben Cottrell AKA Bench

Disclaimer:
This post may contain explicit depictions of things which are "real".
These "real" things are commonly known as 'life'! So, if it sounds
sarcastic, don't take it seriously. If it sounds hazardous, Do not try
this at home or at all. And if it offends you, just don't read it.
Jul 17 '05 #2
"matt cook" wrote:
The concept - a HTML file changes regularly. However, only certain
parts of the page change. I want to find a regular expression that
describes that changed area.

Can anyone here think of a rough way to go about this? I’m
thinking
grab the page once a day, put it in a table with a date stamp, and
then do something.


I think this is beyond regular expressions. You probably need unix
"diff". You can easily call that command by calling shell.

--
http://www.dbForumz.com/ This article was posted by author's request
Articles individually checked for conformance to usenet standards
Topic URL: http://www.dbForumz.com/PHP-Comparin...ict133951.html
Visit Topic URL to contact author (reg. req'd). Report abuse: http://www.dbForumz.com/eform.php?p=447226
Jul 17 '05 #3
On Wed, 28 Jul 2004 01:36:00 -0000
steve <Us************@dbForumz.com> wrote:

[snip]
> Can anyone here think of a rough way to go about this? I’m
> thinking
> grab the page once a day, put it in a table with a date stamp, and
> then do something.


I think this is beyond regular expressions. You probably need unix
"diff". You can easily call that command by calling shell.


Yeah, 'diff' rocks! ;)

Madsen

--
Anders K. Madsen --- http://lillesvin.linux.dk

"There are 10 types of people in the world.
Those who understand binary - and those who don't."

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.4 (GNU/Linux)

iD8DBQFBBwQMlNHJe/JASHcRAj+AAJ969WLEartHDWnBgiz7VeT/RDk+hACfdlyR
nplpk7Tc3nMBdhwwh7iFMSs=
=CipU
-----END PGP SIGNATURE-----

Jul 17 '05 #4
Ben Cottrell wrote:
matt cook wrote:
The concept - a HTML file changes regularly. However, only certain
parts of the page change. I want to find a regular expression that
describes that changed area.

Can anyone here think of a rough way to go about this? I'm thinking
grab the page once a day, put it in a table with a date stamp, and
then do something.


<Snip>

Okay, I think i misunderstood the original 'question'.. nevermind :-)
--
Ben Cottrell AKA Bench

Disclaimer:
This post may contain explicit depictions of things which are "real".
These "real" things are commonly known as 'life'! So, if it sounds
sarcastic, don't take it seriously. If it sounds hazardous, Do not try
this at home or at all. And if it offends you, just don't read it.
Jul 17 '05 #5

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

1
4155
by: Kenneth McDonald | last post by:
I'm working on the 0.8 release of my 'rex' module, and would appreciate feedback, suggestions, and criticism as I work towards finalizing the API and feature sets. rex is a module intended to make...
41
3901
by: Odd-R. | last post by:
I have to lists, A and B, that may, or may not be equal. If they are not identical, I want the output to be three new lists, X,Y and Z where X has all the elements that are in A, but not in B, and...
0
958
by: Daniel Patriak | last post by:
Hi all! I need to compare two regular expressions, because I need two know if they cover in part or in all. Is there any way to do it? Thanks Daniel
4
2802
by: Earl T | last post by:
When I try to get the netscape version for version 7, I get the HttpBrowserCapabilities class returning the version as 5 and not 7. (see code and output below) CODE HttpBrowserCapabilities...
9
3028
by: Rob Panosh | last post by:
Hello, I am looking for a routine that would compare the following: sTest1 = "This is a test" sTest2 = "This is a long test" and return the differences between the two: long
2
4990
by: Sehboo | last post by:
Hi, I have several regular expressions that I need to run against documents. Is it possible to combine several expressions in one expression in Regex object. So that it is faster, or will I...
7
3794
by: Billa | last post by:
Hi, I am replaceing a big string using different regular expressions (see some example at the end of the message). The problem is whenever I apply a "replace" it makes a new copy of string and I...
4
3846
by: seberino | last post by:
I'm trying to extract some data from an XHTML Transitional web page. What is best way to do this? xml.dom.minidom.parseString("text of web page") gives errors about it not being well formed...
25
12989
by: J Caesar | last post by:
In C you can compare two pointers, p<q, as long as they come from the same array or the same malloc()ated block. Otherwise you can't. What I'd like to do is write a function int comparable(void...
0
7321
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
7377
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...
1
7034
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...
0
7488
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...
0
5623
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...
0
4702
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and...
0
1544
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated ...
1
762
muto222
by: muto222 | last post by:
How can i add a mobile payment intergratation into php mysql website.
0
412
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.