Connecting Tech Pros Worldwide Forums | Help | Site Map

How to harness information of the web

Newbie
 
Join Date: Oct 2008
Posts: 15
#1: Oct 21 '08
Hello Everyone,

I am not sure if I posting this to the right forum or not. So, I apologize before hand if this is not the forum for posting this question.

I want a script that would run weather in Windows or Linux that will gather the information of a website and save it in a file (maybe to a .xls file). I want to use a program because I want to search more than 1000s of pages. However, the good thing is that each page is identical to the other. For example, the program browses: http://www.xxxxxx.yyy/1.html and grabs a variable's content from inside and saves it to a file. Next it goes to: http://www.xxxxxx.yyy/2.html and does the same thing.

Can php do this?

Thanks

Markus's Avatar
Moderator
 
Join Date: Jun 2007
Location: York, England, with wolves.
Posts: 4,947
#2: Oct 21 '08

re: How to harness information of the web


PHP can get the contents of a file using file_get_contents() and then you might use a regex to extract the information. So, yes, it is possible.
Reply