473,395 Members | 1,343 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,395 software developers and data experts.

Finding Orphaned Files On A Website

I have a golden list of urls to each and every file on our corporate
website. This includes all webpages and file resources. I am looking
for a tool whereby I can plug in the base url of our website and also
plug in this golden list of urls and have the program figure out which
files are orphaned and not linked to any other webpage on our site. Is
anyone aware of a tool that will do this?

Thanks

Feb 8 '06 #1
7 6030
Els
newspost2000 wrote:
I have a golden list of urls to each and every file on our corporate
website. This includes all webpages and file resources. I am looking
for a tool whereby I can plug in the base url of our website and also
plug in this golden list of urls and have the program figure out which
files are orphaned and not linked to any other webpage on our site. Is
anyone aware of a tool that will do this?

Thanks


Xenulink does that, without the need for the golden list of urls.
(needs ftp access)
http://home.snafu.de/tilman/xenulink.html

--
Els http://locusmeus.com/
Sonhos vem. Sonhos vão. O resto é imperfeito.
- Renato Russo -
Feb 8 '06 #2
My site is contained in a Notes Database. FTP will not do because all
of the files and contents or our website are not contained as
individual files on a web server but are contained in one file which is
a Lotus Notes Database (.nsf). This is why the only other option that
I have is produce a comma separated list of urls that I can pull into a
system and then that system can find which files are on the list that
were not found through a the public search of our site and then
identify those as the orphans. Can Xenu do that?

Feb 8 '06 #3
Els
newspost2000 wrote:
My site is contained in a Notes Database. FTP will not do because all
of the files and contents or our website are not contained as
individual files on a web server but are contained in one file which is
a Lotus Notes Database (.nsf). This is why the only other option that
I have is produce a comma separated list of urls that I can pull into a
system and then that system can find which files are on the list that
were not found through a the public search of our site and then
identify those as the orphans. Can Xenu do that?


I don't know - but Xenu gives you an entire list of valid urls too.
Seems to me that once you have that, it's just comparing one list with
the other and the difference should be the orphans.

--
Els http://locusmeus.com/
Sonhos vem. Sonhos vão. O resto é imperfeito.
- Renato Russo -
Feb 8 '06 #4
Reading the Xenu info I can see no reference to the program being able to
identify "orphan" files.
I understand the term "orphan" to mean files, such as for example images,
that do not have a link pointing to them.
I hope that I am wrong and that it can do this.
I am looking for such a program, other than Dreamweaver.
Brian Tozer

"Els" <el*********@tiscali.nl> wrote in message
news:1b*******************************@40tude.net. ..
newspost2000 wrote:
I have a golden list of urls to each and every file on our corporate
website. This includes all webpages and file resources. I am looking
for a tool whereby I can plug in the base url of our website and also
plug in this golden list of urls and have the program figure out which
files are orphaned and not linked to any other webpage on our site. Is
anyone aware of a tool that will do this?

Thanks


Xenulink does that, without the need for the golden list of urls.
(needs ftp access)
http://home.snafu.de/tilman/xenulink.html

--
Els http://locusmeus.com/
Sonhos vem. Sonhos vão. O resto é imperfeito.
- Renato Russo -

Feb 13 '06 #5
Els
KiwiBrian wrote:
"Els" <el*********@tiscali.nl> wrote in message
news:1b*******************************@40tude.net. ..
newspost2000 wrote:
I have a golden list of urls to each and every file on our corporate
website. This includes all webpages and file resources. I am looking
for a tool whereby I can plug in the base url of our website and also
plug in this golden list of urls and have the program figure out which
files are orphaned and not linked to any other webpage on our site. Is
anyone aware of a tool that will do this?


Xenulink does that, without the need for the golden list of urls.
(needs ftp access)
http://home.snafu.de/tilman/xenulink.html


Reading the Xenu info I can see no reference to the program being able to
identify "orphan" files.
I understand the term "orphan" to mean files, such as for example images,
that do not have a link pointing to them.
I hope that I am wrong and that it can do this.
I am looking for such a program, other than Dreamweaver.


Once you installed it, open the program, go to menu > options. You get
a list of things to tick/untick, and the bottom one is "orphan files".

It does ask for FTP credentials though, but that sounds logical to me.

--
Els http://locusmeus.com/
Sonhos vem. Sonhos vão. O resto é imperfeito.
- Renato Russo -
Now playing: Yes - It will be a good day (The River) [Live][The Ladder
Tour]
Feb 13 '06 #6
On Tue, 14 Feb 2006, KiwiBrian blurted out atop a fullquote[1]:
Reading the Xenu info I can see no reference to the program being
able to identify "orphan" files.


If you're running it over the network to an httpd, it rather obviously
cannot find files to which it has no links!

You have to allow it to see the actual files on the server. This
doesn't appear to be documented in the prog's own documentation, but
ISTM that a quick giggle for the terms xenu and orphan could have got
you to

http://members.chello.nl/f.visser3/x...ned-files.html

and a couple of other interesting-looking pages, faster than posting a
question here.

Have fun.

[1] it sets a better impression, especially when posting or
crossposting to a group in the big-8 hierarchy, if one follows the
long-standing netiquette in this regard.
Feb 13 '06 #7
On Tue, 14 Feb 2006 05:50:00 +1300, "KiwiBrian" <br******@ihug.co.nz>
wrote:
Reading the Xenu info I can see no reference to the program being able to
identify "orphan" files.
Read it again then, it can.
I understand the term "orphan" to mean files, such as for example images,
that do not have a link pointing to them.
I hope that I am wrong and that it can do this.


You are, it can. It will need ftp access though. So does WebLV to do
the same thing.

Now it may be that you don't have ftp access, no matter how improbable
that sounds, and still want to be able to identify orphan files. How
you'd do that, I dunno. But at least you've learned a bit by asking.

BB
--

http://homepage.ntlworld.com/bill.kr...rs-sitemap.htm
http://www.crystal-liaison.com/memor...day/index.html
kr***@crystal-liaison.com Gifty! Shiny! BB!
Feb 14 '06 #8

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

5
by: LarryM | last post by:
Hi, This is maybe not a pure ASP question, but has some relation: Please help me throw some light on this: Which directories and files are visible and readable for a (hacking) user at a...
3
by: CSN | last post by:
Every now and then my connection to my remote server will disconnect. If I was using the psql commandline, "ps" will show it's still running (after I log back on). Is there a way to get access to...
0
by: newspost2000 | last post by:
I have built a Domino Web Application. I am looking for a tool that will be able to scour my site and find all files that are orphaned. I have found a few tools on the web that can do this kind of...
2
by: AAJ | last post by:
Hi all I have a few questions about file locations with VS2005, localhost WidowsXP and remotehost Server2003, and I wonder if anyone can help me get started 1) If I build a web site on my...
0
by: skip | last post by:
Over the past couple days I've been trying to reduce the large number of orphaned wiki pages, deleting many, stitching many others back into the fabric. There are a bunch of orphaned PyCon-related...
1
by: aaapaul | last post by:
What is the best way to transfer the necessary files of a website to the webserver. In VS2003 there was a possibility to say Project copy -only files, which are necessary for running ! I...
8
by: K.J.Williams | last post by:
Hi, Im trying to find the latest version ( hopefully 5 ) for my W95 for offline testing. I have found binaries of version 5+ for W95 but I didnt jump to the conclusion that it would be compatible...
4
by: John Sheppard | last post by:
Hello there, I was wondering if anyone could give me some advice on orphaned rows and foreign key constraints. I am migrating a MS-Access database that didnt have any relations setup...
2
by: rockyd2 | last post by:
Here is the code } if(command.equalsIgnoreCase("home")) { if(teleblock == true){ sendMessage("A magical force stops you from teleporting."); //made by Musicmad441 } else {
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.