473,790 Members | 3,200 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

Finding Orphaned Files On A Website

I have a golden list of urls to each and every file on our corporate
website. This includes all webpages and file resources. I am looking
for a tool whereby I can plug in the base url of our website and also
plug in this golden list of urls and have the program figure out which
files are orphaned and not linked to any other webpage on our site. Is
anyone aware of a tool that will do this?

Thanks

Feb 8 '06 #1
7 6058
Els
newspost2000 wrote:
I have a golden list of urls to each and every file on our corporate
website. This includes all webpages and file resources. I am looking
for a tool whereby I can plug in the base url of our website and also
plug in this golden list of urls and have the program figure out which
files are orphaned and not linked to any other webpage on our site. Is
anyone aware of a tool that will do this?

Thanks


Xenulink does that, without the need for the golden list of urls.
(needs ftp access)
http://home.snafu.de/tilman/xenulink.html

--
Els http://locusmeus.com/
Sonhos vem. Sonhos vão. O resto é imperfeito.
- Renato Russo -
Feb 8 '06 #2
My site is contained in a Notes Database. FTP will not do because all
of the files and contents or our website are not contained as
individual files on a web server but are contained in one file which is
a Lotus Notes Database (.nsf). This is why the only other option that
I have is produce a comma separated list of urls that I can pull into a
system and then that system can find which files are on the list that
were not found through a the public search of our site and then
identify those as the orphans. Can Xenu do that?

Feb 8 '06 #3
Els
newspost2000 wrote:
My site is contained in a Notes Database. FTP will not do because all
of the files and contents or our website are not contained as
individual files on a web server but are contained in one file which is
a Lotus Notes Database (.nsf). This is why the only other option that
I have is produce a comma separated list of urls that I can pull into a
system and then that system can find which files are on the list that
were not found through a the public search of our site and then
identify those as the orphans. Can Xenu do that?


I don't know - but Xenu gives you an entire list of valid urls too.
Seems to me that once you have that, it's just comparing one list with
the other and the difference should be the orphans.

--
Els http://locusmeus.com/
Sonhos vem. Sonhos vão. O resto é imperfeito.
- Renato Russo -
Feb 8 '06 #4
Reading the Xenu info I can see no reference to the program being able to
identify "orphan" files.
I understand the term "orphan" to mean files, such as for example images,
that do not have a link pointing to them.
I hope that I am wrong and that it can do this.
I am looking for such a program, other than Dreamweaver.
Brian Tozer

"Els" <el*********@ti scali.nl> wrote in message
news:1b******** *************** ********@40tude .net...
newspost2000 wrote:
I have a golden list of urls to each and every file on our corporate
website. This includes all webpages and file resources. I am looking
for a tool whereby I can plug in the base url of our website and also
plug in this golden list of urls and have the program figure out which
files are orphaned and not linked to any other webpage on our site. Is
anyone aware of a tool that will do this?

Thanks


Xenulink does that, without the need for the golden list of urls.
(needs ftp access)
http://home.snafu.de/tilman/xenulink.html

--
Els http://locusmeus.com/
Sonhos vem. Sonhos vão. O resto é imperfeito.
- Renato Russo -

Feb 13 '06 #5
Els
KiwiBrian wrote:
"Els" <el*********@ti scali.nl> wrote in message
news:1b******** *************** ********@40tude .net...
newspost2000 wrote:
I have a golden list of urls to each and every file on our corporate
website. This includes all webpages and file resources. I am looking
for a tool whereby I can plug in the base url of our website and also
plug in this golden list of urls and have the program figure out which
files are orphaned and not linked to any other webpage on our site. Is
anyone aware of a tool that will do this?


Xenulink does that, without the need for the golden list of urls.
(needs ftp access)
http://home.snafu.de/tilman/xenulink.html


Reading the Xenu info I can see no reference to the program being able to
identify "orphan" files.
I understand the term "orphan" to mean files, such as for example images,
that do not have a link pointing to them.
I hope that I am wrong and that it can do this.
I am looking for such a program, other than Dreamweaver.


Once you installed it, open the program, go to menu > options. You get
a list of things to tick/untick, and the bottom one is "orphan files".

It does ask for FTP credentials though, but that sounds logical to me.

--
Els http://locusmeus.com/
Sonhos vem. Sonhos vão. O resto é imperfeito.
- Renato Russo -
Now playing: Yes - It will be a good day (The River) [Live][The Ladder
Tour]
Feb 13 '06 #6
On Tue, 14 Feb 2006, KiwiBrian blurted out atop a fullquote[1]:
Reading the Xenu info I can see no reference to the program being
able to identify "orphan" files.


If you're running it over the network to an httpd, it rather obviously
cannot find files to which it has no links!

You have to allow it to see the actual files on the server. This
doesn't appear to be documented in the prog's own documentation, but
ISTM that a quick giggle for the terms xenu and orphan could have got
you to

http://members.chello.nl/f.visser3/x...ned-files.html

and a couple of other interesting-looking pages, faster than posting a
question here.

Have fun.

[1] it sets a better impression, especially when posting or
crossposting to a group in the big-8 hierarchy, if one follows the
long-standing netiquette in this regard.
Feb 13 '06 #7
On Tue, 14 Feb 2006 05:50:00 +1300, "KiwiBrian" <br******@ihug. co.nz>
wrote:
Reading the Xenu info I can see no reference to the program being able to
identify "orphan" files.
Read it again then, it can.
I understand the term "orphan" to mean files, such as for example images,
that do not have a link pointing to them.
I hope that I am wrong and that it can do this.


You are, it can. It will need ftp access though. So does WebLV to do
the same thing.

Now it may be that you don't have ftp access, no matter how improbable
that sounds, and still want to be able to identify orphan files. How
you'd do that, I dunno. But at least you've learned a bit by asking.

BB
--

http://homepage.ntlworld.com/bill.kr...rs-sitemap.htm
http://www.crystal-liaison.com/memor...day/index.html
kr***@crystal-liaison.com Gifty! Shiny! BB!
Feb 14 '06 #8

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

5
1937
by: LarryM | last post by:
Hi, This is maybe not a pure ASP question, but has some relation: Please help me throw some light on this: Which directories and files are visible and readable for a (hacking) user at a Website: a) directories in the default website (obviously: YES) b) .asp-files in the default Website c) .asp-files in any diretory outside the default Website
3
1828
by: CSN | last post by:
Every now and then my connection to my remote server will disconnect. If I was using the psql commandline, "ps" will show it's still running (after I log back on). Is there a way to get access to the running psql process again, or cause it to disconnect and close? I tried "kill" but it didn't do anything. TIA, CSN
0
1364
by: newspost2000 | last post by:
I have built a Domino Web Application. I am looking for a tool that will be able to scour my site and find all files that are orphaned. I have found a few tools on the web that can do this kind of thing however they ask for ftp information to your file server so that they can index everything on your file server and then compare that to everything that they have indexed from their web scour of your site and what ever files are left over...
2
1638
by: AAJ | last post by:
Hi all I have a few questions about file locations with VS2005, localhost WidowsXP and remotehost Server2003, and I wonder if anyone can help me get started 1) If I build a web site on my local server i.e. Creat WebSite from the opening menu, I can select my local webserver. Most of the files seem to be within InetPub\wwwroot\website\...
0
1018
by: skip | last post by:
Over the past couple days I've been trying to reduce the large number of orphaned wiki pages, deleting many, stitching many others back into the fabric. There are a bunch of orphaned PyCon-related pages, mostly subpages of PyCon2005 and PyCon2006. Would someone with PyCon-fu want to check them out and decide their fate? If so, visit http://wiki.python.org/moin/OrphanedPages and scroll down to the "P" section.
1
1403
by: aaapaul | last post by:
What is the best way to transfer the necessary files of a website to the webserver. In VS2003 there was a possibility to say Project copy -only files, which are necessary for running ! I can´t find the same in VS 2005 ! greetings aaapaul
8
1406
by: K.J.Williams | last post by:
Hi, Im trying to find the latest version ( hopefully 5 ) for my W95 for offline testing. I have found binaries of version 5+ for W95 but I didnt jump to the conclusion that it would be compatible with W95. So I just want to know : - What is the latest version from the museum that I can download for W95 - Is there a way for me to manually install it, I read a message that was posted here,
4
1434
by: John Sheppard | last post by:
Hello there, I was wondering if anyone could give me some advice on orphaned rows and foreign key constraints. I am migrating a MS-Access database that didnt have any relations setup properly, consequently there are gazillions of orphaned tuples. If I was to leave these as they were and set constraints with NO CHECK is it likly that I am going to run into problems down the track?
2
4327
by: rockyd2 | last post by:
Here is the code } if(command.equalsIgnoreCase("home")) { if(teleblock == true){ sendMessage("A magical force stops you from teleporting."); //made by Musicmad441 } else {
0
9666
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, we’ll explore What is ONU, What Is Router, ONU & Router’s main usage, and What is the difference between ONU and Router. Let’s take a closer look ! Part I. Meaning of...
0
10419
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed. This is as boiled down as I can make it. Here is my compilation command: g++-12 -std=c++20 -Wnarrowing bit_field.cpp Here is the code in...
0
9987
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the choice of these technologies. I'm particularly interested in Zigbee because I've heard it does some...
0
9023
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then launch it, all on its own.... Now, this would greatly impact the work of software developers. The idea...
1
7531
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms. Adolph will...
0
6770
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert into image. Globals.ThisAddIn.Application.ActiveDocument.Select();...
0
5424
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols. I succeeded, with both firewalls in the same network. But I'm wondering if it's possible to do the same thing, with 2 Pfsense firewalls...
0
5552
by: adsilva | last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
2
3709
muto222
by: muto222 | last post by:
How can i add a mobile payment intergratation into php mysql website.

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.