By using this site, you agree to our updated Privacy Policy and our Terms of Use. Manage your Cookies Settings.
438,521 Members | 1,420 Online
Bytes IT Community
+ Ask a Question
Need help? Post your question and get tips & solutions from a community of 438,521 IT Pros & Developers. It's quick & easy.

URL Component

P: n/a
Hey, I know that in a URL there's generally the "scheme" (i.e. http://),
followed by the hostname or domain name (i.e. en.wikipedia.org/)
followed by possible subfolders (i.e. /wiki/) which can then be
followed by the name of a file to send the request to (i.e. index.html
or index.php or something). There can also be GET data (i.e. ?
name=value) and the anchor thing (i.e. #Chapter_1). But also,
frequently there's just a string that finishes the URL, which doesn't
appear to be a folder (no trailing forward slash) or a file (no
extension), for example:

http://en.wikipedia.org/wiki/Uniform_Resource_Locator

Can someone explain to me what this string (i.e.
"Uniform_Resource_Locator" in the above example) is, and how you can
use php to get it?

Thanks.
Oct 15 '08 #1
Share this Question
Share on Google+
2 Replies


P: n/a
Hi

On Wed, 15 Oct 2008 09:03:10 -0700, bgold12 wrote:
Hey, I know that in a URL there's generally the "scheme" (i.e. http://),
followed by the hostname or domain name (i.e. en.wikipedia.org/)
followed by possible subfolders (i.e. /wiki/) which can then be followed
by the name of a file to send the request to (i.e. index.html or
index.php or something). There can also be GET data (i.e. ? name=value)
and the anchor thing (i.e. #Chapter_1). But also, frequently there's
just a string that finishes the URL, which doesn't appear to be a folder
(no trailing forward slash) or a file (no extension), for example:

http://en.wikipedia.org/wiki/Uniform_Resource_Locator
1) Files do not have to have have "extensions" on modern systems.

2) Web addresses do not have to refer to files.

http://example.org/fooo...barrr

is a valid URL even on some systems where foo...bar is not a valid
filename.

Moreover the part after the third slash and before the final slash does
not have to refer to a directory, although if relative URLs contain ../
then it they are treated as if it does.

PS: you duplicated the third slash - it is not part of the domain.
and how you can use php to get it?
There are all sorts of ways to open a remote resource in php. IIRC you
can just do:

$foo= file( 'http://en.wikipedia.org/wiki/Uniform_Resource_Locator' );

HTH
viza
Oct 15 '08 #2

P: n/a
bgold12 escribió:
Hey, I know that in a URL there's generally the "scheme" (i.e. http://),
followed by the hostname or domain name (i.e. en.wikipedia.org/)
followed by possible subfolders (i.e. /wiki/) which can then be
followed by the name of a file to send the request to (i.e. index.html
or index.php or something).
URLs don't need to match physical files and directories on the server
(and often don't). A simple example is when you omit "index.html" in the
URL and get the "index.html" file anyway. But you can configure your
webserver to map any URL to any resource.

So distinguishing between files and directories is normally meaningless.

Can someone explain to me what this string (i.e.
"Uniform_Resource_Locator" in the above example) is, and how you can
use php to get it?
The short answer is that you don't use PHP for this. Instead, you
configure your web server to run (e.g.)
/home/foo/fetch-article.php?query=Uniform_Resource_Locator and read
input from good old $_GET['query']. The typical option is Apache's
mod_rewrite:

http://httpd.apache.org/docs/2.2/en/rewrite/

You might also enjoy:

http://httpd.apache.org/docs/2.2/en/...gotiation.html
http://httpd.apache.org/docs/2.2/en/urlmapping.html

--
-- http://alvaro.es - Álvaro G. Vicario - Burgos, Spain
-- Mi sitio sobre programación web: http://bits.demogracia.com
-- Mi web de humor al baño María: http://www.demogracia.com
--
Oct 15 '08 #3

This discussion thread is closed

Replies have been disabled for this discussion.