473,490 Members | 2,458 Online
Bytes | Software Development & Data Engineering Community
Create Post

Home Posts Topics Members FAQ

'google' 2

The first generation of web site search engine hands-down is google.
A majority of these web
sites are static page -driven html pages.

Now, I would think more and more web-based applications are coming
into the fray, my recent
experience with google's webbot (indexing engine, a part of the
google's search service) tells
me that its indexing algorithm does not consider a database-driven web
application as part of
a qualified 'web site'. Minicking it as a html site is like a man
trying to dress in skirt,
oh, my poor Scotsman... Hence, I believe a service that would evaluate
and index web apps
(database-driven, xml-based or what not) vs. the plain HTML -driven
web sites would be
valuable to both consumers and businesses. Or maybe there's already
such a service out there?

Oct 20 '07 #1
20 2390
On Sun, 21 Oct 2007 02:38:47 -0000, ta********@gmail.com wrote:
Please take a look at my app to see what I mean, http://www.mytata.net
when you have a chance. So, in sum, the speed of the app is a major
design consideration.
Not criticising, but:
<http://validator.w3.org/check?verbose=1&uri=http%3A%2F%2Fweb.mytata.net%3A 8000%2FmyTA%2F>
won't help.
--
Locate your Mobile phone: <http://www.bizorg.co.uk/news.html>
Great gifts: <http://www.ThisBritain.com/ASOS_popup.html>
Oct 21 '07 #2
On 21 Oct, 03:38, tatata9...@gmail.com wrote:
On Oct 20, 7:20 pm, Adrienne Boswell <arb...@yahoo.comwrote:
2. Put CSS in an external stylesheet.
3. Put javascript in an external script.
Yes, I read about using include file for js, so, I just did.
Which is the complete opposite of what you were told
As for CSS, I've even eliminated them for now for speed sake.
You're mad, aren't you?

Oct 21 '07 #3
ta********@gmail.com wrote:
Now, I would think more and more web-based applications are coming into
the fray, my recent experience with google's webbot (indexing engine, a
part of the google's search service) tells me that its indexing
algorithm does not consider a database-driven web application as part of
a qualified 'web site'.
Google doesn't index your "database" pages because you didn't provide a
public access!
Otherwise, google has perfectly fine algorithms to crawl database driven
sites.
For google indexing, keep in mind that AJAX is very unfriendly to bots
(and to people who have a good browser).
As for CSS, I've even eliminated them for now for speed sake.
POSH (Plain Old Semantic HTML) visually improved by a stylesheet results
in much smaller code than table driven HTML code designed with its layout
in mind only.

Your code is not optimized.
For example, this page:
http://web.mytata.net:8000/myTA/index2_so.cfm?utype=s
Uses 6819 bytes.
It could get down to 600 bytes (you would have to remove that JavaScript
thing).
As for images, I have only a few tiny ones.
That's ok.
One thing I want to do away is a bit too many tables.
Table layouts are a bad thing.

--
If you've a question that doesn't belong to Usenet, contact me at
<ta*****************@yahoDELETETHATo.fr>
Oct 21 '07 #4
André Gillibert wrote:
It could get down to 600 bytes (you would have to remove that
JavaScript thing).
Do you mean the 22,050 bytes of urchintracker.js ? :-)

--
-bts
-Motorcycles defy gravity; cars just suck
Oct 21 '07 #5
Beauregard T. Shagnasty wrote:
André Gillibert wrote:
>It could get down to 600 bytes (you would have to remove that
JavaScript thing).

Do you mean the 22,050 bytes of urchintracker.js ? :-)
The 217 bytes of toggle.js, the 21411 bytes of urchin.js, and #6000 bytes
of useless/invisible tag soup.

--
If you've a question that doesn't belong to Usenet, contact me at
<ta*****************@yahoDELETETHATo.fr>
Oct 21 '07 #6
On Oct 21, 9:04 am, "André Gillibert"
<tabkanDELETETHIS...@yahodeletethato.frwrote:
tatata9...@gmail.com wrote:
Now, I would think more and more web-based applications are coming into
the fray, my recent experience with google's webbot (indexing engine, a
part of the google's search service) tells me that its indexing
algorithm does not consider a database-driven web application as part of
a qualified 'web site'.

Google doesn't index your "database" pages because you didn't provide a
public access!
Otherwise, google has perfectly fine algorithms to crawl database driven
sites.
For google indexing, keep in mind that AJAX is very unfriendly to bots
(and to people who have a good browser).
As for CSS, I've even eliminated them for now for speed sake.

POSH (Plain Old Semantic HTML) visually improved by a stylesheet results
in much smaller code than table driven HTML code designed with its layout
in mind only.

Your code is not optimized.
For example, this page:http://web.mytata.net:8000/myTA/index2_so.cfm?utype=s
Uses 6819 bytes.
It could get down to 600 bytes (you would have to remove that JavaScript
thing).
As for images, I have only a few tiny ones.

That's ok.
One thing I want to do away is a bit too many tables.

Table layouts are a bad thing.

--
If you've a question that doesn't belong to Usenet, contact me at
<tabkanDELETETHIS...@yahoDELETETHATo.fr>
On providing public access to google for indexing my database, could
you elaborate a bit?
Like usually it's done via sitemap.xml? or?
Then, what I should watch in terms of privacy for user data protection
(though I tend to think the only confidential data in my db would be
user email address, could I set up some sort of exclusion rule?)
How that thing work, I'd loving to know it NOW.

And thank you for the technique on POSH and CSS combo. Will work on
it...
Oct 21 '07 #7
On Oct 21, 2:52 am, David Quinton
<usenet_2005D_em...@REMOVETHISBITbizorg.co.ukwrote :
On Sun, 21 Oct 2007 02:38:47 -0000, tatata9...@gmail.com wrote:
Please take a look at my app to see what I mean,http://www.mytata.net
when you have a chance. So, in sum, the speed of the app is a major
design consideration.

Not criticising, but:
<http://validator.w3.org/check?verbose=1&uri=http%3A%2F%2Fweb.mytata.n...>
won't help.
--
Locate your Mobile phone: <http://www.bizorg.co.uk/news.html>
Great gifts: <http://www.ThisBritain.com/ASOS_popup.html>
I don't perceive it as criticism at all, on the contrary, I want to
thank you, someting on the plate...

Oct 21 '07 #8
On Oct 21, 8:16 am, Andy Dingley <ding...@codesmiths.comwrote:
On 21 Oct, 03:38, tatata9...@gmail.com wrote:
On Oct 20, 7:20 pm, Adrienne Boswell <arb...@yahoo.comwrote:
2. Put CSS in an external stylesheet.
3. Put javascript in an external script.
Yes, I read about using include file for js, so, I just did.

Which is the complete opposite of what you were told
You lost me here.
>
As for CSS, I've even eliminated them for now for speed sake.

You're mad, aren't you?
You're right, I don't see its syntax's logic (not from a designer's
background).
Oct 21 '07 #9
On Oct 21, 9:40 am, "Beauregard T. Shagnasty"
<a.nony.m...@example.invalidwrote:
André Gillibert wrote:
It could get down to 600 bytes (you would have to remove that
JavaScript thing).

Do you mean the 22,050 bytes of urchintracker.js ? :-)

--
-bts
-Motorcycles defy gravity; cars just suck
The tradeoff of not knowing site usage would be huge unless spending
time to find an alternative, have to chew on that.

Oct 21 '07 #10
On Oct 21, 9:46 am, "André Gillibert"
<tabkanDELETETHIS...@yahodeletethato.frwrote:
Beauregard T. Shagnasty wrote:
André Gillibert wrote:
It could get down to 600 bytes (you would have to remove that
JavaScript thing).
Do you mean the 22,050 bytes of urchintracker.js ? :-)

The 217 bytes of toggle.js, the 21411 bytes of urchin.js, and #6000 bytes
of useless/invisible tag soup.

--
If you've a question that doesn't belong to Usenet, contact me at
<tabkanDELETETHIS...@yahoDELETETHATo.fr>
I respectfully disagree on not using invisible tag (to me, providing
user less cluttered page would easier for consumption...)

Oct 21 '07 #11
wrote:
On Oct 21, 9:46 am, "André Gillibert"
<tabkanDELETETHIS...@yahodeletethato.frwrote:
>Beauregard T. Shagnasty wrote:
André Gillibert wrote:
>It could get down to 600 bytes (you would have to remove that
JavaScript thing).
Do you mean the 22,050 bytes of urchintracker.js ? :-)

The 217 bytes of toggle.js, the 21411 bytes of urchin.js, and #6000
bytes
of useless/invisible tag soup.

I respectfully disagree on not using invisible tag (to me, providing
user less cluttered page would easier for consumption...)
My point was to REMOVE entirely the invisible content from the page and to
replace the JavaScript link with a regular anchor pointing to a page with
the content.

--
If you've a question that doesn't belong to Usenet, contact me at
<ta*****************@yahoDELETETHATo.fr>
Oct 21 '07 #12
On Oct 21, 11:35 am, "André Gillibert"
<tabkanDELETETHIS...@yahodeletethato.frwrote:
wrote:


On Oct 21, 9:46 am, "André Gillibert"
<tabkanDELETETHIS...@yahodeletethato.frwrote:
Beauregard T. Shagnasty wrote:
André Gillibert wrote:
It could get down to 600 bytes (you would have to remove that
JavaScript thing).
Do you mean the 22,050 bytes of urchintracker.js ? :-)
The 217 bytes of toggle.js, the 21411 bytes of urchin.js, and #6000
bytes
of useless/invisible tag soup.
I respectfully disagree on not using invisible tag (to me, providing
user less cluttered page would easier for consumption...)

My point was to REMOVE entirely the invisible content from the page and to
replace the JavaScript link with a regular anchor pointing to a page with
the content.

--
If you've a question that doesn't belong to Usenet, contact me at
<tabkanDELETETHIS...@yahoDELETETHATo.fr>- Hide quoted text -

- Show quoted text -
Does this have something to do with google bot indexing? Or saving
some bytes? thks. oh, could you share your knowledge on the google
database indexing techniques? Many thks.

Oct 21 '07 #13
ta********@gmail.com wrote:
On Oct 21, 9:40 am, "Beauregard T. Shagnasty"
<a.nony.m...@example.invalidwrote:
>André Gillibert wrote:
It could get down to 600 bytes (you would have to remove that
JavaScript thing).

Do you mean the 22,050 bytes of urchintracker.js ? :-)

The tradeoff of not knowing site usage would be huge unless spending
time to find an alternative, have to chew on that.
Do your statistics with *your* server. HTTP requests are sent to your
server, I don't see why you wouldn't be able to make statistics on them?
You can get the client IP address, all the HTTP headers such as the user
agent string or the referer URI, without any client-side scripts.

This type of script make me confirm that JavaScripts are mainly used as
spywares, not for user benefit.
I understand that kind people who use JavaScript for improving user
experience defends their JavaScripts. But I don't understand how people
who use JavaScripts only to "own" the user computer can defend their
position.
Assume that 100% of the JavaScripts on earth were used for such cr*p, as a
*user*, would you really enable them in your browser profile?

Fortunately, I've disabled JavaScripts on my browser. Surely, it breaks
your statistics, but web statistics are always biaised.

--
If you've a question that doesn't belong to Usenet, contact me at
<ta*****************@yahoDELETETHATo.fr>
Oct 21 '07 #14
On Oct 21, 11:51 am, "André Gillibert"
<tabkanDELETETHIS...@yahodeletethato.frwrote:
tatata9...@gmail.com wrote:
On Oct 21, 9:40 am, "Beauregard T. Shagnasty"
<a.nony.m...@example.invalidwrote:
André Gillibert wrote:
It could get down to 600 bytes (you would have to remove that
JavaScript thing).
Do you mean the 22,050 bytes of urchintracker.js ? :-)
The tradeoff of not knowing site usage would be huge unless spending
time to find an alternative, have to chew on that.

Do your statistics with *your* server. HTTP requests are sent to your
server, I don't see why you wouldn't be able to make statistics on them?
You can get the client IP address, all the HTTP headers such as the user
agent string or the referer URI, without any client-side scripts.

This type of script make me confirm that JavaScripts are mainly used as
spywares, not for user benefit.
I understand that kind people who use JavaScript for improving user
experience defends their JavaScripts. But I don't understand how people
who use JavaScripts only to "own" the user computer can defend their
position.
Assume that 100% of the JavaScripts on earth were used for such cr*p, as a
*user*, would you really enable them in your browser profile?

Fortunately, I've disabled JavaScripts on my browser. Surely, it breaks
your statistics, but web statistics are always biaised.

--
If you've a question that doesn't belong to Usenet, contact me at
<tabkanDELETETHIS...@yahoDELETETHATo.fr>
Thank you very much. I see your point. Not really into CSS, is
there a way to 'minik' javascript function like toggle in my case with
CSS?
Oct 21 '07 #15
ta********@gmail.com wrote:
"Beauregard T. Shagnasty" wrote:
>André Gillibert wrote:
>>It could get down to 600 bytes (you would have to remove that
JavaScript thing).

Do you mean the 22,050 bytes of urchintracker.js ? :-)

The tradeoff of not knowing site usage would be huge unless spending
time to find an alternative, have to chew on that.
Your urchintracker did not register me when I visited your page. All the
google ads, google-analytics and urchin domains are in my HOSTS file.
However, your web host will have registered that I made requests to your
page.

--
-bts
-Motorcycles defy gravity; cars just suck
Oct 21 '07 #16
On Oct 21, 12:38 pm, "Beauregard T. Shagnasty"
<a.nony.m...@example.invalidwrote:
tatata9...@gmail.com wrote:
"Beauregard T. Shagnasty" wrote:
André Gillibert wrote:
It could get down to 600 bytes (you would have to remove that
JavaScript thing).
Do you mean the 22,050 bytes of urchintracker.js ? :-)
The tradeoff of not knowing site usage would be huge unless spending
time to find an alternative, have to chew on that.

Your urchintracker did not register me when I visited your page. All the
google ads, google-analytics and urchin domains are in my HOSTS file.
However, your web host will have registered that I made requests to your
page.

--
-bts
-Motorcycles defy gravity; cars just suck
Yes, I know my web server has a log for user agent requests.
Initially I'll just read the plain log file, then find a tool that
organizes data for me. Thks.

Oct 21 '07 #17
ta********@gmail.com wrote:
The first generation of web site search engine hands-down is google.
A majority of these web
sites are static page -driven html pages.

Now, I would think more and more web-based applications are coming
into the fray, my recent
experience with google's webbot (indexing engine, a part of the
google's search service) tells
me that its indexing algorithm does not consider a database-driven web
application as part of
a qualified 'web site'.
It does, but whether you get results depends on your URLs. If links to a
particular page always have the URL

http://example.com/generate.aspx?page=472

then Google should handle it as well as

http://example.com/staticURL.aspx

But take the case of Amazon, where successive searches produced the
following URLs for me for the same album:

http://www.amazon.com/Cant-Imagine-M...3066211&sr=8-2

http://www.amazon.com/Cant-Imagine-M...3066264&sr=1-2

http://www.amazon.com/Cant-Imagine-M...3066274&sr=1-7

because Amazon includes session-specific information in the query string
(such as the query ID [qid]) and the position of the item in the search
results from which it was selected and the page encoding. Google will
come across each of these links exactly once, and never have any idea
that they represent the same album. So if Amazon didn't also produce a
unique canonical URL for each article to feed to the search engines,
Amazon items couldn't be effectively indexed and ranked. (Well, I
suspect Amazon actually has a special arrangement for feeding Google,
but that doesn't alter my point.)
Oct 22 '07 #18
On Oct 22, 11:23 am, Harlan Messinger
<hmessinger.removet...@comcast.netwrote:
tatata9...@gmail.com wrote:
The first generation of web site search engine hands-down is google.
A majority of these web
sites are static page -driven html pages.
Now, I would think more and more web-based applications are coming
into the fray, my recent
experience with google's webbot (indexing engine, a part of the
google's search service) tells
me that its indexing algorithm does not consider a database-driven web
application as part of
a qualified 'web site'.

It does, but whether you get results depends on your URLs. If links to a
particular page always have the URL

http://example.com/generate.aspx?page=472

then Google should handle it as well as

http://example.com/staticURL.aspx

But take the case of Amazon, where successive searches produced the
following URLs for me for the same album:

http://www.amazon.com/Cant-Imagine-M...B000AMU0UI/ref...

http://www.amazon.com/Cant-Imagine-M...B000AMU0UI/ref...

http://www.amazon.com/Cant-Imagine-M...B000AMU0UI/ref...

because Amazon includes session-specific information in the query string
(such as the query ID [qid]) and the position of the item in the search
results from which it was selected and the page encoding. Google will
come across each of these links exactly once, and never have any idea
that they represent the same album. So if Amazon didn't also produce a
unique canonical URL for each article to feed to the search engines,
Amazon items couldn't be effectively indexed and ranked. (Well, I
suspect Amazon actually has a special arrangement for feeding Google,
but that doesn't alter my point.)
But the thing is, my web app requires user account, so, probably I'll
have to create some gimick url like
http://www.mytata.net/SupportBotQuery?qid=1234 something?

btw, could you folks try my site, http://www.mytata.net/ ? I was told
it's not accessable about 10 minutes ago (my apologies for having to
post this particular request)

Thanks.

Oct 22 '07 #19
ta********@gmail.com wrote:
On providing public access to google for indexing my database, could
you elaborate a bit?
The user specific data (e.g. activity tables) is private. It's only
accessible when logged in.
This ensures privacy of the data, but prevents (thanks goodness) google
from crawling this data. Similarly, google doesn't access to your bank
account!
This is a Good Thing(TM).

Just keep it as it is. Google can still access the public pages (e.g. the
home page), but won't access to user's private data.
Like usually it's done via sitemap.xml?
Forget about sitemap.xml; It is useless as far as every page in your site
is accessible through regular anchors. sitemap.xml is useful for databases
for which there're no HTML indexes, but only a search feature.

--
If you've a question that doesn't belong to Usenet, contact me at
<ta*****************@yahoDELETETHATo.fr>
Oct 22 '07 #20
On Oct 22, 6:16 pm, "André Gillibert"
<tabkanDELETETHIS...@yahodeletethato.frwrote:
tatata9...@gmail.com wrote:
On providing public access to google for indexing my database, could
you elaborate a bit?

The user specific data (e.g. activity tables) is private. It's only
accessible when logged in.
This ensures privacy of the data, but prevents (thanks goodness) google
from crawling this data. Similarly, google doesn't access to your bank
account!
This is a Good Thing(TM).

Just keep it as it is. Google can still access the public pages (e.g. the
home page), but won't access to user's private data.
Like usually it's done via sitemap.xml?

Forget about sitemap.xml; It is useless as far as every page in your site
is accessible through regular anchors. sitemap.xml is useful for databases
for which there're no HTML indexes, but only a search feature.

--
If you've a question that doesn't belong to Usenet, contact me at
<tabkanDELETETHIS...@yahoDELETETHATo.fr>
Ok, thank you.

Oct 23 '07 #21

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

13
2914
by: fuzzyman | last post by:
I've hacked together a 'GoogleCacheServer'. It is based on SimpleHTTPServer. Run the following script (hopefully google groups won't mangle the indentation) and set your browser proxy settings to...
3
4183
by: Alastair | last post by:
Hello guys, I've been building a search facility for an intranet site I'm part of developing and we've been building a search engine using Index Server. It mostly works, however there have been...
25
2512
by: Tor Erik Sønvisen | last post by:
Hi I need to browse the socket-module source-code. I believe it's contained in the file socketmodule.c, but I can't locate this file... Where should I look? regards tores
9
2385
by: Ray5531 | last post by:
Sorry if this is irrelevant to this website,but I didn't find a better place to ask this question.I sent an email to google as well which I didn't recieve the answer.I'm creating a website and...
2
2013
by: gen_tricomi | last post by:
THE IMPORTANCE OF MAKING THE GOOGLE INDEX DOWNLOADABLE I write here to make a request on behalf of all the programmers on earth who have been or are intending to use the Google web search API...
1
7269
by: xahlee | last post by:
Elisp Tutorial: Make Google Earth Xah Lee, 2006-12 This page shows a example of writing a emacs lisp function that creates a Google Earth file, and creates a link to the file, as well a link...
9
3556
by: maheswaran | last post by:
Hi all, I developed one application. From that application i created dynamic pages contact us , about us...(like joomla, but application is not in joomla)... These all are comes from database.In...
0
6967
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...
0
7142
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
7181
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...
1
6847
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...
0
7352
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...
0
5445
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...
1
4875
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new...
0
4565
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and...
0
272
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.