473,387 Members | 3,820 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,387 software developers and data experts.

application/xhtml+xml not recognized by Google

It seems that Google is unable to "recognize" application/xhtml+xml:

http://google.com/search?q=www.unics...otation.x.html
"File Format: Unrecognized"

Then follow the link "View as HTML" to
http://google.com/search?q=cache:www...otation.x.html
and look at the source text (as compared with the original)!

Are they mad?

Apr 3 '06 #1
6 2043
Andreas Prilop wrote:
It seems that Google is unable to "recognize" application/xhtml+xml:
Are they mad?


Nor can a default installation of Internet Explorer (which still holds a
majority marketshare), so there aren't many true XHTML documents out there.
Thus it probably isn't worth all that much to Google to chain an XML parser
into their search indexer.

--
David Dorward <http://blog.dorward.me.uk/> <http://dorward.me.uk/>
Home is where the ~/.bashrc is
Apr 3 '06 #2
On Mon, 3 Apr 2006, David Dorward wrote:
Andreas Prilop wrote:
It seems that Google is unable to "recognize" application/xhtml+xml:
Are they mad?


Nor can a default installation of Internet Explorer (which still holds a
majority marketshare), so there aren't many true XHTML documents out there.
Thus it probably isn't worth all that much to Google to chain an XML parser
into their search indexer.


(1)
Internet Explorer 6 on Windows XP SP2 does display
http://www.unics.uni-hannover.de/nht...otation.x.html
(because of the suffix .html)

(2)
You misquoted me! You have corrupted my text!

My question "Are they mad?" was NOT under the sentence that Google
does not recognize application/xhtml+xml. It was under the link to
Google's cached version
http://google.com/search?q=cache:www...otation.x.html

Look at the source of the above and compare with the original at
http://www.unics.uni-hannover.de/nht...otation.x.html

They *changed* my <h1> to
<p><font size="6" face="helvetica"><b>
etc. etc.

Therefore I ask "Are they mad?".

--
The 6th of June is Bill Gates Day.

Apr 4 '06 #3
Andreas Prilop <nh******@rrzn-user.uni-hannover.de> wrote:
They *changed* my <h1> to
<p><font size="6" face="helvetica"><b>
etc. etc.

Therefore I ask "Are they mad?".


Idiots savants, perhaps? Google has performed a nontrivial conversion from
XHML 1.1 to quasi-XHTML (absurdly presentational XHTML-lookalike markup with
some syntax errors like <BASE> element before <html> element). It has clearly
parsed your XHTML (somehow) and mapped the logical elements to presentational
hacks. This uncalled-for transmogrification is "idiotic" is the common
figurative sense but has really required some (abuse of) intelligence and
mental capabilities far above the level of idiots.

--
Yucca, http://www.cs.tut.fi/~jkorpela/
Pages about Web authoring: http://www.cs.tut.fi/~jkorpela/www.html

Apr 4 '06 #4
On Tue, 4 Apr 2006, Jukka K. Korpela wrote:
Google has performed a nontrivial conversion from
XHML 1.1 to quasi-XHTML (absurdly presentational XHTML-lookalike markup with
some syntax errors like <BASE> element before <html> element). It has clearly
parsed your XHTML (somehow) and mapped the logical elements to presentational
hacks.


They make quite an effort to index non-HTML files:
http://www.google.com/help/faq_filetypes.html
It should be trivial to index XHTML 1.1, no?

Apr 5 '06 #5
Andreas Prilop <nh******@rrzn-user.uni-hannover.de> wrote:
They make quite an effort to index non-HTML files:
http://www.google.com/help/faq_filetypes.html
It should be trivial to index XHTML 1.1, no?


Indeed, especially since they obviously parse XHTML 1.1 (and then do
something nasty). I wonder why XML is not listed. It should be rather simple
to parse XML documents and index just their textual content.

--
Yucca, http://www.cs.tut.fi/~jkorpela/
Pages about Web authoring: http://www.cs.tut.fi/~jkorpela/www.html

Apr 5 '06 #6
On Wed, 5 Apr 2006, Jukka K. Korpela wrote:
http://www.google.com/help/faq_filetypes.html


I wonder why XML is not listed. It should be rather simple
to parse XML documents and index just their textual content.


Google writes "File Format: Unrecognized" for XML:
<http://google.com/search?q=site:groups.google.com+inurl:feed>

--
The 6th of June is Bill Gates Day.
Apr 6 '06 #7

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

2
by: petermichaux | last post by:
Hi, I've searched the usenet groups and used google to look for good, active forums for XHTML, XML, XSLT and CSS but I cannot find any. Since PHP is related to the web, I thought I'd ask here...
10
by: Ian Rastall | last post by:
I get the feeling this was just discussed, so sorry if this is redundant. I want to make my books site XHTML Basic, so it will look good on PDA's. Not knowing what to do, I simply switched...
23
by: Gustaf | last post by:
I just read this article from today: http://webstandards.org/buzz/archive/2005_09.html I need some help understanding this sentense: The W3C recommends XHTML 1.1 should be served with the...
43
by: Christoph Schneegans | last post by:
Hi! Okay, so positions on "text/html" XHTML are totally contradicting. Anyway! I hope there's more consensus about "application/xml" XHTML. I've recently learned that Opera 9.0b2 does not only...
15
by: Zhang Weiwu | last post by:
http://www.w3.org/MarkUp/2004/xhtml-faq provided a trick to serve xhtml webpage to IE as application/xml I used that trick and now every one of my xhtml webpage have following first 4 starting...
1
by: Guramrit Singh | last post by:
Hi everybody, I'm having problem with MicrosoftAjax, when my application is in XHTML+XML mode. I'd set contenttype of page to application/xhtml+xml. but in this case ajax doesn't work properly....
0
by: taylorcarr | last post by:
A Canon printer is a smart device known for being advanced, efficient, and reliable. It is designed for home, office, and hybrid workspace use and can also be used for a variety of purposes. However,...
0
by: Charles Arthur | last post by:
How do i turn on java script on a villaon, callus and itel keypad mobile phone
0
by: aa123db | last post by:
Variable and constants Use var or let for variables and const fror constants. Var foo ='bar'; Let foo ='bar';const baz ='bar'; Functions function $name$ ($parameters$) { } ...
0
by: ryjfgjl | last post by:
If we have dozens or hundreds of excel to import into the database, if we use the excel import function provided by database editors such as navicat, it will be extremely tedious and time-consuming...
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...
0
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.