473,503 Members | 2,166 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

An idea for a new HTML attribute... opinions needed...

4 New Member
Hey,

I was browsing on Google earlier and found that there was one site which had content which had changed since it was indexed - the page was no longer relevant to the search term. I know some sites, such as eBay, have places in some pages which change regularly on page load - for example a catalogue where there may be a New Items section or a Featured Items section in which 5 random items are selected from a catalogue of 500 on every page load. I actually quite regularly come across this kind of thing, which is what made me think of this idea.

My idea is to have a "changeFreq" attribute for which the developer can specify any of the following:
  • PageLoad[:URL]
  • Daily[:hh:mm][:URL]
  • Weekly[Mon|Tue|Wed|Thu|Fri|Sat|Sun][:hh:mm][:URL]
  • Monthly[:dd][:hh:mm][:URL]
  • Yearly[:mm[/dd]][:URL]

So what does all this mean? Well for PageLoad, search engines simply will not index them. For options with [:hh:mm], the time is optional. If no time is supplied then it defaults to midnight. For options with [Mon|Tue|Wed|Thu|Fri|Sat|Sun], the day is optional. If no day is supplied then it defaults to Monday. For options with [:dd] (yes, you guessed it!) it is the day of the month. If it is more than the number of days in the month then it will default back to the last day. If no day is specified then it defaults to the 1st. For options with [:mm/dd] specified, it means (fairly obviously) the month and day. The same rules apply for day and if the month is specified on its own then the day defaults to the 1st.

Now for the clever bit - the [:URL]. This specifies the URL to load into the element when it is out of date. This can be used for two purposes - for more relevant searches and for better caching systems. I can hear asking what difference this would have... it would basically allow search engines to re-index just a part of the page, and browsers to load most of a page from the cache but reload any out-dated parts. This will be optional and, if not specified on an out-dated element, the whole page will be re-indexed or reloaded.

Oh and this attribute, if a URL is specified, could be used to load any outdated parts of the page with jQuery and similar frameworks... other options would have to be available for this, such as Hourly, Minutely, Secondly and so on.

Inheritance will be applied to all elements where there is no changeFreq specified. Any elements within an element set to PageLoad will not be indexed (even if they have a different changeFreq specified. Consider the following code:

Expand|Select|Wrap|Line Numbers
  1. <div changeFreq="Weekly:Mon:13:00">
  2.     <div id="TodayOnlyOffers" changeFreq="Daily:13:30:/Parts/TodayOnly.aspx">
  3.         Content will be updated with the URL /Parts/TodayOnly.aspx at 13:30 every day, but this tag will be replaced weekly every Monday at 13:00.
  4.     </div>
  5.     <div changeFreq="PageLoad">
  6.         Content will not be indexed, and this tag will also be replaced weekly every Monday at 13:00.
  7.         <div changeFreq="Daily:15:00:/Parts/Something-Else.aspx">
  8.             This tag will also not be indexed because it is within a PageLoad tag. No matter what the changeFreq of this tag is set to, the search engine will still think it changes every page load. A solution would be to put the content outside this tag, but within the parent tag, within its own tag set to PageLoad and remove the changeFreq of the parent tag.
  9.         </div>
  10.     </div>
  11. </div>
  12.  
So what do people think of this idea? Are there any major flaws that I havent thought of? I am not really up on how caching and indexing work - I know the basics but I dont know enough to know if this is actually a really good idea... Any opinions would be greatly appreciated before I even think about approaching W3C or anyone like that. And on that note, does anyone know exactly who (either firm or person) I would need to contact?

Thanks in advance.

Regards,

Richard
Oct 3 '10 #1
6 1536
drhowarddrfine
7,435 Recognized Expert Expert
What you are looking at doing is duplicating HTTP headers that already do most of that, such as the Expires header. Other parts are done using .htaccess and robots.txt and sitemap.xml files. Also, you can't have parts of the HTML standard relying on outside functionality, particularly things that are not standards, such as other frameworks.

The HTML and HTTP standards expose themselves so other systems can access the information you talk about and let those programs deal with that in whatever way they want.
Oct 3 '10 #2
Richard Clarke
4 New Member
Well since the original example (in my experience) so frequently occurs, something needs to point out to search engines what, if anything, changes on page load and alike. This is basically tag level caching - so I know the amount of data would be vastly higher but it may also take large chunks of data out of storage due to changing on pageload.

I know techniques exist - but they are ONLY for page level caching / indexing. Nothing can signal to the bot that "this tag should not be indexed".

Richard
Oct 3 '10 #3
drhowarddrfine
7,435 Recognized Expert Expert
No standards committee will ever create something that reacts to a program. Tag level caching, like you describe, can be handled with RDF, already a standard and used quite a bit.
Oct 3 '10 #4
Richard Clarke
4 New Member
What do you mean "something that reacts to a program"? Surely its the programs that are reacting to it - and thats how it is currently (i.e. normal caching with no-cache and alike is reacted to appropriately by browsers, and any reputable robot reacts to robots.txt).
Oct 3 '10 #5
drhowarddrfine
7,435 Recognized Expert Expert
What I meant was a standard developed so one particular implementation can use it. Search engines are just special purpose programs.
Oct 4 '10 #6
Richard Clarke
4 New Member
Ah now I understand what you mean. As it happens I have posted this idea on the Google Webmaster forums, to see how well they accepted the idea - using it for search engines has generally been turned down because too many people dont keep to web standards anyway - so this probably wouldnt be any different, and would be abused.

However I still think it could be used for web browsers to speed up page load times - and it wouldnt stop there as it could also be used for jQuery and other frameworks for smaller timespans such as Minutely:30:/parts/part.aspx (every minute at the 30 seconds mark load /parts/part.aspx into this tag). It would be easy enough to do that and could be built into jQuery to automatically detect these timespans and react accordingly.
Oct 4 '10 #7

Sign in to post your reply or Sign up for a free account.

Similar topics

1
1822
by: Perttu Pulkkinen | last post by:
Hi all freaks! I have been thinking hard, what would be good solution for a straight-forward content management. I want to forget polls and forms etc.. and concentrate first only on different...
11
2195
by: Albretch | last post by:
Hi HTML gurus, I understand that you would use HTML character entities for &auml; and &euro; but why on earth would anyone encode: a colon: ":", a semicolon ";", or a gramatical period...
4
2470
by: A.Hanemann | last post by:
Hi, i need a windows-Programm to translate many .XML-Files into .HTML and .PDF. I have the .DTD-File. The Programm should be easy to use and free. Does someone know where to find such a thing?...
0
395
by: Oisin Grehan | last post by:
Hi, I have a UserControl derived class: <ns:votingbutton runat="server" id="btn1" onclick="votingbuttonclick" /> My question is, what code do I need in place in the codebehind for this to...
2
1742
by: Ben Foster | last post by:
I want to be able to set an attribute of the HTML object in the form's Page_Load event. I can't see how to do this. The main problem is that I can't see how to access the HTML object from this...
6
1964
by: Kourosh | last post by:
I can't figure out how to do this because the tags have to be properly nested... and I dont have much XSL experience. Say I'm generating an HTML tag in my XSL file, and for one of it's attributes,...
3
1951
by: Oliver Block | last post by:
Hi, I did not find any length limit for attributes. Is there any? Maybe "inherited" by SGML? Best Regards, Oliver
4
3483
by: danwms | last post by:
Hi, im new to this forum but if have tried everything else and this is driving me a bit crazy. I am desperate for some help and I know this is going to be a realy simple thing its just my mind has...
4
3178
sanjib65
by: sanjib65 | last post by:
Whenever I take user's input through TextBox or anything else, it's good practice to use Html.Encode(TextBox1.Text) for the security purpose. But is it neccessary now as ASP.NET 2.0 has strengthened...
0
7093
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...
0
7287
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
7349
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...
0
7467
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...
0
4688
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and...
0
3168
by: adsilva | last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
0
1521
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated ...
1
746
muto222
by: muto222 | last post by:
How can i add a mobile payment intergratation into php mysql website.
0
399
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.