Hello All,
I am currently trying to teach a web crawler how to identify blogs,
that is I am trying to determine a fairly inclusive set of criteria
that will help my crawler to identify them.
I have noticed that many Blogs include
div class=blogsomething (A format class conveniantly named blog)
xml tags
and/or php code.
I do know that cms(content management system) is used for several
blogs, does anyone else have any suggestions to help me determine
criteria.
I am aware that any criteria is subjective, especially when
considering sites such as slashdot which has been around longer than
Blogs...
thanks,
David