473,387 Members | 1,321 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,387 software developers and data experts.

Pattern Recognition

This there a public program/algorithm that can tell me the key points of a
text?

For example I entered the following text:

Web logs, or blogs, the online personal diaries where big names and no names
expound on everything from pets to presidents, are going mainstream. While
still a relatively small piece of total online activity, blogging has caught
on with affluent young adults. As Forrester Research analysts recently
noted, blogging will become increasingly common as these consumers age.

The program should give me the main keywords such as: blog, online,people...

I know some spam filters does this and Google??
I don't need it to be super accurate, but just to demostrate that this is
possible.

Any help is greatly appreciated,
Aaron
Jul 21 '05 #1
5 1371
"Aaron" <ku*****@yahoo.com> wrote in
news:uM**************@tk2msftngp13.phx.gbl:
This there a public program/algorithm that can tell me the key points of
a text?


Do you want to know the key points or just evaluate? If you just want to
evaluate you should look at Bayesian algolrithms.

--
Chad Z. Hower (a.k.a. Kudzu) - http://www.hower.org/Kudzu/
"Programming is an art form that fights back"

Empower ASP.NET with IntraWeb
http://www.atozed.com/IntraWeb/
Jul 21 '05 #2
key points

If the program were to process this post, it would output keywords like:
programming, internet...

"Chad Z. Hower aka Kudzu" <cp**@hower.org> wrote in message
news:Xn******************@127.0.0.1...
"Aaron" <ku*****@yahoo.com> wrote in
news:uM**************@tk2msftngp13.phx.gbl:
This there a public program/algorithm that can tell me the key points of
a text?


Do you want to know the key points or just evaluate? If you just want to
evaluate you should look at Bayesian algolrithms.

--
Chad Z. Hower (a.k.a. Kudzu) - http://www.hower.org/Kudzu/
"Programming is an art form that fights back"

Empower ASP.NET with IntraWeb
http://www.atozed.com/IntraWeb/

Jul 21 '05 #3
Hello Aaron,
This there a public program/algorithm that can tell me the key points of a
text?

For example I entered the following text:

Web logs, or blogs, the online personal diaries where big names and no
names expound on everything from pets to presidents, are going mainstream.
While still a relatively small piece of total online activity, blogging
has caught on with affluent young adults. As Forrester Research analysts
recently noted, blogging will become increasingly common as these
consumers age.

The program should give me the main keywords such as: blog,
online,people...


Fascinating. Simply keyword counting produces nearly nothing. The only
words that occur more than once are "blogging" and "names." The word
"people" that you produce in your list of keywords doesn't occur in the
paragraph at all.

You would need an algorithm that creates a contextual map through a lexical
tree and produces, effectively, an "understanding" of the key concept of the
paragraph. Effectively, you are entering the field of Computational
Linguistics.

There is some fascinating research on Natural Language Processing that began
in the late 80s (and continues today) that addresses many of these ideas.
I'm sure that some of the current "search" research has raised interest
further. Microsoft Research, IBM Research, and others are very much
interested in these areas.

One example would be the Text Mining project at IBM:
http://www.trl.ibm.com/projects/textmining/index_e.htm

A good link for coding systems that follow some of these practices is here:
http://www.cl.cam.ac.uk/Research/NL/anlt.html

There is WAY too much involved, morphologically, lexically, and
linguistically, to demonstrate even the simplest of these algorithms in a
newsgroup message. Start at your local college library and/or Google for
"Natural Language Processing" Go from there.

--
--- Nick Malik [Microsoft]
MCSD, CFPS, Certified Scrummaster
http://blogs.msdn.com/nickmalik

Disclaimer: Opinions expressed in this forum are my own, and not
representative of my employer.
I do not answer questions on behalf of my employer. I'm just a
programmer helping programmers.
--
Jul 21 '05 #4
Nick ,thanks for the info, I'll keep looking and let you guys know.

The best example I can think of right now is Google's Adsense. It does a
very good job analyzing the webpage and pin pointing the central meaning.
Any idea what method they use?
"Nick Malik [Microsoft]" <ni*******@hotmail.nospam.com> wrote in message
news:vN********************@comcast.com...
Hello Aaron,
This there a public program/algorithm that can tell me the key points of
a text?

For example I entered the following text:

Web logs, or blogs, the online personal diaries where big names and no
names expound on everything from pets to presidents, are going
mainstream. While still a relatively small piece of total online
activity, blogging has caught on with affluent young adults. As Forrester
Research analysts recently noted, blogging will become increasingly
common as these consumers age.

The program should give me the main keywords such as: blog,
online,people...


Fascinating. Simply keyword counting produces nearly nothing. The only
words that occur more than once are "blogging" and "names." The word
"people" that you produce in your list of keywords doesn't occur in the
paragraph at all.

You would need an algorithm that creates a contextual map through a
lexical tree and produces, effectively, an "understanding" of the key
concept of the paragraph. Effectively, you are entering the field of
Computational Linguistics.

There is some fascinating research on Natural Language Processing that
began in the late 80s (and continues today) that addresses many of these
ideas. I'm sure that some of the current "search" research has raised
interest further. Microsoft Research, IBM Research, and others are very
much interested in these areas.

One example would be the Text Mining project at IBM:
http://www.trl.ibm.com/projects/textmining/index_e.htm

A good link for coding systems that follow some of these practices is
here: http://www.cl.cam.ac.uk/Research/NL/anlt.html

There is WAY too much involved, morphologically, lexically, and
linguistically, to demonstrate even the simplest of these algorithms in a
newsgroup message. Start at your local college library and/or Google for
"Natural Language Processing" Go from there.

--
--- Nick Malik [Microsoft]
MCSD, CFPS, Certified Scrummaster
http://blogs.msdn.com/nickmalik

Disclaimer: Opinions expressed in this forum are my own, and not
representative of my employer.
I do not answer questions on behalf of my employer. I'm just a
programmer helping programmers.
--

Jul 21 '05 #5

Aaron wrote:
This there a public program/algorithm that can tell me the key points of a text?

For example I entered the following text:

Web logs, or blogs, the online personal diaries where big names and no names expound on everything from pets to presidents, are going mainstream. While still a relatively small piece of total online activity, blogging has caught on with affluent young adults. As Forrester Research analysts recently noted, blogging will become increasingly common as these consumers age.
The program should give me the main keywords such as: blog, online,people...
I know some spam filters does this and Google??
I don't need it to be super accurate, but just to demostrate that this is possible.

It sounds like you're interested in text mining. Try KDnuggets
(http://www.kdnuggets.com) in the "Software" section.

-Will Dwinnell
http://will.dwinnell.com

Jul 21 '05 #6

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

4
by: CJ Oxx | last post by:
I have a problem with browser charset recognition when using PHP 4.1.2 (this is the PHP version which our hosting company provides). For charset recognition, I use the following meta-tag: <meta...
6
by: cybit friendly | last post by:
Hi I'm stuck due to lack of knowledge. Problem: I'm scanning through Array1 to find the closest match to Array2 within Array1. At the moment I'm using a method that can best be described as...
6
by: BCC | last post by:
What is the best way to find out how many times a pattern is matched in a string? For example if I have: std::string s = "AAAAABBAAAABBAAAABAAABB"; I want to know how many "BB" patterns there...
5
by: Rod | last post by:
About two weeks ago I had an accident and have broken my left elbow and left wrist. For doing things like Word or e-mail (I use Outlook for) I have been using Microsoft's speech recognition and...
1
by: Junior | last post by:
Hi guys. I want to incorporate voice recognition in a new .NET application I'm developing. I would like to recognize among a group of 30-40 words. But I would also like to avoid using 3dr party...
1
by: Meena | last post by:
In our company we are trying to add speech recognition to our products. I downloaded the Speech Recognition engine. Now there is a component called Microsoft Direct Speech Recognition in VB.Net...
4
by: Ian Dickinson | last post by:
Hi My name is Ian Dickinson and I am a professional software developer working in the UK and reasonably familiar with Python. However a friend of mine who is a special educational needs teacher...
0
by: kalroche | last post by:
looking for expertise on a speech recognition project for thesis presentation My intention is to have speech recognition for product location for a grocery stores. What would be the best database...
0
by: daveftl | last post by:
Hi to all, can anyone help me by sharing there knowledge about pattern recognition using VB6...is there a Component or tool that can do the work?...im going to do pattern recognition on an...
1
by: deluxmilkman | last post by:
Is it possible to integrate Mac`s built in speech recognition feature ()or any other speech recognition software) into flash? I am trying to make a speech recognition based flash application. It...
0
by: taylorcarr | last post by:
A Canon printer is a smart device known for being advanced, efficient, and reliable. It is designed for home, office, and hybrid workspace use and can also be used for a variety of purposes. However,...
0
by: Charles Arthur | last post by:
How do i turn on java script on a villaon, callus and itel keypad mobile phone
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...
0
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.