473,243 Members | 1,623 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,243 software developers and data experts.

Carrer in Data Mining?

Hi all,

I'm a Bachelor in Computer Engineering, and going to study Masters
(major in Knowledge-Based systems). I'm quite fascinated by the concept
of data-mining and knowledge-based systems, and so I'd like to pursue
my career in this field. However, I'm not too sure about the
opportunities available in the field. Apart from research, what else is
(commonly) available? I'd be most interested in developing
knowledge-based software (e.g. using neural networks), but I'd still be
very interested in any of the computational side of things in this
field.

Another question I have is, because my Masters degree will be
coursework-based (plus a minor thesis -- using neural net to compose
music), I'm still thinking about what courses to take. Should I take
computer-based courses only (e.g. machine learning, data mining, data
warehousing, DB development, e-commerce (?) )? Or, should I take some
courses on statistics also (e.g. statistical inference)? Any other
advices will be much appreciated.
Cheers,
Michael

Nov 12 '05 #1
6 2228
Data mining lies in an area of overlap among A.I. (especially the
machine learning and pattern recognition end of A.I.), statistics (and
related applied math fields such as O.R.) and computer science. I
would suggest studying inferential statistics. If you're interested in
learning more about career opportunities in data mining, see the job
listing on KDnuggets:

http://www.kdnuggets.com/jobs/index.html

I'm not sure why you asked about this in a database group- data miners
are more likely to congregate in newsgroups such as comp.ai.neural-nets
or sci.stat.math.

-Will Dwinnell
http://will.dwinnell.com

Nov 12 '05 #2
> I'm not sure why you asked about this in a database group- data
miners
are more likely to congregate in newsgroups such as comp.ai.neural-nets or sci.stat.math.


Yep, but on the other hand the first step in data mining is typically
collection, integration, and cleansing. Additionally, most
organizations need more basic analytics first - hyperlinked olap
reporting, etc: since most organizations have problems that they know
about (and just need more info on) - that's where the low-hanging fruit
is. Finding problems that they don't know about yet is great, but
should wait until you've got the known problems fixed and the
foundation set.

Further, in my experience the value of additional depth of analytics is
roughly equivilent to additional breadth of data. And the breadth of
data can usually be solved more reliably and cost effectively via data
warehousing than the depth can be via data mining.

So, if you know data warehousing and BI you're in a great position to
deliver 80% of the analytics most organizations need today - plus
deliver the foundational components also needed by most data mining
activities.

Unfortunately, most organizations won't pay for that last 20%, but I'd
say that in the meanwhile, BI is more fun than unemployement.

Nov 12 '05 #3
Predictor wrote:
"I'm not sure why you asked about this in a database group- data miners
are more likely to congregate in newsgroups such as comp.ai.neural-nets
or sci.stat.math."

bu*********@yahoo.com responded:
"Yep, but on the other hand the first step in data mining is typically
collection, integration, and cleansing."
The amount of these tasks performed by the data miner varies:
Depending on the circumstances, it may be neccessary to understand a
relational database and formulate an appropriate query, or it may
sufficient to recieve a prepared flat file. Regardless, the data miner
is always responsible for the statistics. If one wants to become a
data miner, I think it makes more sense to study to become a
statistician than a DBA.
-Will Dwinnell
http://will.dwinnell.com

Nov 12 '05 #4
> I think it makes more sense to study to become a
statistician than a DBA.

Agreed. I'm not recommending that someone interested in data mining
pick up database administration.

But a statistician that can consolidate & cleanse data, possibly
provide some contextual reporting, possibly provide a scoring solution
- will be in a better position than one that can't.

working with a set of statisticians right now limited by data
logistics...

buck

Nov 12 '05 #5
bu*********@yahoo.com wrote:
I'm not recommending that someone interested in data mining
pick up database administration.


I think a DBA with good data mining skills would be an extremely
employable person in the next few years (maybe 5). It's my contention
that there is a huge data explosion on the way, and somebody that could
practically seperate the wheat from the chaff will be very, very valued.

Nov 12 '05 #6
Mark Townsend wrote:
"I think a DBA with good data mining skills would be an extremely
employable person in the next few years (maybe 5)."

Perhaps, but I would think that would be due to opportunities which
required one or the other of these skills, not often both.
Mark Townsend wrote:
"It's my contention that there is a huge data explosion on the way, and
somebody that could practically seperate the wheat from the chaff will
be very, very valued."

I'd say the data explosion has already been under way for at least 10
years, but the point is that most of a DBA's skills would be wasted as
a data miner. This will vary by the data mining task, naturally, but I
would think that a database report writer would have more than
sufficient skill required in the overwhelming majority of data mining
projects. I've worked on a number of projects in which I was simply
handed a flat file.

Starting from scratch, becoming a capable DBA or data miner takes time.
There are only 24 hours in a day. My recommendation is to concentrate
on the math and statistics.
-Will Dwinnell
http://will.dwinnell.com

Nov 12 '05 #7

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

0
by: Julie | last post by:
I am studying about designing patterns that could help integrating data mining techniques implemented as web services into a data warehouse in the DB layer. Is anyone have idea what visual...
5
by: Framework fan | last post by:
Hello, If I wrote the next ebay (yes I know, yawn-snore) and I had a database with 5 million auction items in it, what would be a really good strategy to get a search done very quickly? Would...
0
by: visual-basic-data-mining.net | last post by:
Looking for data mining algorithms like Naive Bayes, Neural Networks, Apriori, Genetic Algorithms, Clustering, Decision Trees, etc. implemenattions in C#. Cn you please post a response in our...
0
by: http://www.visual-basic-data-mining.net/forum | last post by:
Looking for data mining algorithms like Naive Bayes, Neural Networks, Apriori, Genetic Algorithms, Clustering, Decision Trees, etc. implementations in VB.NET or C#.NET with Source Code. Can you...
0
by: http://www.visual-basic-data-mining.net/forum | last post by:
Looking for data mining algorithms like Naive Bayes, Neural Networks, Apriori, Genetic Algorithms, Clustering, Decision Trees source code implementations in Visual Basic.NET. Can you please post a...
2
by: Amé | last post by:
Hi! I've been using Intelligent Miner for Data (IMD) since 1 week and I was able to generate some results. However, I would like to know more about the algorithms that are used by IBM (ie in the...
2
by: ist | last post by:
Hi, I am studying data mining features of SSAS and for a workshop I've created 2 views derived from vTargetMail view of AdventureWorksDW. Train data consists every record except those in Pacific,...
18
by: Jens | last post by:
I'm starting a project in data mining, and I'm considering Python and Java as possible platforms. I'm conserned by performance. Most benchmarks report that Java is about 10-15 times faster than...
3
by: suganyasebastian | last post by:
Hi, I've just started my career as a DB2 DBA, previously I was an Oracle dba.. Can any one suggest me how to start from the scratch, as this is a new technology to me... Moreover I would like to...
0
by: fareedcanada | last post by:
Hello I am trying to split number on their count. suppose i have 121314151617 (12cnt) then number should be split like 12,13,14,15,16,17 and if 11314151617 (11cnt) then should be split like...
0
by: stefan129 | last post by:
Hey forum members, I'm exploring options for SSL certificates for multiple domains. Has anyone had experience with multi-domain SSL certificates? Any recommendations on reliable providers or specific...
0
Git
by: egorbl4 | last post by:
Скачал я git, хотел начать настройку, а там вылезло вот это Что это? Что мне с этим делать? ...
0
by: MeoLessi9 | last post by:
I have VirtualBox installed on Windows 11 and now I would like to install Kali on a virtual machine. However, on the official website, I see two options: "Installer images" and "Virtual machines"....
0
by: DolphinDB | last post by:
The formulas of 101 quantitative trading alphas used by WorldQuant were presented in the paper 101 Formulaic Alphas. However, some formulas are complex, leading to challenges in calculation. Take...
0
by: DolphinDB | last post by:
Tired of spending countless mintues downsampling your data? Look no further! In this article, you’ll learn how to efficiently downsample 6.48 billion high-frequency records to 61 million...
0
by: Aftab Ahmad | last post by:
Hello Experts! I have written a code in MS Access for a cmd called "WhatsApp Message" to open WhatsApp using that very code but the problem is that it gives a popup message everytime I clicked on...
0
by: Aftab Ahmad | last post by:
So, I have written a code for a cmd called "Send WhatsApp Message" to open and send WhatsApp messaage. The code is given below. Dim IE As Object Set IE =...
0
by: ryjfgjl | last post by:
ExcelToDatabase: batch import excel into database automatically...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.