By using this site, you agree to our updated Privacy Policy and our Terms of Use. Manage your Cookies Settings.
446,397 Members | 1,425 Online
Bytes IT Community
+ Ask a Question
Need help? Post your question and get tips & solutions from a community of 446,397 IT Pros & Developers. It's quick & easy.

Tool to merge duplicate rows with slightly different data in some fields?

P: n/a
I have a number of spreadsheets, each with between 1000-6000 rows (each row
is a property) and they all need to be combined into a single database. Each
spreadsheet contains slightly different information and 80% of the
properties in spreadsheet_01 also appear spreadsheet_02. Spreadsheet_03
contains slightly different information again and perhaps has 60% of
properties in this one also appear in other spreadsheets. The problem is not
straightforward as data such as an address in Spreadsheet_01 may be written
slightly different to Spreadsheet_02.

My initial thought is to make all the spreadsheets the same structure (which
will be then be the database schema), cut and paste to create one large
table and then find some tool that will find possible duplicate rows and
merge the data into one.

Does such a tool exist? I could even set ODBC connection if I need to use a
non-Access database to sort this mess out.

Any advice would be gratefully received.

Cheers

Phil

Dec 6 '07 #1
Share this Question
Share on Google+
1 Reply


P: n/a

"Phil Latio" <ph********@f-in-stupid.co.ukwrote in message
news:xB*******************@fe01.news.easynews.com. ..
>I have a number of spreadsheets, each with between 1000-6000 rows (each row
is a property) and they all need to be combined into a single database.
Each spreadsheet contains slightly different information and 80% of the
properties in spreadsheet_01 also appear spreadsheet_02. Spreadsheet_03
contains slightly different information again and perhaps has 60% of
properties in this one also appear in other spreadsheets. The problem is
not straightforward as data such as an address in Spreadsheet_01 may be
written slightly different to Spreadsheet_02.

My initial thought is to make all the spreadsheets the same structure
(which will be then be the database schema), cut and paste to create one
large table and then find some tool that will find possible duplicate rows
and merge the data into one.

Does such a tool exist? I could even set ODBC connection if I need to use
a non-Access database to sort this mess out.

Any advice would be gratefully received.

Cheers

Phil
Don't worry, I've found the "dupicates query wizard". Might take some
fiddling but looks like it should get me 90% and I can handle a bit of
manual editing.

Cheers

Phil

Dec 6 '07 #2

This discussion thread is closed

Replies have been disabled for this discussion.