467,921 Members | 1,319 Online
Bytes | Developer Community
New Post

Home Posts Topics Members FAQ

Post your question to a community of 467,921 developers. It's quick & easy.

Tool to merge duplicate rows with slightly different data in some fields?

I have a number of spreadsheets, each with between 1000-6000 rows (each row
is a property) and they all need to be combined into a single database. Each
spreadsheet contains slightly different information and 80% of the
properties in spreadsheet_01 also appear spreadsheet_02. Spreadsheet_03
contains slightly different information again and perhaps has 60% of
properties in this one also appear in other spreadsheets. The problem is not
straightforward as data such as an address in Spreadsheet_01 may be written
slightly different to Spreadsheet_02.

My initial thought is to make all the spreadsheets the same structure (which
will be then be the database schema), cut and paste to create one large
table and then find some tool that will find possible duplicate rows and
merge the data into one.

Does such a tool exist? I could even set ODBC connection if I need to use a
non-Access database to sort this mess out.

Any advice would be gratefully received.

Cheers

Phil

Dec 6 '07 #1
  • viewed: 2294
Share:
1 Reply

"Phil Latio" <ph********@f-in-stupid.co.ukwrote in message
news:xB*******************@fe01.news.easynews.com. ..
>I have a number of spreadsheets, each with between 1000-6000 rows (each row
is a property) and they all need to be combined into a single database.
Each spreadsheet contains slightly different information and 80% of the
properties in spreadsheet_01 also appear spreadsheet_02. Spreadsheet_03
contains slightly different information again and perhaps has 60% of
properties in this one also appear in other spreadsheets. The problem is
not straightforward as data such as an address in Spreadsheet_01 may be
written slightly different to Spreadsheet_02.

My initial thought is to make all the spreadsheets the same structure
(which will be then be the database schema), cut and paste to create one
large table and then find some tool that will find possible duplicate rows
and merge the data into one.

Does such a tool exist? I could even set ODBC connection if I need to use
a non-Access database to sort this mess out.

Any advice would be gratefully received.

Cheers

Phil
Don't worry, I've found the "dupicates query wizard". Might take some
fiddling but looks like it should get me 90% and I can handle a bit of
manual editing.

Cheers

Phil

Dec 6 '07 #2

This discussion thread is closed

Replies have been disabled for this discussion.

Similar topics

reply views Thread by webtrack+googlegroups | last post: by
4 posts views Thread by John Smith | last post: by
8 posts views Thread by Darryl Kerkeslager | last post: by
1 post views Thread by Joey Lee | last post: by
4 posts views Thread by Thomas Arthur Seidel | last post: by
By using this site, you agree to our Privacy Policy and Terms of Use.