473,320 Members | 2,107 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,320 software developers and data experts.

Reconcile two similar but unequal entries as equivalent

Government data from various federal, state, and local sources shows entries referring to the same company name, address, and GPS location are similar but not equal. Such as "GE" vs "General Electric" or "12 1st Street" vs "#12 First Street" etc.
I request hints or tools to successfully reconcile these to a single database entry without manually comparing many thousands of lines.
Thanks.
May 23 '17 #1
1 644
jforbes
1,107 Expert 1GB
The United States Postal Service has a Web-service available to validate an address. Using the Web-service, you could use it's results as a way to find similar addresses. This is something that I've wanted to do in-house for years, but it hasn't become a high enough priority with our data.

This is the link to the Web Tools API.
May 23 '17 #2

Sign in to post your reply or Sign up for a free account.

Similar topics

3
by: Greg Yasko | last post by:
Hi. Does anyone know if there's an equivalent of Perl's file::find module in Python? It traverses a directory. I've googled extensively and checked this newsgroup and can't find anything like it...
6
by: Raistlin | last post by:
Hey, for c++ does any body have a way to compare two arrays? Say one has the numbers 1-5000, the other has *most* of the numbers from 1-5000, is there a way to display the numbers that are in the...
1
by: Fabio | last post by:
Hello, does anyone know what could be wrong? There's a lot of similar entries in the SQL Server current log: 2004-01-21 17:54:00.01 spid57 Starting up database 'DB_NAME' 2004-01-21...
7
by: lkrubner | last post by:
The PHP scripting language has the array_unique() function that gets the unique, non-redundant values out of an array. Does Javascript have anything similar?
2
by: osmethod | last post by:
Hello, I'm stuck! I have received help before and progressed greatly from your(Access Group) answers. I need some again please. Thanks in advance for any suggestions. Problem: Take the...
1
by: .Net Sports | last post by:
when working with ASP, I was able to use control statement loops such as "While myvar.EOF" , "Do Until myvar.EOF" , in which I would loop thru a recordset until the particular field in a database...
48
by: Daniel Crespo | last post by:
Hi! I would like to know how can I do the PHP ternary operator/statement (... ? ... : ...) in Python... I want to something like: a = {'Huge': (quantity>90) ? True : False} Any...
46
by: lovecreatesbea... | last post by:
Do you prefer malloc or calloc? p = malloc(size); Which of the following two is right to get same storage same as the above call? p = calloc(1, size); p = calloc(size, 1);
6
by: Henrik Goldman | last post by:
Hello, I have a dataset which consist of a string username and string hostname as a key and then an integer representing a count as the matching "second" value in a pair. So far I've used...
4
sicarie
by: sicarie | last post by:
I'm querying two different systems by username/email and comparing the results, however I'm not incredibly familiar with python and don't know the best way to compare the dictionary entries. The...
0
by: DolphinDB | last post by:
Tired of spending countless mintues downsampling your data? Look no further! In this article, you’ll learn how to efficiently downsample 6.48 billion high-frequency records to 61 million...
0
by: ryjfgjl | last post by:
ExcelToDatabase: batch import excel into database automatically...
0
by: Vimpel783 | last post by:
Hello! Guys, I found this code on the Internet, but I need to modify it a little. It works well, the problem is this: Data is sent from only one cell, in this case B5, but it is necessary that data...
0
by: jfyes | last post by:
As a hardware engineer, after seeing that CEIWEI recently released a new tool for Modbus RTU Over TCP/UDP filtering and monitoring, I actively went to its official website to take a look. It turned...
0
by: CloudSolutions | last post by:
Introduction: For many beginners and individual users, requiring a credit card and email registration may pose a barrier when starting to use cloud servers. However, some cloud server providers now...
0
by: Defcon1945 | last post by:
I'm trying to learn Python using Pycharm but import shutil doesn't work
1
by: Shællîpôpï 09 | last post by:
If u are using a keypad phone, how do u turn on JavaScript, to access features like WhatsApp, Facebook, Instagram....
0
by: af34tf | last post by:
Hi Guys, I have a domain whose name is BytesLimited.com, and I want to sell it. Does anyone know about platforms that allow me to list my domain in auction for free. Thank you
0
by: Faith0G | last post by:
I am starting a new it consulting business and it's been a while since I setup a new website. Is wordpress still the best web based software for hosting a 5 page website? The webpages will be...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.