By using this site, you agree to our updated Privacy Policy and our Terms of Use. Manage your Cookies Settings.
429,501 Members | 831 Online
Bytes IT Community
+ Ask a Question
Need help? Post your question and get tips & solutions from a community of 429,501 IT Pros & Developers. It's quick & easy.

Need help "weeding out" junk records

P: n/a
Greetings Data Gurus! I've been asked to help find and delete junk
records from a database table that comes from online data request
forms: you know, those annoying forms you have to fill out to get a
free download. Some visitors to the particular website fill out the
required form just to get the free download. Many of them enter
garbage data to avoid contact, and don't realize they won't be
contacted unless they specifically ask for it (via on-screen option).

My challenge is to catch all of their garbage. I can easily catch
certain keywords in the name fields, like "TEST", "Eval" and "Joe
Blow", but the phone number and zipcode fields are tricky, since they
have visitors from around the world. Much of this will yet be manual,
but I at least want to catch patterns, like 654321654, which is just
me dancing on the number keypad. Any clues on how to catch the pattern
I just created (654 appears twice, at the start AND end of the entry).
Any more suggestions on trash data cleanups are greatly appreciated.

Thanks
Nov 12 '05 #1
Share this Question
Share on Google+
1 Reply


P: n/a
I have made some progress in this. I've been able to create a function
to find patterns of repeated numbers in a phone field, and can pass it
the number of repeated characters to find. This enables me to find
many suspicious patterns, such as 111, 123, 789, and 1212121212. I
then insert this function, which returns an integer of 1 if a pattern
is found, into a query to help me identify the suspect records.
However, I can't seem to sort or filter on this column without getting
a data type error of some kind, and then no records are returned. I'll
have to keep working on it. I'll post it once perfected for the
benefit of others.

Has anyone else come up with suggestions?

jm***@yahoo.com (jmev7) wrote in message news:<c7**************************@posting.google. com>...
Greetings Data Gurus! I've been asked to help find and delete junk
records from a database table that comes from online data request
forms: you know, those annoying forms you have to fill out to get a
free download. Some visitors to the particular website fill out the
required form just to get the free download. Many of them enter
garbage data to avoid contact, and don't realize they won't be
contacted unless they specifically ask for it (via on-screen option).

My challenge is to catch all of their garbage. I can easily catch
certain keywords in the name fields, like "TEST", "Eval" and "Joe
Blow", but the phone number and zipcode fields are tricky, since they
have visitors from around the world. Much of this will yet be manual,
but I at least want to catch patterns, like 654321654, which is just
me dancing on the number keypad. Any clues on how to catch the pattern
I just created (654 appears twice, at the start AND end of the entry).
Any more suggestions on trash data cleanups are greatly appreciated.

Thanks

Nov 12 '05 #2

This discussion thread is closed

Replies have been disabled for this discussion.