473,855 Members | 2,141 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

Tom Lane heads up

Just dropping a quick not for Tom Lane. I sent a personal message
today, but I wasn't sure if you'd get it after I remembered all of the
spam filters you've got set up.

Sorry for the off topic post.

---------------------------(end of broadcast)---------------------------
TIP 5: Have you checked our extensive FAQ?

http://www.postgresql.org/docs/faqs/FAQ.html

Nov 23 '05
69 6432
On Mon, Apr 19, 2004 at 09:19:05PM -0700, Joe Conway wrote:
Marc G. Fournier wrote:
Huh? I just use Spamassassin myself, with Razor/Pyzor/DCC and Bayes all
enabled ...


I use exactly the same setup. But recently I've noticed that the
spammers are getting smarter -- I think 20% of it is slipping by the
filters. I'm going to need something better.


No offense, but that means you're not doing it right. I use SA with
Bayes (and everything else), and I'm getting better than 98% with no
false positives. Yesterday I had 823 spams (you read that correctly)
with 9 that made it through. When I woke up this morning, I had 334
spams with 2 that made it through.

I constantly train my Bayesian filter by using an email address I set
up where I forward all false-negatives. So the few that get through
won't be doing that again. It simply runs them through sa-learn. If I
get some time, I'll post the code to my web site.

Spammers cannot outsmart a Bayesian filter. It's game-over. You don't
need to upgrade, you need to figure out how to make your current setup
work.

Make sure you have the latest SA and make sure that Bayesian filtering
is turned on and working, and make sure to train the filter. Reply to
me offlist if you need a group of 5000 or so spams to help train it.

Michael
--
Michael Darrin Chaney
md******@michae lchaney.com
http://www.michaelchaney.com/

---------------------------(end of broadcast)---------------------------
TIP 5: Have you checked our extensive FAQ?

http://www.postgresql.org/docs/faqs/FAQ.html

Nov 23 '05 #41
On Wed, 21 Apr 2004, Michael Chaney wrote:
On Tue, Apr 20, 2004 at 01:30:59PM -0300, Marc G. Fournier wrote:
Also check to make sure that you don't have autolearn disabled ... you
would have had to do it manually, as it is enabled by default, but, for
instance, if you are a user on a system, the site-wide may be set to
disable autolearn, so you'd have to enable it yourself ...

I'm looking forward to 3.x coming out, as the Bayes stuff will be able to
run out of an SQL database instead of flat files ... so servers running
Cyrus IMAPd, where there are no physical user accounts, will be able to
start makng use of Bayes as well ...


You should look into MailScanner, at www.mailscanner.info. I use it as
the framework for running SA and anti-virus software, using Exim as my
mail server. There are no physical user accounts; all virtual stuff.
MailScanner let's SA, along with the Bayesian filter, work for all email
coming through.


Does it allow for per user preferences? I haven't found a clean way to do
that yet, other using using the spamcheck.py lmtpproxy ...

----
Marc G. Fournier Hub.Org Networking Services (http://www.hub.org)
Email: sc*****@hub.org Yahoo!: yscrappy ICQ: 7615664

---------------------------(end of broadcast)---------------------------
TIP 6: Have you searched our list archives?

http://archives.postgresql.org

Nov 23 '05 #42
On Wed, 21 Apr 2004, Michael Chaney wrote:
On Tue, Apr 20, 2004 at 01:30:59PM -0300, Marc G. Fournier wrote:
Also check to make sure that you don't have autolearn disabled ... you
would have had to do it manually, as it is enabled by default, but, for
instance, if you are a user on a system, the site-wide may be set to
disable autolearn, so you'd have to enable it yourself ...

I'm looking forward to 3.x coming out, as the Bayes stuff will be able to
run out of an SQL database instead of flat files ... so servers running
Cyrus IMAPd, where there are no physical user accounts, will be able to
start makng use of Bayes as well ...


You should look into MailScanner, at www.mailscanner.info. I use it as
the framework for running SA and anti-virus software, using Exim as my
mail server. There are no physical user accounts; all virtual stuff.
MailScanner let's SA, along with the Bayesian filter, work for all email
coming through.


Does it allow for per user preferences? I haven't found a clean way to do
that yet, other using using the spamcheck.py lmtpproxy ...

----
Marc G. Fournier Hub.Org Networking Services (http://www.hub.org)
Email: sc*****@hub.org Yahoo!: yscrappy ICQ: 7615664

---------------------------(end of broadcast)---------------------------
TIP 6: Have you searched our list archives?

http://archives.postgresql.org

Nov 23 '05 #43
Michael Chaney wrote:
Make sure you have the latest SA and make sure that Bayesian filtering
is turned on and working, and make sure to train the filter. Reply to
me offlist if you need a group of 5000 or so spams to help train it.


I've got the latest SA and I'm using Bayesian filtering, autolearn,
razor2, dcc, and pyzor. I'm also using relays.ordb.org ,
sbl.spamhaus.or g, bl.spamcop.net, and blackholes.five-ten-sg.com
(although I just added that last one yesterday). I've verified that
autolearn is working. I have my threshold set downward, from the default
of 5.0, to 2.5.

I get a comparible amount of spam (~600 to 1000 per day) and my setup
*was* about 98% effective until a month or so ago. These days it is more
like 80%. I've noticed many of the spam getting through appears
specifically targeted at getting by SA -- no HTML, a paragraph of
nonsense (or sometimes out of some public domain book), and a one liner
trying to sell me a mortgage or something.

The one thing I had *not* been doing, but started to do as of last
night, is to use the false-negatives to explicitly train the Bayesian
filter. It was easy enough to set up. I created an hourly cron job as
follows:

/usr/bin/sa-learn --mbox --spam /path/to/false-neg.mbox

Now I just drop all false negatives into that mailbox, and clean them
out periodically. Hopefully that will make a significant improvement.

Joe

---------------------------(end of broadcast)---------------------------
TIP 8: explain analyze is your friend

Nov 23 '05 #44
Michael Chaney wrote:
Make sure you have the latest SA and make sure that Bayesian filtering
is turned on and working, and make sure to train the filter. Reply to
me offlist if you need a group of 5000 or so spams to help train it.


I've got the latest SA and I'm using Bayesian filtering, autolearn,
razor2, dcc, and pyzor. I'm also using relays.ordb.org ,
sbl.spamhaus.or g, bl.spamcop.net, and blackholes.five-ten-sg.com
(although I just added that last one yesterday). I've verified that
autolearn is working. I have my threshold set downward, from the default
of 5.0, to 2.5.

I get a comparible amount of spam (~600 to 1000 per day) and my setup
*was* about 98% effective until a month or so ago. These days it is more
like 80%. I've noticed many of the spam getting through appears
specifically targeted at getting by SA -- no HTML, a paragraph of
nonsense (or sometimes out of some public domain book), and a one liner
trying to sell me a mortgage or something.

The one thing I had *not* been doing, but started to do as of last
night, is to use the false-negatives to explicitly train the Bayesian
filter. It was easy enough to set up. I created an hourly cron job as
follows:

/usr/bin/sa-learn --mbox --spam /path/to/false-neg.mbox

Now I just drop all false negatives into that mailbox, and clean them
out periodically. Hopefully that will make a significant improvement.

Joe

---------------------------(end of broadcast)---------------------------
TIP 8: explain analyze is your friend

Nov 23 '05 #45
On Wed, Apr 21, 2004 at 02:11:16PM -0300, Marc G. Fournier wrote:
You should look into MailScanner, at www.mailscanner.info. I use it as
the framework for running SA and anti-virus software, using Exim as my
mail server. There are no physical user accounts; all virtual stuff.
MailScanner let's SA, along with the Bayesian filter, work for all email
coming through.


Does it allow for per user preferences? I haven't found a clean way to do
that yet, other using using the spamcheck.py lmtpproxy ...


Yes, MailScanner allows per-user and per-domain preferences.

Michael
--
Michael Darrin Chaney
md******@michae lchaney.com
http://www.michaelchaney.com/

---------------------------(end of broadcast)---------------------------
TIP 3: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to ma*******@postg resql.org so that your
message can get through to the mailing list cleanly

Nov 23 '05 #46
On Wed, Apr 21, 2004 at 02:11:16PM -0300, Marc G. Fournier wrote:
You should look into MailScanner, at www.mailscanner.info. I use it as
the framework for running SA and anti-virus software, using Exim as my
mail server. There are no physical user accounts; all virtual stuff.
MailScanner let's SA, along with the Bayesian filter, work for all email
coming through.


Does it allow for per user preferences? I haven't found a clean way to do
that yet, other using using the spamcheck.py lmtpproxy ...


Yes, MailScanner allows per-user and per-domain preferences.

Michael
--
Michael Darrin Chaney
md******@michae lchaney.com
http://www.michaelchaney.com/

---------------------------(end of broadcast)---------------------------
TIP 3: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to ma*******@postg resql.org so that your
message can get through to the mailing list cleanly

Nov 23 '05 #47
Joe Conway wrote:
I get a comparible amount of spam (~600 to 1000 per day) and my setup
*was* about 98% effective until a month or so ago. These days it is more
like 80%. I've noticed many of the spam getting through appears
specifically targeted at getting by SA -- no HTML, a paragraph of
nonsense (or sometimes out of some public domain book), and a one liner
trying to sell me a mortgage or something.

The one thing I had *not* been doing, but started to do as of last
night, is to use the false-negatives to explicitly train the Bayesian
filter. It was easy enough to set up. I created an hourly cron job as
follows:

/usr/bin/sa-learn --mbox --spam /path/to/false-neg.mbox

Now I just drop all false negatives into that mailbox, and clean them
out periodically. Hopefully that will make a significant improvement.


I can tell you it certainly will.

--
Bruce Momjian | http://candle.pha.pa.us
pg***@candle.ph a.pa.us | (610) 359-1001
+ If your life is a hard drive, | 13 Roberts Road
+ Christ can be your backup. | Newtown Square, Pennsylvania 19073

---------------------------(end of broadcast)---------------------------
TIP 4: Don't 'kill -9' the postmaster

Nov 23 '05 #48
Joe Conway wrote:
I get a comparible amount of spam (~600 to 1000 per day) and my setup
*was* about 98% effective until a month or so ago. These days it is more
like 80%. I've noticed many of the spam getting through appears
specifically targeted at getting by SA -- no HTML, a paragraph of
nonsense (or sometimes out of some public domain book), and a one liner
trying to sell me a mortgage or something.

The one thing I had *not* been doing, but started to do as of last
night, is to use the false-negatives to explicitly train the Bayesian
filter. It was easy enough to set up. I created an hourly cron job as
follows:

/usr/bin/sa-learn --mbox --spam /path/to/false-neg.mbox

Now I just drop all false negatives into that mailbox, and clean them
out periodically. Hopefully that will make a significant improvement.


I can tell you it certainly will.

--
Bruce Momjian | http://candle.pha.pa.us
pg***@candle.ph a.pa.us | (610) 359-1001
+ If your life is a hard drive, | 13 Roberts Road
+ Christ can be your backup. | Newtown Square, Pennsylvania 19073

---------------------------(end of broadcast)---------------------------
TIP 4: Don't 'kill -9' the postmaster

Nov 23 '05 #49
Bruce Momjian wrote:
Joe Conway wrote:
The one thing I had *not* been doing, but started to do as of last
night, is to use the false-negatives to explicitly train the Bayesian
filter. It was easy enough to set up. I created an hourly cron job as
follows:

/usr/bin/sa-learn --mbox --spam /path/to/false-neg.mbox

Now I just drop all false negatives into that mailbox, and clean them
out periodically. Hopefully that will make a significant improvement.


I can tell you it certainly will.


Doesn't sa-learn also require you to teach it Ham as well? My
problem has been that sa-learn appears to ignore white-listed emails
and therefore can't learn from 90% of my Ham. Meanwhile, I get spam
that slips through SA that my Mozilla client *correctly* identifies
as Junk. Once a week, I take that Junk email, along with all Ham and
run sa-learn with the appropriate --spam/--ham switch. But it
doesn't seem to be improving. I still get spam which SA fails to
identify but which, 95% of the time, Mozilla correctly identifies.

Mike Mascari

---------------------------(end of broadcast)---------------------------
TIP 1: subscribe and unsubscribe commands go to ma*******@postg resql.org

Nov 23 '05 #50

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

0
1057
by: Eric | last post by:
I've written a spiffy class to validate my xml. it loads up a schema into an xmlvalidatingreader and runs thru the xml, firing validationevents as it goes. Friend Sub ValidationEventHandle(ByVal sender As Object, ByVal args As ValidationEventArgs) _bIsValid = False _errordata.Add(_reader.Name & "|" & _sCurrentFilename & "|" &
0
1714
by: Darryl Kerkeslager | last post by:
Access 2002 on Win98 at least, not tested elsewhere: Bug: The ListCount property of an unbound listbox may be incorrect if you use column heads. Model: Set up a simple database with two simple tables (I used employer and person). Create a bound form based on the employer table, with a txtName textbox, a txtID textbox, and an unbound listbox to the employer table . Add a second unbound listbox for the person table, with column...
5
1619
by: Lyle Fairfield | last post by:
I messed with Windows XP SP2 and for a while did not get the installation quite right. It looked for some file (I think a remnant of a Trojan file that was left on Windows/System32 after Norton had zapped the Trojan) right near the end of the installation, couldn't find it, aborted and put the system back the way it found it, or so it said. But it seems it left some if its security. When I ran my Wininet functions SP2's security blocked the...
9
1705
by: Domel | last post by:
1 Jak zaokr±glić float'a tak żeby jeżeli czę¶ć dziesiętna była większa lub równa od 0.5 to był on zaokr±glany w górę, jeżeli mniejsza to w dół np 1.6 po zaokr 2.0 1.2 po zaokr 1 prosiłbym o ile to możliwe o jakie¶ sposoby bez doł±czania bibliotek, przy użyciu jakich¶ operacji 2. dla czego jeżeli wykonuję poniższ± operację
11
308
by: Ron Vecchi | last post by:
I've used System.Web.Mail before but have never had the need to send attchemnets through it...until now. A client of mine would like a form on the website to allow a user to type up a message and upload a file. I'm staying away from mailto links. So the file and message will be uploaded to the server when the user clicks send. The new file and message will be processed and emailed from the server to my client. I'm tring to get any...
2
1161
by: Gandalf | last post by:
Say I have persisted a DataSet in the ViewState of a page, and that when the page is posted back I make several references to that dataset like so: DataSet MyDataSet = (DataSet)ViewState; //C# Does this have to deserialize the dataset every time it is referenced... or is it only done once when the page is posted back? I'm just wondering if I should assign the DataSet to a local variable in the PageLoad method or if it's OK to...
9
1902
by: chadlupkes | last post by:
I have this code from someone else, and I'm trying to make heads or tails of it because IE doesn't like it. Can anyone help? Or does anyone have a better idea? /* parse the email to check for valid form */ function parseemail(str) { str = trim(str); <?if(preg_match("/MSIE 5.0;/", $_SERVER)) // this is IE 5.0
1
1154
by: koolaid82 | last post by:
Hey java heads. I am an actionscript developer. I have this code on a button that opens a new window a certain size etc. on (release) { getURL ("javascript:NewWindow=window.open('http://myurl.html','newWin','width=650,height=570,left=200,top=200,toolbar=No,location=No,scrollbars=Yes,status=No,resizable=Yes,fullscreen=No'); NewWindow.focus(); void(0);"); I was wondering if anyone knew how to change the script to open a page _self,...
0
2001
by: CajunCoiler | last post by:
Just a heads-up to let everyone know that the new version of CodeLoader has been posted for download. Now its up to 2.0.5 and ready to fly. Two years of revisions, and still free of cost or spyware. http://www.msbdatasystems.com/Downloads/Loads/codeloader.zip
0
9903
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, we’ll explore What is ONU, What Is Router, ONU & Router’s main usage, and What is the difference between ONU and Router. Let’s take a closer look ! Part I. Meaning of...
0
10692
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven tapestry of website design and digital marketing. It's not merely about having a website; it's about crafting an immersive digital experience that captivates audiences and drives business growth. The Art of Business Website Design Your website is...
1
10767
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For most users, this new feature is actually very convenient. If you want to control the update process,...
0
10375
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the choice of these technologies. I'm particularly interested in Zigbee because I've heard it does some...
0
9526
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then launch it, all on its own.... Now, this would greatly impact the work of software developers. The idea...
0
7084
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert into image. Globals.ThisAddIn.Application.ActiveDocument.Select();...
0
5754
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols. I succeeded, with both firewalls in the same network. But I'm wondering if it's possible to do the same thing, with 2 Pfsense firewalls...
1
4567
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system
3
3194
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating effective websites that not only look great but also perform exceptionally well. In this comprehensive...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.