473,836 Members | 2,178 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

Tom Lane heads up

Just dropping a quick not for Tom Lane. I sent a personal message
today, but I wasn't sure if you'd get it after I remembered all of the
spam filters you've got set up.

Sorry for the off topic post.

---------------------------(end of broadcast)---------------------------
TIP 5: Have you checked our extensive FAQ?

http://www.postgresql.org/docs/faqs/FAQ.html

Nov 23 '05
69 6425
At 10:08 AM 4/20/2004 +0200, Karel Zak wrote:

4. Very long list of procmail filters on header and body patterns.


It must be pretty difficult maintain these header and body patterns
and the others lists. I had same problem and I resolve if by
"spamassass in", it knows learn and it's more simple than procmailrc
coding. Now I have cca 5% of all spams in my INBOX.


My spam:ham ratio is about 98:2 (98% spam), excluding mailing lists.

So far its manageable though rather annoying - fortunately in my situation
I can regard as spam emails that are in html (or have HTML) and not in my
whitelist. That gets rid of about 50% of the spam, the other 40% or so get
filtered via another simple filter.

My situation=I don't really have to answer messages to my personal email
account from ignorant strangers that send me html email. Your situation may
be different.

So far I haven't seen any html emails that were really worth reading, even
the one or two from relatives (who I white-list to not be rude ;) ). I go
through that folder once in a while and it works for me - so far I don't
recall having HTML emails from strangers that weren't spam.

I've had plain text messages from silly strangers (and a silly colleague)
that used lots of !!!! and stupid subject lines - actual content barely
worth replying to. e.g. Help!!!!!

Situation is different at work. But company pays for antispam software.
Ironically while we sell Sophos Puremessage (which seems to be pretty
good), it's for larger companies/orgs than us (>1000 users). ;).

The backup MX thing is not very useful in most cases. Seems similar for DNS
- doesn't appear that useful to have your names resolvable while your site
is unreachable. OK the error messages may be slightly less embarassing?

Regards,
Link.

---------------------------(end of broadcast)---------------------------
TIP 9: the planner will ignore your desire to choose an index scan if your
joining column's datatypes do not match

Nov 23 '05 #61
Marc G. Fournier wrote:
On Mon, 19 Apr 2004, Joe Conway wrote:
Marc G. Fournier wrote:
Huh? I just use Spamassassin myself, with Razor/Pyzor/DCC and Bayes all
enabled ...


I use exactly the same setup. But recently I've noticed that the
spammers are getting smarter -- I think 20% of it is slipping by the
filters. I'm going to need something better.


do you force learn those spam that get through the cracks? I get about 20
or 30 messages that slip through the cracks, which I process through with
sa-learn nightly ...


Sorry to drag this OT thread on even longer, but it seems to be a topic
many are interested in ;-)

I wanted to report back that after just 2 days of forced (supervised)
learning, the bayesian filter is now nailing about 99% of all spam.
*Many, many, thanks* for the suggestion.

But I wonder why the autolearn feature is so conservative? At this point
I'm getting lots of stuff like this:

X-Spam-Status: Yes, hits=5.8 required=2.5 tests=BAYES_99, HTML_FONT_BIG,
HTML_MESSAGE autolearn=no version=2.63
X-Spam-Report:
* 0.1 HTML_MESSAGE BODY: HTML included in message
* 0.3 HTML_FONT_BIG BODY: HTML has a big font
* 5.4 BAYES_99 BODY: Bayesian spam probability is 99 to 100%
* [score: 1.0000]

Notice that, even though I get a hit on BAYES_99, I still get
autolearn=no. Ah well, I guess I should be asking that question of the
SpamAssassin guys. Also notice that this sucker would have gotten
through with a score of only 0.4 had it not been for the bayesian filter.

Again, thanks.

Joe
---------------------------(end of broadcast)---------------------------
TIP 9: the planner will ignore your desire to choose an index scan if your
joining column's datatypes do not match

Nov 23 '05 #62
Marc G. Fournier wrote:
On Mon, 19 Apr 2004, Joe Conway wrote:
Marc G. Fournier wrote:
Huh? I just use Spamassassin myself, with Razor/Pyzor/DCC and Bayes all
enabled ...


I use exactly the same setup. But recently I've noticed that the
spammers are getting smarter -- I think 20% of it is slipping by the
filters. I'm going to need something better.


do you force learn those spam that get through the cracks? I get about 20
or 30 messages that slip through the cracks, which I process through with
sa-learn nightly ...


Sorry to drag this OT thread on even longer, but it seems to be a topic
many are interested in ;-)

I wanted to report back that after just 2 days of forced (supervised)
learning, the bayesian filter is now nailing about 99% of all spam.
*Many, many, thanks* for the suggestion.

But I wonder why the autolearn feature is so conservative? At this point
I'm getting lots of stuff like this:

X-Spam-Status: Yes, hits=5.8 required=2.5 tests=BAYES_99, HTML_FONT_BIG,
HTML_MESSAGE autolearn=no version=2.63
X-Spam-Report:
* 0.1 HTML_MESSAGE BODY: HTML included in message
* 0.3 HTML_FONT_BIG BODY: HTML has a big font
* 5.4 BAYES_99 BODY: Bayesian spam probability is 99 to 100%
* [score: 1.0000]

Notice that, even though I get a hit on BAYES_99, I still get
autolearn=no. Ah well, I guess I should be asking that question of the
SpamAssassin guys. Also notice that this sucker would have gotten
through with a score of only 0.4 had it not been for the bayesian filter.

Again, thanks.

Joe
---------------------------(end of broadcast)---------------------------
TIP 9: the planner will ignore your desire to choose an index scan if your
joining column's datatypes do not match

Nov 23 '05 #63
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi,

- -- Joe Conway <ma**@joeconway .com> wrote:
I use exactly the same setup. But recently I've noticed that the spammers
are getting smarter -- I think 20% of it is slipping by the filters. I'm
going to need something better.


I recently rebuild by bayes database because it was corrupted; feeded it with
about 1000 low-point-spam and nowabout two spams slipping by the filter in
one day while 200 to 300 are catched.
Ciao
Alvar

- --
** Alvar C.H. Freude -- http://alvar.a-blast.org/ -- http://odem.org/
** Berufsverbot? http://odem.org/aktuelles/staatsanwalt.de.html
** ODEM.org-Tour: http://tour.odem.org/
***5 Jahre Blaster:*http://www.a-blast.de/ | http://www.a-blast.de/statistik/

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.3 (FreeBSD)

iD8DBQFAijSAOnd lH63J86wRAnQCAJ 0SiuIkCu9iRKBXk 9XY0IKE0glgFgCd HJl0
KVN3aQfw34S+IWo kGX60OFA=
=hkKo
-----END PGP SIGNATURE-----
---------------------------(end of broadcast)---------------------------
TIP 7: don't forget to increase your free space map settings

Nov 23 '05 #64
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi,

- -- Joe Conway <ma**@joeconway .com> wrote:
I use exactly the same setup. But recently I've noticed that the spammers
are getting smarter -- I think 20% of it is slipping by the filters. I'm
going to need something better.


I recently rebuild by bayes database because it was corrupted; feeded it with
about 1000 low-point-spam and nowabout two spams slipping by the filter in
one day while 200 to 300 are catched.
Ciao
Alvar

- --
** Alvar C.H. Freude -- http://alvar.a-blast.org/ -- http://odem.org/
** Berufsverbot? http://odem.org/aktuelles/staatsanwalt.de.html
** ODEM.org-Tour: http://tour.odem.org/
***5 Jahre Blaster:*http://www.a-blast.de/ | http://www.a-blast.de/statistik/

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.3 (FreeBSD)

iD8DBQFAijSAOnd lH63J86wRAnQCAJ 0SiuIkCu9iRKBXk 9XY0IKE0glgFgCd HJl0
KVN3aQfw34S+IWo kGX60OFA=
=hkKo
-----END PGP SIGNATURE-----
---------------------------(end of broadcast)---------------------------
TIP 7: don't forget to increase your free space map settings

Nov 23 '05 #65
On Fri, 23 Apr 2004, Joe Conway wrote:
Marc G. Fournier wrote:
On Mon, 19 Apr 2004, Joe Conway wrote:
Marc G. Fournier wrote:
Huh? I just use Spamassassin myself, with Razor/Pyzor/DCC and Bayes all
enabled ...

I use exactly the same setup. But recently I've noticed that the
spammers are getting smarter -- I think 20% of it is slipping by the
filters. I'm going to need something better.


do you force learn those spam that get through the cracks? I get about 20
or 30 messages that slip through the cracks, which I process through with
sa-learn nightly ...


Sorry to drag this OT thread on even longer, but it seems to be a topic
many are interested in ;-)

I wanted to report back that after just 2 days of forced (supervised)
learning, the bayesian filter is now nailing about 99% of all spam.
*Many, many, thanks* for the suggestion.

But I wonder why the autolearn feature is so conservative? At this point
I'm getting lots of stuff like this:

X-Spam-Status: Yes, hits=5.8 required=2.5 tests=BAYES_99, HTML_FONT_BIG,
HTML_MESSAGE autolearn=no version=2.63
X-Spam-Report:
* 0.1 HTML_MESSAGE BODY: HTML included in message
* 0.3 HTML_FONT_BIG BODY: HTML has a big font
* 5.4 BAYES_99 BODY: Bayesian spam probability is 99 to 100%
* [score: 1.0000]

Notice that, even though I get a hit on BAYES_99, I still get
autolearn=no. Ah well, I guess I should be asking that question of the
SpamAssassin guys. Also notice that this sucker would have gotten
through with a score of only 0.4 had it not been for the bayesian filter.


BAYES_99 means that its already been found in the bayes filter, so why
would it once more autolearn it? :)
----
Marc G. Fournier Hub.Org Networking Services (http://www.hub.org)
Email: sc*****@hub.org Yahoo!: yscrappy ICQ: 7615664

---------------------------(end of broadcast)---------------------------
TIP 3: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to ma*******@postg resql.org so that your
message can get through to the mailing list cleanly

Nov 23 '05 #66
On Fri, 23 Apr 2004, Joe Conway wrote:
Marc G. Fournier wrote:
On Mon, 19 Apr 2004, Joe Conway wrote:
Marc G. Fournier wrote:
Huh? I just use Spamassassin myself, with Razor/Pyzor/DCC and Bayes all
enabled ...

I use exactly the same setup. But recently I've noticed that the
spammers are getting smarter -- I think 20% of it is slipping by the
filters. I'm going to need something better.


do you force learn those spam that get through the cracks? I get about 20
or 30 messages that slip through the cracks, which I process through with
sa-learn nightly ...


Sorry to drag this OT thread on even longer, but it seems to be a topic
many are interested in ;-)

I wanted to report back that after just 2 days of forced (supervised)
learning, the bayesian filter is now nailing about 99% of all spam.
*Many, many, thanks* for the suggestion.

But I wonder why the autolearn feature is so conservative? At this point
I'm getting lots of stuff like this:

X-Spam-Status: Yes, hits=5.8 required=2.5 tests=BAYES_99, HTML_FONT_BIG,
HTML_MESSAGE autolearn=no version=2.63
X-Spam-Report:
* 0.1 HTML_MESSAGE BODY: HTML included in message
* 0.3 HTML_FONT_BIG BODY: HTML has a big font
* 5.4 BAYES_99 BODY: Bayesian spam probability is 99 to 100%
* [score: 1.0000]

Notice that, even though I get a hit on BAYES_99, I still get
autolearn=no. Ah well, I guess I should be asking that question of the
SpamAssassin guys. Also notice that this sucker would have gotten
through with a score of only 0.4 had it not been for the bayesian filter.


BAYES_99 means that its already been found in the bayes filter, so why
would it once more autolearn it? :)
----
Marc G. Fournier Hub.Org Networking Services (http://www.hub.org)
Email: sc*****@hub.org Yahoo!: yscrappy ICQ: 7615664

---------------------------(end of broadcast)---------------------------
TIP 3: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to ma*******@postg resql.org so that your
message can get through to the mailing list cleanly

Nov 23 '05 #67
Marc G. Fournier wrote:
BAYES_99 means that its already been found in the bayes filter, so why
would it once more autolearn it? :)


To add more spam words to its vocabulary of course. Learning works both
ways...

Greg

---------------------------(end of broadcast)---------------------------
TIP 5: Have you checked our extensive FAQ?

http://www.postgresql.org/docs/faqs/FAQ.html

Nov 23 '05 #68
Marc G. Fournier wrote:
BAYES_99 means that its already been found in the bayes filter, so why
would it once more autolearn it? :)


To add more spam words to its vocabulary of course. Learning works both
ways...

Greg

---------------------------(end of broadcast)---------------------------
TIP 5: Have you checked our extensive FAQ?

http://www.postgresql.org/docs/faqs/FAQ.html

Nov 23 '05 #69
On Tue, Apr 20, 2004 at 01:06:18AM -0400, Tom Lane wrote:
3. I have noticed that bouncing any machine that sends "HELO
sss.pgh.pa.us" gets rid of a ton of spam and viruses. I don't know of
any real clean way to do this, but I have a sendmail.cf hack for it.
By the way, thanks very much for this tip. This almost in one hit made
a many of our spam and virus filters redundant. Very nice on the load.
I'd noticed that some perl mail modules appear to get this wrong but it
efficiently catches our customers sending viruses and spam through our
relay too.

I'm using Exim 3 so I can only pick this up after the mail has been
received but with Exim 4 I should be able to kill the email in SMTP
stage.

--
Martijn van Oosterhout <kl*****@svana. org> http://svana.org/kleptog/ Patent. n. Genius is 5% inspiration and 95% perspiration. A patent is a
tool for doing 5% of the work and then sitting around waiting for someone
else to do the other 95% so you can sue them.


-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.0.6 (GNU/Linux)
Comment: For info see http://www.gnupg.org

iD8DBQFAmteGY5T wig3Ge+YRAiCdAK ClInLz950pbwH5G LvnmB59IgvQCgCd FCYq
5rz9adf8VndULZr +16Rlqh0=
=6ZT1
-----END PGP SIGNATURE-----

Nov 23 '05 #70

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

0
1057
by: Eric | last post by:
I've written a spiffy class to validate my xml. it loads up a schema into an xmlvalidatingreader and runs thru the xml, firing validationevents as it goes. Friend Sub ValidationEventHandle(ByVal sender As Object, ByVal args As ValidationEventArgs) _bIsValid = False _errordata.Add(_reader.Name & "|" & _sCurrentFilename & "|" &
0
1713
by: Darryl Kerkeslager | last post by:
Access 2002 on Win98 at least, not tested elsewhere: Bug: The ListCount property of an unbound listbox may be incorrect if you use column heads. Model: Set up a simple database with two simple tables (I used employer and person). Create a bound form based on the employer table, with a txtName textbox, a txtID textbox, and an unbound listbox to the employer table . Add a second unbound listbox for the person table, with column...
5
1619
by: Lyle Fairfield | last post by:
I messed with Windows XP SP2 and for a while did not get the installation quite right. It looked for some file (I think a remnant of a Trojan file that was left on Windows/System32 after Norton had zapped the Trojan) right near the end of the installation, couldn't find it, aborted and put the system back the way it found it, or so it said. But it seems it left some if its security. When I ran my Wininet functions SP2's security blocked the...
9
1703
by: Domel | last post by:
1 Jak zaokrąglić float'a tak żeby jeżeli część dziesiętna była większa lub równa od 0.5 to był on zaokrąglany w górę, jeżeli mniejsza to w dół np 1.6 po zaokr 2.0 1.2 po zaokr 1 prosiłbym o ile to możliwe o jakieś sposoby bez dołączania bibliotek, przy użyciu jakichś operacji 2. dla czego jeżeli wykonuję poniższą operację
11
308
by: Ron Vecchi | last post by:
I've used System.Web.Mail before but have never had the need to send attchemnets through it...until now. A client of mine would like a form on the website to allow a user to type up a message and upload a file. I'm staying away from mailto links. So the file and message will be uploaded to the server when the user clicks send. The new file and message will be processed and emailed from the server to my client. I'm tring to get any...
2
1161
by: Gandalf | last post by:
Say I have persisted a DataSet in the ViewState of a page, and that when the page is posted back I make several references to that dataset like so: DataSet MyDataSet = (DataSet)ViewState; //C# Does this have to deserialize the dataset every time it is referenced... or is it only done once when the page is posted back? I'm just wondering if I should assign the DataSet to a local variable in the PageLoad method or if it's OK to...
9
1900
by: chadlupkes | last post by:
I have this code from someone else, and I'm trying to make heads or tails of it because IE doesn't like it. Can anyone help? Or does anyone have a better idea? /* parse the email to check for valid form */ function parseemail(str) { str = trim(str); <?if(preg_match("/MSIE 5.0;/", $_SERVER)) // this is IE 5.0
1
1154
by: koolaid82 | last post by:
Hey java heads. I am an actionscript developer. I have this code on a button that opens a new window a certain size etc. on (release) { getURL ("javascript:NewWindow=window.open('http://myurl.html','newWin','width=650,height=570,left=200,top=200,toolbar=No,location=No,scrollbars=Yes,status=No,resizable=Yes,fullscreen=No'); NewWindow.focus(); void(0);"); I was wondering if anyone knew how to change the script to open a page _self,...
0
2000
by: CajunCoiler | last post by:
Just a heads-up to let everyone know that the new version of CodeLoader has been posted for download. Now its up to 2.0.5 and ready to fly. Two years of revisions, and still free of cost or spyware. http://www.msbdatasystems.com/Downloads/Loads/codeloader.zip
0
9656
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can effortlessly switch the default language on Windows 10 without reinstalling. I'll walk you through it. First, let's disable language synchronization. With a Microsoft account, language settings sync across devices. To prevent any complications,...
0
10821
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed. This is as boiled down as I can make it. Here is my compilation command: g++-12 -std=c++20 -Wnarrowing bit_field.cpp Here is the code in...
0
10527
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven tapestry of website design and digital marketing. It's not merely about having a website; it's about crafting an immersive digital experience that captivates audiences and drives business growth. The Art of Business Website Design Your website is...
1
10573
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For most users, this new feature is actually very convenient. If you want to control the update process,...
0
10241
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the choice of these technologies. I'm particularly interested in Zigbee because I've heard it does some...
0
9358
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then launch it, all on its own.... Now, this would greatly impact the work of software developers. The idea...
1
7773
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms. Adolph will...
1
4443
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system
3
3102
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating effective websites that not only look great but also perform exceptionally well. In this comprehensive...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.