473,382 Members | 1,692 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,382 software developers and data experts.

Spam avoidance

I've noticed that there is little to no spam in comp.lang.python and
am wondering how this is accomplished. Is there a moderator who
actively cancels spam? If so, that wouldn't seem to prevent spam from
making it through to the mailing list version of the newsgroup. Is
there an exceptionally good spam filter in place? If so, I haven't
previously seen one that works so well. Have spammers just given up
on Usenet, figuring that there are more exciting places to spam than
the Big 8 dinosaur?

The reason that I ask is that I help to run another newsgroup
(rec.music.gaffa) and we avoid spam by requiring that every
participant who wants to post has to register on a Mailman. But I
think that this approach has the seriously detrimental effect of
discouraging new participants, so I've been wondering about
alternative approaches.

|>oug
Mar 21 '06 #1
4 1403

Doug> I've noticed that there is little to no spam in comp.lang.python
Doug> and am wondering how this is accomplished.

Most mailing lists which originate on mail.python.org have SpamBayes
filtering in front of them. The one notable exception are the two
SpamBayes-related mailing lists.

Skip
Mar 21 '06 #2
[Douglas Alan]
I've noticed that there is little to no spam in comp.lang.python
and am wondering how this is accomplished.

[Skip Montanaro] Most mailing lists which originate on mail.python.org have SpamBayes
filtering in front of them.
Worth noting that the SpamBayes project started specifically because
Barry Warsaw tricked me ;-) into thinking about a way to do spam
filitering for GNU Mailman lists. My original "ham" test data was in
fact a year's worth of comp.lang.python postings. The algorithms work
extremely well for a focused tech mailing list (I don't know how they
work on other kinds of lists, because those weren't tested).

BTW, python.org uses other gimmicks too, right? For example, I think
Greg Ward set up some other gimmicks to weed out obvious viruses.
The one notable exception are the two SpamBayes-related mailing
lists.


Yup, and they do get their fair share of spam.
Mar 21 '06 #3

"Douglas Alan" <ne****@mit.edu> wrote in message
news:m2************@lexx.mit.edu...
I've noticed that there is little to no spam in comp.lang.python and
am wondering how this is accomplished. Is there a moderator who
actively cancels spam?
For the newsgroup, as opposed to the maillist, there is no moderators.
There may be readers who cancel spam.
If so, that wouldn't seem to prevent spam from
making it through to the mailing list version of the newsgroup. Is
there an exceptionally good spam filter in place? If so, I haven't
previously seen one that works so well. Have spammers just given up
on Usenet, figuring that there are more exciting places to spam than
the Big 8 dinosaur?
People who spam the 'dinosaur' groups tend to lose their newsgroup
accounts. News sites that cater to spammers and refuse to cancel can be
blacklisted and disconnected from the main body of Usenet. (I know this
was done in the 90s, and presume threat still exists.)
The reason that I ask is that I help to run another newsgroup
(rec.music.gaffa) and we avoid spam by requiring that every
participant who wants to post has to register on a Mailman. But I
think that this approach has the seriously detrimental effect of
discouraging new participants, so I've been wondering about
alternative approaches.


news.gmane.net, which turns technical mailing lists into free-access
newsgroups, sends a note to first time posters (per list, I believe) and
requires a response before sending the message onward. Was no problem for
me, as I appreciate the filtering.

Terry Jan Reedy

Mar 22 '06 #4
Tim Peters <ti********@gmail.com> wrote:
[Douglas Alan]
I've noticed that there is little to no spam in comp.lang.python
and am wondering how this is accomplished.


[Skip Montanaro]
Most mailing lists which originate on mail.python.org have SpamBayes
filtering in front of them.


BTW, python.org uses other gimmicks too, right? For example, I think
Greg Ward set up some other gimmicks to weed out obvious viruses.


I'm mostly the guilty party at the moment. Incoming mail on
mail.python.org goes through an SMTP server implemented in Python.
The server uses SpamBayes to filter spam. We disallow attachments
with executable filenames (e.g. .scr). That kills almost all virus
mail. We use a number of realtime blackhole lists; they also block
quite a lot of virus junk and some spam. There is a set of manually
maintained message patterns; those kill some annoying junk that's
hard to block in other ways. We do greylisting (two different
kinds, actually). Some IP addresses get blackholed using iptables
(e.g. zombie machines blasting out virus junk). If SpamBayes is
unsure about a message to a list then it gets held for moderation.

I suspect there are people working behind the scenes to cleanup the
NNTP feed. The short answer to Douglas's question: good tools and a
fair amount of elbow grease. :-)

Neil
Mar 22 '06 #5

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

7
by: Simon Burton | last post by:
This was so easy :) Simon Burton. #!/usr/bin/env python import sys from time import sleep from poplib import *
5
by: ThisIsNotMe | last post by:
Hi All, I am EXTREMELY desperate. I have a website. (asp.net2 with c#) User registers, their account is created in a locked state. They receive an email message to confirm that their email...
2
abdoelmasry
by: abdoelmasry | last post by:
Hi men i have big problem with microsoft exchange server 2003 i installed exchange server as main mail server to use it on in company i have static real ip address im sending and...
11
by: arturklis2 | last post by:
Best off http://fender-guitars-review.blogspot.com/
2
by: sf | last post by:
Hundsome Money Online Guaranteed payment month after month · Work part time or full time as you like . Make quick good cash working 2 to 3 hours a day . Work anywhere from home/ office without...
1
by: muammar_y | last post by:
Hello everyone, Lately, I'm concerned about internet protocols and had this big question of how such proxy avoidance websites work?? Can I create my own?? How?? Any good articles/hints would be...
5
by: Gordon | last post by:
I'm working on a reset password script for my CMS, that will generate a random password and email it to a user when they request one. The problem I am having is that the mails being sent out are...
0
by: skip | last post by:
RonI think most of us are annoyed by the recent SPAM messages that Roncrept onto our list. I'd like to suggest a possible solution, and Ronmaybe start a thread that eventually will rid us of this...
0
by: D'Arcy J.M. Cain | last post by:
On Thu, 6 Nov 2008 07:32:26 -0600 skip@pobox.com wrote: In fact my system successfully filtered out the original spam so that was not my problem. I am sure that we all appreciate your...
1
by: CloudSolutions | last post by:
Introduction: For many beginners and individual users, requiring a credit card and email registration may pose a barrier when starting to use cloud servers. However, some cloud server providers now...
0
by: Faith0G | last post by:
I am starting a new it consulting business and it's been a while since I setup a new website. Is wordpress still the best web based software for hosting a 5 page website? The webpages will be...
0
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 3 Apr 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome former...
0
by: taylorcarr | last post by:
A Canon printer is a smart device known for being advanced, efficient, and reliable. It is designed for home, office, and hybrid workspace use and can also be used for a variety of purposes. However,...
0
by: ryjfgjl | last post by:
If we have dozens or hundreds of excel to import into the database, if we use the excel import function provided by database editors such as navicat, it will be extremely tedious and time-consuming...
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.