|
Spam Filtering Methods
| Filtering
Technique |
Description |
MIS
Service |
Collaborative voting |
Users vote on which messages are Spam |
MISgroupware |
Gateway interception |
Gateway device intercepts Spam before
it reaches the desktops |
MISemailserver
MISemailprotector
MISemailhost
MISgroupware |
Heuristic |
Filter scans email for junk mail tip-off
terms |
MISemailserver
MISemailprotector
MISemailhost |
Bayesian |
Filtering that learns from users definition
of Spam |
MISgroupware |
Block and Allow
Lists |
Filter blocks or allows messages based
on senders' being included in a predefined list |
MISemailserver
MISemailprotector
MISemailhost
MISgroupware |
MISemailserver, MISemailprotector,
and MISemailhost spam-identification tactics used include:
header analysis: spammers use a number of
tricks to mask their identities, fool you into thinking they've
sent a valid mail, or fool you into thinking you must have subscribed
at some stage.
text analysis: spam mails often have a characteristic
style and some characteristic disclaimers and CYA text.
Many rules are used to grade the header and text. 
blacklists: supports many useful existing
blacklists, such as mail-abuse.org, ordb.org or others. Additional
fees may apply.
Razor: Vipul's
Razor is a collaborative spam-tracking database, which works
by taking a signature of spam messages. Since spam typically
operates by sending an identical message to hundreds of people,
Razor short-circuits this by allowing the first person to receive
a spam to add it to the database -- at which point everyone
else will automatically block it. Additional fees may apply.
Once identified, the mail can then be optionally tagged as
spam for later filtering using the user's own mail user-agent
application.
MISgroupware adds the following
anti-spam tactics:
Black Lists: supports multiple DNSBL services
to check for blacklisted senders, known open relays, servers
in dialup IP ranges, etc.
White Lists:· per-user and shared whitelists
to ensure messages from particular addresses or domains are
never classified as spam
All user's contacts are automatically whitelisted.
Advanced Bayesian: filtering to classify messages
based on their text content. The filtering engine can be easily
trained by the users to improve its accuracy. Learning is as
easy as clicking on 'This is spam' and 'This is not spam' links.
User-defined spam probability threshold for spam classification
Spam folder: to quarantine messages
Daily Spam Report: to notify every user of
how many messages were classified as spam each day (can be disabled)
|