Spam Links

Spam Filtering Research

There is a lot more to filtering spam than simply blocking IP addresses or separating out email with the word “viagra” present.

Spam Filtering Mechanisms

Specific descriptions and examples on how to filter spam on a server level.

Top Index

Spam Filtering Techniques

Spam Filtering for Mail Exchangers - tldp.org/HOWTO/Spam-Filtering-for-MX/
Filtering Standards Anti Spam Research Group (ASRG) Subgroup - asrg.sp.am/subgroups/filtering.shtml
Server Index Query (SIQ) (draft) - www.milter.info/sendmail/milter-siq/
Second-Generation Anti-Spam Solutions - overcomeemailoverload.com/advice/AntiSpamTools.html
Spam filtering techniques - www.ibm.com/developerworks/linux/library/l-spamf.html?t=gr,lnxw15=SFT
E-Mail Spamming countermeasures - www.ciac.org/ciac/bulletins/i-005c.shtml
How to effectively block spam and junk mail - www.redearthsoftware.com/spam-filter-article.htm
Reverse Spam Filtering - www.ii.com/internet/messaging/spam/
Bloqueando, Filtrando - www.absoluta.org/seguranca/seg_spam.htm - Portugese
Filtering Unsolicited E-mail - ist.uwaterloo.ca/security/howto/2000-09-27/
Anti-Spam Methods & Checks - www.pivotalveracity.com/NewsRes/AntiSpam.php
Technical approaches to spam - www.taugh.com/spamtech.pdf
Technologies to Combat Spam - www.sans.org/rr/whitepapers/email/1130.php
URL filtering - www.sophos.com/pressoffice/news/articles/2004/02/sa_cutsspam.html
Anti-Spam Solutions and Security - www.securityfocus.com/infocus/1763
Stopping Email Abuse - en.wikipedia.org/wiki/Anti-spam_techniques_(e-mail)
SpamGuru Overview - www.research.ibm.com/spam/papers/spamguru-overview.pdf
A Multifaceted Approach to Spam Reduction - www.research.ibm.com/spam/papers/multifaceted-approach.pdf
Tutorial on Junk E-mail Filtering - research.microsoft.com/en-us/um/people/joshuago/ICMLTutorialAnnounce.htm
Effective Filtering - www.spamhaus.org/effective_filtering.html
Technical Standards for E-mail Delivery - postmaster.aol.com/guidelines/standards.html
Technical and Blocklist Restrictions - www.tuffmail.com/mx-restrictions.php
Mail Filtering - www.acme.com/mail_filtering/
MX+ - mxplus.org/
The Effect of Filters on Spam Mail - www.kellogg.northwestern.edu/research/math/papers/1402.pdf
What is Anti-Spam? - www.circleid.com/posts/what_is_anti_spam/
How this system filters mail - www.sput.nl/spam/filter-mail.html
Anti-Spam Technologies - www.oecd-antispam.org/article.php3?id_article=241
Computer Tyme Spam and Virus Filter: How it Works! - www.junkemailfilter.com/spam/how_it_works.html
Understanding the Network-Level Behavior of Spammers - www.nanog.org/meetings/nanog37/abstracts.php?pt=MzYxJm5hbm9nMzc=&nm=nanog37
Separating Wheat from the Chaff: A Deployable Approach to Counter Spam - www.cs.indiana.edu/~minaxi/pubs/sruti06.pdf
Kaboom — email filtering II - www.cyberdelix.net/tech/kaboom.htm
Filtering Spam At Your Leisure: post delivery filtering - www.uoregon.edu/~joe/maawg7/maawg7.ppt
FortiGuard AntiSpam Technology Overview - www.fortiguardcenter.com/antispam/antispam_info.html#spamtech
e-scribe Antispam: Technical Details - e-scribe.com/antispam/
“Default Deny” — A Paradigm Shift for E-Mail - matthias.leisi.net/
Spam Control: The Current Landscape - www.ferris.com/2007/01/02/the-commodity-status-of-spam-control/
The minimum antispam features of a modern SMTP server - utcc.utoronto.ca/~cks/space/blog/spam/MinimumSMTPFeatures
How PerfectMail works - www.xpmsoftware.com/index.php/xpm/howItWorks
Keeping Spam Out of the Network - www.avertlabs.com/research/blog/?p=194
Validating the sender domain - www.avertlabs.com/research/blog/?p=241
Email Relay Detection - mel.byu.edu/spam/
Sieve: A Mail Filtering Language - www.faqs.org/rfcs/rfc3028.html
SIEVE Email Filtering: Spamtest and VirusTest Extensions - www.faqs.org/rfcs/rfc3685.html
Understanding the Network Level Behavior of Spammers - www.nanog.org/meetings/nanog37/abstracts.php?pt=MzYxJm5hbm9nMzc=&nm=nanog37
Anti-Spam: The MagicMail Philosophy - www.linuxmagic.com/opensource/anti_spam/philosophy
Fake MX - www.fakemx.org/
High Speed Image Part Recognition (IPR) - www.comdomsoft.com/en/antispam/white-papers/high-speed-image-part-recognition-ipr.html
Sendmail Best Practices for Combating Spam - www.sendmail.com/sm/wp/spam_best_practices/
Proofpoint MLX Technology Whitepaper - www.proofpoint.com/id/mlxwp/
Fighting Back Against the Spam-Zombie Hordes - research.microsoft.com/en-us/news/features/SpamFighting.aspx
How Dynamic Are IP Addresses? - research.microsoft.com/apps/pubs/default.aspx?id=63680
Spamming Botnets: Signatures and Characteristics - research.microsoft.com/apps/pubs/default.aspx?id=63701
Gmail's spam-fighting technology - www.google.com/mail/help/fightspam/getstarted.html
SpamCompiler System - www.mailshell.com/mail/client/oem2.html/step/howitworks
Spam Challenge 2008: IBM ISS Spam Filtering Technology - www.ceas.cc/2008/papers/iffert.pdf
Anti-spam and spam filtering techniques - www.allspammedup.com/anti-spam/
Improving Image Spam Filtering Using Image Text Features - www.ceas.cc/2008/papers/ceas2008-paper-29.pdf
Empirical research on IP blacklisting - www.ceas.cc/2008/papers/ceas2008-paper-55.pdf
Detecting Known and New Salting Tricks in Unwanted Emails - www.ceas.cc/2008/papers/ceas2008-paper-48.pdf
The antispam accuracy of sender verification - blogs.msdn.com/tzink/archive/2008/11/18/the-antispam-accuracy-of-sender-verification.aspx
Some cool techniques for image filtering - blogs.msdn.com/tzink/archive/2008/11/16/some-cool-techniques-for-image-filtering.aspx
RepuScore - isr.uncc.edu/repuscore/
SMTP DNS authorization - www.crynwr.com/spam/smtp-dns-authorization.html
Can DNS-Based Blacklists Keep Up With Bots? - www.cc.gatech.edu/grads/a/avr/publications/ramachandran-ceas06.pdf
Revealing Botnet Membership Using DNSBL Counter-Intelligence - www.cc.gatech.edu/grads/a/avr/publications/ramachandran-sruti06.pdf
Filtering Spam with Behavioral Blacklisting - www.cc.gatech.edu/grads/a/avr/publications/ccs07.pdf
Anti-Spam: The MagicMail Philosophy - www.linuxmagic.com/opensource/anti_spam/philosophy
Ask Al: Checking email addresses against URIBLs? - www.spamresource.com/2009/07/ask-al-checking-email-addresses-against.html
Effective Spam Filtering - www.spamhaus.org/whitepapers/effective_filtering.html

Top of Section Top Index

Spam Filtering Case Studies

Local Mail Blocking Mechanisms - www.er6.eng.ohio-state.edu/mail_blocking.html
Anti-Spam Mechanisms on our Mail Servers - www.ultradesign.com/support/email/spamfilters.html
Mails rejected by anti-spam rules - web.ccr.jussieu.fr/anti-spam/rejet/rejet.html#english
Spam Filtering in a Small Business Environment, a Case Study - www.sans.org/rr/whitepapers/email/1213.php
Controlling Spam in a Small Business - www.sans.org/rr/whitepapers/email/1248.php
How to filter unsolicited e-mail on your mail server - www.sans.org/rr/whitepapers/email/582.php
Anti spam software – Spam Filter vs. Spam Block - spameater.com/anti-spam-software.html
Spam server details - gconnor.livejournal.com/97154.html
Spam Filtering Survey - ist-socrates.berkeley.edu:7309/public/spam_survey.html
A Study of Supervised Spam Detection - plg.uwaterloo.ca/~gvcormac/spamcormack.html
Review of Gordon Cormack's Study of Spam Detection - www.zdziarski.com/papers/cormack.html
Comparing SpamAssassin with CBDF email filtering - www.cs.bham.ac.uk/~mgl/cluk/papers/obrian.pdf
More on Spam - www.cookco.us/more_on_spam.htm
Traveler, a Spam-resistant E-mail System - www.vsta.org/spam/Traveler.html
Spam filtering best practice and how we filter spam - www.antespam.co.uk/how-we-filter-spam/
Spam Fighting at CERN - mmmtf.web.cern.ch/mmmtf/Minutes/2003-02-18/spamkiller.pdf
CERN AntiSpam Server side - https://websvc06.cern.ch/mmmservices/Antispam/ActionServer.aspx
Deployment Experience: Rolling Out a New Antispam Solution in a Large Corporation - www.ceas.cc/2006/2.pdf
Solving big problems with Open Source: e-mail - www.potentialtech.com/wmoran/spam.pdf
Email filtering with MIMEDefang - www.xs4all.nl/~johnpc/mimedefang-modular/yapceu2005.pdf
Using PostFix To Reject Spam - honeypot.net/filtering-spam-postfix
Greylisting, SpamAssassin, SpamProbe, Image Spam, DNSWL, and Viruses - www.chaosreigns.com/spam/
Fighting Spam in an ISP Environment - www.roaringpenguin.com/files/isp-spam.pdf
ASSP — extracting the ham from the spam - www.uniforum.chi.il.us/slides/assp.ppt - mirrors: 1
pi's Bogofilter page - piology.org/bogofilter/
Yahoo, Gmail and Spamcop Email & Spam Filtering Stats, 2006 - daggle.com/email-spam-filtering-stats-jan-12-2006-50
A Last Go At Spam Filtering Before Whitelisting - daggle.com/a-last-go-at-spam-filtering-before-whitelisting-56
Stats Say: Sticking With Gmail! - daggle.com/stats-say-sticking-with-gmail-58
How I Filter Spam - www.multicians.org/thvv/spamfilt.html
Postini: Google's take on e-mail security - news.cnet.com/8301-1009_3-10276548-83.html

Top of Section Top Index

Spam Filtering References

Internet Standards

Top of Section Top Index

Verisign's SiteFinder and Spam Filtering

Verisign's Wildcard Service Deployment - www.icann.org/en/general/wildcard-history.htm

Top of Section Top Index

Email and Spam Research Groups

IBM Anti-Spam Research - domino.research.ibm.com/comm/research_projects.nsf/pages/spam.index.html
DoI: Denial of Information - www.cc.gatech.edu/projects/doi/
Collaborative Center for Internet Epidemiology and Defenses (CCIED) - ccied.sysnet.ucsd.edu/
Max-Planck-Institut Informatik Machine Learning Group - www.mpi-inf.mpg.de/departments/rg2/
Microsoft Research Machine Learning and Applied Statistics (MLAS) group - research.microsoft.com/en-us/groups/mlas/
Microsoft Research S-GPS: Spammer Global Positioning System - research.microsoft.com/en-us/projects/s-gps/
Cloudmark Research Group - www.cloudmark.com/research/
Verisign Security Research Anti-spam Schema - www.verisign.com/research/Security_Research/037091.html
Knowledge Discovery and Data Mining Laboratory at UAB - www.cis.uab.edu/kddm/ - papers
Computer Forensics Research at UAB - www.cis.uab.edu/forensics/
Alek Kolcz et. al. - ir.iit.edu/~alek/publications.html
Hanoi University Anti-spam group - fit.hanu.edu.vn/antispam/research.html
Georgia Tech Networking Group - www.cc.gatech.edu/~feamster/

Top of Section Top Index

Spammer Techniques

The Spammers' Compendium - www.jgc.org/tsc/
Observed Trends in Spam Construction Techniques: A Case Study of Spam Evolution - www.ceas.cc/2006/4.pdf
Spammer Tricks - www.rickconner.net/spamweb/tricks.html
How to spot a spam website - www.rickconner.net/spamweb/spamwebsites.html
Tricks for protecting spam websites - www.rickconner.net/spamweb/web-dns-tricks.html
Spammer Tricks - gregsearle.tripod.com/spam_tech.html
The Effects of AntiSpam Methods on Spam Mail - www.ceas.cc/2006/24.pdf
Spam Techniques - st.do.homeunix.org/
A day in the life of a spammer - matthias.leisi.net/archives/126-A-day-in-the-life-of-a-spammer.html
Pathological Study of Junk Mails - junkmatcher.sourceforge.net/Pathology/
Round robin DNS - www.spamtrackers.eu/wiki/index.php?title=Round_robin
Pharmacy Alert Security Team - pharmalert.zoomshare.com/
Image spam by the numbers - www.csoonline.com/article/221254/Image_Spam_By_the_Numbers
ISP Spam Issues - www.spamhaus.org/faq/answers.lasso?section=ISP%20Spam%20Issues
Host cloaking technique used by spammers - thespamdiaries.blogspot.com/2006/02/new-host-cloaking-technique-used-by.html
Know Your Enemy: Fast-Flux Service Networks - www.honeynet.org/papers/ff/
Spamscatter: Characterizing Internet Scam Hosting Infrastructure - www.cs.ucsd.edu/~voelker/pubs/spamscatter-security07.pdf
Anatomy of Spam - anatomyofspam.spaces.live.com/ RSS
Do Zebras get more Spam than Aardvarks? - www.cl.cam.ac.uk/~rnc1/aardvark.pdf
A Survey of Modern Spam Tools - www.ceas.cc/2008/papers/ceas2008-paper-35.pdf
An Empirical Analysis of Spam Marketing Conversion - www.icsi.berkeley.edu/pubs/networking/2008-ccs-spamalytics.pdf
A Campaign-based Characterization of Spamming Strategies - www.ceas.cc/2008/papers/ceas2008-paper-45.pdf
ASCII: An artful way around spam filters - news.cnet.com/8301-1023_3-10025917-93.html
Bayesian Spam Filter Poisoning With RSS - robert.accettura.com/blog/2007/01/29/bayesian-spam-filter-poisoning-with-rss/
Understanding the Network-Level Behavior of Spammers - gatech.academia.edu/AnirudhVadakkedathRamachandran/Papers/11974/Understanding-the-network-level-behavior-of-spammers
Spammer Economy and Infrastructure - www.spamtrackers.eu/wiki/index.php?title=Spammer_Economy_and_Infrastructure
Evil Searching - www.lightbluetouchpaper.org/2009/02/25/evil-searching/
Spam in my Calendar? - www.avertlabs.com/research/blog/index.php/2008/05/07/spam-in-my-calendar/
Google Docs used in latest spam run - itknowledgeexchange.techtarget.com/security-bytes/google-docs-used-in-latest-spam-run/
Most Spam Sites Tied to a Handful of Registrars - blog.washingtonpost.com/securityfix/2008/05/most_spam_sites_tied_to_a_hand_1.html
Registrar research by Knujon - www.knujon.com/registrars/
Anonymous Domain Sales: A Spammer's Delight - blog.washingtonpost.com/securityfix/2008/06/anonymous_domain_sales_a_spamm_1.html
Spam Crisis in China - garwarner.blogspot.com/2009/06/spam-crisis-in-china.html
Report: botnets sent over 80% of all June spam - arstechnica.com/security/news/2009/06/report-botnets-send-over-80-of-all-spam-in-june.ars
Spammers Shorten Their URLs - bits.blogs.nytimes.com/2009/07/07/spammers-shorten-their-urls/
RTF file conceals spam - blog.trendmicro.com/rtf-file-conceals-spam/

Top of Section Top Index

Spam Filtering Benchmarks and Reviews

Testing the effectiveness of spam filtering.

Top Index

Spam Filter Benchmarking and Testing

VeriTest Anti-Spam Benchmark Service - www.lionbridge.com/lionbridge/en-US/services/software-product-engineering/testing-veritest.htm
TREC Spam Filter Evaluation Tool Kit - plg.uwaterloo.ca/~gvcormac/jig/
Discovery Challenge - www.ecmlpkdd2006.org/challenge.html
Generic Test for Unsolicited Bulk Email (GTUBE) - spamassassin.apache.org/gtube/
Spirent Avalanche - www.spirent.com/
Global-scale Anti-spam Testing in Your Own Back Yard - www.ceas.cc/2008/papers/ceas2008-paper-03.pdf
A Mail Client Plugin for Privacy-Preserving Spam Filter Evaluation - www.ceas.cc/2008/papers/ceas2008-paper-52.pdf
The nuances of measuring spam effectiveness - blogs.msdn.com/tzink/archive/2009/04/23/the-nuances-of-measuring-spam-effectiveness.aspx
The nuances of measuring spam effectiveness, part 2 - blogs.msdn.com/tzink/archive/2009/04/27/the-nuances-of-measuring-spam-effectiveness-part-2.aspx

Top of Section Top Index

Spam Filter Reviews

Anti-spam Tool League Table - www.jgc.org/astlt/
Security appliances keep mail stream clean - gcn.com/articles/2005/03/30/security-appliances-keep-mail-stream-clean.aspx
Spam Filters - freshmeat.net/articles/spam-filters
SC Magazine antispam - www.scmagazineus.com/spam-techniques/topic/98/0/
SC Magazine content security awards - www.scmagazineuk.com/Awards/section/341/
SC Magazine UK Email Security Group Test 2008 - www.scmagazineuk.com/Email-security-2008/GroupTest/131/
SC Magazine UK Email Content Management Group Test 2008 - www.scmagazineuk.com/Email-content-management-2008/GroupTest/129/
SC Magazine UK Email Content Filtering Group Test 2007 - www.scmagazineuk.com/Email-Content-Filtering-2007/GroupTest/93/
SC Magazine US Email Content Filtering 2007 - www.scmagazineus.com/Email-Content-Filtering-2007/GroupTest/47/
SC Magazine UK Email Content Filtering Group Test 2006 - www.scmagazineuk.com/Email-content-filtering-2006/GroupTest/86/
SC Magazine UK Anti spam Group Test 2006 - www.scmagazineuk.com/Anti-spam-2006/GroupTest/79/
Bayesian Filtering, a review - freshmeat.net/articles/spam-filters
Spam Filter Reviews - spam-filter-review.toptenreviews.com/
Winning the War on spam: Comparison of Bayesian spam filters - home.dataparty.no/kristian/reviews/bayesian/
WhichSpamFilter - www.whichspamfilter.com/
PCMAG antispam software - www.pcmag.com/category2/0,1874,4795,00.asp
Spam Filtering II - sam.holden.id.au/writings/spam2/
Four Cans of Anti-Spam - sartryck.idg.se/Art/Antispamboxar_1_NOK182005e.html
DNS Blocklist Accuracy Figures (as of July 2005) - wiki.apache.org/SpamAssassin/DnsblAccuracy082005
Enterprise Spam Filters Review - www.networkcomputing.com/showArticle.jhtml?articleId=173602950
Connection scoring beats spam filtering - windowssecrets.com/comp/060126/#story1
Wait a minute Mr. Postman! - gcn.com/articles/2006/05/31/wait-a-minute-mr-postman.aspx
Anti-Spam State of the Art - spam.ani.univie.ac.at/files/FA384018-1.pdf
Email Classification - www.massey.ac.nz/~tameyer/research/spambayes/
Network World anti-spam buyer's guide - www.networkworld.com/buyersguides/guide.php?cat=865463
Network World: Spam in the Wild, The Sequel - www.networkworld.com/reviews/2004/122004spampkg.html
Managed Anti-Spam and Content Filtering - www.isp-planet.com/technology/mssp/2006/mssp6a.html
Spam challenge: the winners! - en.onsoftware.com/spam-challenge-the-winners/

Top of Section Top Index

More Than Just Mail Filtering

Firewalls, routers, DNS and playing it slow.

Top Index

Network Based Spam Filtering

Cutting off IP connectivity to spam sources - spam.abuse.net/adminhelp/ip.shtml
Spam Blocking with a Dynamically Updated Firewall Ruleset - deny-spammers.sourceforge.net/
Packetbl - wiki.duskglow.com/tiki-index.php?page=Packetbl
MAPS RBL BGP Feed Configuration FAQ for Cisco Routers - www.pch.net/documents/tutorials/maps-rbl-bgp-cisco-config-faq.html

Top of Section Top Index

Source Device Fingerprinting

Openbsd's fingerprinting and shaping used for evil^Wgood - use.perl.org/~merlyn/journal/17094
Some p0f Data - taint.org/2006/10/03/193930a.html
Passively OS Fingerprinting Email with PF - blog.insidesystems.net/articles/2006/06/06/OS-Fingerprinting-Email
p0f analyzer - www.ijs.si/software/p0f-analyzer.pl
Exploiting Transport-Level Characteristics of Spam - www.ceas.cc/2008/papers/ceas2008-paper-17.pdf

Top of Section Top Index

Playing it Slow

Top of Section Top Index

Nolisting

Nolisting - www.joreybump.com/code/howto/nolisting.html

Top of Section Top Index

Unlisting

Unlisting: Port Knocking for SMTP - www.joreybump.com/code/howto/unlisting.html

Top of Section Top Index

Blacklisting vs. Content Filters

Some views about the benefits or otherwise of “blacklists” or content filters.

Filters vs. Blacklists - www.paulgraham.com/falsepositives.html
Who Runs The Blocklists? - www.linxnet.com/misc/spam/blocklists.html
Why Content Blocking Does Not Work - www.knujon.com/contentblock.html

Top Index

Distributed Spam Filtering

Distributed or collaborative spam filtering uses shared data to fine-tune filtering algorithms. It usually relies on some means to identify an email as belonging to a larger set of near-identical emails (bulk or campaign identification), and then deciding whether that group of emails (rather than each individual email) is spam or not. It also involves many cooperating sites or individuals who can share their understanding or opinion of messages.

Distributed [spam] Early Warning System - www.radagast.org/~dplatt/dews/dews-design-sketch.txt
Spam Agent Architecture - linux.ucla.edu/~larva/spam-agent/
Attack Resistant Trust Metric Metadata HOWTO - www.levien.com/free/tmetric-HOWTO.html
Spam Inoculation Messages - www.zdziarski.com/papers/draft-spamfilt-inoculation-03.txt
Personalised, Collaborative Spam Filtering (CASSANDRA) - https://www.cs.tcd.ie/publications/tech-reports/reports.04/TCD-CS-2004-36.pdf [1]
Reputation Network Analysis for Email Filtering - trust.mindswap.org/papers/emailPaper/ [1]
Complement Set Filtering - en.wikipedia.org/wiki/Complement_set_email_filtering
SpamWatch - www.cs.berkeley.edu/~zf/spamwatch/
Personalised, collaborative spam filtering - www.ceas.cc/papers-2004/132.pdf
Berkeley Workshop on Collaborative Filtering - www2.sims.berkeley.edu/resources/collab/
Collaborative Filtering Research Papers - jamesthornton.com/cf/
Collaborative Filtering - pespmc1.vub.ac.be/COLLFILT.html
Resolving FP-TP Conflict in Digest-Based Collaborative Spam Detection by Use of Negative Selection Algorithm - www.ceas.cc/2008/papers/ceas2008-paper-56.pdf
An Open Digest-based Technique for Spam Detection - spdp.dti.unimi.it/papers/pdcs04.pdf
Nilsimsa - ixazon.dynip.com/~cmeclax/nilsimsa.html
Nilsimsa (Ruby port) - rubyforge.org/projects/nilsimsa/
Nilsimsa (Perl port) - search.cpan.org/~vipul/Digest-Nilsimsa/Nilsimsa.pm

Top Index

Text Classification Spam Filtering

Most text classification spam filters use machine learning of some form to learn how to filter, rather than building rules manually. First implemented in several client side spam filters, it now holds much more potential as part of a server side spam filter, or as a feed for a DNSBL with less serious collateral damage. Baye's theorem is the buzzword of the day.

Spam Classification Overviews

Introduction to Bayesian Filtering - www.process.com/precisemail/bayesian_filtering.htm
SpamBayes Background Reading - spambayes.sourceforge.net/background.html
Why Bayesian filtering is the most effective - www.gfi.com/whitepapers/why-bayesian-filtering.pdf
Machine Learning for Text Classification - www.daviddlewis.com/publications/slides/lewis-2003-0117-spamconf.html
Filtering Research - www.paulgraham.com/bayeslinks.html
Spam Detection - radio.weblogs.com/0101454/stories/2002/09/16/spamDetection.html
Statistics and the war on spam. In Statistics, A Guide to the Unknown
Stopping spam with statistic - research.microsoft.com/en-us/um/people/joshuago/significance-spam_edited2-times.pdf
A Plan for Spam - www.paulgraham.com/spam.html
Better Bayesian Filtering - www.paulgraham.com/better.html
About Bayesian Spam Filtering - email.about.com/cs/bayesianfilters/a/bayesian_filter.htm

Top of Section Top Index

Spam Classification Bibliographies

Bibliography on Machine Learning for Spam Detection - liinwww.ira.uka.de/bibliography/Ai/MLSpamBibliography.html
www.iis.sinica.edu.tw/~jhwang/spam-paper.html
research.microsoft.com/en-us/um/people/joshuago/spambibliography.mht

Top of Section Top Index

Spam Classification Research

Top of Section Top Index

Spam Classification Research 2008

Adaptive Spam Filtering Using Only Naive Bayes Text Classifiers
Introduction of Fingerprint Vector based Bayesian Method for Spam Filtering
Joint NLP Lab between HIT2 and CEAS Spam-filter Challenge 2008
Toward a stochastic speech act model of email behavior
Personalized Spam Filtering for Gray Mail
Improved Phishing Detection using Model-Based Features
Filtering Email Spam in the Presence of Noisy User Feedback
Advances in online learning-based spam filtering

Top of Section Top Index

Spam Classification Research 2007

Technology Impact Assessment: Fingerprinting versus Bayesian Filtering
Intent Based Filtering of Spam
Improving Spam Filtering by Detecting Gray Mail
Hardening Fingerprinting by Context
Dirichlet-Enhanced Spam Filtering based on Biased Samples
Online Active Learning Methods for Fast Label-Efficient Spam Filtering

Top of Section Top Index

Spam Classification Research 2006

Spam Filtering with Naive Bayes — Which Naive Bayes?
Online Discriminative Spam Filter Training
Batch and Online Spam Filter Comparison
Learning at Low False Positive Rates
Fast Uncertainty Sampling for Labeling Large E-mail Corpora
Breaking Anti-Spam Systems with Parasitic Spam
An Adaptive, Semi-Structured Language Model Approach to Spam Filtering on a New Corpus
An Empirical Study of Clustering Behavior of Spammers and Group-based Anti-Spam Strategies
The challenges of service-side personalized spam filtering: scalability and beyond
Topic Models Based Personalized Spam Filter - see: PDF, slides

Top of Section Top Index

Spam Classification Research 2005

SMTP Path Analysis
Spam Corpus Creation for TREC
Spamato – An Extendable Spam Filter System
GoodWord Attacks on Statistical Spam Filters
Naive Bayes Spam Filtering Using Word-Position-Based Attributes
Scalable and Reliable Collaborative Spam Filters: Harnessing the Global Social Email Networks
Stopping Outgoing Spam by Examining Incoming Server Logs
Comparative Graph Theoretical Characterization of Networks of Spam and Legitimate Email
Spam Deobfuscation using a Hidden Markov Model
Let Your CyberAlter Ego Share Information and Manage Spam
Leveraging Social Networks to Fight Spam

Top of Section Top Index

Spam Classification Research 2004

Canning more than SPAM
A Unified Model Of Spam Filtration
Scalable Centralized Bayesian Spam Mitigation with Bogofilter
Improving spam filtering by combining Naïve Bayes with simple k-nearest neighbor searches
Chung Kwei
The Impact of Feature Selection on Signature-Driven Spam Detection
Word Stemming to Enhance Spam Filtering
Exploring Support Vector Machines and Random Forests for Spam Detection
Filtron: A Learning-Based Anti-Spam Filter
On Attacking Statistical Spam Filters
SpamBayes: Effective open-source email classification system
Trends in Spam Products and Methods
Spamguru: An enterprise anti-spam filtering system
On attacking statistical spam filters
How to beat a bayesian spam filter
Advanced language classification using chained tokens
The plateau at 99.9
The more things change: Volatility and stability in spam features
Behavior based spam detection
Email Mining Toolkit
An artificial neural network spam classifier
Spam filtering using contextual network graphs
Spam, damn spam, and statistics: Using statistical analysis to locate spam web pages
Characterizing Spam Traffic
Bayesian Noise Reduction: Contextual Symmetry Logic Utilizing Pattern Consistency Analysis
Personal Email Networks: An Effective Anti-Spam Tool
Learning to Filter Junk E-Mail from Positive and Unlabeled Examples

Top of Section Top Index

Spam Classification Research 2003

A comparison of event models for naive bayes anti-spam e-mail filtering
On memory-bound functions for fighting spam
Moderately Hard, Memory-bound Functions
A case-based approach to spam filtering that can track concept drift
'In vivo' spam filtering: A challenge problem for data mining
Using latent semantic indexing to filter spam
Parameterization of Naïve Bayes for Spam Filters
A memory-based approach to anti-spam filtering for mailing lists
Spam filters: Bayes vs. chi-squared; letters vs. words
Bayesian spam filtering tweaks
Sparse binary polynomial hash message filtering and the crm114 discriminator
Automatic feature induction for text classification

Top of Section Top Index

Spam Classification Research 2002

Robust Feature Selection by Mutual Information Distributions
Evaluating cost-sensitive unsolicited bulk email categorization

Top of Section Top Index

Spam Classification Research 2001

Boosting Trees for Anti-Spam Email Filtering
Stacking classifiers for anti-spam filtering of e-mail
A Memory-Based Approach to Anti-Spam Filtering for Mailing Lists [1]
SVM-based filtering of e-mail spam with content-specific misclassification costs

Top of Section Top Index

Spam Classification Research 2000

An evaluation of Naïve Bayesian anti-spam filtering
ifile: An application of machine learning to mail filtering [1]
Learning to filter spam-email: A comparision of a naïve Bayesian and memory-based approach
A comparative study of classification-based personal e-mail filtering
An experimental comparison of naive bayesian and keyword-based anti-spam filtering with personal e-mail messages
Combining text and heuristics for cost-sensitive spam filtering

Top of Section Top Index

Spam Classification Research 1999

Naïve-Bayes vs. Rule-Learning in Classification of Email
Performance Comparison between Genetic Programming & Naïve Bayes

Top of Section Top Index

Spam Classification Research 1998 and earlier

A Bayesian Approach to Filtering Junk E-mail [1]
SpamCop: A Spam Classification & Organization Program
Learning Rules that classify Email

Top of Section Top Index

everything you didn't want to have to know about spam

Hosted by spam.abuse.net, with help from Neil Schwartzman. Domain registration by Gregg DesElms. Logo by Art101.
Spam Links Home Creative Commons License
This work is licensed under a Creative Commons License. SPAM is a trademark of Hormel Foods.
Page last updated: 21-Mar-2010