
Web spam is mostly aimed at increasing the ranking of spammed links in search engines. To achieve this end links were first spammed in guestbooks, then blogs, referrer logs and now Wikis - anywhere that a user of a site may submit links is vulnerable to spamdexing.
Fighting Spam for the Search Engines - www.engine-spam.com/
Spamdexing, from Wikipedia - en.wikipedia.org/wiki/Spamdexing
Poly's Anti-Spam-Seite gegen Suchmaschinen-Spamming - geocities.com/polyphydra/Anti_Spam.html - German
Link Spamming - www.rickconner.net/spamweb/spam_linkspam.html
Link spamming - www.spam.co.nz/linkspamming.html
A tube for spam - www.rogelsview.com/technology-and-software/a-tube-for-spam/
SEO Egghead on spam - www.seoegghead.com/blog/category/spam/
Akismet web spam statistics - akismet.com/stats/
Stopping spambots with hashes and honeypots - nedbatchelder.com/text/stopbots.html
\n\t KittenAuth - www.thepcspy.com/kittenauth
ModSecurity - www.modsecurity.org/
ModSecurity rules - gotroot.com/tiki-index.php?page=mod_security%20rules
Protect Web Form - www.protectwebform.com/
LinkSleeve (SLV) Spam Link Verification - www.linksleeve.org/
Strider Search Defender - research.microsoft.com/SearchDefender/
Search engine SPAM detector - tool.motoricerca.info/spam-detector/
PHPrbl - phprbl.init1.nl/
Mod_Access_RBL2 - www.sosdg.org/software
Babycart - apthorpe.cynistar.net/code/babycart/
Keywords Abuse and Search Engine Submission Complaints - www.seopros.org/Processing/Complaints/offenderlist.asp
SEOmoz URL Spam Detection Algorithm - www.seomoz.org/user_files/spam-detection/
Spam vs. Accessibility - www.joedolson.com/articles/2008/06/spam-vs-accessibility/
OpenID - openid.net/
Facebook Connect - developers.facebook.com/fbconnect.php
Identity 2.0 - identity20.com/
Asirra - research.microsoft.com/asirra/
Resources for Research on Web Spam - www.yr-bcn.es/webspam/
Adversarial Information Retrieval (AIR) on the Web - airweb.cse.lehigh.edu/
Web spam challenge - webspam.lip6.fr/wiki/pmwiki.php
Introducing the Webb Spam Corpus: Using Email Spam to Identify Web Spam Automatically - www.ceas.cc/2006/6.pdf
Combating Web Spam with TrustRank - dbpubs.stanford.edu:8090/pub/2004-17
Carlos Castillo publications on web information retrieval - www.chato.cl/research/
Using Rank Propagation and Probabilistic Counting for Link-based Spam Detection - www.slideshare.net/ChaToX/linkbased-spam-detection-fws-2006-barcelona
Link analysis for Web spam detection - www.chato.cl/papers/becchetti_2007_link_analysis_web_spam_detection.pdf
Formalization of Link Farm Structure Using Graph Grammar - www.ieeexplore.ieee.org/xpl/freeabs_all.jsp?isnumber=4482669&arnumber=4482802&count=160&index=132
Microsoft Research Text Mining Search and Navigation Research - research.microsoft.com/TMSN/
Transductive Web Spam Detection - www2007.org/workshops/paper_120.pdf - slides
Robust PageRank and Locally Computable Spam Detection Features - airweb.cse.lehigh.edu/2008/submissions/andersen_2008_robust_pagerank_local_spam.pdf
Pass it on - blog.facebook.com/blog.php?post=7830237130
Web 2.0 Cracks Start to Show - www.wired.com/science/discoveries/news/2005/10/69366
Blogs were hailed as a new, open, trusting communications method, but it wasn't long before the spammers came long and wrecked open comments, pings and trackbacks.
Preventing comment spam - www.google.com/googleblog/2005/01/preventing-comment-spam.html
Blog Spam DB - www.markcarey.com/spamdb/
The "Anti-Spam Feedback" Initiative - www.tamingthebeast.net/antispam/antispam.htm
Seven quick tips for a spam-free blog - cheerleader.yoz.com/archives/000849.html
Comment spam - kalsey.com/2003/09/comment_spam/
Blacklisting Comment Spam - simonwillison.net/2003/Sep/2/blacklisting/
Documented cases of spamming - spammers.chongqed.org/
Defending Against Comment Spam - urbanmainframe.com/folders/blog/20040323/
Spam Huntress - spamhuntress.com/
Bloggers Declare War on Comment Spam, but Can They Win? - ojr.org/ojr/glaser/1095201311.php
Push back - www.rojisan.com/spam/
Damn Spam! - spam.tinyweb.net/
Blog Spammer Caught. Now What? - www.spam-blocker-resource.com/
Building a Better Spam Detector - www.seomoz.org/blog/building-a-better-spam-detector
Combating Comment Spam - codex.wordpress.org/Combating_Comment_Spam
Spam Tools - codex.wordpress.org/Plugins/Spam_Tools
Combat Comment Spam - www.tamba2.org.uk/wordpress/spam/
Spam Karma - unknowngenius.com/blog/wordpress/spam-karma/
Anti-spam Plugin - weblog.sinteur.com/2004/11/yet-another-anti-spam-measure/
Akismet - akismet.com/
SpamWords - dev.wp-plugins.org/wiki/SpamWords
Spaminator - dev.wp-plugins.org/wiki/Spaminator
Trackback Validator Plugin - seclab.cs.rice.edu/proj/trackback/trackback-validator-plugin/
Learning Movable Type: Concerning Spam - www.learningmovabletype.com/a/000246concerning_spam/
Six Apart Guide to Comment Spam - www.sixapart.com/pronet/comment_spam.html
MT-Blacklist/Comment Spam Clearinghouse - jayallen.org/comment_spam/
Keystrokes Plugin - overstated.net/projects/mt-keystrokes/
SpamLookup - bradchoate.com/projects/spamlookup/
Using mod_security to shield Movable Type from Blog Comment Spam - jeremy.zawodny.com/blog/archives/007442.html
Word banning in TypePad - www.sixapart.com/typepad/news/2006/05/word_banning_an.html
Akismet API and implementations - akismet.com/development/
Clean up link spam on your Plone site - plone.org/documentation/how-to/clean-up-link-spam-on-your-site
Akismet for Ruby on Rails - rubyforge.org/projects/ror-akismet/
Fight Trac spam - madwifi.org/wiki/FightingTracSpam
Trac spam filter - trac.edgewall.org/wiki/SpamFilter
Drupal Spam Module - drupal.org/node/11104
Defensio - defensio.com/ (Reviews: 1)
Mollom - mollom.com/
Akismet spam graphs with PHP RRD - www.ioncannon.net/php/113/akismet-spam-graphs-with-php-rrd/
More Spam Fun (Akismet) - blog.joshuaeichorn.com/archives/2006/12/21/more-spam-fun/
Trackback Spam Resources - seclab.cs.rice.edu/proj/trackback/
Fight Splog - fightsplog.blogspot.com/
Flagday - flagday.pbwiki.com/
Splog - wiki.chongqed.org/Splog
AntiSplog - antisplog.phpmagazine.net/
Destroy all Malware - www.kbcafe.com/spam/
On Spam Removals - buzz.blogger.com/2006/04/on-spam-removals.html
Splog software from Hell - ebiquity.umbc.edu/blogger/2006/04/03/splog-software-from-hell/
Detecting Spam Blogs: A Machine Learning Approach - ebiquity.umbc.edu/paper/html/id/296/Detecting-Spam-Blogs-A-Machine-Learning-Approach
Characterizing the Splogosphere - ebiquity.umbc.edu/paper/html/id/299/Characterizing-the-Splogosphere
Spam blog - en.wikipedia.org/wiki/Spam_blog
On Auto-Bloggers - www.talkbiz.net/ramblings/archives.php?id=A2006061
Twingly.com — spam-free blog search - www.twingly.com/search?q=&spam-free=Spam-free+search+(beta)
FeedThirsty - www.feedthirsty.com/
Pings, spings, splogs and the Splogosphere - ebiquity.umbc.edu/blogger/2007/02/01/pings-spings-splogs-and-the-splogosphere-2007-updates/
Welcome to the Splogosphere - ebiquity.umbc.edu/blogger/2005/12/15/welcome-to-the-splogosphere-75-of-new-blog-posts-are-spam/
Spings on Wikipedia - en.wikipedia.org/wiki/Sping
State of the Splogosphere, Part III - www.kbcafe.com/spam/?guid=20060401093834
Fight Wiki Spam - chongqed.org/fightback.html
BannedContent Discussion - www.communitywiki.org/cw/BannedContentDiscussion
SpamBusters - www.communitywiki.org/cw?SpamBusters
Wikipedia Nofollows Links - blogoscoped.com/archive/2007-01-22-n21.html
WikiSpam - www.usemod.com/cgi-bin/mb.pl?WikiSpam
WikiSpam - www.cocoadev.com/?WikiSpam
MediaWiki anti-spam features - www.mediawiki.org/wiki/Anti-spam_features
SpamBlacklist for MediaWiki - www.mediawiki.org/wiki/SpamBlacklist_extension
Blocking Spam in MediaWiki - wiki.evernex.com/index.php?title=Blocking_Spam_in_Mediawiki
MediaWiki Spam Filter project - www.mediawiki.org/wiki/Spam_Filter
Proxy blocking discussion at Wikimedia - meta.wikimedia.org/wiki/Proxy_blocking
MoinMoin AntiSpamGlobalSolution - moinmo.in/AntiSpamGlobalSolution
Kwiki DNSBL - svn.kwiki.org/jooon/Kwiki-DNSBL/
TWiki Blacklist Plugin - twiki.org/cgi-bin/view/Plugins/BlackListPlugin
Zimbra Anti-spam - wiki.zimbra.com/index.php?title=Category:Anti-spam
Much blog and wiki spam blocking is based on identifying spammy keywords, URLs or IPs, much like the blacklists of words, URLs and IPs that have been built to counter email spam. Lists designed for one wiki or blog are probably useful (with a bit of tweaking) for any other Web 2.0 application.
MoinMoin BadContent - master.moinmo.in/BadContent
Wikia Spam Blacklist - www.wikia.com/wiki/Spam_Blacklist
Mozilla wiki spam blacklist - wiki.mozilla.org/Spam_blacklist
Wikihow spam blacklist - www.wikihow.com/Spam-Blacklist
S23 spam blacklist - s23.org/wiki/Spam_blacklist
Audacity spam blacklist - audacityteam.org/wiki/index.php?title=Audacity_spam_blacklist
Perplexcity wiki spam blacklist - perplexcitywiki.com/wiki/Meta:Spam_Blacklist
Spamikaze BadContent - spamikaze.org/BadContent
Wiki Spammers - www.openwiki.com/ow.asp?WikiSpammers - IPs
Meatball Ban List - www.usemod.com/cgi-bin/mb.pl?BanList - IPs and hostnames
Wiki Vandals - rollerweblogger.org/wiki/Wiki.jsp?page=WikiVandals
\n\t WikiSpam - twiki.org/cgi-bin/view/Codev/WikiSpam
Trac bad content - trac.edgewall.org/wiki/BadContent
bbAntiSpam: phpBB antispam solution - bbantispam.com/
phpBB ree Anti-Spam Check - www.phpbb-security.com/check.php
Disable phpBB Spambots - www.phpbb.com/files/mods/disable-spambots-1.0.1.mod
ForumBan - www.sugapablo.com/forumban/
Forum Equalizer? Spam Equalizer - www.talkbiz.net/ramblings/archives.php?id=A2006111
Fight Guest Book and Public Forum Spam - www.unwantedlinks.com/GuestBookSpam.html
Analytics spam: coming to an internet near you - www.seomoz.org/blog/analytics-spam-coming-to-an-internet-near-you
REF(errer)SPAM FUCKER 3000 - docs.g-blog.net/code/RefSpamFucker/3000/refspamfucker3000.php.txt
Adminshop - visualintensity.com/adminshop-com/
Get Rid of Referer Spammers - www.topsiteswebdirectory.com/referer_spam/
These sites list hostnames that are associated with web advertising, which some consider to be as offensive as spam. The ads may be viewed in bulk, but unless they were placed onto the site in question without the permission of the owner they are not unsolicited, so they're not strictly spam.
Hostess - accs-net.com/hostess/
KillHost - www.sillysot.com/other.htm
Using the Hosts File - www.accs-net.com/hosts/
Spamblocked.com hosts file - www.spamblocked.com/hosts.html
How To Block Ads (& Web Bugs) Without Extra Software - ssmedia.com/Utilities/hosts/
List of ad servers - pgl.yoyo.org/adservers/
Blocking Unwanted Parasites with a Hosts File - www.mvps.org/winhelp2002/hosts.htm
Mike Skallas' Ad Blocking Hosts file - www.everythingisnt.com/hosts.html
someonewhocares hosts file - someonewhocares.org/hosts/
Hosts file updates - datadragon.com/banners/hosts.shtml