Spam Links

Web Spam

Web spam is mostly aimed at increasing the ranking of spammed links in search engines. To achieve this end links were first spammed in guestbooks, then blogs, referrer logs and now Wikis - anywhere that a user of a site may submit links is vulnerable to spamdexing.

Web Spam Overviews

Fighting Spam for the Search Engines - www.engine-spam.com/
Spamdexing, from Wikipedia - en.wikipedia.org/wiki/Spamdexing
Poly's Anti-Spam-Seite gegen Suchmaschinen-Spamming - geocities.com/polyphydra/Anti_Spam.html - German
Link Spamming - www.rickconner.net/spamweb/spam_linkspam.html
Link spamming - www.spam.co.nz/linkspamming.html
A tube for spam - www.rogelsview.com/technology-and-software/a-tube-for-spam/
SEO Egghead on spam - www.seoegghead.com/blog/category/spam/
Akismet web spam statistics - akismet.com/stats/

Top Index

Web Spam Prevention

Stopping spambots with hashes and honeypots - nedbatchelder.com/text/stopbots.html
\n\t KittenAuth - www.thepcspy.com/kittenauth
ModSecurity - www.modsecurity.org/
ModSecurity rules - gotroot.com/tiki-index.php?page=mod_security%20rules
Protect Web Form - www.protectwebform.com/
LinkSleeve (SLV) Spam Link Verification - www.linksleeve.org/
Strider Search Defender - research.microsoft.com/SearchDefender/
Search engine SPAM detector - tool.motoricerca.info/spam-detector/
PHPrbl - phprbl.init1.nl/
Mod_Access_RBL2 - www.sosdg.org/software
Babycart - apthorpe.cynistar.net/code/babycart/
Keywords Abuse and Search Engine Submission Complaints - www.seopros.org/Processing/Complaints/offenderlist.asp
SEOmoz URL Spam Detection Algorithm - www.seomoz.org/user_files/spam-detection/
Spam vs. Accessibility - www.joedolson.com/articles/2008/06/spam-vs-accessibility/

Top Index

Web Identity Management

OpenID - openid.net/
Facebook Connect - developers.facebook.com/fbconnect.php
Identity 2.0 - identity20.com/

Top Index

Web Spam Research

Asirra - research.microsoft.com/asirra/
Resources for Research on Web Spam - www.yr-bcn.es/webspam/
Adversarial Information Retrieval (AIR) on the Web - airweb.cse.lehigh.edu/
Web spam challenge - webspam.lip6.fr/wiki/pmwiki.php
Introducing the Webb Spam Corpus: Using Email Spam to Identify Web Spam Automatically - www.ceas.cc/2006/6.pdf
Combating Web Spam with TrustRank - dbpubs.stanford.edu:8090/pub/2004-17
Carlos Castillo publications on web information retrieval - www.chato.cl/research/
Using Rank Propagation and Probabilistic Counting for Link-based Spam Detection - www.slideshare.net/ChaToX/linkbased-spam-detection-fws-2006-barcelona
Link analysis for Web spam detection - www.chato.cl/papers/becchetti_2007_link_analysis_web_spam_detection.pdf
Formalization of Link Farm Structure Using Graph Grammar - www.ieeexplore.ieee.org/xpl/freeabs_all.jsp?isnumber=4482669&arnumber=4482802&count=160&index=132
Microsoft Research Text Mining Search and Navigation Research - research.microsoft.com/TMSN/
Transductive Web Spam Detection - www2007.org/workshops/paper_120.pdf - slides
Robust PageRank and Locally Computable Spam Detection Features - airweb.cse.lehigh.edu/2008/submissions/andersen_2008_robust_pagerank_local_spam.pdf

Top Index

Social Networking Spam

Pass it on - blog.facebook.com/blog.php?post=7830237130
Web 2.0 Cracks Start to Show - www.wired.com/science/discoveries/news/2005/10/69366

Top Index

Blog Spam/Comment Spam

Blogs were hailed as a new, open, trusting communications method, but it wasn't long before the spammers came long and wrecked open comments, pings and trackbacks.

Top Index

About Blog Spam

Preventing comment spam - www.google.com/googleblog/2005/01/preventing-comment-spam.html
Blog Spam DB - www.markcarey.com/spamdb/
The "Anti-Spam Feedback" Initiative - www.tamingthebeast.net/antispam/antispam.htm
Seven quick tips for a spam-free blog - cheerleader.yoz.com/archives/000849.html
Comment spam - kalsey.com/2003/09/comment_spam/
Blacklisting Comment Spam - simonwillison.net/2003/Sep/2/blacklisting/
Documented cases of spamming - spammers.chongqed.org/
Defending Against Comment Spam - urbanmainframe.com/folders/blog/20040323/
Spam Huntress - spamhuntress.com/
Bloggers Declare War on Comment Spam, but Can They Win? - ojr.org/ojr/glaser/1095201311.php
Push back - www.rojisan.com/spam/
Damn Spam! - spam.tinyweb.net/
Blog Spammer Caught. Now What? - www.spam-blocker-resource.com/
Building a Better Spam Detector - www.seomoz.org/blog/building-a-better-spam-detector

Top of Section Top Index

Stop Wordpress Blog Spam

Combating Comment Spam - codex.wordpress.org/Combating_Comment_Spam
Spam Tools - codex.wordpress.org/Plugins/Spam_Tools
Combat Comment Spam - www.tamba2.org.uk/wordpress/spam/
Spam Karma - unknowngenius.com/blog/wordpress/spam-karma/
Anti-spam Plugin - weblog.sinteur.com/2004/11/yet-another-anti-spam-measure/
Akismet - akismet.com/
SpamWords - dev.wp-plugins.org/wiki/SpamWords
Spaminator - dev.wp-plugins.org/wiki/Spaminator
Trackback Validator Plugin - seclab.cs.rice.edu/proj/trackback/trackback-validator-plugin/

Top of Section Top Index

Stop Movable Type Blog Spam

Learning Movable Type: Concerning Spam - www.learningmovabletype.com/a/000246concerning_spam/
Six Apart Guide to Comment Spam - www.sixapart.com/pronet/comment_spam.html
MT-Blacklist/Comment Spam Clearinghouse - jayallen.org/comment_spam/
Keystrokes Plugin - overstated.net/projects/mt-keystrokes/
SpamLookup - bradchoate.com/projects/spamlookup/
Using mod_security to shield Movable Type from Blog Comment Spam - jeremy.zawodny.com/blog/archives/007442.html

Top of Section Top Index

Stop Other Blog Spam

Word banning in TypePad - www.sixapart.com/typepad/news/2006/05/word_banning_an.html
Akismet API and implementations - akismet.com/development/
Clean up link spam on your Plone site - plone.org/documentation/how-to/clean-up-link-spam-on-your-site
Akismet for Ruby on Rails - rubyforge.org/projects/ror-akismet/
Fight Trac spam - madwifi.org/wiki/FightingTracSpam
Trac spam filter - trac.edgewall.org/wiki/SpamFilter
Drupal Spam Module - drupal.org/node/11104
Defensio - defensio.com/ (Reviews: 1)
Mollom - mollom.com/
Akismet spam graphs with PHP RRD - www.ioncannon.net/php/113/akismet-spam-graphs-with-php-rrd/
More Spam Fun (Akismet) - blog.joshuaeichorn.com/archives/2006/12/21/more-spam-fun/

Top of Section Top Index

Trackback Blog Spam

Trackback Spam Resources - seclab.cs.rice.edu/proj/trackback/

Top of Section Top Index

Spam Blogs (Splogs)

Fight Splog - fightsplog.blogspot.com/
Flagday - flagday.pbwiki.com/
Splog - wiki.chongqed.org/Splog
AntiSplog - antisplog.phpmagazine.net/
Destroy all Malware - www.kbcafe.com/spam/
On Spam Removals - buzz.blogger.com/2006/04/on-spam-removals.html
Splog software from Hell - ebiquity.umbc.edu/blogger/2006/04/03/splog-software-from-hell/
Detecting Spam Blogs: A Machine Learning Approach - ebiquity.umbc.edu/paper/html/id/296/Detecting-Spam-Blogs-A-Machine-Learning-Approach
Characterizing the Splogosphere - ebiquity.umbc.edu/paper/html/id/299/Characterizing-the-Splogosphere
Spam blog - en.wikipedia.org/wiki/Spam_blog
On Auto-Bloggers - www.talkbiz.net/ramblings/archives.php?id=A2006061
Twingly.com — spam-free blog search - www.twingly.com/search?q=&spam-free=Spam-free+search+(beta)

Top of Section Top Index

Spam Pings (Spings)

FeedThirsty - www.feedthirsty.com/
Pings, spings, splogs and the Splogosphere - ebiquity.umbc.edu/blogger/2007/02/01/pings-spings-splogs-and-the-splogosphere-2007-updates/
Welcome to the Splogosphere - ebiquity.umbc.edu/blogger/2005/12/15/welcome-to-the-splogosphere-75-of-new-blog-posts-are-spam/
Spings on Wikipedia - en.wikipedia.org/wiki/Sping
State of the Splogosphere, Part III - www.kbcafe.com/spam/?guid=20060401093834

Top of Section Top Index

Wiki Spam/LinkSpam

Top Index

About Wiki Spam

Fight Wiki Spam - chongqed.org/fightback.html
BannedContent Discussion - www.communitywiki.org/cw/BannedContentDiscussion
SpamBusters - www.communitywiki.org/cw?SpamBusters
Wikipedia Nofollows Links - blogoscoped.com/archive/2007-01-22-n21.html
WikiSpam - www.usemod.com/cgi-bin/mb.pl?WikiSpam
WikiSpam - www.cocoadev.com/?WikiSpam

Top of Section Top Index

Anti-WikiSpam Solutions and Filters

MediaWiki anti-spam features - www.mediawiki.org/wiki/Anti-spam_features
SpamBlacklist for MediaWiki - www.mediawiki.org/wiki/SpamBlacklist_extension
Blocking Spam in MediaWiki - wiki.evernex.com/index.php?title=Blocking_Spam_in_Mediawiki
MediaWiki Spam Filter project - www.mediawiki.org/wiki/Spam_Filter
Proxy blocking discussion at Wikimedia - meta.wikimedia.org/wiki/Proxy_blocking
MoinMoin AntiSpamGlobalSolution - moinmo.in/AntiSpamGlobalSolution
Kwiki DNSBL - svn.kwiki.org/jooon/Kwiki-DNSBL/
TWiki Blacklist Plugin - twiki.org/cgi-bin/view/Plugins/BlackListPlugin
Zimbra Anti-spam - wiki.zimbra.com/index.php?title=Category:Anti-spam

Top of Section Top Index

Blog and Wiki Spam Blacklists

Much blog and wiki spam blocking is based on identifying spammy keywords, URLs or IPs, much like the blacklists of words, URLs and IPs that have been built to counter email spam. Lists designed for one wiki or blog are probably useful (with a bit of tweaking) for any other Web 2.0 application.

MoinMoin BadContent - master.moinmo.in/BadContent
Wikia Spam Blacklist - www.wikia.com/wiki/Spam_Blacklist
Mozilla wiki spam blacklist - wiki.mozilla.org/Spam_blacklist
Wikihow spam blacklist - www.wikihow.com/Spam-Blacklist
S23 spam blacklist - s23.org/wiki/Spam_blacklist
Audacity spam blacklist - audacityteam.org/wiki/index.php?title=Audacity_spam_blacklist
Perplexcity wiki spam blacklist - perplexcitywiki.com/wiki/Meta:Spam_Blacklist
Spamikaze BadContent - spamikaze.org/BadContent
Wiki Spammers - www.openwiki.com/ow.asp?WikiSpammers - IPs
Meatball Ban List - www.usemod.com/cgi-bin/mb.pl?BanList - IPs and hostnames
Wiki Vandals - rollerweblogger.org/wiki/Wiki.jsp?page=WikiVandals
\n\t WikiSpam - twiki.org/cgi-bin/view/Codev/WikiSpam
Trac bad content - trac.edgewall.org/wiki/BadContent

Top Index

Forum Spam

bbAntiSpam: phpBB antispam solution - bbantispam.com/
phpBB ree Anti-Spam Check - www.phpbb-security.com/check.php
Disable phpBB Spambots - www.phpbb.com/files/mods/disable-spambots-1.0.1.mod
ForumBan - www.sugapablo.com/forumban/
Forum Equalizer? Spam Equalizer - www.talkbiz.net/ramblings/archives.php?id=A2006111

Top Index

Guestbook Spam

Fight Guest Book and Public Forum Spam - www.unwantedlinks.com/GuestBookSpam.html

Top Index

Referrer Spam

Analytics spam: coming to an internet near you - www.seomoz.org/blog/analytics-spam-coming-to-an-internet-near-you
REF(errer)SPAM FUCKER 3000 - docs.g-blog.net/code/RefSpamFucker/3000/refspamfucker3000.php.txt
Adminshop - visualintensity.com/adminshop-com/
Get Rid of Referer Spammers - www.topsiteswebdirectory.com/referer_spam/

Top Index

Web Adverts

These sites list hostnames that are associated with web advertising, which some consider to be as offensive as spam. The ads may be viewed in bulk, but unless they were placed onto the site in question without the permission of the owner they are not unsolicited, so they're not strictly spam.

Hostess - accs-net.com/hostess/
KillHost - www.sillysot.com/other.htm
Using the Hosts File - www.accs-net.com/hosts/
Spamblocked.com hosts file - www.spamblocked.com/hosts.html
How To Block Ads (& Web Bugs) Without Extra Software - ssmedia.com/Utilities/hosts/
List of ad servers - pgl.yoyo.org/adservers/
Blocking Unwanted Parasites with a Hosts File - www.mvps.org/winhelp2002/hosts.htm
Mike Skallas' Ad Blocking Hosts file - www.everythingisnt.com/hosts.html
someonewhocares hosts file - someonewhocares.org/hosts/
Hosts file updates - datadragon.com/banners/hosts.shtml

Top Index

everything you didn't want to have to know about spam

Hosted by spam.abuse.net, with help from Neil Schwartzman. Domain registration by Gregg DesElms. Logo by Art101.
Spam Links Home Creative Commons License
This work is licensed under a Creative Commons License. SPAM is a trademark of Hormel Foods.
Unsubscribe
Page last updated: 09-Aug-2008